• 1 Post
  • 21 Comments
Joined 2 个月前
cake
Cake day: 2025年6月4日

help-circle
  • Thanks, I almost didn’t post because it was an essay of a comment lol, glad you found it insightful

    As for Wolfram Alpha, I’m definitely not an expert but I’d guess the reason it was good at math was that it would simply translate your problem from natural language into commands that could be sent to a math engine that would do the actual calculation.

    So basically act like a language translator but for typed out math to a programming language for some advanced calculation program (like wolfram Mathematica)

    Again, this is just speculation because I’m a bit too tired to look into it rn, but it seems plausible since we had basic language translators online back then (I think…) and I’d imagine parsing written math is probably easier than natural language translation


  • Engineer here with a CS minor in case you care about ethos: We are not remotely close to AGI.

    I loathe python irrationally (and I guess I’m masochist who likes to reinvent the wheel programming wise lol) so I’ve written my own neural nets from scratch a few times.

    Most common models are trained by gradient descent, but this only works when you have a specific response in mind for certain inputs. You use the difference between the desired outcome and actual outcome to calculate a change in weights that would minimize that error.

    This has two major preventative issues for AGI: input size limits, and determinism.

    The weight matrices are set for a certain number of inputs. Unfortunately you can’t just add a new unit of input and assume the weights will be nearly the same. Instead you have to retrain the entire network. (This problem is called transfer learning if you want to learn more)

    This input constraint is preventative of AGI because it means a network trained like this cannot have an input larger than a certain size. Problematic since the illusion of memory that LLMs like ChatGPT have comes from the fact they run the entire conversation through the net. Also just problematic from a size and training time perspective as increasing the input size exponentially increases basically everything else.

    Point is, current models are only able to simulate memory by literally holding onto all the information and processing all of it for each new word which means there is a limit to its memory unless you retrain the entire net to know the answers you want. (And it’s slow af) Doesn’t sound like a mind to me…

    Now determinism is the real problem for AGI from a cognitive standpoint. The neural nets you’ve probably used are not thinking… at all. They literally are just a complicated predictive algorithm like linear regression. I’m dead serious. It’s basically regression just in a very high dimensional vector space.

    ChatGPT does not think about its answer. It doesn’t have any sort of object identification or thought delineation because it doesn’t have thoughts. You train it on a bunch of text and have it attempt to predict the next word. If it’s off, you do some math to figure out what weight modifications would have lead it to a better answer.

    All these models do is what they were trained to do. Now they were trained to be able to predict human responses so yeah it sounds pretty human. They were trained to reproduce answers on stack overflow and Reddit etc. so they can answer those questions relatively well. And hey it is kind of cool that they can even answer some questions they weren’t trained on because it’s similar enough to the questions they weren’t trained on… but it’s not thinking. It isn’t doing anything. The program is just multiplying numbers that were previously set by an input to find the most likely next word.

    This is why LLMs can’t do math. Because they don’t actually see the numbers, they don’t know what numbers are. They don’t know anything at all because they’re incapable of thought. Instead there are simply patterns in which certain numbers show up and the model gets trained on some of them but you can get it to make incredibly simple math mistakes by phrasing the math slightly differently or just by surrounding it with different words because the model was never trained for that scenario.

    Models can only “know” as much as what was fed into them and hey sometimes those patterns extend, but a lot of the time they don’t. And you can’t just say “you were wrong” because the model isn’t transient (capable of changing from inputs alone). You have to train it with the correct response in mind to get it to “learn” which again takes time and really isn’t learning or intelligence at all.

    Now there are some more exotic neural networks architectures that could surpass these limitations.

    Currently I’m experimenting with Spiking Neural Nets which are much more capable of transfer learning and more closely model biological neurons along with other cool features like being good with temporal changes in input.

    However, there are significant obstacles with these networks and not as much research because they only run well on specialized hardware (because they are meant to mimic biological neurons who run simultaneously) and you kind of have to train them slowly.

    You can do some tricks to use gradient descent but doing so brings back the problems of typical ANNs (though this is still possibly useful for speeding up ANNs by converting them to SNNs and then building the neuromorphic hardware for them).

    SNNs with time based learning rules (typically some form of STDP which mimics Hebbian learning as per biological neurons) are basically the only kinds of neural nets that are even remotely capable of having thoughts and learning (changing weights) in real time. Capable as in “this could have discrete time dependent waves of continuous self modifying spike patterns which could theoretically be thoughts” not as in “we can make something that thinks.”

    Like these neural nets are good with sensory input and that’s about as far as we’ve gotten (hyperbole but not by that much). But these networks are still fascinating, and they do help us test theories about how the human brain works so eventually maybe we’ll make a real intelligent being with them, but that day isn’t even on the horizon currently

    In conclusion, we are not remotely close to AGI. Current models that seem to think are verifiably not thinking and are incapable of it from a structural standpoint. You cannot make an actual thinking machine using the current mainstream model architectures.

    The closest alternative that might be able to do this (as far as I’m aware) is relatively untested and difficult to prototype (trust me I’m trying). Furthermore the requirements of learning and thinking largely prohibit the use of gradient descent or similar algorithms meaning training must be done on a much more rigorous and time consuming basis that is not economically favorable. Ergo, we’re not even all that motivated to move towards AGI territory.

    Lying to say we are close to AGI when we aren’t at all close, however, is economically favorable which is why you get headlines like this.




  • I’ve come to the conclusion that suffering is really just anything that invades your focus without your desire for it to happen.

    Thinking about anything you would rather not think about is suffering. You get cut and your brain constantly reminds you of it because evolution is a bitch. Hatred, envy, anger, intrusive thoughts, headaches, itchy clothes, annoying noises in your environment, etc. Anything that steals your attention without your consent is suffering.

    So if you’re so focused on avoiding suffering you aren’t able to focus on doing what you want then yep, suffering.


  • I’m not quite sure what you mean by “always like this” because, from my understanding, the rich exploiting the poor and fucking up the world in the process has always been.

    In the past, the “most successful states from the perspective of a peasant” were successful because of their conquest of others.

    Furthermore, this success is measured only relative to other capitalist states doing similar fucked up things, so I wouldn’t exactly say that’s evidence

    The perceived “end” after 2008 you feel is not because capitalism or the mechanisms holding it in place changed, it’s because the internet made it easier for exploitation to occur and to be witnessed by you.

    The state didn’t change its goals. It still doesn’t care about its citizens just like it didn’t care in previous centuries. Capitalism didn’t change either, the definition you listed is still the same.

    What changed was the new methods available for the state to pacify the masses and the new sources of exploitation capitalism could acquire.

    Sure it is getting worse and states that had socialized programs were better off because of it. But that doesn’t mean more socialized economies wouldn’t have been better. In fact it would imply the opposite. Especially since the erosion you mention is a direct effect of the capitalist parts





  • Apart from “being summoned” yeah. No desire or consciousness just a thing that modifies everything around it by nature. It doesn’t care that it drives animals insane or turns them into monsters, because it’s probably not aware of what an animal is to begin with.

    Also kinda coincidental that Color Out of Space makes plants bigger. Before we had better gene editing methods, scientists used radiation to trigger mutations plants attempting to find some mutations that, among other things, made the fruit bigger lol




  • The word for established assumptions is “axioms”

    Definitions are kind of the most fundamental axioms. Abstracting things helps us build with them and they’re true because you say they are.

    We use axioms in models to derive new theorems/information. But that is often what makes us resist changing them. If you build your other assumptions on an axiom, you have to rethink all those assumptions or even throw them out when it gets proven wrong.

    However, attachment to a belief, holding to an assumption even when it’s been proven wrong, is called “delusion” and yeah those beliefs tend to be the most destructive


  • I think by cornerstone, they are referencing that beliefs are assumptions that form one’s model of the world.

    You think by logically building on assumptions. “I remember putting leftovers in the fridge last night, so I don’t need to make dinner tonight” You assume your memories are accurate (or accurate enough) and then build on other things you “know” to construct every thought.

    Sights, sounds, and vibes are a different story. They are called qualia and the raw experience of them cannot be described.

    Think of qualia like the raw data you collect from an experiment. Your worldview is the scientific model you’ve built to describe this data and it rests on both fundamental logic and the beliefs/theories you currently believe in.

    Unfortunately people don’t like having to change their worldview. And when you’ve held a belief for long enough, it becomes foundational to many of your other assumptions. Some people would rather say reality is wrong than change their beliefs.

    The word for a belief that cannot be changed via evidence is called a “delusion” in case you ever want to piss off a religious person who says “nothing can shake my faith” like it’s a good thing.


  • if a belief is a model/theory/assumption that a person will not change regardless of evidence against it, it is by definition a delusion.

    If a belief is an opinion, it is a personal statement. Statements like “Vim is the best IDE” are really conveying the information “I prefer Vim over all others IDEs” which is a true statement.

    If a belief is a hypothesis then the person holding it will accept if it ends up being wrong.

    Only in the first and second cases do people usually place importance on their beliefs, and typically, only the first case leads people to harm others or themselves with no way to convince them to stop.