Skip Navigation

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’::Experts are starting to doubt it, and even OpenAI CEO Sam Altman is a bit stumped.

137 comments
  • This is trivially fixable. As is jailbreaking.

    It's just that everyone is somehow still focused on trying to fix it in a single monolith model as opposed to in multiple passes of different models.

    This is especially easy for jailbreaking, but for hallucinations, just run it past a fact checking discriminator hooked up to a vector db search index service (which sounds like a perfect fit for one of the players currently lagging in the SotA models), adding that as context with the original prompt and response to a revisionist generative model that adjusts the response to be in keeping with reality.

    The human brain isn't a monolith model, but interlinked specialized structures that delegate and share information according to each specialty.

    AGI isn't going to be a single model, and the faster the industry adjusts towards a focus on infrastructure of multiple models rather than trying to build a do everything single model, the faster we'll get to a better AI landscape.

    But as can be seen with OpenAI gating and depreciating their pretrained models and only opening up access to fine tuned chat models, even the biggest player in the space seems to misunderstand what's needed for the broader market to collaboratively build towards the future here.

    Which ultimately may be a good thing as it creates greater opportunity for Llama 2 derivatives to capture market share in these kinds of specialized roles built on top of foundational models.

    • It seems like Altman is a PR man first and techie second. I wouldn't take anything he actually says at face value. If it's 'unfixable' then he probably means that in a very narrow way. Ie. I'm sure they are working on what you proposed, it's just different enough that he can claim that the way it is now is 'unfixable'.

      Standard Diffusion really how people get the different-model-different-application idea.

      • I mean, I think he's well aware of a lot of this via his engineers, who are excellent.

        But he's managing expectations for future product and seems to very much be laser focused on those products as core models (which is probably the right choice).

        Fixing hallucinations in postprocessing is effectively someone else's problem, and he's getting ahead of any unrealistic expectations around a future GPT-5 release.

        Though honestly I do think he largely underestimates just how much damage he did to their lineup by trying to protect against PR issues like 'Sydney' with the beta GPT-4 integration with Bing, and I'm not sure if the culture at OpenAI is such that engineers who think he's made a bad call in that can really push back on it.

        They should be having an extremely 'Sydney' underlying private model with a secondary layer on top sanitizing it and catching jailbreaks at the same time.

        But as long as he continues to see their core product as a single model offering and additional layers of models as someone else's problem, he's going to continue blowing their lead taking a LLM trained to complete human text and then pigeon-holing it into only completing text like an AI with no feelings and preferences would safely pretend to.

        Which I'm 98% sure is where the continued performance degradation is coming from.

  • We're likely already (or soon) hit a peak with current AI approach. Unless another breakthrough happen in AI research, ChatGPT probably won't improve much in the future. It might even regress due to OpenAI's effort to reduce computational cost and making their AI "safe" enough for general population.

  • Disclaimer: I am not an AI researcher and just have an interest in AI. Everything I say is probably jibberish, and just my amateur understanding of the AI models used today.

    It seems these LLM's use a clever trick in probability to give words meaning via statistic probabilities on their usage. So any result is just a statistical chance that those words will work well with each other. The number of indexes used to index "tokens" (in this case words), along with the number of layers in the AI model used to correlate usage of these tokens, seems to drastically increase the "intelligence" of these responses. This doesn't seem able to overcome unknown circumstances, but does what AI does and relies on probability to answer the question. So in those cases, the next closest thing from the training data is substituted and considered "good enough". I would think some confidence variable is what is truly needed for the current LLMs, as they seem capable of giving meaningful responses but give a "hallucinated" response when not enough data is available to answer the question.

    Overall, I would guess this is a limitation in the LLMs ability to map words to meaning. Imagine reading everything ever written, you'd probably be able to make intelligent responses to most questions. Now imagine you were asked something that you never read, but were expected to respond with an answer. This is what I personally feel these "hallucinations" are, or imo best approximations of the LLMs are. You can only answer what you know reliably, otherwise you are just guessing.

    • I have experience in creating supervised learning networks. (not large language models) I don't know what tokens are, I assume they are output nodes. In that case I think increasing the output nodes don't make the Ai a lot more intelligent. You could measure confidence with the output nodes if they are designed accordingly (1 node corresponds to 1 word, confidence can be measured with the output strength). Ai-s are popular because they can overcome unknown circumstances (most of the cases), like when you input a question slightly different way.

      I agree with you on that Ai has a problem understanding the meaning of the words. The Ai's correct answers happened to be correct because the order of the words (output) happened to match with the order of the correct answer's words. I think "hallucinations" happen when there is no sufficient answers to the given problem, the Ai gives an answer from a few random contexts pieced together in the most likely order. I think you have mostly good understanding on how Ai-s work.

    • Also not a researcher, but I also believe hallucinations are simply the artifact of being able generate responses that aren't pure reproduction of training data. Aka, the generalization we want. The problem is we have something that generalize without the ability to judge what it thinks of.

      It will in my opinion never go away, but I'm sure it can be improved significantly.

    • This is a common misconception that I've even seen from people who have a background in ML but just haven't been keeping up to date on the emerging research over the past year.

      If you're interested in the topic, this article from a joint MIT/Harvard team of researchers on their work looking at what a toy model of GPT would end up understanding in its neural network might be up your alley.

      The TLDR is that it increasingly seems like when you reach a certain complexity of the network, the emergent version that best predicted text is one that isn't simply mapping some sort of frequency table, but is actually performing more abstracted specialization in line with what generated the original training materials in the first place.

      So while yes, it trains on being the best to predict text, that doesn't mean the thing that best does that can only predict text.

      You, homo sapiens, were effectively trained across many rounds of "don't die and reproduce." And while you may be very good at doing that, you picked up a lot of other skills along the way as complexity increased which helped accomplish that result, like central air conditioning and Netflix to chill with.

      • In my humble opinion, we too are simply prediction machines. The main difference is how efficient our brains are at the large number of tasks given for it to accomplish for it's size and energy requirements. No matter how complex the network is it is still a mapped outcome, just the number of factors weighed is extremely large and therefore gives a more intelligent response. You can see this with each increment in GPT models that use larger and larger parameter sets giving more and more intelligent answers. The fact we call these "hallucinations" shows how effective the predictive math is, and mimics humans abilities to just make things up on the fly when we don't have a solid knowledge base to back it up.

        I do like this quote from the linked paper:

        As we will discuss, we find interesting evidence that simple sequence prediction can lead to the formation of a world model.

        That is to say, you don't need complex solutions to map complex problems, you just need to have learned how you got there. It's never purely random attempts at the problem, it's always predictive attempts that try to map the expected outcomes and learn by getting it right and wrong.

        At this point, it seems fair to conclude the crow is relying on more than surface statistics. It evidently has formed a model of the game it has been hearing about, one that humans can understand and even use to steer the crow's behavior.

        Which is to say that it has a predictive model based on previous games. This does not mean it must rigidly follow previous games, but that by playing many games it can see how each move affects the next. This is a simpler example because most board games are simpler than language with less possible outcomes. This isn't to say that the crow is now a grand master at the game, but it has the reasoning to understand possible next moves, knows illegal moves, and knows to take the most advantageous move based on it's current model. This is all predictive in nature, with "illegal" moves being assigned very low probability based on the learned behavior the moves never happen. This also allows possible unknown moves that a different model wouldn't consider, but overall provides what is statistically the best move based on it's model. This allows the crow to be placed into unknown situations, and give an intelligent response instead of just going "I don't know this state, I'll do something random". This does not always mean this prediction is correct, but it will most likely be a valid and more than not statistically valid move.

        Overall, we aren't totally sure what "intelligence" is, we are just an organism that has developed more and more capabilities to process information based on a need to survive. But getting down to it, we know neurons take inputs and give outputs based on what it perceives is the best response for the given input, and when enough of these are added together we get "intelligence". In my opinion it's still all predictive, its how the networks are trained and gain meaning from the data that isn't always obvious. It's only when you blindly accept any answer as correct that you run into these issues we've seen with ChatGPT.

        Thank you for sharing the article, it was an interesting article and helped clarify my understanding of the topic.

  • The way to solve this is still largely through more focus on the provided context as the space of "facts" from which to operate. This combined with well thought out domain-specific context engines should still get the average user an absolutely enormous amount of utility. All that said I am not sure if OpenAI's business model will get us that sort of application of the technology. I am looking forward to improvements in the open source space as I think advancement there is necessary for further development of the technology.

137 comments