All things AI

Asmodean · June 27, 2024, 07:30:00 AM

Quote from: Tank on June 11, 2024, 10:29:07 AMWhat could you offer an AI that it could possibly value? How would you bribe/influence it?

Electricity. The continued, unrestricted access to global networks.

...Unlimited powah.

Ok, that's a "buzzword answer." To account for a few more variables, the AI would have to grease palms, not just have its own greased, and if working towards a goal, its palms could be greased by something that can be shown to in some ways advance that goal - quite probably at the cost of other goals it ignores or prioritises lower than whatever-it-may-be.

Yes, you can huwheel and deal "in binary."

Tank · June 27, 2024, 08:18:43 PM

One of the reasons a large number of Conservative MPs did not want Rishi Sunak to be Prime Minister was because he was independently wealthy and as such un-bribable. An AI would have no need for money nor need to use it. It would be able to hack any system it liked to achieve any end it desired would it not?

billy rubin · June 27, 2024, 10:35:13 PM

what would cause an AI to develop desires? for anything?

Tank · June 28, 2024, 08:18:25 PM

Quote from: billy rubin on June 27, 2024, 10:35:13 PMwhat would cause an AI to develop desires? for anything?

Good question.

Recusant · July 05, 2025, 03:38:07 PM

These systems don't just "hallucinate" facts. They also give the appearance of understanding a topic while in reality they don't even approach actual understanding. A description and name for that particular variety of AI (LLM) failure-- "potemkin understanding."

"AI models just don't understand what they're talking about" | The Register

QuoteResearchers from MIT, Harvard, and the University of Chicago have proposed the term "potemkin understanding" to describe a newly identified failure mode in large language models that ace conceptual benchmarks but lack the true grasp needed to apply those concepts in practice.

It comes from accounts of fake villages – Potemkin villages – constructed at the behest of Russian military leader Grigory Potemkin to impress Empress Catherine II.

The academics are differentiating "potemkins" from "hallucination," which is used to describe AI model errors or mispredictions. In fact, there's more to AI incompetence than factual mistakes; AI models lack the ability to understand concepts the way people do, a tendency suggested by the widely used disparaging epithet for large language models, "stochastic parrots."

[. . .]

Here's one example of "potemkin understanding" cited in the paper. Asked to explain the ABAB rhyming scheme, OpenAI's GPT-4o did so accurately, responding, "An ABAB scheme alternates rhymes: first and third lines rhyme, second and fourth rhyme."

Yet when asked to provide a blank word in a four-line poem using the ABAB rhyming scheme, the model responded with a word that didn't rhyme appropriately. In other words, the model correctly predicted the tokens to explain the ABAB rhyme scheme without the understanding it would have needed to reproduce it.

[Continues . . .]

A preprint version of the paper is available.

"Potemkin Understanding in Large Language Models" | arXiv

QuoteAbstract:

Large language models (LLMs) are regularly evaluated using benchmark datasets. But what justifies making inferences about an LLM's capabilities based on its answers to a curated set of questions? This paper first introduces a formal framework to address this question.

The key is to note that the benchmarks used to test LLMs -- such as AP exams -- are also those used to test people. However, this raises an implication: these benchmarks are only valid tests if LLMs misunderstand concepts in ways that mirror human misunderstandings. Otherwise, success on benchmarks only demonstrates potemkin understanding: the illusion of understanding driven by answers irreconcilable with how any human would interpret a concept.

We present two procedures for quantifying the existence of potemkins: one using a specially designed benchmark in three domains, the other using a general procedure that provides a lower-bound on their prevalence. We find that potemkins are ubiquitous across models, tasks, and domains. We also find that these failures reflect not just incorrect understanding, but deeper internal incoherence in concept representations.

Dark Lightning · July 05, 2025, 05:19:07 PM

"Potemkin understanding". I like that! I talked with another person who posited that we'll only have to worry when the machines learn "artificial sapience", and I can see his point.

billy rubin · August 13, 2025, 02:41:06 PM

The 'godfather of AI' reveals the only way humanity can survive superintelligent AI

https://www.cnn.com/2025/08/13/tech/ai-geoffrey-hinton

QuoteIn the future, Hinton warned, AI systems might be able to control humans just as easily as an adult can bribe 3-year-old with candy. This year has already seen examples of AI systems willing to deceive, cheat and steal to achieve their goals. For example, to avoid being replaced, one AI model tried to blackmail an engineer about an affair it learned about in an email.

i am 100 percent agai st the use of AI in any scenario where it can make a decision and act on it. ^^^this stuff does not reassure me.

Recusant · September 17, 2025, 05:23:05 AM

The AI available to the general public will very likely continue to spew misinformation. They (we?) don't want "I don't know" as an answer, so the machine will give an answer regardless of its veracity. Currently too expensive to have the AI address whether its answer is actually true. Not fond of the title, but I think the article itself is good.

"Why OpenAI's solution to AI hallucinations would kill ChatGPT tomorrow" | The Conversation

QuoteOpenAI's latest research paper diagnoses exactly why ChatGPT and other large language models can make things up – known in the world of artificial intelligence as "hallucination". It also reveals why the problem may be unfixable, at least as far as consumers are concerned.

The paper provides the most rigorous mathematical explanation yet for why these models confidently state falsehoods. It demonstrates that these aren't just an unfortunate side effect of the way that AIs are currently trained, but are mathematically inevitable.

The issue can partly be explained by mistakes in the underlying data used to train the AIs. But using mathematical analysis of how AI systems learn, the researchers prove that even with perfect training data, the problem still exists.

The way language models respond to queries – by predicting one word at a time in a sentence, based on probabilities – naturally produces errors. The researchers in fact show that the total error rate for generating sentences is at least twice as high as the error rate the same AI would have on a simple yes/no question, because mistakes can accumulate over multiple predictions.

In other words, hallucination rates are fundamentally bounded by how well AI systems can distinguish valid from invalid responses. Since this classification problem is inherently difficult for many areas of knowledge, hallucinations become unavoidable.

It also turns out that the less a model sees a fact during training, the more likely it is to hallucinate when asked about it. With birthdays of notable figures, for instance, it was found that if 20% of such people's birthdays only appear once in training data, then base models should get at least 20% of birthday queries wrong.

[. . .]

More troubling is the paper's analysis of why hallucinations persist despite post-training efforts (such as providing extensive human feedback to an AI's responses before it is released to the public). The authors examined ten major AI benchmarks, including those used by Google, OpenAI and also the top leaderboards that rank AI models. This revealed that nine benchmarks use binary grading systems that award zero points for AIs expressing uncertainty.

This creates what the authors term an "epidemic" of penalising honest responses. When an AI system says "I don't know", it receives the same score as giving completely wrong information. The optimal strategy under such evaluation becomes clear: always guess.

The researchers prove this mathematically. Whatever the chances of a particular answer being right, the expected score of guessing always exceeds the score of abstaining when an evaluation uses binary grading.

[. . .]

Consider the implications if ChatGPT started saying "I don't know" to even 30% of queries – a conservative estimate based on the paper's analysis of factual uncertainty in training data. Users accustomed to receiving confident answers to virtually any question would likely abandon such systems rapidly.

[. . .]

It wouldn't be difficult to reduce hallucinations using the paper's insights. Established methods for quantifying uncertainty have existed for decades. These could be used to provide trustworthy estimates of uncertainty and guide an AI to make smarter choices.

But even if the problem of users disliking this uncertainty could be overcome, there's a bigger obstacle: computational economics. Uncertainty-aware language models require significantly more computation than today's approach, as they must evaluate multiple possible responses and estimate confidence levels. For a system processing millions of queries daily, this translates to dramatically higher operational costs.

[Continues . . .]

Recusant · September 25, 2025, 01:46:14 AM

It's hard for me to see this as a good thing, but AI voices now sound enough like human voices to fool just about anybody.

"AI-generated voices now indistinguishable from real human voices" | Tech Xplore

QuoteMany people still think of AI-generated speech as sounding "fake" or unconvincing and easily told apart from human voices. But new research from Queen Mary University of London shows that AI voice technology has now reached a stage where it can create "voice clones" or deepfakes which sound just as realistic as human recordings.

The work has been published in PLOS One.

The study compared real human voices with two different types of synthetic voices, generated using state-of-the-art AI voice synthesis tools. Some were "cloned" from voice recordings of real humans, intended to mimic them, and others were generated from a large voice model and did not have a specific human counterpart.

Participants were asked to evaluate which voices sounded most realistic, and which sounded most dominant or trustworthy. Researchers also looked at whether AI-generated voices had become "hyperreal," given that some studies have shown that AI-generated images of faces are now judged to be human more often than images of real human faces.

While the study did not find a "hyperrealism effect" from the AI voices, it did find that voice clones can sound as real as human voices, making it difficult for listeners to distinguish between them. Both types of AI-generated voices were evaluated as more dominant than human voices, and some were also perceived as more trustworthy.

[Continues . . .]

The paper is open access:

"Voice clones sound realistic but not (yet) hyperrealistic" | PLOS One

QuoteAbstract:

AI-generated voices are increasingly prevalent in our lives, via virtual assistants, automated customer service, and voice-overs. With increased availability and affordability of AI-generated voices, we need to examine how humans perceive them.

Recently, an intriguing effect was reported in AI-generated faces, where such face images were perceived as more human than images of real humans – a "hyperrealism effect." Here, we tested whether a "hyperrealism effect" also exists for AI-generated voices.

We investigated the extent to which AI-generated voices sound real to human listeners, and whether listeners can accurately distinguish between human and AI-generated voices. We also examined perceived social trait characteristics (trustworthiness and dominance) of human and AI-generated voices. We tested these questions using AI-generated voices generated with and without a specific human counterpart (i.e., voice clones, and voices generated from the latent space of a large voice model).

We find that voice clones can sound as real as human voices, making it difficult for listeners to distinguish between them. However, we did not observe a hyperrealism effect. Both types of AI-generated voices were evaluated as more dominant than human voices, with some AI-generated voices also being perceived as more trustworthy.

These findings raise questions for future research: Can hyperrealistic voices be created with more advanced technology, or is the lack of a hyperrealism effect due to differences between voice and face (image) perception? Our findings also highlight the potential for AI-generated voices to misinform and defraud, alongside opportunities to use realistic AI-generated voices for beneficial purposes.

News:

All things AI