News | National
15 Apr 2025 16:42
NZCity News
NZCity CalculatorReturn to NZCity

  • Start Page
  • Personalise
  • Sport
  • Weather
  • Finance
  • Shopping
  • Jobs
  • Horoscopes
  • Lotto Results
  • Photo Gallery
  • Site Gallery
  • TVNow
  • Dating
  • SearchNZ
  • NZSearch
  • Crime.co.nz
  • RugbyLeague
  • Make Home
  • About NZCity
  • Contact NZCity
  • Your Privacy
  • Advertising
  • Login
  • Join for Free

  •   Home > News > National

    ChatGPT just passed the Turing test. But that doesn’t mean AI is now as smart as humans

    The Turing test has been popularised as the ultimate indicator of machine intelligence. But the truth is more complicated.

    Zena Assaad, Senior Lecturer, School of Engineering, Australian National University
    The Conversation


    There have been several headlines over the past week about an AI chatbot officially passing the Turing test.

    These news reports are based on a recent preprint study by two researchers at the University of California San Diego in which four large language models (LLMs) were put through the Turing test. One model – OpenAI’s GPT-4.5 – was deemed indistinguishable from a human more than 70% of the time.

    The Turing test has been popularised as the ultimate indicator of machine intelligence. However, there is disagreement about the validity of this test. In fact, it has a contentious history which calls into question how effective it really is at measuring machine intelligence.

    So what does this mean for the significance of this new study?

    What did the study find?

    The preprint study by cognitive scientists Cameron Jones and Benjamin Bergen was published in March, but has not yet been peer-reviewed. It tested 4 LLMs: ELIZA, GPT-4o, LLaMa-3.1-405B, and GPT-4.5.

    The tests consisted of participants completing eight rounds of conversations in which they acted as an interrogator exchanging text messages with two witnesses simultaneously. One witness was a human and the other an LLM. Some 284 participants were randomly assigned to be either the interrogator or the witness.

    Participants were required to interact with both witnesses simultaneously across a split screen for five minutes, with the test interface mimicking that of a conventional messaging interface. Following this interaction, they decided which witness was a human, and which was an AI chatbot.

    Participants judged GPT-4.5 to be the human 73% of the time, and LLaMa-3.1-405B to be the human 56% of the time. The other two models (ELIZA and GPT-4o) only fooled participants 23% and 21% of the time respectively.

    ChatGPT chat bot screen seen on smartphone and laptop display with Chat GPT login screen on the background.
    Participants in the study judged OpenAI’s GPT-4.5 to be human 73% of the time. Ascannio/Shutterstock

    What exactly is the Turing Test?

    The first iteration of the Turing test was presented by English mathematician and computer scientist Alan Turing in a 1948 paper titled “Intelligent Machinery”. It was originally proposed as an experiment involving three people playing chess with a theoretical machine referred to as a paper machine, two being players and one being an operator.

    In the 1950 publication “Computing Machinery and Intelligence”, Turing reintroduced the experiment as the “imitation game” and claimed it was a means of determining a machine’s ability to exhibit intelligent behaviour equivalent to a human. It involved three participants: Participant A was a woman, participant B a man and participant C either gender.

    Through a series of questions, participant C is required to determine whether “X is A and Y is B” or “X is B and Y is A”, with X and Y representing the two genders.

    Black and white portrait of a man with a combover wearing a suit.
    Alan Turing in 1951. Elliott & Fry/Wikipedia

    A proposition is then raised: “What will happen when a machine takes the part of A in this game? Will the interrogator decide wrongly as often when the game is played like this as he does when the game is played between a man and a woman?”

    These questions were intended to replace the ambiguous question, “Can machines think?”. Turing claimed this question was ambiguous because it required an understanding of the terms “machine” and “think”, of which “normal” uses of the words would render a response to the question inadequate.

    Over the years, this experiment was popularised as the Turing test. While the subject matter varied, the test remained a deliberation on whether “X is A and Y is B” or “X is B and Y is A”.

    Why is it contentious?

    While popularised as a means of testing machine intelligence, the Turing test is not unanimously accepted as an accurate means to do so. In fact, the test is frequently challenged.

    There are four main objections to the Turing test:

    1. Behaviour vs thinking. Some researchers argue the ability to “pass” the test is a matter of behaviour, not intelligence. Therefore it would not be contradictory to say a machine can pass the imitation game, but cannot think.
    2. Brains are not machines. Turing makes assertions the brain is a machine, claiming it can be explained in purely mechanical terms. Many academics refute this claim and question the validity of the test on this basis.
    3. Internal operations. As computers are not humans, their process for reaching a conclusion may not be comparable to a person’s, making the test inadequate because a direct comparison cannot work.
    4. Scope of the test. Some researchers believe only testing one behaviour is not enough to determine intelligence.
    A young girl writing at a desk, with a book open in front of her.
    Even though GPT-4.5 may have passed the Turing test, this doesn’t mean it’s as intelligent as humans. fizkes/Shutterstock

    So is an LLM as smart as a human?

    While the preprint article claims GPT-4.5 passed the Turing test, it also states:

    the Turing test is a measure of substitutability: whether a system can stand-in for a real person without […] noticing the difference.

    This implies the researchers do not support the idea of the Turing test being a legitimate indication of human intelligence. Rather, it is an indication of the imitation of human intelligence – an ode to the origins of the test.

    It is also worth noting that the conditions of the study were not without issue. For example, a five minute testing window is relatively short.

    In addition, each of the LLMs was prompted to adopt a particular persona, but it’s unclear what the details and impact of the “personas” were on the test.

    For now it is safe to say GPT-4.5 is not as intelligent as humans – although it may do a reasonable job of convincing some people otherwise.

    The Conversation

    Zena Assaad does not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and has disclosed no relevant affiliations beyond their academic appointment.

    This article is republished from The Conversation under a Creative Commons license.
    © 2025 TheConversation, NZCity

     Other National News
     15 Apr: Farewell to Mario Vargas Llosa, the end of an era for Latin America’s literary luminaries
     15 Apr: Getting AIs working toward human goals - study shows how to measure misalignment
     15 Apr: Who’s thriving, who’s struggling and who’s stuck at the kitchen table: how working lives are changing in the UK
     15 Apr: The Police watchdog wants a policy change, after two deaths in a chase in Otaki last year
     15 Apr: Police are investigating the discovery of a cannabis grow house - at an Auckland house fire
     15 Apr: Police say the 77-year-old man reported missing in the Mount Roskill area's been found
     15 Apr: Winter electricity prices are rising – how do we know we’re getting value for money?
     Top Stories

    RUGBY RUGBY
    Blues winger Katelyn Vaha'akolo has hailed the linear TV broadcast figures released for their Super Rugby Aupiki final win over Matatu More...


    BUSINESS BUSINESS
    Small and medium business confidence is on the rise -- reaching its highest point in five years More...



     Today's News

    Basketball:
    The Breakers have confirmed reports Rob Loe is returning to the Australian basketball league 16:37

    Rugby:
    Blues winger Katelyn Vaha'akolo has hailed the linear TV broadcast figures released for their Super Rugby Aupiki final win over Matatu 16:27

    Entertainment:
    Bella Thorne has accused Mickey Rourke of bruising her pelvic bone with a metal grinder while shooting a movie 16:26

    National:
    Farewell to Mario Vargas Llosa, the end of an era for Latin America’s literary luminaries 16:07

    International:
    Confusion over chip, smartphone tariffs unfolds in US as Trump foreshadows more changes 16:07

    International:
    Donald Trump hosts El Salvador's president Nayib Bukele amid deportations controversy 15:57

    Entertainment:
    Sir Mick Jagger is engaged to Melanie Hamrick - but they haven't made any wedding plans yet 15:56

    Health & Safety:
    A Coroner's found the failure of a baby's cot was the sole cause of his death 15:27

    Entertainment:
    David Beckham's ownership of Inter Miami has been the "biggest challenge" of his career 15:26

    Rugby League:
    Lachlan Galvin is facing the axe at Wests Tigers as the five-eighth's decision to knock back the club's contract extension sets the stage for a messy break-up 14:57


     News Search






    Power Search


    © 2025 New Zealand City Ltd