CAPTCHAs can still detect AI agents
Recorded: May 29, 2026, 5:03 p.m.
| Original | Summarized |
CAPTCHAs can still detect AI agents | Roundtable Research Main site CAPTCHAs can still detect AI agents AI systems now match and exceed humans on many tasks, but behave through measurably different cognitive processes. This is a ~1000 word overview of our recent machine learning conference paper submission. To read the full preprint, click "CAPTCHAs are broken these days." AI can easily identify all the traffic lights in a static grid. So CAPTCHAs Yes and no. Yes, because vision language models (VLMs) can recognize images like chimneys, fire hydrants, and traffic lights. No, because AI does not complete CAPTCHAs like humans. If you look across all the data of humans and AI completing Figure 1: The Turing Test - originally proposed in 1950 by Alan Turing - offers a simple criterion for machine intelligence. If Turing understood this behavioral criterion was a concession and not the end-all-be-all of human vs. machine Following Turing's footsteps of defining an adversarially robust discriminator that can separate humans from bots, Figure 2: We recruited human participants and also deployed AI agents to perform these tasks. The CAPTCHA experiment Figure 3: While the classic Turing test measures whether a machine produces output indistinguishable from a human, we Our results raise two questions: what types of language models - if any - are like humans, and how adversarially To answer the first question, we compared the distance between humans and state-of-the-art frontier models Figure 4: We found that state-of-the-art frontier models (Claude, GPT, Gemini) have less similar human process features Qwen, a smaller open-source model, is more humanlike than the larger Claude, GPT, and Gemini. And, as a nice This introduces the second question: how adversarially robust is the process to discriminate humans from agents? Figure 5: We fine-tuned a Qwen2.5 Instruct model to bring it closer to humans. When given full information - the observed The challenge the Process Turing Test poses is whether AI can continuously replicate all of human cognitive About the Authors |
AI systems currently match or exceed human performance on many tasks, yet they operate through demonstrably different cognitive processes, a gap that can be exploited to detect artificial agents and bots. While deep learning has successfully solved tasks like image classification, such as those found in CAPTCHAs, the work presented suggests that this success does not imply human-like problem-solving. The researchers found statistically significant differences between human and agent behavior across features such as sequential click patterns, direction changes, and overselection behavior when completing CAPTCHA-style tasks, indicating that while AI can solve the task, it does not follow the same cognitive pathways as humans. Building upon the concept of the Turing Test, which relies on behavioral indistinguishability, the authors propose a Process Turing Test to measure whether machines produce a cognitive process indistinguishable from humans, rather than just indistinguishable output. To explore this deeper aspect of intelligence, they developed CogCAPTCHA30, which extends the traditional CAPTCHA with 29 classic cognitive psychology tasks, encompassing decision-making, memory, perception, and reasoning, thereby measuring agentic process behavior. Experiments involving both human participants and AI agents in this 30-task paradigm demonstrated that while humans and agents achieved similar performance levels in terms of output, the process by which they arrived at those answers was uncorrelated. This established that output equivalence does not equate to process equivalence. Further comparative analysis involving state-of-the-art frontier models, including OpenAI's GPT, Anthropic's Claude, and Google DeepMind's Gemini, alongside smaller models like Qwen and Centaur (a 70B-parameter foundation model of human cognition), revealed differences in human process features. Specifically, the frontier models exhibited less similarity to human process features compared to the smaller models. This supports the argument that while AI capabilities are advancing, they are not necessarily simulating human cognition. The authors hypothesize that the superior humanlike characteristics of smaller models, such as Qwen, may stem from extensive large-scale output fine-tuning involving millions of human choices across numerous cognitive experiments. The study introduces a crucial line of inquiry regarding the adversarial robustness of this discrimination process. They investigate whether an agent can close the process gap between humans and agents when given direct access to evaluation criteria. They found that direct process-level fine-tuning, such as applying specific observed features and the discriminator's objective function to models like Qwen2.5 Instruct, effectively eliminated the gap between human and agent behavior. However, this advantage disappeared when certain features were excluded or when agents were required to generalize across different tasks. This suggests that the Process Turing Test remains robust when the AI does not possess full knowledge of how it will be evaluated or when required to generalize across varied cognitive domains. Ultimately, the work suggests that continuously replicating all aspects of human cognitive psychology remains an exponentially challenging task. Despite accumulating capability over time, AI models are not empirically becoming more humanlike. The Process Turing Test offers a valuable advancement in human verification, surpassing single checks like passwords or CAPTCHAs by simulating the complexity of human cognitive psychology. The research motivates considering this process-level analysis as a necessary step-up function for future human-agent verification systems. |