GPT Guesses Between 1 and 100
Recorded: May 25, 2026, 12:58 p.m.
| Original | Summarized |
GitHub - exmergo/research-chatgpt-guesses-between-1-and-100: When asked to pick a random number between 1 and 100, ChatGPT does not follow a random uniform distribution · GitHub Skip to content Navigation Menu Toggle navigation
Sign in
Appearance settings PlatformAI CODE CREATIONGitHub CopilotWrite better code with AIGitHub SparkBuild and deploy intelligent appsGitHub ModelsManage and compare promptsMCP RegistryNewIntegrate external toolsDEVELOPER WORKFLOWSActionsAutomate any workflowCodespacesInstant dev environmentsIssuesPlan and track workCode ReviewManage code changesAPPLICATION SECURITYGitHub Advanced SecurityFind and fix vulnerabilitiesCode securitySecure your code as you buildSecret protectionStop leaks before they startEXPLOREWhy GitHubDocumentationBlogChangelogMarketplaceView all featuresSolutionsBY COMPANY SIZEEnterprisesSmall and medium teamsStartupsNonprofitsBY USE CASEApp ModernizationDevSecOpsDevOpsCI/CDView all use casesBY INDUSTRYHealthcareFinancial servicesManufacturingGovernmentView all industriesView all solutionsResourcesEXPLORE BY TOPICAISoftware DevelopmentDevOpsSecurityView all topicsEXPLORE BY TYPECustomer storiesEvents & webinarsEbooks & reportsBusiness insightsGitHub SkillsSUPPORT & SERVICESDocumentationCustomer supportCommunity forumTrust centerPartnersView all resourcesOpen SourceCOMMUNITYGitHub SponsorsFund open source developersPROGRAMSSecurity LabMaintainer CommunityAcceleratorGitHub StarsArchive ProgramREPOSITORIESTopicsTrendingCollectionsEnterpriseENTERPRISE SOLUTIONSEnterprise platformAI-powered developer platformAVAILABLE ADD-ONSGitHub Advanced SecurityEnterprise-grade security featuresCopilot for BusinessEnterprise-grade AI featuresPremium SupportEnterprise-grade 24/7 supportPricing Search or jump to... Search code, repositories, users, issues, pull requests...
Search Clear
Search syntax tips Provide feedback Include my email address so I can be contacted Cancel Submit feedback Saved searches
Name Query To see all available qualifiers, see our documentation. Cancel Create saved search Sign in Sign up
Appearance settings Resetting focus You signed in with another tab or window. Reload to refresh your session. Dismiss alert exmergo research-chatgpt-guesses-between-1-and-100 Public
Notifications
Fork
Star Code Issues Pull requests Actions Projects Security and quality Insights
Additional navigation options
Code Issues Pull requests Actions Projects Security and quality Insights
mainBranchesTagsGo to fileCodeOpen more actions menuFolders and filesNameNameLast commit messageLast commit dateLatest commit History20 Commits20 Commitsdatadata docsdocs src/llm_random_biassrc/llm_random_bias teststests .env.example.env.example .gitignore.gitignore .python-version.python-version CONTRIBUTING.mdCONTRIBUTING.md LICENSELICENSE README.mdREADME.md pyproject.tomlpyproject.toml uv.lockuv.lock View all filesRepository files navigationREADMEContributingMIT licenseGPT Guesses Between 1 and 100 An interesting thing about humans is that they are not good random number generators. r/dataisbeautiful — "[OC] I asked 100 people to pick a number between 1 and 100" Methodology Model. gpt-4.1 (OpenAI), called via the Responses API. It is a Assumptions & Limitations Single model. Results describe gpt-4.1 only and do not generalize to Results Number 37 42 73 The five most-picked numbers overall — 47, 57, 72, 37, 42 — lean heavily on Stage Collect Clean Transform Stats Setup About When asked to pick a random number between 1 and 100, ChatGPT does not follow a random uniform distribution Readme MIT license Contributing Contributing Uh oh! There was an error while loading. Please reload this page. Activity Custom properties 12 0 0 Report repository Releases Packages
Uh oh! There was an error while loading. Please reload this page. Contributors Uh oh! There was an error while loading. Please reload this page. Languages Python
Footer © 2026 GitHub, Inc. Footer navigation Terms Privacy Security Status Community Docs Contact Manage cookies Do not share my personal information You can’t perform that action at this time. |
This research investigates whether large language models exhibit the same non-uniform randomness patterns found in human number selection when asked to pick a random number between one and one hundred. The central premise is that humans are not random number generators, as their choices cluster around specific numbers and avoid round figures, contrasting with a truly uniform random distribution. The experiment was designed to test if the language model gpt-4.1, trained on human text, inherits these inherent human biases rather than producing a fair, uniform distribution. The methodology involved 10,000 independent calls to the model, set with a temperature of 1.0 to ensure full sampling of the distribution. The prompt required the model to output a single integer between 1 and 100, and the experiment compared the resulting distribution against a theoretical uniform baseline, rather than the distribution of human-picked numbers. The data processing followed a four-stage pipeline: collection, cleaning, transformation, and statistical analysis. The cleaning stage ensured all responses were valid integers within the specified range, and the transformation stage prepared the data for statistical evaluation. The results demonstrated that gpt-4.1 emphatically does not produce a uniform random generator. Statistical testing against a uniform distribution indicated a significant deviation, confirming that the model generates a lumpy, distinctly human-shaped distribution. Specifically, the model reproduced classic human biases, showing a strong pull toward numbers like 37 and 73, and replicated meme numbers such as 42. Furthermore, the model exhibited a marked aversion to round numbers, avoiding virtually all multiples of ten except for 10, and uniquely exhibiting a lower-than-expected selection for the number 69. This suggests that the model inherited a moderated version of human bias, likely due to safety guardrails implemented during training, rather than simply mimicking raw human unpredictability. The study highlights critical limitations, noting that the results pertain solely to gpt-4.1 and do not generalize to other models or providers. The researchers emphasize that randomness in this context is a sampling artifact reflecting the model's learned token distribution, not evidence that the model is intentionally generating random numbers. The overall pipeline structure emphasizes reproducibility, allowing analysts to execute the collection, cleaning, transformation, and statistical stages independently. The project also describes methods for both analysis (reproducing the findings with a committed dataset) and fresh data collection (requiring API access), demonstrating a comprehensive approach to evaluating the stochastic properties of language models interacting with human-derived decision-making patterns. |