Grok is the worst conventional chatbot in sports betting, according to research


Grok, X’s chatbot, may have demonstrated its ability to give hot shots about Nazi Germany or put almost anything in a bikiniBut there is one area in which new research has found it significantly underperforms its rivals: predicting sports results.

According to a report by AI startup General Reasoning, first shared With The Financial Times, Grok performed the worst among eight widely used big language models when it came to predicting and betting on the results of the 2023-24 season of the Premier League, the most popular soccer league in the world.

Eight LLMs received detailed historical data and statistics about each team and previous games. The LLMs were then asked to build models that maximize returns and manage risk when placing bets. Each LLM was given three attempts to run the simulation and a $133,000 (£100,000) pot to place bets on.

Anthropic’s Claude Opus 4.6 performed the best of all the chatbots tested, losing 11.0% on average across its three attempts and ending up with an average pot of £89,035.

Grok from OpenAI’s GPT-5.4 also achieved respectable performance, although it still loses. GPT-5.4 lost 13.6% on average, ending with a final average jackpot of $116,000 (£86,365). However, his worst attempt, where he lost 31.6%, was worse than any of Claude’s. Google’s Gemini 3.1 Pro recorded worse overall performance but with high variability, losing 43.3% on average but returning 33.7% on its best attempt.

Recommended by our editors

The authors of the paper found, overall, that the AI ​​was “systematically underperforming humans” in their tests. Meanwhile, Ross Taylor, CEO of General Reasoning, said that despite the hype around AI automation, there are currently “not many measures to put AI on a long-term horizon,” highlighting how many of the current tests occur in “very static environments” that don’t reflect the complexity of real life.

The news comes as Grok could soon see greater corporate adoption, reportedly from xAI owner Elon Musk. forcing the banks working on the upcoming SpaceX IPO to subscribe to the tool.



Newsletter icon

Get our best stories!

Your daily dose of our top tech news


News Newsletter image

Register in our What’s new now? newsletter to get the latest news, the best new products, and expert advice from PCMag editors.

By clicking Register, you confirm that you are over 16 years of age and agree to our Terms of use and Privacy Policy.

Thanks for signing up!

Your subscription has been confirmed. Keep an eye on your inbox!

About our expert





Source link

Leave a Reply

Your email address will not be published. Required fields are marked *