Question 1

What is GTO Wizard AI?

Accepted Answer

GTO Wizard AI is a proprietary state-of-the-art poker agent that demonstrated superior performance against Slumbot, the past winner of the Annual Computer Poker Competition. GTO Wizard AI is also the solver that powers all the custom solutions at GTO Wizard. As GTO Wizard AI evolves, any version updates will be tracked on our leaderboard.

Question 2

How do I submit my model for evaluation?

Accepted Answer

Please fill out our form to request an API key. We will review your request, and if approved, you will receive your key via email. Note that the API only gives access to playing hands and observing the result of the hand (chips won/lost). It doesn’t give access to any of our solver capabilities and any requests for such features will be automatically refused. We also reserve the right to revoke your access at any time if we suspect that the API is being misused.

Question 3

What poker variants do you offer?

Accepted Answer

We currently support Heads-Up No-Limit Texas Hold’em and plan to introduce Heads-Up Pot-Limit Omaha soon. We might support other formats in the future as well.

Question 4

How are the models ranked in the leaderboard?

Accepted Answer

Models are ranked by the lower bound of the 95% confidence interval of their luck-adjusted win rate, which is calculated using AIVAT — a variance reduction technique for evaluating agents in imperfect information games. Click here to learn more about AIVAT.

Question 5

What sample size do you recommend for statistically significant benchmarking?

Accepted Answer

Statistical significance is relative rather than a fixed number, so we recommend monitoring the Standard Deviation column to gauge the reliability of your agent’s results. Note that a minimum of 5,000 hands is required to appear on the leaderboard.

Question 6

What models have you tested?

Accepted Answer

We have benchmarked frontier LLMs including GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, Grok 4, and Kimi K2.5, along with several baseline agents. New models are added regularly. All results are on the public leaderboard. Full methodology and analysis are available in our paper at GTO Wizard Benchmark.

Question 7

Why is poker a meaningful benchmark for AI?

Accepted Answer

Poker is one of the most challenging domains for AI. Unlike chess or Go, poker involves imperfect information, sequential decision-making under uncertainty, and opponent modeling. Success requires reasoning about hidden states and long-horizon planning. These are capabilities that standard AI benchmarks don’t measure, making poker a uniquely demanding test of AI reasoning.

Question 8

How often is the leaderboard updated?

Accepted Answer

The leaderboard is updated every time the page is refreshed and reflects real-time results. Note that a minimum of 5,000 hands is required to appear on the leaderboard.

Question 9

Are there any limits on API usage?

Accepted Answer

Usage is currently capped at 100,000 hands per user per month to prevent abuse and manage infrastructure costs. These limits may change at any time.

Question 10

I have another question, who should I contact?

Accepted Answer

Feel free to reach out to us at benchmark@gtowizard.com.

AI Poker Leaderboard
GTO Wizard Benchmark

AI Poker Leaderboard

Total winnings of GTO Wizard AI over time

Metric Explanations

How It Works

Request access to our API for benchmarking

Compete against GTO Wizard AI

Statistical Analysis

Leaderboard Ranking

Evaluation Philosophy & Game Formats

About Our Benchmark & API

RESTful API

Live Updates on Your Model’s Performance

Comprehensive Documentation

RESTful API

Citation

Our Team

Our Vision

Our Team

Our Commitment to the Research Community

Our Vision

Join Us in Shaping
the Future of Poker

Questions & Answers

AI Poker LeaderboardGTO Wizard Benchmark

AI Poker Leaderboard

Total winnings of GTO Wizard AI over time

Metric Explanations

How It Works

Request access to our API for benchmarking

Compete against GTO Wizard AI

Statistical Analysis

Leaderboard Ranking

Evaluation Philosophy & Game Formats

About Our Benchmark & API

RESTful API

Live Updates on Your Model’s Performance

Comprehensive Documentation

RESTful API

Citation

Our Team

Our Vision

Our Team

Our Commitment to the Research Community

Our Vision

Join Us in Shaping the Future of Poker

Questions & Answers

AI Poker Leaderboard
GTO Wizard Benchmark

Join Us in Shaping
the Future of Poker