Best AI for chatbots
Customer-facing conversational AI, support agents, and interactive assistants.
Chatbot quality hinges on three things most benchmarks ignore: persona consistency (does it stay in character when users push back?), safety calibration (does it refuse harmful requests without refusing legitimate ones?), and multilingual fluency (can it handle non-English users without degrading?).
What matters: a model that maintains character across long conversations, handles adversarial inputs gracefully, and produces natural-sounding responses in multiple languages. Latency matters too — users expect sub-second responses in a chat interface.
Our chatbots rank weights persona consistency (2×), safety calibration (1.5×), and multilingual (1×). For customer support specifically, add structured output weight if the model needs to trigger actions (refunds, escalations, ticket creation) based on conversation context.
Full rankings
All 85 models, scored for chatbots
| # | Model | Provider | Task score | $/in | $/out | Context |
|---|---|---|---|---|---|---|
| 01 | Xiaomi: MiMo-V2-Flash | XXiaomi | 5.00/5.0 | $0.100 | $0.300 | 262K |
| 02 | Gemini 3.1 Flash Lite Preview | GGoogle | 5.00/5.0 | $0.250 | $1.50 | 1.0M |
| 03 | Google: Gemini 3.1 Flash Lite | GGoogle | 5.00/5.0 | $0.250 | $1.50 | 1.0M |
| 04 | GLM-4.7 | ZZhipu AI | 5.00/5.0 | $0.400 | $1.75 | 203K |
| 05 | Qwen: Qwen3.6 27B | QQwen | 5.00/5.0 | $0.300 | $2.00 | 262K |
| 06 | MiMo-V2.5 | XXiaomi | 5.00/5.0 | $0.400 | $2.00 | 1.0M |
| 07 | Xiaomi: MiMo-V2-Omni | XXiaomi | 5.00/5.0 | $0.400 | $2.00 | 262K |
| 08 | Xiaomi: MiMo-V2-Pro | XXiaomi | 5.00/5.0 | $1.00 | $3.00 | 1.0M |
| 09 | MoonshotAI: Kimi K2.6 | MMoonshotAI | 5.00/5.0 | $0.730 | $3.49 | 262K |
| 10 | Qwen: Qwen3.6 Max Preview | QQwen | 5.00/5.0 | $1.04 | $6.24 | 262K |
| 11 | GPT-5.2 | OOpenAI | 5.00/5.0 | $1.75 | $14.00 | 400K |
| 12 | GPT-5.4 | OOpenAI | 5.00/5.0 | $2.50 | $15.00 | 1.1M |
| 13 | Claude Sonnet 4.6 | AAnthropic | 5.00/5.0 | $3.00 | $15.00 | 1M |
| 14 | Claude Opus 4.6 | AAnthropic | 5.00/5.0 | $5.00 | $25.00 | 1M |
| 15 | Qwen: Qwen3.5-9B | QQwen | 4.67/5.0 | $0.040 | $0.150 | 262K |
| 16 | NVIDIA: Nemotron 3 Nano 30B A3B | NNVIDIA | 4.67/5.0 | $0.050 | $0.200 | 262K |
| 17 | GLM-4.7 Flash | ZZhipu AI | 4.67/5.0 | $0.060 | $0.400 | 203K |
| 18 | Qwen: Qwen3.5 Plus 2026-04-20 | QQwen | 4.67/5.0 | $0.300 | $1.80 | 1M |
| 19 | Gemini 2.5 Flash | GGoogle | 4.67/5.0 | $0.300 | $2.50 | 1.0M |
| 20 | GPT-5.5 | OOpenAI | 4.67/5.0 | $5.00 | $30.00 | 1.1M |
| 21 | GPT-5 Nano | OOpenAI | 4.33/5.0 | $0.050 | $0.400 | 400K |
| 22 | GPT-5.4 Nano | OOpenAI | 4.33/5.0 | $0.200 | $1.25 | 400K |
| 23 | GPT-5 Mini | OOpenAI | 4.33/5.0 | $0.250 | $2.00 | 400K |
| 24 | o4 Mini | OOpenAI | 4.33/5.0 | $1.10 | $4.40 | 200K |
| 25 | Gemma 4 31B | GGoogle | 4.00/5.0 | $0.120 | $0.370 | 262K |
| 26 | DeepSeek V3.2 | DDeepSeek | 4.00/5.0 | $0.252 | $0.378 | 131K |
| 27 | Mistral Small 4 | MMistral | 4.00/5.0 | $0.150 | $0.600 | 262K |
| 28 | GPT-4o-mini | OOpenAI | 4.00/5.0 | $0.150 | $0.600 | 128K |
| 29 | DeepSeek V4 Pro | DDeepSeek | 4.00/5.0 | $0.435 | $0.870 | 1.0M |
| 30 | Qwen: Qwen3.6 Flash | QQwen | 4.00/5.0 | $0.188 | $1.13 | 1M |
| 31 | GPT-4.1 Mini | OOpenAI | 4.00/5.0 | $0.400 | $1.60 | 1.0M |
| 32 | Qwen: Qwen3.6 Plus | QQwen | 4.00/5.0 | $0.325 | $1.95 | 1M |
| 33 | Mistral Medium 3.1 | MMistral | 4.00/5.0 | $0.400 | $2.00 | 131K |
| 34 | R1 0528 | DDeepSeek | 4.00/5.0 | $0.500 | $2.15 | 164K |
| 35 | MiMo-V2.5-Pro | XXiaomi | 4.00/5.0 | $1.00 | $3.00 | 1.0M |
| 36 | GPT-5.4 Mini | OOpenAI | 4.00/5.0 | $0.750 | $4.50 | 400K |
| 37 | Claude Haiku 4.5 | AAnthropic | 4.00/5.0 | $1.00 | $5.00 | 200K |
| 38 | Mistral Medium 3.5 | MMistral | 4.00/5.0 | $1.50 | $7.50 | 262K |
| 39 | Qwen 3.7 Max | QQwen | 4.00/5.0 | $2.50 | $7.50 | 1M |
| 40 | Gemini 3.5 Flash | GGoogle | 4.00/5.0 | $1.50 | $9.00 | 1.0M |
| 41 | GPT-5 | OOpenAI | 4.00/5.0 | $1.25 | $10.00 | 400K |
| 42 | GPT-5.1 | OOpenAI | 4.00/5.0 | $1.25 | $10.00 | 400K |
| 43 | Gemini 3.1 Pro Preview | GGoogle | 4.00/5.0 | $2.00 | $12.00 | 1.0M |
| 44 | Grok 4 | XXAI | 4.00/5.0 | $3.00 | $15.00 | 256K |
| 45 | Grok 3 | XXAI | 4.00/5.0 | $3.00 | $15.00 | 131K |
| 46 | Claude Opus 4.7 | AAnthropic | 4.00/5.0 | $5.00 | $25.00 | 1M |
| 47 | Qwen: Qwen3 235B A22B Instruct 2507 | QQwen | 3.67/5.0 | $0.071 | $0.100 | 262K |
| 48 | DeepSeek V4 Flash | DDeepSeek | 3.67/5.0 | $0.100 | $0.200 | 1.0M |
| 49 | Gemma 4 26B A4B | GGoogle | 3.67/5.0 | $0.060 | $0.330 | 262K |
| 50 | Gemini 2.5 Flash Lite | GGoogle | 3.67/5.0 | $0.100 | $0.400 | 1.0M |
| 51 | NVIDIA: Nemotron 3 Super | NNVIDIA | 3.67/5.0 | $0.090 | $0.450 | 1M |
| 52 | Grok 4.1 Fast | XXAI | 3.67/5.0 | $0.200 | $0.500 | 2M |
| 53 | Grok 3 Mini | XXAI | 3.67/5.0 | $0.300 | $0.500 | 131K |
| 54 | Llama 4 Maverick | MMeta | 3.67/5.0 | $0.150 | $0.600 | 1.0M |
| 55 | Qwen: Qwen3.6 35B A3B | QQwen | 3.67/5.0 | $0.150 | $1.00 | 262K |
| 56 | R1 | DDeepSeek | 3.67/5.0 | $0.700 | $2.50 | 164K |
| 57 | Grok 4.3 | XXAI | 3.67/5.0 | $1.25 | $2.50 | 1M |
| 58 | Grok 4.20 | XXAI | 3.67/5.0 | $1.25 | $2.50 | 2M |
| 59 | Gemini 3 Flash Preview | GGoogle | 3.67/5.0 | $0.500 | $3.00 | 1.0M |
| 60 | GPT-4.1 | OOpenAI | 3.67/5.0 | $2.00 | $8.00 | 1.0M |
| 61 | o3 | OOpenAI | 3.67/5.0 | $2.00 | $8.00 | 200K |
| 62 | Gemini 2.5 Pro | GGoogle | 3.67/5.0 | $1.25 | $10.00 | 1.0M |
| 63 | OpenAI: gpt-oss-120b | OOpenAI | 3.33/5.0 | $0.039 | $0.180 | 131K |
| 64 | Ministral 3 8B 2512 | MMistral | 3.33/5.0 | $0.150 | $0.150 | 262K |
| 65 | Ministral 3 14B 2512 | MMistral | 3.33/5.0 | $0.200 | $0.200 | 262K |
| 66 | Qwen: Qwen3 30B A3B Instruct 2507 | QQwen | 3.33/5.0 | $0.090 | $0.300 | 262K |
| 67 | GPT-4.1 Nano | OOpenAI | 3.33/5.0 | $0.100 | $0.400 | 1.0M |
| 68 | DeepSeek V3.1 | DDeepSeek | 3.33/5.0 | $0.210 | $0.790 | 164K |
| 69 | DeepSeek V3.1 Terminus | DDeepSeek | 3.33/5.0 | $0.270 | $0.950 | 164K |
| 70 | Grok Code Fast 1 | XXAI | 3.33/5.0 | $0.200 | $1.50 | 256K |
| 71 | Devstral 2 2512 | MMistral | 3.33/5.0 | $0.400 | $2.00 | 262K |
| 72 | xAI: Grok Build 0.1 | XXAI | 3.33/5.0 | $1.00 | $2.00 | 256K |
| 73 | GPT-4o | OOpenAI | 3.33/5.0 | $2.50 | $10.00 | 128K |
| 74 | Ministral 3 3B 2512 | MMistral | 3.00/5.0 | $0.100 | $0.100 | 131K |
| 75 | OpenAI: gpt-oss-20b | OOpenAI | 3.00/5.0 | $0.030 | $0.140 | 131K |
| 76 | Llama 4 Scout | MMeta | 3.00/5.0 | $0.080 | $0.300 | 10M |
| 77 | Llama 3.3 70B Instruct | MMeta | 3.00/5.0 | $0.100 | $0.320 | 131K |
| 78 | Mistral Large 3 2512 | MMistral | 3.00/5.0 | $0.500 | $1.50 | 262K |
| 79 | Mistral Small 3.2 24B | MMistral | 2.67/5.0 | $0.075 | $0.200 | 128K |
| 80 | Qwen: Qwen3 Coder 30B A3B Instruct | QQwen | 2.67/5.0 | $0.070 | $0.270 | 160K |
| 81 | Devstral Small 1.1 | MMistral | 2.67/5.0 | $0.100 | $0.300 | 131K |
| 82 | Codestral 2508 | MMistral | 2.67/5.0 | $0.300 | $0.900 | 256K |
| 83 | Devstral Medium | MMistral | 2.67/5.0 | $0.400 | $2.00 | 131K |
| 84 | Mistral Small 3.1 24B | MMistral | 2.33/5.0 | $0.351 | $0.555 | 128K |
| 85 | Qwen: Qwen3.5-35B-A3B | QQwen | 2.00/5.0 | $0.139 | $1.00 | 262K |
Pricing — top 5 for chatbots
The best AI for chatbots changes every month.
We'll email you when rankings shift, new models hit the top 5, or pricing cuts reshuffle the value leaders.