/best/chatbotsupdated May 202685 models evaluated

Best AI for chatbots

Customer-facing conversational AI, support agents, and interactive assistants.

CodingMathWritingResearchTranslationData AnalysisChatbotsStudentsBusinessCreative WritingTabular Data & Spreadsheets

Chatbot quality hinges on three things most benchmarks ignore: persona consistency (does it stay in character when users push back?), safety calibration (does it refuse harmful requests without refusing legitimate ones?), and multilingual fluency (can it handle non-English users without degrading?).

What matters: a model that maintains character across long conversations, handles adversarial inputs gracefully, and produces natural-sounding responses in multiple languages. Latency matters too — users expect sub-second responses in a chat interface.

Our chatbots rank weights persona consistency (2×), safety calibration (1.5×), and multilingual (1×). For customer support specifically, add structured output weight if the model needs to trigger actions (refunds, escalations, ticket creation) based on conversation context.

Full rankings

All 85 models, scored for chatbots

weighted composite · lower-is-worse
#ModelProviderTask score$/in$/outContext
01Xiaomi: MiMo-V2-FlashXXiaomi5.00/5.0$0.100$0.300262K
02Gemini 3.1 Flash Lite PreviewGGoogle5.00/5.0$0.250$1.501.0M
03Google: Gemini 3.1 Flash LiteGGoogle5.00/5.0$0.250$1.501.0M
04GLM-4.7ZZhipu AI5.00/5.0$0.400$1.75203K
05Qwen: Qwen3.6 27BQQwen5.00/5.0$0.300$2.00262K
06MiMo-V2.5XXiaomi5.00/5.0$0.400$2.001.0M
07Xiaomi: MiMo-V2-OmniXXiaomi5.00/5.0$0.400$2.00262K
08Xiaomi: MiMo-V2-ProXXiaomi5.00/5.0$1.00$3.001.0M
09MoonshotAI: Kimi K2.6MMoonshotAI5.00/5.0$0.730$3.49262K
10Qwen: Qwen3.6 Max PreviewQQwen5.00/5.0$1.04$6.24262K
11GPT-5.2OOpenAI5.00/5.0$1.75$14.00400K
12GPT-5.4OOpenAI5.00/5.0$2.50$15.001.1M
13Claude Sonnet 4.6AAnthropic5.00/5.0$3.00$15.001M
14Claude Opus 4.6AAnthropic5.00/5.0$5.00$25.001M
15Qwen: Qwen3.5-9BQQwen4.67/5.0$0.040$0.150262K
16NVIDIA: Nemotron 3 Nano 30B A3BNNVIDIA4.67/5.0$0.050$0.200262K
17GLM-4.7 FlashZZhipu AI4.67/5.0$0.060$0.400203K
18Qwen: Qwen3.5 Plus 2026-04-20QQwen4.67/5.0$0.300$1.801M
19Gemini 2.5 FlashGGoogle4.67/5.0$0.300$2.501.0M
20GPT-5.5OOpenAI4.67/5.0$5.00$30.001.1M
21GPT-5 NanoOOpenAI4.33/5.0$0.050$0.400400K
22GPT-5.4 NanoOOpenAI4.33/5.0$0.200$1.25400K
23GPT-5 MiniOOpenAI4.33/5.0$0.250$2.00400K
24o4 MiniOOpenAI4.33/5.0$1.10$4.40200K
25Gemma 4 31BGGoogle4.00/5.0$0.120$0.370262K
26DeepSeek V3.2DDeepSeek4.00/5.0$0.252$0.378131K
27Mistral Small 4MMistral4.00/5.0$0.150$0.600262K
28GPT-4o-miniOOpenAI4.00/5.0$0.150$0.600128K
29DeepSeek V4 ProDDeepSeek4.00/5.0$0.435$0.8701.0M
30Qwen: Qwen3.6 FlashQQwen4.00/5.0$0.188$1.131M
31GPT-4.1 MiniOOpenAI4.00/5.0$0.400$1.601.0M
32Qwen: Qwen3.6 PlusQQwen4.00/5.0$0.325$1.951M
33Mistral Medium 3.1MMistral4.00/5.0$0.400$2.00131K
34R1 0528DDeepSeek4.00/5.0$0.500$2.15164K
35MiMo-V2.5-ProXXiaomi4.00/5.0$1.00$3.001.0M
36GPT-5.4 MiniOOpenAI4.00/5.0$0.750$4.50400K
37Claude Haiku 4.5AAnthropic4.00/5.0$1.00$5.00200K
38Mistral Medium 3.5MMistral4.00/5.0$1.50$7.50262K
39Qwen 3.7 MaxQQwen4.00/5.0$2.50$7.501M
40Gemini 3.5 FlashGGoogle4.00/5.0$1.50$9.001.0M
41GPT-5OOpenAI4.00/5.0$1.25$10.00400K
42GPT-5.1OOpenAI4.00/5.0$1.25$10.00400K
43Gemini 3.1 Pro PreviewGGoogle4.00/5.0$2.00$12.001.0M
44Grok 4XXAI4.00/5.0$3.00$15.00256K
45Grok 3XXAI4.00/5.0$3.00$15.00131K
46Claude Opus 4.7AAnthropic4.00/5.0$5.00$25.001M
47Qwen: Qwen3 235B A22B Instruct 2507QQwen3.67/5.0$0.071$0.100262K
48DeepSeek V4 FlashDDeepSeek3.67/5.0$0.100$0.2001.0M
49Gemma 4 26B A4B GGoogle3.67/5.0$0.060$0.330262K
50Gemini 2.5 Flash LiteGGoogle3.67/5.0$0.100$0.4001.0M
51NVIDIA: Nemotron 3 SuperNNVIDIA3.67/5.0$0.090$0.4501M
52Grok 4.1 FastXXAI3.67/5.0$0.200$0.5002M
53Grok 3 MiniXXAI3.67/5.0$0.300$0.500131K
54Llama 4 MaverickMMeta3.67/5.0$0.150$0.6001.0M
55Qwen: Qwen3.6 35B A3BQQwen3.67/5.0$0.150$1.00262K
56R1DDeepSeek3.67/5.0$0.700$2.50164K
57Grok 4.3XXAI3.67/5.0$1.25$2.501M
58Grok 4.20XXAI3.67/5.0$1.25$2.502M
59Gemini 3 Flash PreviewGGoogle3.67/5.0$0.500$3.001.0M
60GPT-4.1OOpenAI3.67/5.0$2.00$8.001.0M
61o3OOpenAI3.67/5.0$2.00$8.00200K
62Gemini 2.5 ProGGoogle3.67/5.0$1.25$10.001.0M
63OpenAI: gpt-oss-120bOOpenAI3.33/5.0$0.039$0.180131K
64Ministral 3 8B 2512MMistral3.33/5.0$0.150$0.150262K
65Ministral 3 14B 2512MMistral3.33/5.0$0.200$0.200262K
66Qwen: Qwen3 30B A3B Instruct 2507QQwen3.33/5.0$0.090$0.300262K
67GPT-4.1 NanoOOpenAI3.33/5.0$0.100$0.4001.0M
68DeepSeek V3.1DDeepSeek3.33/5.0$0.210$0.790164K
69DeepSeek V3.1 TerminusDDeepSeek3.33/5.0$0.270$0.950164K
70Grok Code Fast 1XXAI3.33/5.0$0.200$1.50256K
71Devstral 2 2512MMistral3.33/5.0$0.400$2.00262K
72xAI: Grok Build 0.1XXAI3.33/5.0$1.00$2.00256K
73GPT-4oOOpenAI3.33/5.0$2.50$10.00128K
74Ministral 3 3B 2512MMistral3.00/5.0$0.100$0.100131K
75OpenAI: gpt-oss-20bOOpenAI3.00/5.0$0.030$0.140131K
76Llama 4 ScoutMMeta3.00/5.0$0.080$0.30010M
77Llama 3.3 70B InstructMMeta3.00/5.0$0.100$0.320131K
78Mistral Large 3 2512MMistral3.00/5.0$0.500$1.50262K
79Mistral Small 3.2 24BMMistral2.67/5.0$0.075$0.200128K
80Qwen: Qwen3 Coder 30B A3B InstructQQwen2.67/5.0$0.070$0.270160K
81Devstral Small 1.1MMistral2.67/5.0$0.100$0.300131K
82Codestral 2508MMistral2.67/5.0$0.300$0.900256K
83Devstral MediumMMistral2.67/5.0$0.400$2.00131K
84Mistral Small 3.1 24BMMistral2.33/5.0$0.351$0.555128K
85Qwen: Qwen3.5-35B-A3BQQwen2.00/5.0$0.139$1.00262K

Pricing — top 5 for chatbots

XXiaomi: MiMo-V2-Flash
$0.250/MTok
5.00/5.0
GGemini 3.1 Flash Lite Preview
$1.19/MTok
5.00/5.0
GGoogle: Gemini 3.1 Flash Lite
$1.19/MTok
5.00/5.0
ZGLM-4.7
$1.41/MTok
5.00/5.0
QQwen: Qwen3.6 27B
$1.57/MTok
5.00/5.0
modelpicker.aipowered by live benchmark data

The best AI for chatbots changes every month.

We'll email you when rankings shift, new models hit the top 5, or pricing cuts reshuffle the value leaders.

Get notified when models change
Price drops, new models, benchmark updates. One email per change, no spam.