Question 1

Is R1 0528 better than Gemini 2.5 Pro?

Accepted Answer

In our tests R1 0528 wins more benchmarks (3 wins vs Gemini's 2; 7 ties). R1 leads on safety_calibration (4 vs 1), agentic_planning (5 vs 4), and constrained_rewriting (4 vs 3). Gemini wins structured_output (5 vs 4) and creative_problem_solving (5 vs 4).

Question 2

Which model is cheaper?

Accepted Answer

R1 0528 is far cheaper: $0.50 input / $2.15 output per 1k tokens. Gemini 2.5 Pro costs $1.25 input / $10.00 output per 1k. For a 1M in+1M out token month R1 ≈ $2,650 vs Gemini ≈ $11,250.

Question 3

Which is better for coding?

Accepted Answer

On SWE-bench Verified (Epoch AI) Gemini scores 57.6% (rank 10 of 12). The payload does not include an SWE-bench score for R1 0528, so we cannot declare R1 superior on that external coding benchmark based on the supplied data.

Question 4

Which is better for math and olympiad problems?

Accepted Answer

On MATH Level 5 (Epoch AI) R1 0528 scores 96.6% (rank 5 of 14). On AIME 2025 (Epoch AI) Gemini scores 84.2% vs R1 66.4% (Gemini rank 11 of 23, R1 rank 16). Choose based on the specific external test: R1 excels on MATH Level 5 in our data; Gemini performs better on AIME 2025.

Question 5

Can Gemini 2.5 Pro accept images, audio and video?

Accepted Answer

Yes—payload lists Gemini 2.5 Pro modality as text+image+file+audio+video→text. R1 0528 is text→text only.

Question 6

Any operational quirks to watch for?

Accepted Answer

R1 0528 notes: it returns empty responses on structured_output, constrained_rewriting, and agentic_planning unless given high max completion tokens; it uses reasoning tokens that consume output budget, and has a min_max_completion_tokens setting of 1000. Plan prompts and token budgeting accordingly.

R1 0528 vs Gemini 2.5 Pro

R1 0528

Gemini 2.5 Pro

Benchmark Analysis

Pricing Analysis

Real-World Cost Comparison

Bottom Line

How We Test

Frequently Asked Questions