Benchmark Status: This evaluation is based on the Sarvam AI web version only, from a single prompt run with no re-prompting. It has not been validated in real projects and remains incomplete until stable API access is available.

All Reports

Detailed content is organized here track-wise. Use the sections below for fast navigation.

Reasoning Systems Coding Benchmark Timeline

Reasoning (7)

Section average: 7.89 / 10

Systems (1)

Section average: 6.00 / 10

Coding (2)

Section average: 8.46 / 10

Benchmark Timeline