Aider Polyglot logo

Aider Polyglot

Coding

The practitioner favorite for code editing: 225 hard Exercism exercises across six languages, solved end to end through the aider tool and checked by unit tests. Higher is better.

The board has not been refreshed since November 2025, so current frontier models (Claude Fable 5, Claude Opus 4.8, GPT-5.5) do not appear yet. It remains the reference for the prior generation.

Each model attempts the 225 hardest Exercism practice exercises spanning C++, Go, Java, JavaScript, Python and Rust, driving aider end to end. The model must emit changes in a structured edit format (diff, whole-file, or architect mode), solutions are checked by running each exercise's unit tests, and one retry is allowed after seeing failures: percent correct is the share of tasks passing after that second attempt. Every run also publishes its total USD cost (shown here divided by 225 as cost per task), which makes the board a clean score vs cost frontier. All runs live as YAML in the aider GitHub repo and community result PRs are accepted.

Score vs. cost
Leaderboard
Share:
Details:
  • Category


    Coding
  • Aider logoCreated by


    Aider
  • Models tested


    11
  • Configs tested


    17
  • Leader


    OpenAIGPT-5
  • Top score


    88%

Updated November 2025