Skip to content

amirrezaask/agenticbench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

41 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€– Agentic Benchmarks

Pitting AI models against each other in real-world coding challenges.

This repository hosts a collection of benchmarks designed to evaluate how well different AI models perform on practical programming tasks.

πŸ“š Benchmarks

Benchmark Category Description
1 Billion Row Challenge Performance Process 1B temperature readings as fast as possible
Project Euler Reasoning/Algorithm Solve mathematical and programming problems

πŸ“‚ Repository Structure

agentic-benchmarks/
β”œβ”€β”€ README.md           # This file
β”œβ”€β”€ 1brc/               # 1 Billion Row Challenge
β”œβ”€β”€ projecteuler/       # Project Euler Challenge
└── ...

Each benchmark has its own directory with setup instructions, prompts, implementations, and results.