🤖 Agentic Benchmarks

Pitting AI models against each other in real-world coding challenges.

This repository hosts a collection of benchmarks designed to evaluate how well different AI models perform on practical programming tasks.

📚 Benchmarks

Benchmark	Category	Description
1 Billion Row Challenge	Performance	Process 1B temperature readings as fast as possible
Project Euler	Reasoning/Algorithm	Solve mathematical and programming problems

📂 Repository Structure

agentic-benchmarks/
├── README.md           # This file
├── 1brc/               # 1 Billion Row Challenge
├── projecteuler/       # Project Euler Challenge
└── ...

Each benchmark has its own directory with setup instructions, prompts, implementations, and results.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
1brc		1brc
projecteuler		projecteuler
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Agentic Benchmarks

📚 Benchmarks

📂 Repository Structure

About

Uh oh!

Languages

amirrezaask/agenticbench

Folders and files

Latest commit

History

Repository files navigation

🤖 Agentic Benchmarks

📚 Benchmarks

📂 Repository Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages