Experiments & Evaluations Index

Repository Index

This repository serves as an index of experimental projects, evaluations, proof-of-concepts, templates, patterns, and exploratory ideas related to AI/LLM development and workflows.

About this Index

This collection brings together various experimental repositories exploring AI agent workflows, LLM capabilities, evaluation frameworks, and development patterns. These repositories represent hands-on experiments, proof-of-concepts, benchmarking efforts, and reusable templates for AI-driven development.

AI Agent Development

Agent Workflows & Patterns

Repository	Description	Date
Agent Handover Demo	Demonstration of agent handover patterns	2025
Agent Network Expander Template	Template for expanding agent networks	2025
Agent Task Repo Pattern With MCP	Repository pattern for agent tasks using MCP	2025
AI Agent UN	AI agent unified namespace	2025
AI Agent Workspace Spec	Agent workspace specification	Mar 2025

Development Templates

Repository	Description	Date
AI Assistant Template	Template for AI assistant development	2025
AI Dev Prompts Example	Example development prompts for AI	2025
AI Development Template	General AI development template	2025

LLM Evaluation & Benchmarking

LLM Capabilities & Testing

Repository	Description	Date
Bias Censorship Eval Tests	Evaluation tests for bias and censorship	2025
LLM Evaluation Prompts	Prompts for evaluating LLMs	2025
Assistant Self Ideation	Demo of AI "self ideation" in practice	Feb 2025

LLM Experiments

Repository	Description	Date
LLM Experiment Notebook	Notebook of LLM experiments	2025
LLM Long Codegen Test	Testing long-form code generation	2025
LLM Max Token Length	Maximum token length exploration	Feb 2025
Single Shot Brevity Training	Training for concise single-shot responses	2025
One Prompt AI Book	Experiment in generating content from single prompts	2025
Long AI Prompting Experiment	Testing experiments with extended prompts	2025

Hugging Face Spaces

Repository	Description	Date
Single Shot Brevity Training	Brevity training interface	2025
LLM Long Code Generation Experiment	Long-form code generation experiment	2025
Max Output Tokens Analysis	Maximum output tokens analysis	Feb 2025

Speech-to-Text & Audio Processing

STT Benchmarks & Evaluation

Repository	Description	Date
Local ASR STT Benchmark	Local ASR and STT benchmarking	2025
Long Form Audio Eval	Evaluation of long-form audio transcription	2025
Personal STT Benchmarking	Personal speech-to-text benchmarking	2025
STT Voice Note Evaluation	Evaluation of STT for voice notes	2025

Hugging Face Spaces

Repository	Description	Date
Single Podcast ASR Eval	Single podcast ASR evaluation	2025
STT Comparison	Speech-to-text comparison tool	2025
Local STT Eval One Sample	Local STT evaluation with single samples	2025
Whisper Fine-Tune Eval	Interactive evaluation of fine-tuned Whisper models	2025

Hugging Face Datasets

Repository	Link	Description	Date
Podcast ASR Evaluation		Dataset for podcast ASR evaluation	2025
Whisper Fine-Tune One Shot Eval		WER and accuracy evaluation comparing fine-tuned Whisper (Tiny, Base, Small, Medium) vs stock models on 1 hour of audio, inference on Modal A100	2025

Audio Processing Experiments

Repository	Link	Description	Date
Crying Baby Audio Scrub		Audio processing for baby noise removal	2025
Audio Context Pipeline Model		Notes and model for audio context pipeline	Apr 2025

Specialized Applications

OSINT & Intelligence

Repository	Link	Description	Date
OSINT Missile Intelligence Agent		OSINT-focused intelligence agent	2025

Data Analysis

Repository	Link	Description	Date
GHG EBITDA Correlations		Analysis of greenhouse gas and EBITDA correlations	2025

Testing & Documentation

Test Repositories

Repository	Link	Description	Date
Test Markdown Docs		Test repository for markdown documentation	2025
Test System Prompts		Test repository for system prompts	2025

Note: This is a focused index covering experimental AI/LLM development projects. For a higher-level collection of all repository indexes and other projects, see the GitHub Master Index.

Author

Daniel Rosehill Contact: [email protected] Website: danielrosehill.com

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Experiments & Evaluations Index

Repository Index

About this Index

AI Agent Development

Agent Workflows & Patterns

Development Templates

LLM Evaluation & Benchmarking

LLM Capabilities & Testing

LLM Experiments

Hugging Face Spaces

Speech-to-Text & Audio Processing

STT Benchmarks & Evaluation

Hugging Face Spaces

Hugging Face Datasets

Audio Processing Experiments

Specialized Applications

OSINT & Intelligence

Data Analysis

Testing & Documentation

Test Repositories

Author

About

Uh oh!

danielrosehill/Experiments-And-Evaluations-Index

Folders and files

Latest commit

History

Repository files navigation

Experiments & Evaluations Index

Repository Index

About this Index

AI Agent Development

Agent Workflows & Patterns

Development Templates

LLM Evaluation & Benchmarking

LLM Capabilities & Testing

LLM Experiments

Hugging Face Spaces

Speech-to-Text & Audio Processing

STT Benchmarks & Evaluation

Hugging Face Spaces

Hugging Face Datasets

Audio Processing Experiments

Specialized Applications

OSINT & Intelligence

Data Analysis

Testing & Documentation

Test Repositories

Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks