note: Agentlog is in beta, and is under heavy development. Breaking changes may occur.
live demo: https://agentlog.scalebase.io
agentlog is a Go platform that wraps AI APIs (starting with Google Gemini) with multi-variation execution, database logging, and use case-specific implementations. It enables you to run the same AI prompt with different configurations, compare results, and implement domain-specific AI solutions like procurement management, legal analysis, and more.
Experimentation mode to compare multiple models:

Detailed execution logs, flow graphs and flame graphs for each execution:

Execution/Comparison metrics on every execution:

Batteries included functions with customizations:

"Stateless" agent self tracking:

... And more!
Get AgentLog running in 2 steps:
Update the config.env file with your local db and gemini api keys and then:
make run-serverThis starts the agentlog backend on localhost:8080 with REST API endpoints for multi-variation AI execution, function calling, and database logging.
2025/07/24 20:36:54 Go SDK disabled - using REST API for all Gemini calls
π agentlog HTTP Server starting on port 8080
π‘ Health check: http://localhost:8080/health
π§ API endpoints:
POST /api/execute - Multi-variation execution
GET /api/execution-runs - Execution history
GET /api/configurations - List API configurations
GET /api/functions - List function definitions
POST /api/functions - Create function definition
GET /api/functions/{id} - Get function by ID
PUT /api/functions/{id} - Update function
DELETE /api/functions/{id} - Delete function
POST /api/functions/test/{id} - Test function execution
GET /api/database/stats - Database statistics
GET /api/database/tables - Database tables
π‘ Use X-Use-Mock: true header for mock responses
π Set GEMINI_API_KEY in config.env for real API calls
2025/07/24 20:36:57 π Listing function definitions from database
2025/07/24 20:36:57 β
Successfully loaded 2 function definitions from database
make frontend-startThis launches the React Native development server with the mobile interface for configuring AI models, executing prompts, and viewing results.
You now have an AI experimentation platform running locally. The frontend will connect to the backend automatically.
When building AI agents with Gemini (or any LLM), you need visibility and control for debugging and optimization. Most implementations lack:
- Traceability & Monitoring - No logging of AI interactions
- Configuration Flexibility - Can't easily adjust temperature, tokens, system prompts
- Parallel Testing - No way to run multiple model variations simultaneously
- Centralized Management - No unified platform to track and compare executions
AgentLog is an AI experimentation platform that gives you control over your Gemini agents:
- Configure Gemini API keys and any custom function API keys
- Adjust model parameters: temperature, max tokens, top-P, top-K
- Customize system prompts and context for different use cases
- Set up parallel executions with variation testing
- Run multiple AI model configurations simultaneously
- Compare results side-by-side with analysis
- Database logging of every API call and response
- Track function calls, execution times, and model performance
- Add custom functions for external API integrations
- Built-in support for weather APIs, Neo4j graph databases
- Create domain-specific AI workflows (procurement, legal analysis, etc.)
- Function call tracing and debugging capabilities
- React Native mobile app for platform management
- Real-time execution monitoring with loading states
- Historical analysis with searchable execution logs
- Database inspection tools for debugging
- MySQL database with audit trails
- RESTful API architecture for integration
- Multi-variation execution engine
- Production deployment capabilities
Result: Instead of blind AI development, you get a platform with the tools needed to build, test, and optimize agents with visibility into their behavior.
- π Multi-Variation Execution: Run the same prompt with different configurations simultaneously
- ποΈ Database Logging: Every API call and response logged to MySQL database
- π Result Comparison: Analyze and compare results across variations
- βοΈ Configuration Support: Support for different models, temperatures, system prompts, and more
- π‘οΈ Type-Safe Operations: Uses sqlc for generated type-safe SQL queries
- π§© Interface Architecture: Clean, extensible interfaces for different use cases
- π’ AI Procurement Manager: Solution for vendor evaluation, contract analysis, negotiation strategies
- π Framework: Easy to implement new domains (legal, content, risk assessment, etc.)
- π Factory Pattern: Simple instantiation of different implementations
- π Plugin System: Extensible architecture for custom functionality
- π Analytics & Insights: Performance metrics, cost analysis, model comparison
- π Multi-Provider Support: Extensible to support different AI providers
- π§ͺ A/B Testing: Built-in experimentation framework for AI prompts
- π Audit Trail: Compliance and audit logging
agentlog Platform
βββ π― Interface Layer
β βββ MultiVariationExecutor
β βββ ExecutionLogger
β βββ ConfigurationManager
β βββ ResultComparator
β βββ Use Case Interfaces
βββ π’ Domain Implementations
β βββ ProcurementManager
β βββ LegalAnalyzer (extensible)
β βββ ContentGenerator (extensible)
β βββ RiskAssessor (extensible)
βββ π Factory & Adapters
β βββ agentlogFactory
β βββ ClientAdapter
β βββ MockFactory (testing)
βββ ποΈ Database Layer (MySQL + sqlc)
βββ π§ Core Client (AI API Wrapper)
βββ π Analytics & Comparison Engine
The make run-api command starts a persistent HTTP server that provides REST API endpoints for the frontend mobile app.
GET /health- Health check with status informationPOST /api/execute- Multi-variation execution endpointGET /api/execution-runs- Get execution historyGET /api/database/stats- Database statisticsGET /api/database/tables- List database tables
- Mock Mode Support: Add
X-Use-Mock: trueheader for mock responses - Real API Integration: Uses real Gemini API when API key is configured
- CORS Enabled: Ready for frontend integration
- Database Logging: All executions logged to MySQL when available
# Start the server
make run-server
# Test health endpoint
curl http://localhost:8080/health
# Test execution with mock data
curl -X POST http://localhost:8080/api/execute \
-H "Content-Type: application/json" \
-H "X-Use-Mock: true" \
-d '{
"execution_run_name": "test",
"base_prompt": "Write a story about AI",
"configurations": [{
"id": "test-1",
"variation_name": "creative",
"model_name": "gemini-1.5-flash",
"temperature": 0.8
}]
}'The HTTP server is designed to work with the React Native frontend:
- Start Backend:
make run-server(runs on localhost:8080) - Start Frontend:
make frontend-start - Configure: Set backend URL in mobile app settings
- Use: Configure AI models and execute multi-variation prompts
