Home

Welcome to Murmur Wiki 🌟

Welcome to the official wiki for Murmur, a sophisticated multi-agent system that orchestrates different specialized AI agents using local LLM models. This wiki serves as the central knowledge base for understanding, using, and contributing to the project.

Overview

Murmur is an intelligent orchestration system that integrates multiple Large Language Models (LLMs) built on the Ollama architecture. The system implements a sophisticated pipeline of specialized agents to process and respond to user queries with high accuracy and reliability.

Key Features

🤖 Multi-agent architecture with specialized roles
🔄 Asynchronous processing pipeline
💾 Local LLM integration via REST API
🧠 Conversation memory management
📊 Confidence-based response evaluation
⚠️ Comprehensive error handling

Current Status

Version: 1.1.0
Stability: Beta
Python Support: 3.7+
License: MIT

System Architecture

Agent Pipeline

User Input → Interpreter → Reasoner → Generator → Critic → Final Response

Each agent in the pipeline serves a specific purpose:

Interpreter Agent
- Model: mistral-nemo:latest
- Purpose: Analyzes user intent and context
- Key functions:
  - Intent recognition
  - Context analysis
  - Requirement identification
Reasoner Agent
- Model: llama3.2-vision:11b
- Purpose: Develops logical approach
- Key functions:
  - Problem decomposition
  - Solution strategy development
  - Consideration analysis
Generator Agent
- Model: gemma2:9b
- Purpose: Creates initial responses
- Key functions:
  - Content generation
  - Response structuring
  - Context incorporation
Critic Agent
- Model: llama3.2-vision:11b
- Purpose: Reviews and refines content
- Key functions:
  - Accuracy verification
  - Clarity assessment
  - Content refinement

Technical Components

LocalLLMConnector: Manages communication with local LLM server
AgentOrchestrator: Coordinates agent pipeline execution
Message System: Handles inter-agent communication
Confidence Scoring: Evaluates response quality

Getting Started

Prerequisites

Python 3.7+
Local LLM server (Ollama)
Required packages: aiohttp, asyncio

Installation Steps

Clone the repository:

git clone https://github.com/KazKozDev/murmur.git
cd murmur

Install dependencies:

pip install -r requirements.txt

Configure local LLM server:

# Default server URL: http://localhost:11434
# Modify in config if needed

Basic Usage

# Start the application
python src/main.py

# Enter queries when prompted
Enter your message: What is the capital of France?

# Review response and confidence score
Response: The capital of France is Paris.
Confidence: 0.95

Configuration Guide

Model Configuration

Models can be configured in the AgentOrchestrator initialization:

self.agents = {
    AgentRole.INTERPRETER: BaseAgent("Interpreter", "mistral-nemo:latest", AgentRole.INTERPRETER),
    AgentRole.REASONER: BaseAgent("Reasoner", "llama3.2-vision:11b", AgentRole.REASONER),
    AgentRole.GENERATOR: BaseAgent("Generator", "gemma2:9b", AgentRole.GENERATOR),
    AgentRole.CRITIC: BaseAgent("Critic", "llama3.2-vision:11b", AgentRole.CRITIC)
}

Server Configuration

Default server settings in LocalLLMConnector:

base_url = "http://localhost:11434"
timeout = 30.0
max_retries = 3

Development Guidelines

Code Style

Follow PEP 8 guidelines
Use Black formatter
Sort imports with isort
Include type hints
Write comprehensive docstrings

Testing

Write unit tests for new features
Ensure test coverage > 80%
Run tests before submitting PR:

pytest tests/

Pull Request Process

Fork the repository
Create feature branch
Implement changes
Add tests
Submit PR

Troubleshooting

Common Issues

Connection Errors
- Verify LLM server is running
- Check server URL configuration
- Ensure network connectivity
High Resource Usage
- Reduce conversation memory size
- Adjust model configurations
- Monitor system resources
Low Confidence Scores
- Check input quality
- Verify model availability
- Review agent configurations

Debug Mode

Enable debug logging:

logging.basicConfig(level=logging.DEBUG)

FAQ

Q: Can I use different LLM models? A: Yes, any model compatible with the Ollama API can be configured.

Q: How is conversation context maintained? A: Through a deque-based memory system with configurable size.

Q: What's the recommended hardware? A: Minimum 16GB RAM, modern multi-core CPU, and SSD storage.

For more information, visit our GitHub repository or contact the maintainers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!