Human-Like Memory Chatbot with Pinecone and Ollama

This project implements a chatbot with human-like memory characteristics using Pinecone as a vector store and Ollama with the llama3.2:latest model for natural language processing. The system emulates memory degradation over time (based on an exponential forgetting curve) and assigns stronger imprints to important memories, mimicking how humans prioritize significant events.

Version

Current Version: 1.1.0 (Semantic Versioning: MAJOR.MINOR.PATCH)

Features

Memory Degradation: Older memories fade using an exponential decay function, with a configurable half-life (default: 90 days).
Importance Weighting: Memories deemed important (via sentiment analysis) receive a stronger imprint, making them more retrievable. (Done ✅)
Persistent Storage: Conversation history is stored in Pinecone, allowing scalable, long-term memory.
Contextual Responses: The chatbot retrieves relevant past interactions to inform its responses implicitly, avoiding explicit mentions of memory loss or repetition. (Done ✅)
Dynamic User Identity Detection: The user's identity is detected explicitly by the LLM from the user's input and stored as metadata in Pinecone. (Done ✅)
Debug Mode: Optional debug output for monitoring importance scores and memory operations. (New ✨)
PEP8 Compliant: Code follows Python's PEP8 style guide for readability and maintainability. (New ✨)
Versioning System: Version information is maintained in the source code and displayed at startup.

Prerequisites

Python 3.8+
Pinecone Account: Sign up at Pinecone and obtain an API key.
Ollama: Installed locally with the llama3.2:latest model pulled (ollama pull llama3.2:latest).
Git: To clone this repository.

Installation

Clone the Repository:

git clone https://github.com/MLidstrom/llm-memory.git
cd llm-memory

Install Dependencies:

pip install pinecone-client ollama langchain langchain_pinecone langchain-huggingface sentence-transformers numpy python-dotenv nltk

Set Up Pinecone:

Create a .env file in the project root with your Pinecone API key:

PINECONE_API_KEY=your-actual-api-key-here

Run Ollama Locally:

Start the Ollama server with the llama3.2:latest model:

ollama run llama3.2:latest

Usage

Run the Chatbot:

python llm-memory.py

Type your messages in the prompt (You: ).
Type exit to stop the chatbot.

Debug Mode:

To enable debug mode, open llm-memory.py and set DEBUG = True near the top of the file.
With debug mode enabled, you'll see additional output like importance scores for each conversation.

How It Works

Pinecone Vector Store: Stores conversation embeddings with metadata (text, timestamp, importance, user identity).
Embedding Model: Uses sentence-transformers/all-MiniLM-L6-v2 to generate 384-dimensional vectors.
Memory Degradation: Applies an exponential decay factor based on time elapsed since the memory was stored.
Importance Scoring: Uses sentiment analysis (VADER) to assign an importance score (−1.0 to 1.0) to each memory, amplifying its embedding and retrieval weight.
User Identity Detection: The user's identity is explicitly identified by the LLM based on user disclosures.

Versioning

This project uses Semantic Versioning:

MAJOR version for incompatible API changes
MINOR version for functionality added in a backward-compatible manner
PATCH version for backward-compatible bug fixes

Version information is displayed at startup showing the current version, Pinecone index, model information, and debug status.

Configuration

Half-Life: Adjust half_life_days in calculate_decay_factor (default: 90 days) to control memory fade rate.
Importance Logic: Modify calculate_importance to use custom rules.
Decay Threshold: Change decay_threshold in retrieve_context (default: 0.05) to filter faded memories.
Debug Mode: Set DEBUG = True in the source code to enable diagnostic output.

Future Enhancements

Implement a repetition boost for frequently mentioned topics.
Periodically prune highly decayed memories from Pinecone.
Support multi-user memory with user-specific filtering.
Version-based memory migrations for handling breaking changes.

Dependencies

pinecone-client: Pinecone vector store integration.
ollama: Local LLM inference with llama3.2:latest.
langchain & langchain_pinecone: Framework for embeddings and vector store management.
langchain-huggingface: HuggingFace embeddings integration for LangChain.
sentence-transformers: Embedding generation.
numpy: Numerical operations for decay calculations.
python-dotenv: Load environment variables from .env file.
nltk: Sentiment analysis for importance scoring.

License

This project is licensed under the MIT License. See LICENSE for details.

Contributing

Contributions are welcome! Please submit a pull request or open an issue for suggestions, bug reports, or enhancements.

Code Style

This project follows PEP8 style guidelines. When contributing, please ensure your code passes PEP8 validation.

Acknowledgments

Built with Pinecone for vector storage.
Powered by Ollama and the llama3.2:latest model.
Inspired by human memory research and the forgetting curve.

Last Updated: March 12, 2025

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
llm-memory.py		llm-memory.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Human-Like Memory Chatbot with Pinecone and Ollama

Version

Features

Prerequisites

Installation

Usage

How It Works

Versioning

Configuration

Future Enhancements

Dependencies

License

Contributing

Code Style

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

MLidstrom/llm-memory

Folders and files

Latest commit

History

Repository files navigation

Human-Like Memory Chatbot with Pinecone and Ollama

Version

Features

Prerequisites

Installation

Usage

How It Works

Versioning

Configuration

Future Enhancements

Dependencies

License

Contributing

Code Style

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages