Skip to content

Commit 081c059

Browse files
authored
Optimizing-rag-performance-in-librechat.md
This PR adds a detailed blog post written by Henk van Ess that covers how to optimize Retrieval-Augmented Generation (RAG) performance in LibreChat. The guide walks through: Improving vector database performance (PostgreSQL/pgvector) Chunking strategies (CHUNK_SIZE / CHUNK_OVERLAP) Embedding provider options (OpenAI, Azure, Ollama) Retrieval settings (RAG_API_TOP_K) Monitoring and server resource tips It's designed to help developers fine-tune their LibreChat instances for speed and quality. All content is based on hands-on testing and is Markdown-formatted for blog use. Looking forward to feedback — happy to revise if needed!
1 parent 38a0266 commit 081c059

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

pages/blog/optimizing-rag-performance-in-librechat.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -67,7 +67,7 @@ Check with `\di` again. Look for a `hnsw` or `ivfflat` index type.
6767
docker stats vectordb
6868
```
6969

70-
Watch for memory or CPU saturation. PostgreSQL benefits from abundant RAM.
70+
Watch for memory and/or CPU saturation. PostgreSQL benefits from abundant RAM.
7171

7272
#### Optional: Set resource limits in `docker-compose.override.yml`
7373

0 commit comments

Comments
 (0)