Skip to content

Commit 87396d6

Browse files
Fjfduncan
andauthored
🪙 feat: Add check_embedding_ctx_length Flag (#161)
* Add environment variable for disabling 'check_embedding_ctx_length'. * Update README.md --------- Co-authored-by: duncan <[email protected]>
1 parent 2b31473 commit 87396d6

File tree

2 files changed

+5
-0
lines changed

2 files changed

+5
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -85,6 +85,7 @@ The following environment variables are required to run the application:
8585
- `AWS_ACCESS_KEY_ID`: (Optional) needed for bedrock embeddings
8686
- `AWS_SECRET_ACCESS_KEY`: (Optional) needed for bedrock embeddings
8787
- `GOOGLE_APPLICATION_CREDENTIALS`: (Optional) needed for Google VertexAI embeddings
88+
- `RAG_CHECK_EMBEDDING_CTX_LENGTH` (Optional) Default is true, disabling this will send raw input to the embedder, use this for custom embedding models.
8889

8990
Make sure to set these environment variables before running the application. You can set them in a `.env` file or as system environment variables.
9091

app/config.py

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -179,6 +179,8 @@ async def dispatch(self, request, call_next):
179179
AWS_ACCESS_KEY_ID = get_env_variable("AWS_ACCESS_KEY_ID", "")
180180
AWS_SECRET_ACCESS_KEY = get_env_variable("AWS_SECRET_ACCESS_KEY", "")
181181
GOOGLE_APPLICATION_CREDENTIALS = get_env_variable("GOOGLE_APPLICATION_CREDENTIALS", "")
182+
env_value = get_env_variable("RAG_CHECK_EMBEDDING_CTX_LENGTH", "True").lower()
183+
RAG_CHECK_EMBEDDING_CTX_LENGTH = True if env_value == "true" else False
182184

183185
## Embeddings
184186

@@ -193,6 +195,7 @@ def init_embeddings(provider, model):
193195
openai_api_base=RAG_OPENAI_BASEURL,
194196
openai_proxy=RAG_OPENAI_PROXY,
195197
chunk_size=EMBEDDINGS_CHUNK_SIZE,
198+
check_embedding_ctx_length=RAG_CHECK_EMBEDDING_CTX_LENGTH,
196199
)
197200
elif provider == EmbeddingsProvider.AZURE:
198201
from langchain_openai import AzureOpenAIEmbeddings
@@ -203,6 +206,7 @@ def init_embeddings(provider, model):
203206
azure_endpoint=RAG_AZURE_OPENAI_ENDPOINT,
204207
api_version=RAG_AZURE_OPENAI_API_VERSION,
205208
chunk_size=EMBEDDINGS_CHUNK_SIZE,
209+
check_embedding_ctx_length=RAG_CHECK_EMBEDDING_CTX_LENGTH,
206210
)
207211
elif provider == EmbeddingsProvider.HUGGINGFACE:
208212
from langchain_huggingface import HuggingFaceEmbeddings

0 commit comments

Comments
 (0)