🪙 feat: Add `check_embedding_ctx_length` Flag #161

Fjf · 2025-06-20T07:59:50Z

When using an OpenAI compatible endpoint with e.g., vLLM, the referenced models may not be one of the default openai models. The langchain OpenAIEmbeddings will then pre-tokenize the input using a local tiktoken tokenizer, before sending this to the endpoint as tokenized input.
This of course will not work if the tokenizer of the vLLM model is different from the openai tokenizer.

Disabling this check will directly send input to the given endpoint without checking the tokenized input first.
For vLLM I have checked, and giving input larger than the context length will not cause any errors, but it will simply be truncated and will return an embedding as expected.

danny-avila · 2025-07-04T12:52:58Z

@Fjf can you also edit the README to include this new variable?

Fjf · 2025-07-04T14:09:14Z

@danny-avila updated

Add environment variable for disabling 'check_embedding_ctx_length'.

e428240

Fjf mentioned this pull request Jun 24, 2025

Embedding Generation Failure with OpenAI-Compatible Providers (DeepInfra) #163

Closed

Update README.md

f7d36d5

danny-avila changed the title ~~Add environment variable for disabling 'check_embedding_ctx_length'.~~ 🪙 feat: Add check_embedding_ctx_length Flag Jul 4, 2025

danny-avila approved these changes Jul 4, 2025

View reviewed changes

danny-avila merged commit 87396d6 into danny-avila:main Jul 4, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🪙 feat: Add `check_embedding_ctx_length` Flag #161

🪙 feat: Add `check_embedding_ctx_length` Flag #161

Uh oh!

Fjf commented Jun 20, 2025

Uh oh!

danny-avila commented Jul 4, 2025

Uh oh!

Fjf commented Jul 4, 2025

Uh oh!

Uh oh!

Uh oh!

🪙 feat: Add check_embedding_ctx_length Flag #161

🪙 feat: Add check_embedding_ctx_length Flag #161

Uh oh!

Conversation

Fjf commented Jun 20, 2025

Uh oh!

danny-avila commented Jul 4, 2025

Uh oh!

Fjf commented Jul 4, 2025

Uh oh!

Uh oh!

Uh oh!

🪙 feat: Add `check_embedding_ctx_length` Flag #161

🪙 feat: Add `check_embedding_ctx_length` Flag #161