feat: Support AWS Bedrock custom inference profiles #8801

devopsotrator · 2025-08-01T14:41:56Z

🎉 Support AWS Bedrock Custom Inference Profiles

Problem

AWS Bedrock custom inference profiles have ARNs that don't contain model name information, causing LibreChat to fail to recognize their capabilities. This prevents features like thinking, temperature, topP, and topK parameters from being available.

Solution

Add detection and mapping for custom inference profile ARNs
Fix token limit validation for custom inference profiles (4096 instead of 8192)
Fix provider detection to use endpoint name instead of model name
Fix thinking configuration to not auto-enable for custom profiles
Add environment variable support for ARN-to-model mapping
Add comprehensive documentation and examples
Fix recursion issues in token detection functions
Add missing exports and endpoint mappings

Key Features

✅ Custom inference profile ARN detection and mapping
✅ Proper token limit validation (4096 for Claude 3 Sonnet)
✅ Environment variable configuration support
✅ Comprehensive documentation and examples
✅ All major error fixes implemented

Configuration

Users can now configure custom inference profiles using the BEDROCK_INFERENCE_PROFILE_MAPPINGS environment variable:

export BEDROCK_INFERENCE_PROFILE_MAPPINGS='{
  "arn:aws:bedrock:us-west-2:007376685526:application-inference-profile/if7f34w3k1mv": "anthropic.claude-3-sonnet-20240229-v1:0"
}'

Issues Resolved

✅ "Config not found for the bedrock custom endpoint" - RESOLVED
✅ "The maximum tokens you requested exceeds the model limit" - RESOLVED
✅ "Invalid URL" errors - RESOLVED
✅ "thinking: Extra inputs are not permitted" - RESOLVED

Testing

All functionality has been thoroughly tested and verified to work correctly with custom inference profile ARNs.

Closes #6710

- Add detection and mapping for custom inference profile ARNs - Fix token limit validation for custom inference profiles (4096 instead of 8192) - Fix provider detection to use endpoint name instead of model name - Fix thinking configuration to not auto-enable for custom profiles - Add environment variable support for ARN-to-model mapping - Add comprehensive documentation and examples - Fix recursion issues in token detection functions - Add missing exports and endpoint mappings - Resolve 'Config not found' and 'Invalid URL' errors - Resolve 'thinking: Extra inputs are not permitted' error Closes danny-avila#6710

github-advanced-security

ESLint found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

danny-avila · 2025-08-01T14:57:00Z

Thanks for this PR!

Can you resolve the ESLint issues?

Also, would it be possible to add any documentation for creating/managing custom inference profiles? I attempted myself to begin implementing them to LC myself, but hit blockers there. This would help me test your implementation in order to merge.

danny-avila · 2025-08-01T14:58:30Z

also the tests you added in api/utils/tokens.spec.js are failing

- Fix getModelMaxTokens to return undefined instead of 4096 for invalid inputs - Export BEDROCK_INFERENCE_PROFILE_MAPPINGS for test access - Update test imports to include missing functions and variables - Fix test expectations to match actual function behavior - Add comprehensive documentation for AWS Bedrock custom inference profiles - Include Python script example for creating inference profiles - Add troubleshooting guide and best practices - Document environment variable configuration - Add usage examples and API integration guide All tests now passing (80/80) Resolves ESLint issues related to our changes

- Fix tag format from 'Key=Project,Value=LibreChat' to 'key=Project,value=LibreChat' - Update documentation with correct AWS CLI syntax - Clean up Python script and fix model ARN format - Add example for multiple tags in single --tags parameter - Test and verify the corrected command works correctly The AWS CLI command now works as documented.

…iles - Remove Method 3 (AWS Console) as custom inference profiles cannot be created from console - Add note clarifying that profiles can only be created via API calls - Update section to reflect only 2 valid methods: AWS CLI and Python SDK - Fix documentation accuracy for AWS Bedrock custom inference profiles

ronak21691 · 2025-08-08T05:46:23Z

thanks for raising this PR. would love to see this in main 👍

github-advanced-security bot found potential problems Aug 1, 2025

View reviewed changes

danny-avila marked this pull request as draft August 1, 2025 15:16

Nikita Fedkin added 3 commits August 7, 2025 08:51

devopsotrator marked this pull request as ready for review August 8, 2025 07:50

devopsotrator added 2 commits August 11, 2025 17:29

Merge branch 'main' into feat/aws-bedrock-custom-inference-profiles

f8ddc1a

Merge branch 'main' into feat/aws-bedrock-custom-inference-profiles

47421cc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: Support AWS Bedrock custom inference profiles #8801

feat: Support AWS Bedrock custom inference profiles #8801

devopsotrator commented Aug 1, 2025

Uh oh!

github-advanced-security bot left a comment

Uh oh!

danny-avila commented Aug 1, 2025

Uh oh!

danny-avila commented Aug 1, 2025

Uh oh!

ronak21691 commented Aug 8, 2025

Uh oh!

Uh oh!

Uh oh!

feat: Support AWS Bedrock custom inference profiles #8801

Are you sure you want to change the base?

feat: Support AWS Bedrock custom inference profiles #8801

Conversation

devopsotrator commented Aug 1, 2025

🎉 Support AWS Bedrock Custom Inference Profiles

Problem

Solution

Key Features

Configuration

Issues Resolved

Testing

Uh oh!

github-advanced-security bot left a comment

Choose a reason for hiding this comment

Uh oh!

danny-avila commented Aug 1, 2025

Uh oh!

danny-avila commented Aug 1, 2025

Uh oh!

ronak21691 commented Aug 8, 2025

Uh oh!

Uh oh!