Update Prisma Migrations #3

github-actions · 2025-08-06T18:28:17Z

Auto-generated migration based on schema.prisma changes.

Generated files:

deploy/migrations/${VERSION}_schema_update/migration.sql
deploy/migrations/${VERSION}_schema_update/README.md

Redis Session Patch (COMPLETE) - Problem: Conversation context lost due to 10-second batch processing delay - Solution: Redis-based immediate session storage with graceful fallback - Status: Production-ready with comprehensive testing Ad discussed in BerriAI#12364

Fixes streaming ID inconsistency where streaming responses used raw provider IDs while non-streaming responses used properly encoded IDs with provider context. Changes: - Updated LiteLLMCompletionStreamingIterator to accept provider context - Added _encode_chunk_id() method using same logic as non-streaming responses - Modified chunk transformation to encode all streaming item_ids with resp_ prefix - Updated handlers to pass custom_llm_provider and litellm_metadata to streaming iterator Impact: - Streaming chunk IDs now format: resp_<base64_encoded_provider_context> - Enables session continuity when using streaming response IDs as previous_response_id - Allows provider detection and load balancing with streaming responses - Maintains backward compatibility with existing streaming functionality 🤖 Generated with [Claude Code](https://claude.ai/code)

…rmat Fixes streaming ID inconsistency where streaming responses used raw provider IDs while non-streaming responses used properly encoded IDs with provider context. Changes: - Updated LiteLLMCompletionStreamingIterator to accept provider context - Added _encode_chunk_id() method using same logic as non-streaming responses - Modified chunk transformation to encode all streaming item_ids with resp_ prefix - Updated handlers to pass custom_llm_provider and litellm_metadata to streaming iterator Impact: - Streaming chunk IDs now format: resp_<base64_encoded_provider_context> - Enables session continuity when using streaming response IDs as previous_response_id - Allows provider detection and load balancing with streaming responses - Maintains backward compatibility with existing streaming functionality 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

This resolves MyPy type checking error where model_id could be None but wasn't explicitly typed as Optional[str].

Prevents 'Item None has no attribute get' error by checking for None before accessing litellm_metadata dictionary.

@ishaan-jaff

Adds unit and E2E tests to verify streaming chunk IDs are properly encoded with consistent format across streaming responses. ## Tests Added ### Unit Test (test_reasoning_content_transformation.py) - `test_streaming_chunk_id_encoding()`: Validates the `_encode_chunk_id()` method correctly encodes chunk IDs with `resp_` prefix and provider context ### E2E Tests (test_e2e_openai_responses_api.py) - `test_streaming_id_consistency_across_chunks()`: Tests that all streaming chunk IDs are properly encoded across multiple chunks in a real streaming response - `test_streaming_response_id_as_previous_response_id()`: Tests the core use case - using streaming response IDs for session continuity with `previous_response_id` ## Key Testing Approach - Uses **Gemini** (non-OpenAI model) to test the transformation logic rather than OpenAI passthrough, since the streaming ID consistency issue occurs when LiteLLM transforms responses rather than just passing through to native OpenAI responses API - Tests validate that streaming chunk IDs now use same encoding as non-streaming responses - Verifies session continuity works with streaming responses Addresses @ishaan-jaff's request for unit tests covering the streaming ID consistency fix. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Removes unused imports to fix CI linting errors: - GenericResponseOutputItem - OutputFunctionToolCall

@ishaan-jaff

Remove streaming ID consistency E2E tests as requested by @ishaan-jaff. Keep only the mock/unit test in test_reasoning_content_transformation.py

This reverts the streaming chunk ID encoding changes to understand the original issue better. Original behavior was: - Streaming chunks: raw provider IDs - Streaming final response: raw IDs (PROBLEM!) - Non-streaming final response: encoded IDs (correct) The real issue: streaming final response IDs were not encoded, breaking session continuity.

…ehavior Fixes streaming ID inconsistency to match OpenAI's Responses API behavior: - Streaming chunks: raw message IDs (like OpenAI's msg_xxx) - Final response: encoded IDs (like OpenAI's resp_xxx) This enables session continuity by ensuring streaming final response IDs have the same encoded format as non-streaming responses, allowing them to be used as previous_response_id in follow-up requests. Changes: - Add custom_llm_provider and litellm_metadata to LiteLLMCompletionStreamingIterator - Update handlers to pass provider context to streaming iterator - Apply _update_responses_api_response_id_with_model_id to final streaming response - Keep streaming chunks as raw IDs to match OpenAI format Impact: - Session continuity works with streaming responses - Load balancing can detect provider from streaming final response IDs - Format matches OpenAI's Responses API exactly 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Updates the unit test to verify streaming chunk IDs are raw (not encoded) to match OpenAI's responses API format: - Streaming chunks: raw message IDs (like msg_xxx) - Final response: encoded IDs (like resp_xxx) This reflects the correct behavior implemented in the fix.

- Add test_responses_api.py for testing multiple providers - Add responses_api_config.yaml with Claude, DeepSeek, and Gemini - Add RESPONSES_API_TEST_README.md with setup instructions - Tests session management with Redis for context retention - Validates basic responses, streaming, and session linking

The Response API wasn't storing sessions in Redis for streaming requests, only for non-streaming. This caused context to be lost when using previous_response_id with streaming responses. Changes: - Add _store_session_in_redis method to streaming iterator - Store full conversation history immediately when stream completes - Pass litellm_completion_request to streaming iterator for message history - Ensures streaming behaves identically to non-streaming for session storage This fixes the timing issue where a delay was needed between requests to allow batch processing to store sessions. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

…y' into chore/merge-streaming-id-consistency

Port streaming ID consistency fixes from `jatorre/feature/streaming-id-consistency` to `main`

…nsistency' into chore/merge-streaming-id-consistency" This reverts commit ce3b79c, reversing changes made to b7db96f.

…ssion-timing Jatorre/fix/responses api redis session timing

Configure scheduler with memory leak prevention settings

fix mcp_table server_name error & remove arm64 platform from dockerfile

Automatic sync from upstream BerriAI/litellm Preparing for v1.78.5-stable release Strategy: Accept all upstream changes (main is a mirror)

jatorre and others added 30 commits July 16, 2025 13:40

Redis quick patch

30ca2a3

Redis Session Patch (COMPLETE) - Problem: Conversation context lost due to 10-second batch processing delay - Solution: Redis-based immediate session storage with graceful fallback - Status: Production-ready with comprehensive testing Ad discussed in BerriAI#12364

fix(types): add explicit Optional[str] type annotation for model_id

af6862a

This resolves MyPy type checking error where model_id could be None but wasn't explicitly typed as Optional[str].

fix(types): handle None case for litellm_metadata access

53b2346

Prevents 'Item None has no attribute get' error by checking for None before accessing litellm_metadata dictionary.

fix(lint): remove unused imports in transformation.py

4c84225

Removes unused imports to fix CI linting errors: - GenericResponseOutputItem - OutputFunctionToolCall

test: remove E2E tests from openai_endpoints_tests

6fa10e9

Remove streaming ID consistency E2E tests as requested by @ishaan-jaff. Keep only the mock/unit test in test_reasoning_content_transformation.py

Modify CI for Docker build

762246a

Modify CI for Docker build

ac7d257

Modify CI for Docker build

68da968

Add ghcr_carto_deploy and comment old litellm one

911a97f

optimize ghcr_carto_deploy

e2874b4

optimize ghcr_carto_deploy

81bb84c

optimize ghcr_carto_deploy

9db2219

optimize ghcr_carto_deploy

32a6bad

Update ci behavior

c04610d

Merge branch 'main' of https://github.com/jatorre/litellm

0354f20

Merge branch 'BerriAI:main' into main

ab04eaa

apply new dockerfile prisma solution to offline binaries download

400d9a0

apply new dockerfile prisma solution to offline binaries download

b7db96f

Merge remote-tracking branch 'jatorre/feature/streaming-id-consistenc…

ce3b79c

…y' into chore/merge-streaming-id-consistency

Rollback non-root

3b7f35e

disable un-used workflows

f517814

Merge branch 'main' into chore/merge-streaming-id-consistency

d67d4d1

mateo-di and others added 16 commits August 8, 2025 10:33

Merge pull request #4 from CartoDB/chore/merge-streaming-id-consistency

2b09182

Port streaming ID consistency fixes from `jatorre/feature/streaming-id-consistency` to `main`

Revert "Merge remote-tracking branch 'jatorre/feature/streaming-id-co…

577904f

…nsistency' into chore/merge-streaming-id-consistency" This reverts commit ce3b79c, reversing changes made to b7db96f.

Merge pull request #5 from CartoDB/jatorre/fix/responses-api-redis-se…

f509a08

…ssion-timing Jatorre/fix/responses api redis session timing

Configure scheduler with memory leak prevention settings

c6dd3c2

fix by Opus on the memray-1757425844.bin

0f19392

Merge pull request #7 from CartoDB/feature/apscheduler-memory-leak-fixes

3ed99d0

Configure scheduler with memory leak prevention settings

fix mcp_table server_name error

df4d371

fix mcp_table server_name error

b950752

remove arm64 from platforms

14647b7

Add new migration to fix mcp server_name column missing error

e1a4da8

Remove fixed litellm-proxy-extras package and use local

4ceb26d

Remove dist commited folder

7deaf6e

Fix linting errors

3b1331e

Merge pull request #11 from CartoDB/feature/fix-mcp-sql-migration-issue

3532931

fix mcp_table server_name error & remove arm64 platform from dockerfile

sync: merge upstream/main for v1.78.5-stable

6f07aee

Automatic sync from upstream BerriAI/litellm Preparing for v1.78.5-stable release Strategy: Accept all upstream changes (main is a mirror)

chore: update prisma migrations

1db91eb

github-actions bot force-pushed the feat/prisma-migration- branch from 3361271 to 1db91eb Compare October 30, 2025 01:36

mateo-di force-pushed the main branch 2 times, most recently from 044e260 to 5fde83d Compare October 30, 2025 01:46

Cartofante force-pushed the main branch from 5fde83d to 448d72f Compare October 30, 2025 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Prisma Migrations #3

Update Prisma Migrations #3

Uh oh!

github-actions bot commented Aug 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update Prisma Migrations #3

Are you sure you want to change the base?

Update Prisma Migrations #3

Uh oh!

Conversation

github-actions bot commented Aug 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants