Add exponential retry logic for gemini models #764

gabrielibagon · 2024-11-15T23:48:35Z

This is to avoid the following error with long context on Gemini models due to insufficient quota:

Error: 429 Resource exhausted. Please try again later. Please refer to https://cloud.google.com/vertex-ai/generative-ai/docs/quotas#error-code-429 for more details.

This approach uses exponential backoff retries when encountering a ResourceExhausted error.

HuanzhiMao

Thank you very much for the PR @gabrielibagon! It is very helpful.

gabrielibagon and others added 4 commits November 15, 2024 15:45

Add exponential retry logic for gemini models

94059f2

Update pyproject.toml

9798ea6

Merge branch 'main' into pr/gabrielibagon/764

d85c0da

use generate_with_backoff for prompting mode as well

56d6a59

HuanzhiMao added the BFCL-General General BFCL Issue label Nov 17, 2024

HuanzhiMao approved these changes Nov 17, 2024

View reviewed changes

HuanzhiMao merged commit 291904c into ShishirPatil:main Nov 17, 2024

CharlieJCJ mentioned this pull request Nov 17, 2024

[BFCL] Leaderboard Update, 11/17/2024 #748

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add exponential retry logic for gemini models #764

Add exponential retry logic for gemini models #764

Uh oh!

gabrielibagon commented Nov 15, 2024

Uh oh!

HuanzhiMao left a comment

Uh oh!

Uh oh!

Add exponential retry logic for gemini models #764

Add exponential retry logic for gemini models #764

Uh oh!

Conversation

gabrielibagon commented Nov 15, 2024

Uh oh!

HuanzhiMao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!