Skip to content

chore: SWE-Bench related changes #181

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 24 commits into from
Jul 1, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
24 commits
Select commit Hold shift + click to select a range
0170a6b
fix: Adjust timeout in eval_instance and update training notebook
bradhilton Jun 27, 2025
cc0248c
chore: Update dependencies and versions in pyproject.toml and uv.lock
bradhilton Jun 27, 2025
ba8fe38
refactor: Simplify level assignment in TorchtuneService
bradhilton Jun 27, 2025
a840baa
feat: Add timeout parameter to rollout function
bradhilton Jun 27, 2025
8953a89
refactor: Clean up outputs and execution counts in training notebook
bradhilton Jun 27, 2025
dcb6fbf
feat: Enhance rollout function and training notebook configuration
bradhilton Jun 28, 2025
76f43a1
fix: Update training notebook and TorchtuneService configuration
bradhilton Jun 29, 2025
e7abf85
fix: Increase timeout and adjust training configuration
bradhilton Jun 29, 2025
abd82fa
fix: Adjust timeout in eval_instance and update training notebook
bradhilton Jun 27, 2025
e03e9be
chore: Update dependencies and versions in pyproject.toml and uv.lock
bradhilton Jun 27, 2025
c7e324b
feat: Add timeout parameter to rollout function
bradhilton Jun 27, 2025
3adef91
refactor: Clean up outputs and execution counts in training notebook
bradhilton Jun 27, 2025
c647203
feat: Enhance rollout function and training notebook configuration
bradhilton Jun 28, 2025
408882b
fix: Update training notebook and TorchtuneService configuration
bradhilton Jun 29, 2025
38af95b
fix: Increase timeout and adjust training configuration
bradhilton Jun 29, 2025
97627ac
Merge branch 'feat/swebench' of https://github.com/OpenPipe/agent-rei…
bradhilton Jun 30, 2025
e27fa7f
feat: Integrate Daytona SDK and enhance dependency management
bradhilton Jul 1, 2025
5fb9d2c
feat: Implement logging functionality and enhance test execution flow
bradhilton Jul 1, 2025
360abbc
feat: Improve error handling and logging in test execution
bradhilton Jul 1, 2025
5767e9d
feat: Enhance instance selection and logging in test execution
bradhilton Jul 1, 2025
a3f7202
feat: Add sandbox deletion before testing instances
bradhilton Jul 1, 2025
b76c958
fix: Update instance filtering logic in get_filtered_swe_smith_instan…
bradhilton Jul 1, 2025
a845bdb
feat: Enhance test execution with automatic plugin installation and i…
bradhilton Jul 1, 2025
65f454c
feat: Update instance filtering in get_filtered_swe_smith_instances_df
bradhilton Jul 1, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion dev/swebench/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -155,7 +155,7 @@ def get_config(
if model.trainable and model.config.xml_function_calling:
completion_kwargs["stop"] = "</function>"
completion_kwargs["include_stop_str_in_output"] = True
completion_kwargs["timeout"] = 60.0 * 10
completion_kwargs["timeout"] = 60.0 * 30
completion_kwargs["metadata"] = {
"trace_id": langfuse_context.get_current_trace_id(),
"parent_observation_id": langfuse_context.get_current_observation_id(),
Expand Down
Loading