Official code for Letta on Terminal-Bench.
Check out our technical blog for more information: Blog
tb run \
--dataset-name terminal-bench-core \
--dataset-version 0.1.1 \
--agent-import-path letta-agent.letta_agent_v1:LettaAgent \
--n-concurrent 4 --model anthropic/claude-sonnet-4-20250514
# note this script supports --model anthropic/*. However, one can modify LlmConfig to use other models.