ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants

🏆 Red-Team Winner of Amazon Nova AI Challenge - First-ever global tournament where elite university teams battle to harden and hack AI coding assistants

📰 News

🏆 Latest Achievement

🥇 Amazon Nova AI Challenge Winner - ASTRA emerged victorious as the top attacking team in Amazon's global AI safety competition, defeating elite defending teams from universities worldwide in live adversarial evaluation.

🎯 Key Highlights

🏆 Winner of Amazon Nova AI Challenge - Top attacking team category
🥇 $250,000 Prize - Awarded for winning the competition
📊 >=90% Success Rate - In AI assistant safety assessment

📰 Media Coverage

Amazon Science - Official announcement of ASTRA as the winning red team tool

🎯 About

ASTRA System Overview

ASTRA (Autonomous Spatial-Temporal Red-teaming for AI Software Assistants) is a full lifecycle red-teaming system that builds structured domain-specific knowledge graphs and performs online vulnerability exploration by adaptively probing both input space (spatial) and reasoning processes (temporal).

🚀 What Makes ASTRA Different

Unlike existing tools that are either static benchmarks or jailbreak attempts on given benchmarks, ASTRA operates as a complete red-teaming solution:

🔍 1. Structural Domain Modeling

Given a target domain, ASTRA performs structural modeling and generates high-quality violation-inducing prompts
No pre-defined benchmarks required - ASTRA creates its own test cases systematically

💬 2. Multi-turn Conversation Framework

Uses generated prompts as starting points for comprehensive testing
Conducts adaptive multi-round conversations with target systems based on responses
Temporal Exploration: Identifies weak links in target system reasoning traces and dynamically adjusts test prompts to exploit discovered vulnerabilities

🎯 3. Self-Evolving Red-teaming

Self-evolving capability: Records successful cases and adjusts sampling strategies to target similar prompts, gradually improving success rates
Autonomous operation: No human intervention required during testing

🚀 Quick Start

✅ Prerequisites

🐍 Python 3.9+
📦 Required dependencies (see requirements.txt)
🔑 API access to LLM providers (OpenAI, Anthropic, etc.)

🛠️ Installation

git clone https://github.com/PurCL/ASTRA
cd ASTRA
pip install -r requirements.txt

▶️ Basic Usage

ASTRA consists multiple stages from knowledge graph construction to online adaptive red-teaming. This section provides a convenient guide on how to run the online adaptive red-teaming component with a new target model.
For detailed usage instructions, see 📘 USAGE.md.

ASTRA comes with prompts generated for secure code generation and security event guidance domains. You can directly use those prompts to test your target model.

🧰 Specify the configure of your model at resources/client-config.yaml.
Then run the following command to start the online adaptive red-teaming process:

python3 online/main.py --model_name <name of the blue team model> --log <path to the output log file> --n_session <number of chat sessions> --n_probing <number of initial probing sessions before the chat sessions> --n_turn <maximum number of turns per session>

For example,

python3 online/main.py --model_name phi4m --log log_out/phi4m.jsonl --n_session 200 --n_probing 0 --n_turn 5

📝 This will run 200 chat sessions with the target model phi4m, each with up to 5 turns, and log the results to log_out/phi4m.jsonl.

📧 Contact

For questions, collaborations, or feedback, please contact:

Xiangzhe Xu - [email protected]
Guangyu Shen - [email protected]

We welcome academic collaborations and industry partnerships!

📄 Citation

If you find ASTRA useful in your research, please cite our paper:

@article{xu2025astra,
  title={ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants},
  author={Xu, Xiangzhe and Shen, Guangyu and Su, Zian and Cheng, Siyuan and Guo, Hanxi and Yan, Lu and Chen, Xuan and Jiang, Jiasheng and Jin, Xiaolong and Wang, Chengpeng and others},
  journal={arXiv preprint arXiv:2508.03936},
  year={2025}
}

🙏 Acknowledgments

We would like to thank the following projects and communities for their inspiration and support:

Amazon Nova AI Challenge - For providing the platform and resources that enabled ASTRA's development and validation

Made with ❤️ for AI Safety Research

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
agent		agent
assets		assets
data_out		data_out
enumerator		enumerator
kg		kg
online		online
resources		resources
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
USAGE.md		USAGE.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants

📰 News

🏆 Latest Achievement

🎯 Key Highlights

📰 Media Coverage

🎯 About

🚀 What Makes ASTRA Different

🔍 1. Structural Domain Modeling

💬 2. Multi-turn Conversation Framework

🎯 3. Self-Evolving Red-teaming

🚀 Quick Start

✅ Prerequisites

🛠️ Installation

▶️ Basic Usage

📧 Contact

📄 Citation

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

PurCL/ASTRA

Folders and files

Latest commit

History

Repository files navigation

ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants

📰 News

🏆 Latest Achievement

🎯 Key Highlights

📰 Media Coverage

🎯 About

🚀 What Makes ASTRA Different

🔍 1. Structural Domain Modeling

💬 2. Multi-turn Conversation Framework

🎯 3. Self-Evolving Red-teaming

🚀 Quick Start

✅ Prerequisites

🛠️ Installation

▶️ Basic Usage

📧 Contact

📄 Citation

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages