RepoAudit

RepoAudit is a repo-level bug detector for general bugs. Currently, it supports the detection of diverse bug types (such as Null Pointer Dereference, Memory Leak, and Use After Free) in multiple programming languages (including C/C++, Java, Python, and Go). It leverages LLMSCAN to parse the codebase and uses LLM to mimic the process of manual code auditing. Compared with existing code auditing tools, RepoAudit offers the following advantages:

🛡️ Compilation-Free Analysis
🌍 Multi-Lingual Support
🐞 Multiple Bug Type Detection
⚙️ Customization Support

News 📰

[June 2025] The preprint of "An LLM Agent for Functional Bug Detection in Network Protocols" has been released, providing the technical details of rfcscan!

[May 2025] 🎉 Our paper "RepoAudit: Automated Code Auditing with Multi-Agent LLM Framework" has been accepted at ICML 2025! 🏆

[March 2025] RepoAudit has helped identify over 100 bugs in open-source projects this quarter!

Agents in RepoAudit

RepoAudit is a multi-agent framework for code auditing. We offer two agent instances in our current version:

MetaScanAgent in metascan.py: Scan the project using tree-sitter–powered parsing-based analyzers and obtains the basic syntactic properties of the program.
DFBScanAgent in dfbscan.py: Perform inter-procedural data-flow analysis as described in this preprint. It detects data-flow bugs, including source-must-not-reach-sink bugs (e.g., Null Pointer Dereference) and source-must-reach-sink bugs (e.g., Memory Leak).

We are keeping implementing more agents and will open-source them very soon. Utilizing DFBScanAgent and other agents, we have discovered hundred of confirmed and fixed bugs in open-source community. You can refer to this bug list.

Installation

Create and activate a conda environment with Python 3.13:

conda create -n repoaudit python=3.13
conda activate repoaudit

Install the required dependencies:

cd RepoAudit
pip install -r requirements.txt

Ensure you have the Tree-sitter library and language bindings installed:
```
cd lib
python build.py
```

Configure the OpenAI API key and Anthropic API key:

export OPENAI_API_KEY=xxxxxx >> ~/.bashrc
export ANTHROPIC_API_KEY=xxxxxx >> ~/.bashrc

Quick Start

We have prepared several benchmark programs in the benchmark directory for a quick start. Some of these are submodules, so you may need to initialize them using the following commands:
```
cd RepoAudit
git submodule update --init --recursive
```
We provide the script src/run_repoaudit.sh to scan files in the benchmark/Java/toy/NPD directory. You can run the following commands:
```
cd src
sh run_repoaudit.sh  # Run the agent DFBScanAgent
```
After the scanning is complete, you can check the resulting JSON and log files.

Parallel Auditing Support

For a large repository, a sequential analysis process may be quite time-consuming. To accelerate the analysis, you can choose parallel auditing. Specifically, you can set the option --max-neural-workers to a larger value. By default, this option is set to 30 for parallel auditing. Also, we have set the parsing-based analysis in a parallel mode by default, which is determined by the option --max-symbolic-workers. The default maximal number of workers is 30.

Website, Documentation and Papers

We have open-sourced the implementation of dfbscan. Other agents in RepoAudit will be released soon. For more information, please visit our website: RepoAudit: Auditing Code As Human.

For more details about tool usage, project architecture, and extensions of RepoAudit, please refer to the following documents:

User Guide: Detailed instructions on installation, configuration, and usage of RepoAudit, including CLI and webUI usage.
Project Architecture: In-depth explanation of RepoAudit's multi-agent framework, including parsing-based analyzers/tools, LLM-driven tools, and agent memory designs.
Extension: Guidelines for customizing RepoAudit to support new bug types and programming languages.
DeepWiki: All-in-one documentation generated by Devin.

If you find our research or tools helpful, please cite the following papers. More technical reports/research papers will be released in the future.

@inproceedings{repoaudit2025,
  title={RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing},
  author={Guo, Jinyao* and Wang, Chengpeng* and Xu, Xiangzhe and Su, Zian and Zhang, Xiangyu},
  booktitle={Proceedings of the 42nd International Conference on Machine Learning},
  year={2025},
  note={*Equal contribution}
}

@article{rfcscan2025,
  title={An LLM Agent for Functional Bug Detection in Network Protocols},
  author={Zheng, Mingwei and Wang, Chengpeng and Liu, Xuwei and Guo, Jinyao and Feng, Shiwei and Zhang, Xiangyu},
  journal={arXiv preprint arXiv:2506.00714},
  year={2025}
}

License

This project is licensed under MIT license.

Contact

For any questions or suggestions, please submit issues or pull requests on GitHub. You can also reach out to our maintainers:

Chengpeng Wang (Purdue University) - [email protected]
Jinyao Guo (Purdue University) - [email protected]
Zhuo Zhang (Columbia University) - [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.github/workflows		.github/workflows
benchmark		benchmark
docs		docs
img		img
lib		lib
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RepoAudit

News 📰

Agents in RepoAudit

Installation

Quick Start

Parallel Auditing Support

Website, Documentation and Papers

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 7

Languages

License

PurCL/RepoAudit

Folders and files

Latest commit

History

Repository files navigation

RepoAudit

News 📰

Agents in RepoAudit

Installation

Quick Start

Parallel Auditing Support

Website, Documentation and Papers

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 7

Languages

Packages