🚩
Working from school
Junior undergrad at Peking University📚 Focus on Scalable Oversight / AI Safety / AI Alignment
-
Peking University
- Beijing
- https://cby-pku.github.io/
- https://scholar.google.com/citations?user=o23sDqkAAAAJ&hl=zh-CN&oi=ao
Pinned Loading
-
PKU-Alignment/llms-resist-alignment
PKU-Alignment/llms-resist-alignment Public[ACL2025 Best Paper] Language Models Resist Alignment
-
PKU-Alignment/aligner
PKU-Alignment/aligner Public[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
-
AlignmentSurvey
AlignmentSurvey PublicForked from PKU-Alignment/AlignmentSurvey
[ACM Computing Surveys] AI Alignment: A Comprehensive Survey
-
PKU-Alignment/align-anything
PKU-Alignment/align-anything PublicAlign Anything: Training All-modality Model with Feedback
-
DeceptionSurvey
DeceptionSurvey PublicForked from deceptionsurvey/DeceptionSurvey
Shadow of Intelligence: A Comprehensive Survey of AI Deception
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.