Skip to content

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

License

dw-dengwei/daily-arXiv-ai-enhanced

Repository files navigation

🚀 daily-arXiv-ai-enhanced

Caution

若您所在法域对学术数据有审查要求,谨慎运行本代码;任何二次分发版本必须履行合规审查(包括但不限于原始论文合规性、AI合规性)义务,否则一切法律后果由下游自行承担。

Caution

If your jurisdiction has censorship requirements for academic data, run this code with caution; any secondary distribution version must remove the entrance accessible to China and fulfill the content review obligations, otherwise all legal consequences will be borne by the downstream.

This innovative tool transforms how you stay updated with arXiv papers by combining automated crawling with AI-powered summarization.

✨ Key Features

🎯 Zero Infrastructure Required

  • Leverages GitHub Actions and Pages - no server needed
  • Completely free to deploy and use

🤖 Smart AI Summarization

  • Daily paper crawling with DeepSeek-powered summaries
  • Cost-effective: Only ~0.2 CNY per day

💫 Smart Reading Experience

  • Personalized paper highlighting based on your interests
  • Cross-device compatibility (desktop & mobile)
  • Local preference storage for privacy
  • Flexible date range filtering

👉 Try it now! - No installation required

video-1080p.mp4

How to use

This repo will daily crawl arXiv papers about cs.CV, cs.GR, cs.CL and cs.AI, and use DeepSeek to summarize the papers in Chinese. If you wish to crawl other arXiv categories, use other LLMs, or other languages, please follow the instructions. Otherwise, you can directly use this repo in https://dw-dengwei.github.io/daily-arXiv-ai-enhanced/. Please star it if you like :)

Instructions:

  1. Fork this repo to your own account
  2. Go to: your-own-repo -> Settings -> Secrets and variables -> Actions
  3. Go to Secrets. Secrets are encrypted and used for sensitive data
  4. Create two repository secrets named OPENAI_API_KEY and OPENAI_BASE_URL, and input corresponding values.
  5. [Optional] Set a password in secrets.ACCESS_PASSWORD if you do not wish others to access your page. (see #64)
  6. Go to Variables. Variables are shown as plain text and are used for non-sensitive data
  7. Create the following repository variables:
    1. CATEGORIES: separate the categories with ",", such as "cs.CL, cs.CV"
    2. LANGUAGE: such as "Chinese" or "English"
    3. MODEL_NAME: such as "deepseek-chat"
    4. EMAIL: your email for push to GitHub
    5. NAME: your name for push to GitHub
  8. Go to your-own-repo -> Actions -> arXiv-daily-ai-enhanced
  9. You can manually click Run workflow to test if it works well (it may take about one hour). By default, this action will automatically run every day. You can modify it in .github/workflows/run.yml
  10. Set up GitHub pages: Go to your own repo -> Settings -> Pages. In Build and deployment, set Source="Deploy from a branch", Branch="main", "/(root)". Wait for a few minutes, go to https://<username>.github.io/daily-arXiv-ai-enhanced/. Please see this issue for more precise instructions.

Plans

See https://github.com/users/dw-dengwei/projects/3

Contributors

Thanks to the following special contributors for contributing code, discovering bugs, and sharing useful ideas for this project!!!

JianGuanTHU
JianGuanTHU

Chi-hong22
Chi-hong22

chaozg
chaozg

quantum-ctrl
quantum-ctrl

Zhao2z
Zhao2z

eclipse0922
eclipse0922

xuemian168
xuemian168

Acknowledgement

We sincerely thank the following individuals and organizations for their promotion and support!!!

Github_Daily
Github_Daily

AIGCLINK
AIGCLINK

阮一峰的网络日志
阮一峰的网络日志
科技爱好者周刊
(第 353 期)

《HelloGitHub》第 111 期
《HelloGitHub》
月刊第 111 期

Star history

Stargazers over time

About

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published