CitationGenerator: This tool is designed for Zlin's students to simplify citation management.
CitationGenerator has two modes, controlled by setting GetPDF in docx_gen.py. You can set GetPDF=True to crawl citation PDF files and generate a Word document for each paper (note that some PDF files require database access and cannot be crawled). Alternatively, you can set GetPDF=False to generate Word documents directly using only local citation PDF files.
I advise you to run docx_gen.py with GetPDF=True first to download part of the PDF files and get an initial Word document. Then, download the remaining PDFs manually from the web hyperlinks in Word documents. After that, run with GetPDF=False to generate the final Word document. This way, you won't even need to manually adjust the hyperlinks in Word documents.
It is recommended to create a Python virtual environment with conda to install CitationGenerator.
git clone https://github.com/chenhengzh/CitationGenerator.git
cd CitationGenerator
conda create -n citgen python=3.10
conda activate citgen
pip install -r requirements.txt- Put citation data in
paper_list/, such aspaper_list/papername/data/ - Set
GetPDF=Trueindocx_gen.pyand rundocx_gen.py. This may take several hours, so using tmux is recommended. - Download the remaining PDFs manually in
paper_list/papername/ - Set
GetPDF=Falseand rundocx_gen.pyagain to update the hyperlinks in Word documents.
Some data has been provided in the paper_list/ directory; you can use it to test first.