Skip to content

Pull requests: huggingface/datatrove

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

updateRayPipelineExecutor naming conventions
#390 opened Sep 20, 2025 by Tavish9 Loading…
fixes #388
#389 opened Sep 5, 2025 by zinccat Loading…
fix: typos
#386 opened Aug 30, 2025 by DeVikingMark Loading…
Add support to load HF dataset from disk
#385 opened Aug 29, 2025 by iamgroot42 Loading…
chore(ci): upgrade checkout to v5
#384 opened Aug 27, 2025 by zkpepe Loading…
docs: fixed a broken link in the documentation stats
#383 opened Aug 26, 2025 by Olexandr88 Loading…
ensure folder_path has consistent usage
#366 opened May 8, 2025 by hynky1999 Loading…
fix bos token missing
#346 opened Feb 13, 2025 by jquesnelle Loading…
Add open-source text extraction libraries
#293 opened Sep 27, 2024 by garrethlee Loading…
Mersenne prime hashing fix.
#200 opened May 28, 2024 by Apsod Loading…
Linewise filters
#125 opened Mar 14, 2024 by guipenedo Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.