Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
pdf2dataset
Easily turn large sets of pdf urls to pdf dataset.
Python
Emerging
GitHub
Stars
8
Forks
—
Contributors
8
Last push
17mo ago
Recent commits
Latest commits.
Release 1.0.2 add content type filtering
58c51f2
vinyesm
17mo ago
filter on content type
3502120
vinyesm
17mo ago
Update bench_8M.sh
7097632
Marina Vinyes
17mo ago
Release 1.0.1 logging + compute_hash + readme
aca9a8f
vinyesm
17mo ago
fix tests
502788a
vinyesm
17mo ago
fmt
cd65a64
vinyesm
17mo ago
fixes #1
7062db4
vinyesm
17mo ago
add compute hash
0c0159c
vinyesm
17mo ago
Top contributors
Builders behind this project.
rom1504
253 commits
vinyesm
49 commits
borisdayma
9 commits
gabrielilharco
6 commits
GeorgiosSmyrnis
4 commits
dependabot[bot]
4 commits
Skylion007
2 commits
bryant1410
2 commits