Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
lhoestq
/
Common-Crawl-Pipeline-Creator
like
22
Sleeping
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
0b712fa
Common-Crawl-Pipeline-Creator
1 contributor
History:
10 commits
lhoestq
HF staff
workaround dataframe bug
0b712fa
4 months ago
data
view pipeline result
4 months ago
images
view pipeline result
4 months ago
output_text_extraction-2k
view pipeline result
4 months ago
output_text_extraction-full
stream on full warc
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
README.md
Safe
263 Bytes
update readme
4 months ago
app.py
Safe
32.2 kB
workaround dataframe bug
4 months ago
requirements.txt
Safe
72 Bytes
update requirements.txt
4 months ago