Pinned Repositories
chatnoir-api
🔍 Simple, type-safe access to the ChatNoir search API.
chatnoir-chat
chatnoir-copycat
CopyCat is a resource for deduplication in TREC-style experimental setups.
chatnoir-pyterrier
🔍 Use the ChatNoir search engine in PyTerrier.
chatnoir-resiliparse
A robust web archive analytics toolkit
chatnoir-warc-dl
This pipeline allows extracting data from WARC files on a CPU cluster and streaming it to a GPU server, where it is processed.
chatnoir2-indexer
ChatNoir Indexer
chatnoir2-mapfile-generator
ChatNoir HDFS Map File Generator
chatnoir2-webclient
ChatNoir Web Frontend
web-content-extraction-benchmark
Web Content Extraction Benchmark
ChatNoir's Repositories
chatnoir-eu/chatnoir-resiliparse
A robust web archive analytics toolkit
chatnoir-eu/web-content-extraction-benchmark
Web Content Extraction Benchmark
chatnoir-eu/chatnoir2-indexer
ChatNoir Indexer
chatnoir-eu/chatnoir-copycat
CopyCat is a resource for deduplication in TREC-style experimental setups.
chatnoir-eu/chatnoir2-webclient
ChatNoir Web Frontend
chatnoir-eu/chatnoir-api
🔍 Simple, type-safe access to the ChatNoir search API.
chatnoir-eu/chatnoir-warc-dl
This pipeline allows extracting data from WARC files on a CPU cluster and streaming it to a GPU server, where it is processed.
chatnoir-eu/chatnoir2-mapfile-generator
ChatNoir HDFS Map File Generator
chatnoir-eu/chatnoir-chat
chatnoir-eu/chatnoir-pyterrier
🔍 Use the ChatNoir search engine in PyTerrier.
chatnoir-eu/webis-uuid
Webis UUID Generation Tool
chatnoir-eu/aitools3-ie-stopwords
chatnoir-eu/aitools4-aq-web-page-content-extraction
chatnoir-eu/aitools3-ie-languagedetection
chatnoir-eu/chatnoir-warc-indexer
ChatNoir Indexer