wikidump-parser
There are 4 repositories under wikidump-parser topic.
david-smejkal/wiki2txt
A tool to extract plain (unformatted) multilingual text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI training / Machine Learning software.
shaltielshmid/ZimReaderSharp
A .NET library designed for streamlined reading and handling of ZIM files.
SirCremefresh/wiki-to-neo4j-csv-parser
Convert Wikipedia dumps to Neo4j loadable CSVs, efficiently transforming Wikipedia data for graph database usage.
HappyBravo/RAG_KG_LLM_Experiment
A repo for my MS Project titled "Fake-news detection".