Pinned Repositories
ad-blog
AD Blog Posts (currently: student project reports)
completesearch
Search engine for semi-structured data (text and structured data) that provides all kinds of intelligent search features (keyword search, autocompletion, faceted search, error-tolerant search, synonym search, semantic search) very efficiently also on very large data.
pdfact
A basic tool that extracts the structure from the PDF files of scientific articles.
pdftotext-plus-plus
A fast and accurate command line tool for extracting text from PDF files.
qlever
Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. It's faster than engines like Blazegraph or Virtuoso, especially for queries involving large result sets.
dotfiles
My dotfiles
icecite
The repository of Icecite, a research paper management system.
pdf-drawer
A Python project for drawing text and shapes into existing PDF files.
pdf-text-extraction-benchmark
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
semantic-roles-detection
ckorzen's Repositories
ckorzen/pdf-text-extraction-benchmark
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
ckorzen/icecite
The repository of Icecite, a research paper management system.
ckorzen/dotfiles
My dotfiles
ckorzen/pdf-drawer
A Python project for drawing text and shapes into existing PDF files.