ckorzen

Pinned Repositories

ad-blog
AD Blog Posts (currently: student project reports)
Language:HTML0 8 237
completesearch
Search engine for semi-structured data (text and structured data) that provides all kinds of intelligent search features (keyword search, autocompletion, faceted search, error-tolerant search, synonym search, semantic search) very efficiently also on very large data.
Language:C++22 5 26
pdfact
A basic tool that extracts the structure from the PDF files of scientific articles.
Language:Java74 7 611
pdftotext-plus-plus
A fast and accurate command line tool for extracting text from PDF files.
Language:C++17 4 190
qlever
Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. It's faster than engines like Blazegraph or Virtuoso, especially for queries involving large result sets.
Language:C++636 26 62190
dotfiles
My dotfiles
0 1 00
icecite
The repository of Icecite, a research paper management system.
Language:Dart14 2 52
pdf-drawer
A Python project for drawing text and shapes into existing PDF files.
Language:Python0 1 01
pdf-text-extraction-benchmark
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
Language:TeX69 6 211
semantic-roles-detection
Language:Python10

ckorzen's Repositories

ckorzen/pdf-text-extraction-benchmark
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
Language:TeX69 6 211
ckorzen/icecite
The repository of Icecite, a research paper management system.
Language:Dart14 2 52
ckorzen/dotfiles
My dotfiles
0 1 00
ckorzen/pdf-drawer
A Python project for drawing text and shapes into existing PDF files.
Language:Python0 1 01