malteos
NLP researcher. Currently working on: Document similarity, LLMs, scientific & legal document processing
Berlin, Germany
Pinned Repositories
aspect-document-similarity
Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020
awesome-document-similarity
A curated list of resources on document similarity measures (papers, tutorials, code, ...)
clp-transfer
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
legal-document-similarity
Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal Literature Recommendations"
llm-datasets
A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.
pytorch-bert-document-classification
Enriching BERT with Knowledge Graph Embedding for Document Classification (PyTorch)
scincl
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)
semantic-document-relations
Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"
oldp
Open Legal Data Platform
malteos's Repositories
malteos/CmdLineSlideShow
Command line script for generating rich slide shows from a set of images with transition effects and audio. Using ImageMagick and FFMPEG.
malteos/Leaflet.Sim
Leaflet.Sim is a framework for location-based simulations with Leaflet maps that can visualise moving markers, which can change their style, and events over time on a map.
malteos/kibana-reallybettermap
Multiple locations for Kibana's bettermap panel
malteos/news-visualization
News visualization with Elastic Search and Kibana including NER, Sentiment Analysis and Geo Locations.
malteos/Wikipedia2Lucene
Import a Wikipedia XML Dump from HDFS to Lucene index or Elasticsearch and retrieve similar Wikipedia articles based on Lucene's MoreLikeThis query.
malteos/apps-android-wikipedia
Github mirror of "apps/android/wikipedia" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing
malteos/Autogenerated-Legal-Comment-for-German-Law
malteos/citolytics-demo
Simple demo script for building a MediaWiki-Citolytics demo based on Wikipedia's simplewiki
malteos/citolytics-docker
malteos/codefor.de
The website for Code for Germany. Includes the blog, projects list and basic info about the group.
malteos/ideenwerkstatt
malteos/IMDBScaperJS
Scraping movie data from IMDB.com with NodeJS
malteos/mediawiki-extensions-CirrusSearch
Github mirror of MediaWiki extension CirrusSearch - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing
malteos/ms-com
Private website
malteos/mschwarzer.github.io
Build a Jekyll blog in minutes, without touching the command line.
malteos/NeuralTapeArt
Neural Artistic Style in Python
malteos/news-please
news-please - an integrated web crawler and information extractor for news that just works.
malteos/Stanko.github.io
My personal blog