ÚFAL
Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Prague, Czech Republic
Pinned Repositories
acl2019_nested_ner
Source code for paper Neural Architectures for Nested NER through Linearization
morphodita
MorphoDiTa: Morphologic Dictionary and Tagger
mtmonkey
Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)
neuralmonkey
An open-source tool for sequence learning in NLP built on TensorFlow.
public-license-selector
Tool that will help you select the right open license for your data or software
SimulStreaming
treex
Treex NLP framework
udpipe
UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files
unilib
Embeddable C++17 Unicode library offering UTF encodings, general category info, simple and full casing, normalization forms, and combining marks stripping.
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
ÚFAL's Repositories
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
ufal/unilib
Embeddable C++17 Unicode library offering UTF encodings, general category info, simple and full casing, normalization forms, and combining marks stripping.
ufal/morphodita
MorphoDiTa: Morphologic Dictionary and Tagger
ufal/factgenie
Lightweight self-hosted span annotation tool
ufal/npfl138
Materials for Deep Learning – ÚFAL course NPFL138
ufal/treex
Treex NLP framework
ufal/clarin-dspace
clarin-dspace digital repository based on DSpace and LINDAT/CLARIN DSpace
ufal/lindat-translation
Frontend of LINDAT translation service
ufal/microrestd
MicroRestD is a small C++11 cross-platform REST server built on top of libmicrohttpd http://www.gnu.org/software/libmicrohttpd/.
ufal/npfl139
Materials for Deep Reinforcement Learning – ÚFAL course NPFL139
ufal/hamledt
Makefiles, scenarios and support scripts for the development of HamleDT within the Treex infrastructure
ufal/dockerized-nginx-with-shibboleth
ufal/cpp_builtem
C++ Builtem is a cross-platform Makefile-based build system for C++11
ufal/Glitter
Lexical suprisal estimation and visualisation tool
ufal/maskit
MasKIT: A tool for pseudonymization and anonymization of Czech legal texts.
ufal/UMR
ufal/cpp_utils
UFAL C++ Utils
ufal/R_BEGINNERS_SHORT
tidyverse summerschool
ufal/didaktikon
Exponát pro Didaktikon
ufal/dspace-angular
DSpace 7.x (and above) User Interface built on Angular.io
ufal/edupo
EduPo: Generování české poezie v edukačním a multimediálním prostředí
ufal/hickok
ufal/lindat-shortener
ufal/media-newton
Tools for converting newtonmedia XML format to TEI
ufal/ParlaStats
Parliamentary debates statistics presentation
ufal/ponk
Assistant for clear official communication
ufal/SEEM-CZ
A repository for the project SEEM-CZ: Epistemic and Evidential Markers in Czech
ufal/soudec
Source Detection and Classification
ufal/TEIPipe
Tools for annotating TEI files linguistically
ufal/tsd2025-gec
An official implementation from the Refining Czech GEC: Insights from a Multi-Experiment Approach paper