codemurt
Data Scientist | Computational Linguistics, HSE University
Yekaterinburg, Russian Federation
codemurt's Stars
deepseek-ai/DeepSeek-R1
browser-use/browser-use
Make websites accessible for AI agents
DS4SD/docling
Get your documents ready for gen AI
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
pydantic/pydantic
Data validation using Python type hints
huggingface/smolagents
🤗 smolagents: a barebones library for agents that think in python code.
HabitRPG/habitica
A habit tracker app which treats your goals like a Role Playing Game.
Flowseal/zapret-discord-youtube
LibreTranslate/LibreTranslate
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
gruns/icecream
🍦 Never use print() to debug again.
DrewThomasson/ebook2audiobook
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
abbodi1406/KMS_VL_ALL_AIO
Smart Activation Script
oumi-ai/oumi
Everything you need to build state-of-the-art foundation models, end-to-end.
ggerganov/ggwave
Tiny data-over-sound library
simplescaling/s1
s1: Simple test-time scaling
OpenNMT/CTranslate2
Fast inference engine for Transformer models
agentica-project/deepscaler
Democratizing Reinforcement Learning for LLMs
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
NJU-PCALab/STAR
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
huggingface/picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
facebookresearch/SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
EmilStenstrom/conllu
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
facebookresearch/stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
jpjacobpadilla/Google-Colab-Selenium
The best way to use Selenium in Google Colab Notebooks!
wannaphong/ttsmms
TTS with The Massively Multilingual Speech (MMS) project
salute-developers/GigaAM
Foundational Model for Speech Recognition Tasks
PavelLaptev/Fliege-mono
A free monospace font
timarkh/tsakorpus
Yet another search platform for linguistic corpora.