VitalyRomanov
Interested in creating smart applications using the tools of ML and Data Analysis. Specialize in NLP.
VitalyRomanov's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
chroma-core/chroma
the AI-native open-source embedding database
cayleygraph/cayley
An open-source graph database
evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
ganeshrvel/openmtp
OpenMTP - Advanced Android File Transfer Application for macOS
paperswithcode/galai
Model API for GALACTICA
memgraph/memgraph
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
freedmand/semantra
Multi-tool for semantic search
Barre/privaxy
Privaxy is the next generation tracker and advertisement blocker. It blocks ads and trackers by MITMing HTTP(s) traffic. Also check out my new project, https://www.merklemap.com/
UniversalDataTool/universal-data-tool
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
https-deeplearning-ai/machine-learning-engineering-for-production-public
Public repo for DeepLearning.AI MLEP Specialization
intel/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
NVIDIA/aistore
AIStore: scalable storage for AI applications
awslabs/sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
castorini/anserini
Anserini is a Lucene toolkit for reproducible information retrieval research
tborychowski/self-hosted-cookbook
A cookbook, for docker-compose based recipes, for self-hosted applications and services.
GoogleCloudPlatform/mlops-on-gcp
NicolasLM/bplustree
An on-disk B+tree for Python 3
Koziev/NLP_Datasets
My NLP datasets for Russian language
allenai/ir_datasets
Provides a common interface to many IR ranking datasets.
vijaydwivedi75/gnn-lspe
Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations), ICLR 2022
epitron/mitm-adblock
A fast adblocking proxy server (which works on HTTPS connections)
pmezard/adblock
AdBlockPlus parser, matcher and transparent HTTP/HTTPS proxy
helliun/targetedSummarization
TextReducer - A Tool for Summarization and Information Extraction
grill-lab/DL-Hard
Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.
Oxid15/theme
Minimalistic CLI labeling tool for text classification
UniversalDependencies/UD_Tatar-NMCTT