yurakuratov

yurakuratov's Stars

nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Language:C++71.5k 647 2k7.8k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.4k 351 1.8k4.6k
ibraheemdev/modern-unix
A collection of modern/faster/saner alternatives to common unix commands.
31.3k 395 81788
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.7k 291 432.3k
benfred/py-spy
Sampling profiler for Python programs
Language:Rust13.1k 110 375439
muesli/duf
Disk Usage/Free Utility - a better 'df' alternative
Language:Go13k 88 129408
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python11k 70 108698
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.9k 77 580634
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Language:Python8.6k 69 186356
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++6k 63 625896
mosaicml/composer
Supercharge Your Model Training
Language:Python5.2k 49 552429
Lightning-AI/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python5.2k 63 476540
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.6k 79 91348
pytorch/ignite
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
Language:Python4.6k 60 1.4k621
HazyResearch/flash-attention
Fast and memory-efficient exact attention
Language:Python4.3k 71 268365
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.4k 30 163170
facebookarchive/MemNN
Memory Networks implementations
Language:Lua1.8k 135 23374
pytorch/tnt
A lightweight library for PyTorch training tools and utilities
Language:Python1.7k 43 71278
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language:Python1.3k 23 3499
mryab/efficient-dl-systems
Efficient Deep Learning Systems course materials (HSE, YSDA)
Language:Jupyter Notebook710 14 4113
facebookresearch/contriever
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
Language:Python703 15 1760
tanelp/tiny-diffusion
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
Language:Jupyter Notebook690 10 656
stanford-crfm/BioMedLM
Language:Python611 21 2764
ofirpress/attention_with_linear_biases
Code for the ALiBi method for transformer language models (ICLR 2022)
Language:Python508 12 1938
lucidrains/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Language:Python398 13 2015
google-research/meliad
Language:Python252 10 530
tk-rusch/LEM
Official code for Long Expressive Memory (ICLR 2022, Spotlight)
Language:Python69 2 411
sismetanin/sentiment-analysis-in-russian
Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset, LINIS Crowd, and RuTweetCorp were utilized as training data.
52 4 55
mlcommons/training_results_v2.1
This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.
Language:C++15 3 919
AIRI-Institute/al_toolbox
Active learning
Language:Python1 0 110