Pinned Repositories
dmitrymailk's Repositories
dmitrymailk/mt_bench_ru
dmitrymailk/verbalist
dmitrymailk/ssh_remote
dmitrymailk/any_ner
dmitrymailk/audio
dmitrymailk/bot
dmitrymailk/cython_lesson
dmitrymailk/data_processing
dmitrymailk/DeepSpeedExamples
Example models using DeepSpeed
dmitrymailk/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
dmitrymailk/efficient-dl-systems
dmitrymailk/efficient_transformers
dmitrymailk/fucking_haskel
dmitrymailk/go_volsu
dmitrymailk/gpus_go_brrrr
dmitrymailk/k3s_pod_network_issue
dmitrymailk/llama.cpp
Port of Facebook's LLaMA model in C/C++
dmitrymailk/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
dmitrymailk/neuroevolution
dmitrymailk/open_orca_ru
dmitrymailk/python-performance
Repository for the book Fast Python - published by Manning
dmitrymailk/pytorch_cpp
dmitrymailk/ru_chatGPT
dmitrymailk/ru_lm
Большая языковая модель для следования инструкциям на русском языке
dmitrymailk/ru_lm_hydra
dmitrymailk/t5_optimization
dmitrymailk/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
dmitrymailk/tensorrt_devcontainer
dmitrymailk/volsu-schedule-2
dmitrymailk/volsu_comp