BobaZooba's Stars
fastapi/fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
huggingface/trl
Train transformer language models with reinforcement learning.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
aiogram/aiogram
aiogram is a modern and fully asynchronous framework for Telegram Bot API written in Python using asyncio
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
airtai/faststream
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
ktorio/ktor-samples
Sample projects for Ktor
dstackai/dstack
dstack is an open-source alternative to Kubernetes, designed to simplify development, training, and deployment of AI across any cloud or on-prem. It supports NVIDIA, AMD, and TPU.
taskiq-python/taskiq
Distributed task queue with full async support
saveourtool/diktat
Strict coding standard for Kotlin and a custom set of rules for detecting code smells, code style issues and bugs
iam-abbas/FastAPI-Production-Boilerplate
A scalable and production ready boilerplate for FastAPI
soupslurpr/Transcribro
Private and on-device speech recognition keyboard and service for Android.
BobaZooba/xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
EulerSearch/embedding_studio
Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
argilla-io/notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
BobaZooba/DeepNLP
Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики
qdrant/java-client
Official Java client for Qdrant
KompleteAI/xllm
🦖 X—LLM: Simple & Cutting Edge LLM Finetuning
BobaZooba/xllm-demo
Demo project using XLLM
BobaZooba/wgpt
This repository features an example of how to utilize the xllm library. Included is a solution for a common type of assessment given to LLM engineers, who typically earn between $120,000 to $140,000 annually
BobaZooba/shurale
Conversation AI model for open domain dialogs
KompleteAI/shurale
Code for training