scihobbit's Stars
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
johnowhitaker/aiaiart
Course content and resources for the AIAIART course.
hhaji/Deep-Learning
Course: Deep Learning
AnswerDotAI/fsdp_qlora
Training LLMs with QLoRA + FSDP
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
kuutsav/information-retrieval
Neural information retrieval / Semantic search / Bi-encoders
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
mlflow/mlflow
Open source platform for the machine learning lifecycle
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
chiphuyen/dmls-book
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
modular/mojo
The Mojo Programming Language
outerbounds/dsbook
Code samples for the Effective Data Science Infrastructure book
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
kuprel/min-dalle
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
run-llama/llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Codium-ai/AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
abacaj/fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
isafulf/inbox_cleaner
A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.
codecrafters-io/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.