scihobbit

scihobbit's Stars

meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
Language:Jupyter Notebook16.2k2.3k
johnowhitaker/aiaiart
Course content and resources for the AIAIART course.
Language:Jupyter Notebook57047
hhaji/Deep-Learning
Course: Deep Learning
Language:Jupyter Notebook188105
AnswerDotAI/fsdp_qlora
Training LLMs with QLoRA + FSDP
Language:Jupyter Notebook1.4k188
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.2k1.1k
kuutsav/information-retrieval
Neural information retrieval / Semantic search / Bi-encoders
Language:Jupyter Notebook16921
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python20k1.7k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook40k5.3k
mlflow/mlflow
Open source platform for the machine learning lifecycle
Language:Python19.5k4.3k
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
6.3k868
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Language:Python22.8k3.6k
chiphuyen/dmls-book
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
2.5k377
modular/mojo
The Mojo Programming Language
Language:Mojo23.7k2.6k
outerbounds/dsbook
Code samples for the Effective Data Science Infrastructure book
Language:Python11330
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python15.5k1.5k
kuprel/min-dalle
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
Language:Python3.5k255
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
Language:Python14.8k1.2k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python9.1k648
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language:Python1.3k100
run-llama/llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Language:Python39k5.6k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37.2k3.3k
Codium-ai/AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
Language:Python3.7k280
abacaj/fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
Language:Python70563
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.8k528
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Language:Python30.2k2k
isafulf/inbox_cleaner
A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.
Language:Python42923
codecrafters-io/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
Language:Markdown336k31.1k
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8.2k504
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python11.6k1.2k
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python6k518