karmi's Stars
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
exelban/stats
macOS system monitor in your menu bar
ml-explore/mlx
MLX: An array framework for Apple silicon
karpathy/nn-zero-to-hero
Neural Networks: Zero to Hero
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
udlbook/udlbook
Understanding Deep Learning - Simon J.D. Prince
ml-explore/mlx-examples
Examples in the MLX framework
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
cbh123/narrator
David Attenborough narrates your life
Arize-ai/phoenix
AI Observability & Evaluation
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
jaymody/picoGPT
An unnecessarily tiny implementation of GPT-2 in NumPy.
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
karpathy/makemore
An autoregressive character-level language model for making more things
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
wandb/examples
Example deep learning projects that use wandb's features.
taoyds/spider
scripts and baselines for Spider: Yale complex and cross-domain semantic parsing and text-to-SQL challenge
jxmorris12/vec2text
utilities for decoding deep representations (like sentence embeddings) back to text
ClickHouse/ClickBench
ClickBench: a Benchmark For Analytical Databases
sdadas/polish-nlp-resources
Pre-trained models and language resources for Natural Language Processing in Polish
nelson-liu/lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
replicate/latent-consistency-model
Run Latent Consistency Models on your Mac
PAIR-code/scatter-gl
Interactive 3D / 2D webgl-accelerated scatter plot point renderer
mulab-mir/song-describer-dataset
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
nlp-uoregon/Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
glami/glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
cbuescher/rankEvalDemo
Some scripts and code snippets for ranking evaluation