Motsepe-Jr

AI, deep learning

Still Looking For hard ProblemSouth Africa

Motsepe-Jr's Stars

joey00072/QuasiQ
quasiq is really simple quantum computer
Language:Python31
Motsepe-Jr/bafoGPT
Series of Language Model for Zulu Langauge
Language:Python3
Motsepe-Jr/mini_RLHF
Minimalist PPO with machine feedback
Language:Jupyter Notebook5
mikex86/LibreCuda
Language:C97737
Motsepe-Jr/gpt2
Language:Jupyter Notebook1
changchencc/dreamerv2_pytorch
dreamerv2_pytorch
Language:Python4
asg017/sqlite-vec
A vector search SQLite extension that runs anywhere!
Language:C3.9k131
aimerou/awesome-ai-papers
A curated list of the most impressive AI papers
77971
ejmejm/CLAgent
Continual learning agent
Language:Python4
danijar/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
Language:Python507109
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python6.6k1.7k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29k1.6k
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python16.2k1.1k
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.5k445
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python92599
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
Language:Python7.6k838
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda23.6k2.6k
Chinese-Tiny-LLM/Chinese-Tiny-LLM
Language:Python20412
nus-apr/auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-bench lite and 38.40% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
Language:Python2.7k276
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python32.2k3.7k
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.2k399
valkey-io/valkey
A flexible distributed key-value datastore that supports both caching and beyond caching workloads.
Language:C16.3k607
joey00072/ohara
Collection of autoregressive model implementation
Language:Python625
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54.4k5.6k
gleam-lang/gleam
⭐️ A friendly language for building type-safe, scalable systems!
Language:Rust17.6k733
turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Language:Python2.7k215
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.5k159
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.7k451
NVlabs/tiny-cuda-nn
Lightning fast C++/CUDA neural network framework
Language:C++3.7k449
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python33.2k5.6k

Motsepe-Jr

Motsepe-Jr's Stars

joey00072/QuasiQ

Motsepe-Jr/bafoGPT

Motsepe-Jr/mini_RLHF

mikex86/LibreCuda

Motsepe-Jr/gpt2

changchencc/dreamerv2_pytorch

asg017/sqlite-vec

aimerou/awesome-ai-papers

ejmejm/CLAgent

danijar/dreamer

EleutherAI/lm-evaluation-harness

karpathy/LLM101n

unslothai/unsloth

allenai/OLMo

allenai/dolma

axolotl-ai-cloud/axolotl

karpathy/llm.c

Chinese-Tiny-LLM/Chinese-Tiny-LLM

nus-apr/auto-code-rover

All-Hands-AI/OpenHands

imoneoi/openchat

valkey-io/valkey

joey00072/ohara

labmlai/annotated_deep_learning_paper_implementations

gleam-lang/gleam

turboderp/exllama

Zjh-819/LLMDataHub

jzhang38/TinyLlama

NVlabs/tiny-cuda-nn

ray-project/ray