zafstojano's Stars
nvbn/thefuck
Magnificent app which corrects your previous console command.
junegunn/fzf
:cherry_blossom: A command-line fuzzy finder
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
karpathy/LLM101n
LLM101n: Let's build a Storyteller
meta-llama/llama3
The official Meta Llama 3 GitHub site
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
sympy/sympy
A computer algebra system written in pure Python
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
SeldonIO/alibi-detect
Algorithms for outlier, adversarial and drift detection
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
igrek51/wat
Deep inspection of Python objects
mzucker/page_dewarp
Text page dewarping using a "cubic sheet" model
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
EleutherAI/math-lm
lucidrains/alphafold3-pytorch
Implementation of Alphafold 3 in Pytorch
likejazz/llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
openai/automated-interpretability
rmislam/PythonSIFT
A clean and concise Python implementation of SIFT (Scale-Invariant Feature Transform)
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Aleph-Alpha/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
AlignmentResearch/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
openai/sparse_autoencoder
Aleph-Alpha/intelligence-layer-sdk
a unified framework for leveraging LLMs
zafstojano/wordgamebench
Evaluating language models on word puzzle games