zafstojano

Machine Learning Engineer

zafstojano's Stars

nvbn/thefuck
Magnificent app which corrects your previous console command.
Language:Python85.1k 832 7363.4k
junegunn/fzf
:cherry_blossom: A command-line fuzzy finder
Language:Go64.6k 394 2.8k2.4k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python55k 452 1325.7k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.4k 2.3k 01.6k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.7k 220 2513k
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
Language:Rust22.9k 49 3.5k664
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Language:TypeScript14k 101 2761.3k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.4k 93 161.1k
sympy/sympy
A computer algebra system written in pure Python
Language:Python12.9k 291 13.3k4.4k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.4k 92 1.9k949
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.9k 123 434998
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
Language:Python3.2k 46 359279
SeldonIO/alibi-detect
Algorithms for outlier, adversarial and drift detection
Language:Python2.2k 40 357223
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
Language:Python1.9k 34 1.1k244
kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
Language:Python1.7k 18 6588
igrek51/wat
Deep inspection of Python objects
Language:Python1.5k 7 1422
mzucker/page_dewarp
Text page dewarping using a "cubic sheet" model
Language:Python1.4k 42 25239
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.2k 41 76111
EleutherAI/math-lm
Language:Python1k 15 4478
lucidrains/alphafold3-pytorch
Implementation of Alphafold 3 in Pytorch
Language:Python1k 45 47118
likejazz/llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
Language:Python963 13 474
openai/automated-interpretability
Language:Python953 16 20113
rmislam/PythonSIFT
A clean and concise Python implementation of SIFT (Scale-Invariant Feature Transform)
Language:Python921 9 33255
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Language:Python726 29 13682
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Language:Python689 13 1435
Aleph-Alpha/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
Language:Python477 11 3455
AlignmentResearch/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
Language:Python415 7 5344
openai/sparse_autoencoder
Language:Python290 11 1332
Aleph-Alpha/intelligence-layer-sdk
a unified framework for leveraging LLMs
Language:Python524
zafstojano/wordgamebench
Evaluating language models on word puzzle games
Language:Python80

zafstojano

zafstojano's Stars

nvbn/thefuck

junegunn/fzf

labmlai/annotated_deep_learning_paper_implementations

karpathy/LLM101n

meta-llama/llama3

astral-sh/uv

ItzCrazyKns/Perplexica

naklecha/llama3-from-scratch

sympy/sympy

NVIDIA/TensorRT-LLM

EleutherAI/gpt-neox

facebookresearch/fairscale

SeldonIO/alibi-detect

stanford-crfm/helm

kha-white/manga-ocr

igrek51/wat

mzucker/page_dewarp

huggingface/nanotron

EleutherAI/math-lm

lucidrains/alphafold3-pytorch

likejazz/llama3.np

openai/automated-interpretability

rmislam/PythonSIFT

huggingface/lighteval

EleutherAI/cookbook

Aleph-Alpha/magma

AlignmentResearch/tuned-lens

openai/sparse_autoencoder

Aleph-Alpha/intelligence-layer-sdk

zafstojano/wordgamebench