jingairpi's Stars
KellerJordan/modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
AI-Hypercomputer/gpu-recipes
Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.
nicolaka/netshoot
a Docker + Kubernetes network trouble-shooting swiss-army container
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
SzymonOzog/GPU_Programming
apache/yunikorn-core
Apache YuniKorn Core
openucx/ucx
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
magicproduct/hash-hop
Long context evaluation for large language models
OpenAutoCoder/Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
halanelson/Essential-Math-For-AI
This repository contains the supplementary material associated with my book: Essential Math for AI published by O'Reilly Media
gpu-mode/lectures
Material for gpu-mode lectures
imbue-ai/cluster-health
amjadmajid/BabyTorch
BabyTorch is a minimalist deep-learning framework with a similar API to PyTorch. This minimalist design encourages learners explore and understand the underlying algorithms and mechanics of deep learning processes. It is design such that when learners are ready to switch to PyTorch they only need to remove the word `baby`.
codecrafters-io/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
google/neper
neper is a Linux networking performance tool.
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
microsoft/AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
adam-maj/deep-learning
A deep-dive on the entire history of deep-learning
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
ray-project/kuberay
A toolkit to run Ray applications on Kubernetes
minio/minio
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
3b1b/videos
Code for the manim-generated scenes used in 3blue1brown videos
ibm-granite/granite-code-models
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
mit-han-lab/qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
rauhul/ece408
Applied Parallel Programming UIUC FA 2017
facebookincubator/dynolog
Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
meta-llama/llama3
The official Meta Llama 3 GitHub site