Eugene29

Eugene29's Stars

microsoft/vscode
Visual Studio Code
Language:TypeScript166k 3.3k 188k29.9k
commaai/openpilot
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
Language:Python51.3k 1.3k 2.8k9.3k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.1k 348 2.9k4.2k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.9k 123 1.2k1.4k
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Language:Python8.6k 69 186356
Project-MONAI/MONAI
AI Toolkit for Healthcare Imaging
Language:Python6k 89 3.1k1.1k
OpenDriveLab/End-to-end-Autonomous-Driving
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
2.6k 61 1248
facebookresearch/fvcore
Collection of common code that's shared among different research projects in FAIR computer vision team.
Language:Python2.1k 40 82228
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.9k 32 3290
fastmachinelearning/hls4ml
Machine learning on FPGAs using HLS
Language:C++1.3k 56 425423
ThereminGoat/switch-scores
PDF Repository of switch score sheets.
1.1k 70 1165
huggingface/optimum-quanto
A pytorch quantization backend for optimum
Language:Python859 8 14466
trevor-vincent/awesome-high-performance-computing
A curated list of awesome high performance computing resources
720 24 674
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
Language:Cuda671 4 658
chaitjo/efficient-gnns
Code and resources on scalable and efficient Graph Neural Networks
Language:Python527 20 264
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python393 5 2129
huggingface/picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
Language:Python28519
argonne-lcf/ai-science-training-series
Language:Jupyter Notebook212 30 3633
uuudown/Tartan
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite
Language:Cuda63 3 312
argonne-lcf/ALCF_Hands_on_HPC_Workshop
The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the examples used in the workshop.
Language:HTML59 21 346
argonne-lcf/user-guides
ALCF Systems User Documentation
Language:HTML23 40 7130
saforem2/ezpz
Train across all your devices, ezpz 🍋
Language:Python13 3 43
argonne-lcf/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python10 2 1112
argonne-lcf/ml_communications
ML communications benchmark
Language:Python52
mindee/Problem-of-BatchNorm
Playground repository to highlight the problem of BatchNorm layers for an blog article
Language:Python5 2 00
noahgift/ssh-tips-tricks
SSH Tips and Tricks
5 3 16
saforem2/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python5 0 00
Drug-Repurposing-GNN/FlyKD
FlyKD, a novel Graph Knowledge Distillation on the Fly with Curriculum Learning for link prediction tasks.
Language:Python3 0 00
saforem2/wordplay
Playing with words
Language:Python3 2 01
Eugene29/PointNET_PMT
PMT Event reconstruction using PointNET
Language:Python2 1 00