Eugene29's Stars
microsoft/vscode
Visual Studio Code
commaai/openpilot
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Project-MONAI/MONAI
AI Toolkit for Healthcare Imaging
OpenDriveLab/End-to-end-Autonomous-Driving
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
facebookresearch/fvcore
Collection of common code that's shared among different research projects in FAIR computer vision team.
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
fastmachinelearning/hls4ml
Machine learning on FPGAs using HLS
ThereminGoat/switch-scores
PDF Repository of switch score sheets.
huggingface/optimum-quanto
A pytorch quantization backend for optimum
trevor-vincent/awesome-high-performance-computing
A curated list of awesome high performance computing resources
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
chaitjo/efficient-gnns
Code and resources on scalable and efficient Graph Neural Networks
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
huggingface/picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
argonne-lcf/ai-science-training-series
uuudown/Tartan
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite
argonne-lcf/ALCF_Hands_on_HPC_Workshop
The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the examples used in the workshop.
argonne-lcf/user-guides
ALCF Systems User Documentation
saforem2/ezpz
Train across all your devices, ezpz 🍋
argonne-lcf/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
argonne-lcf/ml_communications
ML communications benchmark
mindee/Problem-of-BatchNorm
Playground repository to highlight the problem of BatchNorm layers for an blog article
noahgift/ssh-tips-tricks
SSH Tips and Tricks
saforem2/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Drug-Repurposing-GNN/FlyKD
FlyKD, a novel Graph Knowledge Distillation on the Fly with Curriculum Learning for link prediction tasks.
saforem2/wordplay
Playing with words
Eugene29/PointNET_PMT
PMT Event reconstruction using PointNET