hwang595
Incoming Assistant Professor of CS@Rutgers; Core Member of LLM360.ai; Doing something adventurous now.
Carnegie Mellon UniversityPittsburgh, PA
hwang595's Stars
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
prakhar1989/awesome-courses
:books: List of awesome university courses for learning Computer Science!
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
numpy/numpy
The fundamental package for scientific computing with Python.
karpathy/llama2.c
Inference Llama 2 in one file of pure C
openai/gpt-3
GPT-3: Language Models are Few-Shot Learners
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
FedML-AI/FedML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
pytorch/torchtitan
A PyTorch native library for large model training
soumith/convnet-benchmarks
Easy benchmarking of all publicly accessible implementations of convnets
Trusted-AI/AIX360
Interpretability and explainability of data and machine learning models
dmlc/ps-lite
A lightweight parameter server interface
openai/blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
mpi4py/mpi4py
Python bindings for MPI
tushar-semwal/awesome-federated-computing
:books: :eyeglasses: A collection of research papers, codes, tutorials and blogs on Federated Computing/Learning.
yhyu13/AlphaGOZero-python-tensorflow
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
leopard-ai/betty
Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization
cheind/py-lapsolver
Fast linear assignment problem (LAP) solvers for Python based on c-extensions
FALCONN-LIB/FFHT
Fast Fast Hadamard Transform
daleroberts/hdmedians
High-dimensional medians (medoid, geometric median, etc.). Fast implementations in Python.
cnellington/Contextualized
An SKLearn-style toolbox for estimating and analyzing models, distributions, and functions with context-specific parameters.
tanyuqian/redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
stsievert/pytorch_ps_mpi
PyTorch parameter server with MPI
Hacker0912/SysML-gpu