Pinned Repositories
ao
The torchao repository contains api's and workflows for quantization and pruning gpu models.
benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
builder
Continuous builder and binary build scripts for pytorch
CS513-Final-Project
Final Project for CS 513 Data Curation
cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
driss_torch
Cuda extensions for PyTorch
helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
simple_cuda
Learnings + Exercises from the PMPP book!
transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
drisspg's Repositories
drisspg/transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
drisspg/driss_torch
Cuda extensions for PyTorch
drisspg/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
drisspg/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
drisspg/simple_cuda
Learnings + Exercises from the PMPP book!
drisspg/ao
The torchao repository contains api's and workflows for quantization and pruning gpu models.
drisspg/benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
drisspg/builder
Continuous builder and binary build scripts for pytorch
drisspg/cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
drisspg/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
drisspg/cutlass
CUDA Templates for Linear Algebra Subroutines
drisspg/drisspg
drisspg/feature_preview_1.12_core
drisspg/funnel
A small classifier and server
drisspg/gensim
Topic Modelling for Humans
drisspg/glow
Compiler for Neural Network hardware accelerators
drisspg/extension-cpp
C++ extensions in PyTorch
drisspg/flash-attention
Fast and memory-efficient exact attention
drisspg/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
drisspg/lit-llama
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
drisspg/MacOSPythonDevSetup
This is my current playlist and order of operations for setting up new Mac OS computer dev environment.
drisspg/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
drisspg/thinc
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
drisspg/tlparse
TORCH_LOGS parser for PT2
drisspg/torchtitan
A native PyTorch Library for large model training
drisspg/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
drisspg/triton
Development repository for the Triton language and compiler
drisspg/tutorials
PyTorch tutorials.
drisspg/ufmt
Safe, atomic formatting with black and µsort
drisspg/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.