RossM's Stars
BorealisAI/flora-opt
This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.
imbue-ai/carbs
Cost aware hyperparameter tuning algorithm
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Stability-AI/sd3-ref
czczup/FAST
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
notAI-tech/NudeNet
Lightweight nudity detection
HigherOrderCO/Bend
A massively parallel, high-level programming language
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
mirage-project/mirage
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
llvm/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
aredden/torch-cublas-hgemm
PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu
mlir-rs/melior
The rustic MLIR bindings in Rust
ionelmc/python-lazy-object-proxy
A fast and thorough lazy object proxy.
GraphPKU/PiSSA
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
segmind/segmoe
guillaumeBellec/multitask
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
f-dangel/cockpit
Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Nerogar/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
kohya-ss/sd-scripts
SHI-Labs/NATTEN
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
SHI-Labs/Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
Algomancer/Bayesian-Flow-Networks
A simple implimentation of Bayesian Flow Networks (BFN)
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
dotnet/TorchSharp
A .NET library that provides access to the library that powers PyTorch.
SciSharp/NumSharp
High Performance Computation for N-D Tensors in .NET, similar API to NumPy.
SciSharp/Tensor.NET
A lightweight and high-performance tensor library which provides numpy-like operations but .NET style interfaces. It supports generic tensor, Linq, C# native slices and so on. (Qushui student project))