Pinned Repositories
AM207
cme213_material_2013
CME 213 Class Material
cryptocurrency-derivatives-pricing-and-delta-neutral-volatility-trading
This project is to download and analyze cryptocurrency option data available on Deribit via a public API. Data are collected on an Ubuntu remote server with the implementation of Python3, Shell and SQLite and are then analyzed locally with Python3.
DL_packt
nbdev-tutorial
nbdev tutorial
Python-Financial-Tools
Providing financial analysis tools to the Python open-source community.
ssg-dataset
Open reproducible dataset on static site generators (SSG) popularity.
triton-rs
vector-search-class-notes
Class notes for the course "Long Term Memory in AI - Vector Search and Databases" COS 495 @ Princeton Fall 2023
jeromeku's Repositories
jeromeku/triton-rs
jeromeku/accelerated-scan
Accelerated First Order Parallel Associative Scan
jeromeku/ao
torchao: PyTorch Architecture Optimization (AO). A repository to host AO techniques and performant kernels that work with PyTorch.
jeromeku/api-design
LivingSocial API Design Guide
jeromeku/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
jeromeku/candle
Minimalist ML framework for Rust
jeromeku/colab-connect
Connect to Google Colab VM from your local VSCode
jeromeku/colab-test
jeromeku/cutlass
CUDA Templates for Linear Algebra Subroutines
jeromeku/CutlassProgramming
jeromeku/EVT_AE
Artifacts of EVT ASPLOS'24
jeromeku/extension_builder
jeromeku/FlagAttention
A collection of memory efficient attention operators implemented in the Triton language.
jeromeku/fsdp_qlora
Training LLMs with QLoRA + FSDP
jeromeku/GaLore
jeromeku/GEMM_MMA
Optimize GEMM with tensorcore step by step
jeromeku/haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
jeromeku/LLM-Training-Puzzles
What would you do with 1000 H100s...
jeromeku/neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
jeromeku/packing-cat
jeromeku/punica
Serving multiple LoRA finetuned LLM as one
jeromeku/pybind_example
jeromeku/rust-telemetry-workshop
A workshop that introduces participants to a comprehensive toolkit to detect, troubleshoot and resolve issues with Rust applications.
jeromeku/stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
jeromeku/toydb
Distributed SQL database in Rust, written as a learning project
jeromeku/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
jeromeku/trident
A performance library for machine learning applications.
jeromeku/triton
Development repository for the Triton language and compiler
jeromeku/triton-aot
jeromeku/unsloth
5X faster 60% less memory QLoRA finetuning