Pinned Repositories
Ampere_Persistent_Cache_Eval
AX6S-unlock
MP-SPDZ
Versatile framework for multi-party computation
overcoming-catastrophic
tvm-models-baseline
YangWang92.github.io
YangWang92's Repositories
YangWang92/abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
YangWang92/Accel-NASBench
Accel-NASBench: A Surrogate Benchmark for Accelerator-Aware NAS
YangWang92/AI-Youtube-Shorts-Generator
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
YangWang92/anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
YangWang92/AutoFP8
YangWang92/chameleon-meta
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
YangWang92/ConvSSM
YangWang92/Coyote
Framework providing operating system abstractions and a range of shared networking (RDMA, TCP/IP) and memory services to common modern heterogeneous platforms.
YangWang92/enso
Ensō is a high-performance streaming interface for NIC-application communication.
YangWang92/fast-hadamard-transform
Fast Hadamard transform in CUDA, with a PyTorch interface
YangWang92/fast_pytorch_kmeans
This is a pytorch implementation of k-means clustering algorithm
YangWang92/hqq
Official implementation of Half-Quadratic Quantization (HQQ)
YangWang92/kotomamba
Mamba training library developed by kotoba technologies
YangWang92/KVQuant
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
YangWang92/Latte
Latte: Latent Diffusion Transformer for Video Generation.
YangWang92/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
YangWang92/llm-reasoners
A library for advanced large language model reasoning
YangWang92/LongMamba
YangWang92/Mamba_SSM
A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)
YangWang92/mojo
The Mojo Programming Language
YangWang92/quanto
A pytorch Quantization Toolkit
YangWang92/quip-sharp
YangWang92/sae-auto-interp
YangWang92/SpinQuant
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
YangWang92/TiledCUDA
TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.
YangWang92/TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
YangWang92/VideoLingo
Netflix级字幕切割翻译、精确对齐和个性化配音,一键全自动视频搬运
YangWang92/Vim
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
YangWang92/VMamba
VMamba: Visual State Space Models
YangWang92/xmir-patcher
Firmware patcher for Xiaomi routers