wdlctc

independent

Pinned Repositories

ABR-gym
Language:DIGITAL Command Language0 1 00
MAPPO
Language:Python5 2 00
mini-s
Language:Python35 3 02
pensieve
Language:JavaScript1 1 00
Pensieve-PPO
The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, and SAC
Language:Python3 0 00
QR-DNN
This repository represents the quantization and reconstruction algorithm (QR-DNN) for the FPL 2018 paper "RNA: An Accurate Residual Network Accelerator for Quantized and Reconstructed Deep Neural Networks"
Language:Jupyter Notebook4 2 01
recurrent_maskable
Language:Python8 1 12
rtp
RTP: Rethinking Tensor Parallelism with Memory Deduplication
Language:Python8 1 00
starlink-trace-tracker
Language:Python14 1 10
unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python3 0 01

wdlctc's Repositories

wdlctc/mini-s
Language:Python35 3 02
wdlctc/starlink-trace-tracker
Language:Python14 1 10
wdlctc/recurrent_maskable
Language:Python8 1 12
wdlctc/rtp
RTP: Rethinking Tensor Parallelism with Memory Deduplication
Language:Python8 1 00
wdlctc/Pensieve-PPO
The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, and SAC
Language:Python3 0 00
wdlctc/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python3 0 01
wdlctc/coconet
Language:HTML0 0
wdlctc/colab
Language:Python1 0
wdlctc/efficient_cross_entropy
Language:Python0 0
wdlctc/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++0 0
wdlctc/LASP
Linear Attention Sequence Parallelism (LASP)
Language:Python0 0
wdlctc/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
wdlctc/nccl_test
Language:Cuda1 0
wdlctc/neuraloperator
Learning in infinite dimension with neural operators.
Language:Python0 0
wdlctc/Open-Sora-old
Building your own video generation model like OpenAI's Sora
Language:Python0 0
wdlctc/OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
Language:Python0 0
wdlctc/peft_minis
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python
wdlctc/PyTAG
Language:Python0 0
wdlctc/regicide
Language:Python1 0
wdlctc/sevenvice
Language:PHP1 0
wdlctc/SIMPLE
Selfplay In MultiPlayer Environments
Language:Python0 0
wdlctc/Speculative-Sampling
Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind
Language:Python0 0
wdlctc/streaming-llm
Efficient Streaming Language Models with Attention Sinks
Language:Python0 0
wdlctc/tensorly
TensorLy: Tensor Learning in Python.
Language:Python0 0
wdlctc/tltorch
TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch
Language:Python0 0
wdlctc/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python
wdlctc/triton
Development repository for the Triton language and compiler
Language:C++0 0
wdlctc/VisionLLaMA
0 0
wdlctc/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0
wdlctc/wdlctc.github.io
Language:HTML1 0