TechxGenus

Study in University of Science and Technology of China at present.

USTC

Pinned Repositories

AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python2k 16 457255
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python142k 1.1k 16.9k28.4k
CursorCore
CursorCore: Assist Programming through Aligning Anything
Language:Python116 1 311
CursorWeb
CursorWeb: Implement popular features of Cursor in the browser.
Language:Python7 1 03
Deepseek-Coder-MoE
Sparse Deepseek-Coder.
Language:Python5 1 20
Jamba-utils
Language:Python4 1 00
Jamba.c
Language:C4 1 00
LightBinPack
A lightweight library for solving packing problems in LLM training
Language:C++20
Typst-Coder
Language:Jupyter Notebook14 1 02
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python42.4k 348 7.4k6.4k

TechxGenus's Repositories

TechxGenus/CursorCore
CursorCore: Assist Programming through Aligning Anything
Language:Python116 1 311
TechxGenus/Typst-Coder
Language:Jupyter Notebook14 1 02
TechxGenus/CursorWeb
CursorWeb: Implement popular features of Cursor in the browser.
Language:Python7 1 03
TechxGenus/LightBinPack
A lightweight library for solving packing problems in LLM training
Language:C++20
TechxGenus/LightDPO
Language:Python2
TechxGenus/aider
aider is AI pair programming in your terminal
1
TechxGenus/continue
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
1
TechxGenus/DeepSeek-R1
1
TechxGenus/Janus
11
TechxGenus/RadixFlexAttention
Language:Python1
TechxGenus/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python01
TechxGenus/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
TechxGenus/code-r1
Reproducing R1 for Code with Reliable Rewards
Language:Python
TechxGenus/DeepSeek-V3
TechxGenus/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
TechxGenus/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0
TechxGenus/FlashMLA
TechxGenus/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python0 0
TechxGenus/llama.vscode
VS Code extension for local LLM-assisted code/text completion
Language:TypeScript
TechxGenus/numba
NumPy aware dynamic Python compiler using LLVM
Language:Python0 0
TechxGenus/Pattention
Language:Python
TechxGenus/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
TechxGenus/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python0 0
TechxGenus/ring-flash-attention
Ring attention implementation with flash attention
TechxGenus/sglang
SGLang is yet another fast serving framework for large language models and vision language models.
Language:Python0 0
TechxGenus/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
TechxGenus/torchtitan
A PyTorch native library for large model training
TechxGenus/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
TechxGenus/verl
verl: Volcano Engine Reinforcement Learning for LLMs
TechxGenus/vscode
Visual Studio Code