Pinned Repositories
OneBit
The homepage of OneBit model quantization framework.
Werewolf
OS_Notes
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
bert
TensorFlow code and pre-trained models for BERT
BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。 🤪 😜 阿里招p6/p7 Python Golang | gaojunqi@outlook.com | 上海张江
ChromeController
Comprehensive wrapper and execution manager for the Chrome browser using the Chrome Debugging Protocol.
chromium
The official GitHub mirror of the Chromium source
corpus_process_script
chinese and english corpus process script, python, c++, java
xuyuzhuang11's Repositories
xuyuzhuang11/xuyuzhuang11.github.io
Yuzhuang XU's Personal Academic Homepage
xuyuzhuang11/StyleMT
xuyuzhuang11/OneBit
The homepage of OneBit model quantization framework.
xuyuzhuang11/hithesis
嗨!thesis!哈尔滨工业大学毕业论文LaTeX模板
xuyuzhuang11/lm-evaluation-harness
A framework for few-shot evaluation of language models.
xuyuzhuang11/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
xuyuzhuang11/FasterTransformer
Transformer related optimization, including BERT, GPT
xuyuzhuang11/Werewolf
xuyuzhuang11/thuthesis
LaTeX Thesis Template for Tsinghua University
xuyuzhuang11/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
xuyuzhuang11/SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
xuyuzhuang11/Megatron-LM
Ongoing research training transformer models at scale
xuyuzhuang11/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM3)
xuyuzhuang11/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
xuyuzhuang11/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
xuyuzhuang11/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
xuyuzhuang11/OmniQuant
OmniQuant is a simple and powerful quantization technique for LLMs.
xuyuzhuang11/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
xuyuzhuang11/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
xuyuzhuang11/iterm2-with-oh-my-zsh
iTerm2 + Oh My Zsh 打造舒适终端体验
xuyuzhuang11/fairseq-pro
From Shuo Wang
xuyuzhuang11/speech-to-speech-translation
xuyuzhuang11/THU-PPT-Theme
清华主题PPT模板
xuyuzhuang11/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
xuyuzhuang11/NVIDIA_SGEMM_PRACTICE
Step-by-step optimization of CUDA SGEMM
xuyuzhuang11/ChromeController
Comprehensive wrapper and execution manager for the Chrome browser using the Chrome Debugging Protocol.
xuyuzhuang11/chromium
The official GitHub mirror of the Chromium source
xuyuzhuang11/gcn
Implementation of Graph Convolutional Networks in TensorFlow
xuyuzhuang11/OS_Notes
xuyuzhuang11/bert
TensorFlow code and pre-trained models for BERT