HandH1998

Contact me at 1335248067@qq.com.

MeituanBeijing

Pinned Repositories

AutoSmoothQuant
An easy-to-use package for implementing SmoothQuant for LLMs
Language:Python75 3 165
lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
Language:C++3.2k 59 286328
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
books_and_wiki_en_clean_format_and_shard
Language:Python00
QQQ
QQQ is an innovative and hardware-optimized W4A8 quantization solution.
Language:Python51 5 144
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python10
marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Language:Python543 15 2642
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.8k 24 177338
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python26k 225 4.3k3.8k

HandH1998's Repositories

HandH1998/QQQ
QQQ is an innovative and hardware-optimized W4A8 quantization solution.
Language:Python51 5 144
HandH1998/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python10
HandH1998/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
HandH1998/books_and_wiki_en_clean_format_and_shard
Language:Python00
HandH1998/BUAA_Course
0 1 00
HandH1998/BUAA_Course_Sharing
北京航空航天大学(北航)课程作业资料共享计划——涉密不上网,上网不涉密！！！
0 0 00
HandH1998/manifold_distillation
Language:Python0 1 02
HandH1998/mct_former
Language:Python0 1 02
HandH1998/pregenerate_bert_train_corpus
Language:Python0 1 02
HandH1998/carInsurancePred
Language:Python
HandH1998/easy-scrape
Language:Python
HandH1998/Entity-Relation-Extraction
Entity and Relation Extraction Based on TensorFlow and BERT. 基于TensorFlow和BERT的管道式实体及关系抽取，2019语言与智能技术竞赛信息抽取任务解决方案。Schema based Knowledge Extraction, SKE 2019
HandH1998/HandH1998.github.io
个人主页
Language:HTML
HandH1998/Java_learn
Language:Java
HandH1998/JS_learn
Language:HTML
HandH1998/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
Language:C++
HandH1998/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
HandH1998/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Language:Python0 0
HandH1998/matplotlib
Language:Python
HandH1998/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python
HandH1998/ML_practice
Language:Python
HandH1998/net2net
Language:Python
HandH1998/NLP-Tutorials
Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com
Language:Python
HandH1998/NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Language:Python0 0
HandH1998/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
HandH1998/soln-ml
A research framework for fast prototyping of automl algorithms.
HandH1998/test
just for test
Language:Hack
HandH1998/tmp_bert_mlkd
Language:Python
HandH1998/zh-NER-TF
A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow)
Language:Python0 0