itsucks

itsucks's Stars

THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.7k 395 1.3k5.2k
google-research/google-research
Google Research
Language:Jupyter Notebook34.4k 754 1.3k7.9k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.4k 185 7311.9k
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++14.8k 247 6.7k2.9k
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.3k 52 1k636
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.9k 62 625893
baidu/lac
百度NLP：分词，词性标注，命名实体识别，词重要性
Language:C++3.9k 105 248596
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Language:Python3.4k 29 273519
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包，准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Language:Python3.4k 34 214404
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
Language:C++3.2k 58 287329
huawei-noah/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Language:Python3k 56 201627
onnx/onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
Language:C++3k 68 665545
FreedomIntelligence/LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
Language:Python2.9k 53 49201
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook2.3k 31 91161
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python1.9k 29 50155
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.7k 31 663232
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.3k 21 89150
huawei-noah/bolt
Bolt is a deep learning library with high performance and heterogeneous flexibility.
Language:C++919 50 74159
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Language:Python852 26 34106
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
Language:Python830 12 14185
kpe/bert-for-tf2
A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.
Language:Python802 35 91193
shinyke/Time-NLP
中文语句中的时间语义识别。即通过分析中文语句，识别出话语中提到的时间。
Language:Java629 39 41183
fastnlp/CPT
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
Language:Python481 5 8772
xkzhangsan/xk-time
xk-time 是时间转换，时间计算，时间格式化，时间解析，日历，时间cron表达式和时间NLP等的工具，使用Java8（JSR-310），线程安全，简单易用，多达70几种常用日期格式化模板，支持Java8时间类和Date，轻量级，无第三方依赖。
Language:Java323 7 2285
NetEase-FuXi/EET
Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model
Language:Python261 6 1046
kssteven418/I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
Language:Python229 4 3132
timoschick/dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
Language:Python187 4 1724
ptillet/torch-blocksparse
Block-sparse primitives for PyTorch
Language:Python148 2 2322
ModelTC/mqbench-paper
Language:Python44 9 39
ncoop57/retrofit
Little python library for retrofitting autoregressive decoder transformers to use DeepMinds Retro framework: https://arxiv.org/pdf/2112.04426.pdf
Language:Jupyter Notebook6 3 02