tsingcoo

Nanjing UniversityNanjing, China

tsingcoo's Stars

chroma-core/chroma
the AI-native open-source embedding database
Language:Rust14.7k1.2k
ZNLP/BigTranslate
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
Language:Python21312
lemon0830/TIM
code for Teaching LM to Translate with Comparison
Language:Python386
wxjiao/ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
Language:Python16624
deep-spin/hallucinations-in-nmt
173
hongbinye/Cognitive-Mirage-Hallucinations-in-LLMs
Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"
461
lancopku/label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Language:Python14612
FreedomIntelligence/GrammarGPT
The code and data for GrammarGPT.
Language:Python1599
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
90847
HillZhang1999/MuCGEC
MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
Language:Python49364
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
2.6k173
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.4k184
kimiyoung/transformer-xl
Language:Python3.6k762
bojone/bytepiece
更纯粹、更高压缩率的Tokenizer
Language:Python44222
intro-llm/intro-llm.github.io
website
Language:CSS35943
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.4k599
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k293
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python1.9k151
QingruZhang/AdaLoRA
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
Language:Python25828
microsoft/gpt-MT
Language:Ruby848
xverse-ai/XVERSE-13B
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
Language:Python64758
Unbabel/COMET
A Neural Framework for MT Evaluation
Language:Python49276
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.6k1.1k
fkodom/grouped-query-attention-pytorch
(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)
Language:Python1197
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k1.2k
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Language:Python29.8k4.4k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.3k390
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.4k668
DAMO-NLP-MT/PolyLM
Language:Python747
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++65.7k9.4k

tsingcoo

tsingcoo's Stars

chroma-core/chroma

ZNLP/BigTranslate

lemon0830/TIM

wxjiao/ParroT

deep-spin/hallucinations-in-nmt

hongbinye/Cognitive-Mirage-Hallucinations-in-LLMs

lancopku/label-words-are-anchors

FreedomIntelligence/GrammarGPT

HillZhang1999/llm-hallucination-survey

HillZhang1999/MuCGEC

DefTruth/Awesome-LLM-Inference

mit-han-lab/llm-awq

kimiyoung/transformer-xl

bojone/bytepiece

intro-llm/intro-llm.github.io

facebookresearch/xformers

baichuan-inc/Baichuan2

IST-DASLab/gptq

QingruZhang/AdaLoRA

microsoft/gpt-MT

xverse-ai/XVERSE-13B

Unbabel/COMET

QwenLM/Qwen

fkodom/grouped-query-attention-pytorch

Dao-AILab/flash-attention

explosion/spaCy

InternLM/lmdeploy

microsoft/LoRA

DAMO-NLP-MT/PolyLM

ggerganov/llama.cpp