xiaogp's Stars
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
LlamaFamily/Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
THUDM/GLM
GLM (General Language Model)
LinkSoul-AI/Chinese-Llama-2-7b
开源社区第一个能下载、能运行的中文 LLaMA2 模型!
Micro-sheep/efinance
efinance 是一个可以快速获取基金、股票、债券、期货数据的 Python 库,回测以及量化交易的好帮手!🚀🚀🚀
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
THUDM/P-tuning
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
ZhuiyiTechnology/roformer
Rotary Transformer
ShannonAI/mrc-for-flat-nested-ner
Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`
triton-inference-server/client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
triton-inference-server/python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
beader/tianchi_nl2sql
追一科技首届中文NL2SQL挑战赛决赛第3名方案+代码
owenliang/qwen-vllm
通义千问VLLM推理部署DEMO
pypdfium2-team/pypdfium2
Python bindings to PDFium
bojone/CoSENT
比Sentence-BERT更有效的句向量方案
ethanyanjiali/minChatGPT
A minimum example of aligning language models with RLHF similar to ChatGPT
wangzhegeek/DSSM-Lookalike
jayli/langchain-ChatGLM