TissueC
A researcher focused in LLM. Graduated from Tsinghua University.
CoAI of Tsinghua University @thu-coaiBeijing
TissueC's Stars
vict0rsch/PaperMemory
Your browser's reference manager: automatic paper detection (Arxiv, OpenReview & more), publication venue matching and code repository discovery! Also enhances ArXiv: BibTex citation, Markdown link, direct download and more!
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
multimodal-art-projection/MAP-NEO
huggingface/lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
elastic/elasticsearch
Free and Open, Distributed, RESTful Search Engine
chujiezheng/LLM-Extrapolation
Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
microsoft/mup
maximal update parametrization (µP)
allenai/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
outlines-dev/outlines
Structured Text Generation
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
OpenBMB/MiniCPM
MiniCPM-2B: An end-side LLM outperforming Llama2-13B.
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
qinyiwei/InfoBench
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
paralym/COIG-CQIA
YJiangcm/FollowBench
Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
thu-coai/CritiqueLLM
arcee-ai/mergekit
Tools for merging pretrained large language models.
IEIT-Yuan/Yuan-2.0
Yuan 2.0 Large Language Model
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
thu-coai/BPO
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
yangjianxin1/LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT