TissueC

A researcher focused in LLM. Graduated from Tsinghua University.

CoAI of Tsinghua University @thu-coaiBeijing

TissueC's Stars

vict0rsch/PaperMemory
Your browser's reference manager: automatic paper detection (Arxiv, OpenReview & more), publication venue matching and code repository discovery! Also enhances ArXiv: BibTex citation, Markdown link, direct download and more!
Language:JavaScript47717
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python3.7k271
multimodal-art-projection/MAP-NEO
Language:Python75772
huggingface/lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Language:Python47958
elastic/elasticsearch
Free and Open, Distributed, RESTful Search Engine
Language:Java68.6k24.4k
chujiezheng/LLM-Extrapolation
Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"
Language:Python512
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Language:Python1.9k179
microsoft/mup
maximal update parametrization (µP)
Language:Jupyter Notebook1.2k88
allenai/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
Language:JavaScript25118
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell6.2k353
outlines-dev/outlines
Structured Text Generation
Language:Python7.2k370
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.2k399
OpenBMB/MiniCPM
MiniCPM-2B: An end-side LLM outperforming Llama2-13B.
Language:Python4.4k322
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Language:Python80242
qinyiwei/InfoBench
Language:Python406
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Language:Python92241
paralym/COIG-CQIA
715
YJiangcm/FollowBench
Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"
Language:Python588
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Language:Python56278
thu-coai/CritiqueLLM
Language:Python107
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.1k356
IEIT-Yuan/Yuan-2.0
Yuan 2.0 Large Language Model
Language:Python67184
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
Language:Python60337
thu-coai/BPO
Language:Python26014
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.3k345
yangjianxin1/LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
Language:Python15111
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python6.1k439
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Python7.5k458
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python32.1k5.5k
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.7k877

TissueC

TissueC's Stars

vict0rsch/PaperMemory

THUDM/GLM-4

multimodal-art-projection/MAP-NEO

huggingface/lighteval

elastic/elasticsearch

chujiezheng/LLM-Extrapolation

deepseek-ai/DeepSeek-VL

microsoft/mup

allenai/fm-cheatsheet

QwenLM/Qwen2

outlines-dev/outlines

allenai/OLMo

OpenBMB/MiniCPM

pjlab-sys4nlp/llama-moe

qinyiwei/InfoBench

deepseek-ai/DeepSeek-MoE

paralym/COIG-CQIA

YJiangcm/FollowBench

alibaba/Pai-Megatron-Patch

thu-coai/CritiqueLLM

arcee-ai/mergekit

IEIT-Yuan/Yuan-2.0

BlackSamorez/tensor_parallel

thu-coai/BPO

open-compass/opencompass

yangjianxin1/LongQLoRA

deepseek-ai/DeepSeek-Coder

01-ai/Yi

ray-project/ray

NVIDIA/FasterTransformer