cslydia

cslydia's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python168k 1.6k 2.8k44.3k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.3k 345 2.8k4.1k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27k 224 2603.1k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.3k 184 7311.9k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.9k 104 1.1k1.1k
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。
Language:JavaScript13.6k 74 4991k
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.4k 98 7811.6k
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python12.3k 167 239833
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python11.9k 123 3541.1k
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Language:Python9.7k 90 365751
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k 99 198607
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k 67 128505
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python2.9k 51 151591
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.6k 38 34129
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
Language:Python1.9k 35 1.1k247
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Language:Python1.6k 15 8378
microsoft/CodeXGLUE
CodeXGLUE
Language:C#1.5k 37 157366
microsoft/mup
maximal update parametrization (µP)
Language:Jupyter Notebook1.4k 29 6295
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.3k 24 144215
hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
Language:Python1.2k 19 2091
Duxiaoman-DI/XuanYuan
轩辕：度小满中文金融对话大模型
Language:Python1.1k 12 4095
ORDINAND/The-Art-of-Asking-ChatGPT-for-High-Quality-Answers-A-complete-Guide-to-Prompt-Engineering-Technique
ChatGPT提问技巧
984 9 0120
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell979 37 19103
ZhuiyiTechnology/roformer
Rotary Transformer
Language:Python808 8 850
bigcode-project/bigcode-dataset
Language:Jupyter Notebook363 9 3961
thu-coai/COLDataset
The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection
214 2 518
thaumstrial/FinetuneGLMWithPeft
Simple implementation of using lora form the peft library to fine-tune the chatglm-6b
Language:Python86 1 618
OpenLMLab/ChatZoo
Light local website for displaying performances from different chat models.
Language:Python85 2 97
dqxiu/KAssess
Language:Python14 1 31
RyanBurnell/revealing-LLM-capabilities
Code and data for the paper Revealing the structure of language model capabilities
71

cslydia

cslydia's Stars

Significant-Gravitas/AutoGPT

microsoft/DeepSpeed

meta-llama/llama3

ymcui/Chinese-LLaMA-Alpaca

QwenLM/Qwen

opendatalab/MinerU

THUDM/ChatGLM3

openai/tiktoken

OpenMOSS/MOSS

cleanlab/cleanlab

THUDM/GLM-130B

baichuan-inc/Baichuan-7B

google/BIG-bench

FranxYao/chain-of-thought-hub

stanford-crfm/helm

hkust-nlp/ceval

microsoft/CodeXGLUE

microsoft/mup

bigscience-workshop/Megatron-DeepSpeed

hendrycks/test

Duxiaoman-DI/XuanYuan

ORDINAND/The-Art-of-Asking-ChatGPT-for-High-Quality-Answers-A-complete-Guide-to-Prompt-Engineering-Technique

bigscience-workshop/bigscience

ZhuiyiTechnology/roformer

bigcode-project/bigcode-dataset

thu-coai/COLDataset

thaumstrial/FinetuneGLMWithPeft

OpenLMLab/ChatZoo

dqxiu/KAssess

RyanBurnell/revealing-LLM-capabilities