haonan-li's Stars
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
TransformerOptimus/SuperAGI
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
HazyResearch/flash-attention
Fast and memory-efficient exact attention
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
openai/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
gururise/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
SJTU-LIT/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models
Cerebras/modelzoo
primeqa/primeqa
The prime repository for state-of-the-art Multilingual Question Answering research and development.
haonan-li/CMMLU
CMMLU: Measuring massive multitask language understanding in Chinese
peci1/nvidia-htop
A tool for enriching the output of nvidia-smi.
ryanzhumich/Contrastive-Learning-NLP-Papers
Paper List for Contrastive Learning for Natural Language Processing
google-research-datasets/tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.
deepmind/xquad
apple/ml-mkqa
We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). The goal of this dataset is to provide a challenging benchmark for question answering quality across a wide set of languages. Please refer to our paper for details, MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
zwhe99/MAPS-mt
[TACL 2024] MAPS enables LLMs🤖 to mimic the human😁 translation process.
mbzuai-nlp/bactrian-x
A Multilingual Replicable Instruction-Following Model
abrazinskas/SelSum
Abstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.
ChunhuaLiu596/WAX
The respository describing a novel datasets for word association explanations
haonan-li/QA-Datasets
A summary of QA datasets.