kongya

kongya's Stars

Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python11.7k1k
epfml/dynamic-sparse-flash-attention
Language:Jupyter Notebook1206
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Language:Python4.6k396
Watchful1/PushshiftDumps
Example scripts for the pushshift dump files
Language:Python25651
EleutherAI/the-pile
Language:Python1.4k121
nouhadziri/THRED
The implementation of the paper "Augmenting Neural Response Generation with Context-Aware Topical Attention"
Language:Python11125
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.4k335
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python5.7k1.5k
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Language:Python10.8k1.2k
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python11.9k1.1k
lamini-ai/lamini
Language:Python2.5k155
X-PLUG/mPLUG-Owl
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Language:Python2k159
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Python88.1k13.8k
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python163k43.3k
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python14.3k2.5k
LAION-AI/Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
Language:Python20319
X-PLUG/ChatPLUG
A Chinese Open-Domain Dialogue System
Language:Python30525
yoheinakajima/babyagi
Language:Python19.6k2.6k
nichtdax/awesome-totally-open-chatgpt
A list of totally open alternatives to ChatGPT
4.5k190
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
1k56
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
Language:HTML4.1k297
orhonovich/unnatural-instructions
17110
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Language:Python2.2k112
radi-cho/datasetGPT
A command-line interface to generate textual and conversational datasets with LLMs.
Language:Python28218
radi-cho/botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering various contexts and tasks (task-oriented dialogue systems, abstract reasoning, brainstorming).
16512
teknium1/GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
Language:Python1.6k169
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
Language:C4.1k427
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.5k236
project-baize/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
Language:Python3.1k276
microsoft/ContextualSP
Multiple paper open-source codes of the Microsoft Research Asia DKI group
Language:Python36561