pskun

Currently at @IDEA-CCNL as an algorithm engineer.

@IDEA-CCNLShenzhen, China

pskun's Stars

GanjinZero/RRHF
[NIPS2023] RRHF & Wombat
Language:Python79249
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.9k4.1k
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
Language:HTML4.2k300
AI4Finance-Foundation/FinNLP
Democratizing Internet-scale financial data.
Language:Jupyter Notebook1.1k198
Tencent/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Language:Python1k140
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
Language:Python3k233
IllDepence/unarXive
A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network
Language:Python25717
radi-cho/datasetGPT
A command-line interface to generate textual and conversational datasets with LLMs.
Language:Python29219
radi-cho/botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering various contexts and tasks (task-oriented dialogue systems, abstract reasoning, brainstorming).
16512
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
1.1k59
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Language:Python2.2k113
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.6k4.5k
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
Language:Python1.6k101
bigcode-project/bigcode-dataset
Language:Jupyter Notebook35961
sahil280114/codealpaca
Language:Python1.4k108
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Language:Python10.8k1.2k
getcursor/cursor
The AI Code Editor
23.9k1.5k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.2k1.9k
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.6k245
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
Language:Python3.7k440
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
Language:Python35.8k5.1k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k4k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.5k5.2k
ad-freiburg/large-qa-datasets
A collection of large question answering datasets
31735
arian-askari/ChatGPT-RetrievalQA
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.
Language:Jupyter Notebook1397
togethercomputer/OpenChatKit
Language:Python9k1k
c-box/KnowledgeLifecycle
Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"
615
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.9k998
atfortes/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
1.6k92
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.7k2.4k

pskun

pskun's Stars

GanjinZero/RRHF

microsoft/DeepSpeed

Instruction-Tuning-with-GPT-4/GPT-4-LLM

AI4Finance-Foundation/FinNLP

Tencent/TencentPretrain

CVI-SZU/Linly

IllDepence/unarXive

radi-cho/datasetGPT

radi-cho/botbots

yaodongC/awesome-instruction-dataset

thunlp/UltraChat

lm-sys/FastChat

AetherCortex/Llama-X

bigcode-project/bigcode-dataset

sahil280114/codealpaca

databrickslabs/dolly

getcursor/cursor

ymcui/Chinese-LLaMA-Alpaca

PhoebusSi/Alpaca-CoT

mymusise/ChatGLM-Tuning

run-llama/llama_index

tatsu-lab/stanford_alpaca

THUDM/ChatGLM-6B

ad-freiburg/large-qa-datasets

arian-askari/ChatGPT-RetrievalQA

togethercomputer/OpenChatKit

c-box/KnowledgeLifecycle

EleutherAI/gpt-neox

atfortes/Awesome-LLM-Reasoning

NVIDIA/NeMo