Pinned Repositories
ace-data-prep
ACE 2005 Corpus Preprocessing
ACE2005-toolkit
ace2005数据集处理
adaptive-highlight
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
Adversarial-Learning-for-Generative-Conversational-Agents
This repository contains a new adversarial training method for Generative Conversational Agents
AGIEval
AGiXT
AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
Image-Captioning
TensorFlow implementation of Text-guided Attention for Image Captioning using scheduled sampling as a learning approach. Generates captions to unseen images through an end-to-end process.
MemN2N
End-To-End Memory Networks in Theano
dapeng2018's Repositories
dapeng2018/AGIEval
dapeng2018/AGiXT
AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
dapeng2018/ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
dapeng2018/auto-openai-prompter
dapeng2018/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
dapeng2018/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
dapeng2018/chatgpt-corpus
ChatGPT 中文语料库 对话语料 小说语料 客服语料 用于训练大模型
dapeng2018/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU部署 (Chinese LLaMA & Alpaca LLMs)
dapeng2018/chinese_chatbot_corpus
中文公开聊天语料库-有多个数据
dapeng2018/Feishu-OpenAI
llm的sdk可以参考这个总结
dapeng2018/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
dapeng2018/flan-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
dapeng2018/GPT-4-LLM
Instruction Tuning with GPT-4
dapeng2018/GPTFuzz
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
dapeng2018/kenlm
KenLM: Faster and Smaller Language Model Queries
dapeng2018/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
黑名单词列表
dapeng2018/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
dapeng2018/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
dapeng2018/MedicalGPT
wiki中介绍了很多训练数据
dapeng2018/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
dapeng2018/multi-turn-chatbot-gpt-sagemaker
Multi-Turn Chatbot with GPT-Neo and SageMaker: A conversational AI system for engaging and informative interactions with users.
dapeng2018/musiclm.github.io
dapeng2018/nlvr
Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
dapeng2018/Qwen-7B-20230803
通义千问7b模型
dapeng2018/sft_datasets
开源SFT数据集整理,随时补充
dapeng2018/speechbrain
音频工具包
dapeng2018/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
dapeng2018/video-bgm-generation
Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)
dapeng2018/WizardLM
Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder and WizardMath
dapeng2018/YuLan-Chat
YuLan-Chat: An Open-Source Bilingual Chatbot