kaijyunL's Stars
AnswerDotAI/rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
stylellm/stylellm_models
StyleLLM文风大模型:基于大语言模型的文本风格迁移项目。Text style transfer base on Large Language Model. #文字修饰 # 润色 #风格模仿
openai/openai-cookbook
Examples and guides for using the OpenAI API
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
mistralai/mistral-inference
Official inference library for Mistral models
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
LlamaFamily/Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
git-cloner/llama-lora-fine-tuning
llama fine-tuning with lora
xmxoxo/BERT-train2deploy
BERT模型从训练到部署
macanv/BERT-BiLSTM-CRF-NER
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
RandyPen/TextCluster
短文本聚类预处理模块 Short text cluster
xuanzebi/BERT-CH-NER
基于BERT的中文命名实体识别
zhang17173/Event-Extraction
基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
dorianbrown/rank_bm25
A Collection of BM25 Algorithms in Python
MachineLP/TextMatch
QAmatch(qa_match)/文本匹配/文本分类/文本embedding/文本聚类/文本检索(bow/ifidf/ngramtf-df/bert/albert/bm25/…/nn/gbdt/xgb/kmeans/dscan/faiss/….)
csuldw/AntSpider
1000万豆瓣电影/评论/名人/评分数据采集源码分享(内含千万电影数据集,可下载)
lanbing510/DouBanSpider
豆瓣读书的爬虫
baabaaox/ScrapyDouban
豆瓣电影/豆瓣读书 Scarpy 爬虫
LG-1/video_music_book_datasets
NLP NER datasets video/music/book bio
haibincoder/AddressCrawer
全国各城市地名抓取,包含街道、村落、小区、商店、景点等
dabeaz-course/python-mastery
Advanced Python Mastery (course by @dabeaz)
gongminmin/awesome-aigc
A list of awesome AIGC works
taishan1994/prompt_text_classification
基于prompt的中文文本分类。