Starrylun's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
state-spaces/mamba
Mamba SSM architecture
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
MaartenGr/KeyBERT
Minimal keyword extraction with BERT
Docta-ai/docta
A Doctor for your data
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
KoljaB/RealtimeTTS
Converts text to speech in realtime
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
zhangbc/eBooks
eBook分享大集合:主要以IT领域经典书籍收藏,以备不时之需。
explosion/spacy-models
💫 Models for the spaCy Natural Language Processing (NLP) library
NUS-HPC-AI-Lab/OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
0x648/luna-ai
Luna AI - 全自动的 AI 直播系统
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
hao-ai-lab/LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
nonamestreet/weixin_public_corpus
微信公众号语料库
charent/Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
intro-llm/intro-llm.github.io
website
togethercomputer/stripedhyena
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
bhattbhavesh91/voice-assistant-whisper-chatgpt
This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.
MengLingchao/Chinese_financial_sentiment_dictionary
A Chinese financial sentiment word dictionary
bobo0810/LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)