hiyouga's Stars
kamranahmedse/developer-roadmap
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen & Gemma LLMs 2-5x faster with 80% less memory
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
state-spaces/mamba
Mamba SSM architecture
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
dair-ai/ML-Papers-Explained
Explanation to key concepts in ML
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
pharmapsychotic/clip-interrogator
Image to prompt with BLIP and CLIP
deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
zwq2018/Data-Copilot
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
IEIT-Yuan/Yuan-2.0
Yuan 2.0 Large Language Model
tatsu-lab/gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
deepglint/unicom
MLCD & UNICOM : Large-Scale Visual Representation Model
Re-Align/URIAL
yxli2123/LoftQ
Yuchen413/text2image_safety
xverse-ai/XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
chentong0/factoid-wiki
Dense X Retrieval: What Retrieval Granularity Should We Use?
DocAILab/IDP-system
Intelligent Document Processing System
LiteSSLHub/DisCo
This is the public repository of EMNLP 2023 paper "DisCo: Co-training Distilled Student Models for Semi-supervised Text Mining"
OpenSUM/BiGAE
Code Repo for EMNLP'23 paper "Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders"
OpenSUM/CPSUM
Code and Data Repo for COLING'22 paper "Noise-injected Consistency Training and Entropy-constrained Pseudo Labeling for Semi-supervised Extractive Summarization"
tding1/Efficient-LLM-Survey
The Efficiency Spectrum of LLM
MrYxJ/enhance_long
This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without training, and can be used directly in the LLM inference phase.
Alab-NII/chain-of-thought
Research papers about Chain of Thought (CoT)
snowmeow2/Blue-arXiv-Theme
Blue theme for arXiv website