WPU93's Stars
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
crownpku/Awesome-Chinese-NLP
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
alibaba-damo-academy/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
layerdiffusion/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
k2-fsa/sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
xcfcode/Summarization-Papers
Summarization Papers
NUS-HPC-AI-Lab/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
baegwangbin/DSINE
[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
hitcslj/Awesome-AIGC-3D
A curated list of awesome AIGC 3D papers
wenet-e2e/WenetSpeech
A 10000+ hours dataset for Chinese speech recognition
DiffusionGPT/DiffusionGPT
cylnlp/dialogsum
DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021
microsoft/DialogLM
Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."
aliyun/alibabacloud-aiacc-demo
alibabacloud-aiacc-demo
BADBADBADBOY/baipiaoOCR
convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino
hahahawu/VCSum
Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"