zsLin177's Stars
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
2noise/ChatTTS
A generative speech model for daily dialogue.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
fishaudio/fish-speech
Brand new TTS solution
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
k2-fsa/icefall
HITsz-TMG/UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
microsoft/CLAP
Learning audio concepts from natural language supervision
wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
EmbraceAGI/AIGC_Interview
📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
tsroten/zhon
Constants used in Chinese text processing
microsoft/Pengi
An Audio Language model for Audio Tasks
naginoa/LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
pkunlp-icler/FastV
[ECCV 2024] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
k2-fsa/fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
LaMP-Benchmark/LaMP
Codes for papers on Large Language Models Personalization (LaMP)
cdancette/rubi.bootstrap.pytorch
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
LightChen233/OpenSLU
nyukat/greedy_multimodal_learning
Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
amazon-science/tofueval
Mashiro009/slidespeech_dl