zsLin177

zsLin177's Stars

microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.6k 343 2.7k4k
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
Language:Python34.5k 319 4.5k3k
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python32k 169 4.7k2.4k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python30.4k 172 4983.3k
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python30.3k 194 4.7k3.7k
fishaudio/fish-speech
Brand new TTS solution
Language:Python7.4k 61 329588
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell7.3k 42 756442
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python4.6k 50 320461
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python2.5k 33 106240
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML2.4k 10 6282
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目
1.5k 24 0156
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
888 12 347
k2-fsa/icefall
Language:Python886 48 641286
HITsz-TMG/UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
Language:Python750 11 934
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Language:Python594 15 4593
microsoft/CLAP
Learning audio concepts from natural language supervision
Language:Python455 14 2035
wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
Language:Python444 10 10866
EmbraceAGI/AIGC_Interview
📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~
397 2 034
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Language:Python356 14 4730
tsroten/zhon
Constants used in Chinese text processing
Language:Python355 18 2945
microsoft/Pengi
An Audio Language model for Audio Tasks
Language:Python281 14 1315
naginoa/LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
248 0 081
pkunlp-icler/FastV
[ECCV 2024] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Language:Python201 3 229
k2-fsa/fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
Language:Python137 9 2122
LaMP-Benchmark/LaMP
Codes for papers on Large Language Models Personalization (LaMP)
Language:Python98 2 62
cdancette/rubi.bootstrap.pytorch
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
Language:Python59 6 1714
LightChen233/OpenSLU
Language:Python33 2 03
nyukat/greedy_multimodal_learning
Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
Language:Python27 2 33
amazon-science/tofueval
24 5 22
Mashiro009/slidespeech_dl
Language:Python14 1 50