sunyui's Stars
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Whiffe/SCB-dataset
Student Classroom Behavior dataset
ykk648/AI_power
AI toolbox and pretrain models.
linexjlin/GPTs
leaked prompts of GPTs
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
adobe-research/MakeItTalk
THUDM/MathGLM
Official Pytorch Implementation for MathGLM
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
public-apis/public-apis
A collective list of free APIs
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
krikristoophe/whisper_flutter_plus
Ready to use whisper.cpp models implementation for iOS and Android
azkadev/whisper
Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models
lyledean1/flutter_whisper.cpp
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
kabachuha/sd-webui-text2video
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Stability-AI/generative-models
Generative Models by Stability AI
electron/rebuild
Package to rebuild native Node.js modules against the currently installed Electron version
sheng895/androidtts
PaddleSpeech TTS Android Demo 的改进,实现了中英文混合模型的推理和中英文混合 c++ 前端
lym0302/paddlespeech_tts_cpp
PaddleSpeech TTS cpp
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
PlayVoice/EmotiVoice
网易语音克隆,EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine