sunyui

sunyui's Stars

QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python14.7k1.2k
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language:Python1.1k86
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.5k797
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go103k8.2k
Whiffe/SCB-dataset
Student Classroom Behavior dataset
Language:Python22721
ykk648/AI_power
AI toolbox and pretrain models.
Language:Python376
linexjlin/GPTs
leaked prompts of GPTs
29k3.9k
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Language:Python1.9k139
adobe-research/MakeItTalk
Language:Jupyter Notebook499307
THUDM/MathGLM
Official Pytorch Implementation for MathGLM
Language:Python32025
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Language:Python1.6k79
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。
Language:Python31.9k8.3k
public-apis/public-apis
A collective list of free APIs
Language:Python320k34k
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Language:TypeScript19k2k
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Language:Python65.5k8k
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.6k304
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python72.8k8.7k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python12.9k1.4k
krikristoophe/whisper_flutter_plus
Ready to use whisper.cpp models implementation for iOS and Android
Language:C++1811
azkadev/whisper
Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models
Language:C++56135
lyledean1/flutter_whisper.cpp
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust
Language:C++11611
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.5k5.1k
kabachuha/sd-webui-text2video
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Language:Python1.3k108
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.9k2.8k
electron/rebuild
Package to rebuild native Node.js modules against the currently installed Electron version
Language:TypeScript1k175
sheng895/androidtts
PaddleSpeech TTS Android Demo 的改进，实现了中英文混合模型的推理和中英文混合 c++ 前端
Language:C++3610
lym0302/paddlespeech_tts_cpp
PaddleSpeech TTS cpp
Language:Python3612
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11.3k1.9k
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.5k639
PlayVoice/EmotiVoice
网易语音克隆，EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python61