phoenixluo's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
microsoft/autogen
A programming framework for agentic AI 🤖
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
magento/magento2
Prior to making any Submission(s), you must sign an Adobe Contributor License Agreement, available here at: https://opensource.adobe.com/cla.html. All Submissions you make to Adobe Inc. and its affiliates, assigns and subsidiaries (collectively “Adobe”) are subject to the terms of the Adobe Contributor License Agreement.
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
i18next/react-i18next
Internationalization for react done right. Using the i18next i18n ecosystem.
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
OthersideAI/self-operating-computer
A framework to enable multimodal models to operate a computer.
microsoft/UFO
A UI-Focused Agent for Windows OS Interaction.
JoeanAmier/TikTokDownloader
TikTok 主页/合辑/直播/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
i18next/next-i18next
The easiest way to translate your NextJs apps.
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
markshust/docker-magento
Mark Shust's Docker Configuration for Magento
yerfor/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
chenxwh/insanely-fast-whisper
Incredibly fast Whisper-large-v3
medusajs/nextjs-starter-medusa
A performant frontend ecommerce starter template with Next.js 14 and Medusa.
tldraw/make-real-starter
Make it real
3lang3/react-vant
React mobile UI Components base on Vant
TaxyAI/browser-extension
Automate your browser with GPT-4
strapi/blocks-react-renderer
A React renderer for the Strapi's Blocks rich text editor