korykim's Stars
ggerganov/llama.cpp
LLM inference in C/C++
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
FiloSottile/mkcert
A simple zero-config tool to make locally trusted development certificates with any names you'd like.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
open-webui/open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
vbenjs/vue-vben-admin
A modern vue admin. It is based on Vue3, vite and TypeScript. It's fast!
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
sml2h3/ddddocr
带带弟弟 通用验证码识别OCR pypi版
coder/coder
Provision remote development environments via Terraform
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
alibaba-damo-academy/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
google-gemini/cookbook
A collection of guides and examples for the Gemini API.
yinkaisheng/Python-UIAutomation-for-Windows
(Donot use 3.7.6,3.8.1):snake:Python 3 wrapper of Microsoft UIAutomation. Support UIAutomation for MFC, WindowsForm, WPF, Modern UI(Metro UI), Qt, IE, Firefox, Chrome ...
alibaba-damo-academy/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
juhaku/utoipa
Simple, Fast, Code first and Compile time generated OpenAPI documentation for Rust
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
NoviScl/Design2Code
FuAdmin/fu-admin
采用当前最流行的技术栈 Vben Vue Vue3 Python Django Ninja(Fast Api 和 Django的结合)开发的后端管理系统
chatbookai/ai-to-pptx
Ai-to-pptx是一个使用AI技术(ChatGpt和Gemini)制作PPTX的助手,支持在线修改和导出PPTX。 主要功能: 1 使用ChatGPT等大语言模型来生成大纲 2 生成的内容允许用户再次修改 3 生成PPTX的时候可以选择不同的模板 4 支持在线修改PPTX的文字内容,样式,图片等 5 支持导出PPTX,PDF,PNG等多种格式
tjardoo/openai-client
OpenAI Dive is an unofficial async Rust library that allows you to interact with the OpenAI API.
lybbn/django-vue-lyadmin
django vue3 python3 terminal ssh monitor crontab and other modules开箱即用后台管理系统,RABC权限控制、内置服务监控面板、终端服务webssh、微服务框架、支付第三方登录等
SeaQL/sea-orm-tutorial
ebook for SeaORM tutorial
sunmh207/xunfei-spark-python
科大讯飞星火模型SDK