jbang2004's Stars
2noise/ChatTTS
A generative speech model for daily dialogue.
vbenjs/vue-vben-admin
A modern vue admin panel built with Vue3, Shadcn UI, Vite, TypeScript, and Monorepo. It's fast!
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
wasp-lang/wasp
The fastest way to develop full-stack web apps with React & Node.js.
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
anthropics/courses
Anthropic's educational courses
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
kf-liu/The-Art-of-Linear-Algebra-zh-CN
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone", 线性代数的艺术中文版, 欢迎PR.
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
poloclub/transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
xenova/whisper-web
ML-powered speech recognition directly in your browser
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
solidSpoon/DashPlayer
为英语学习者量身打造的视频播放器,助你通过观看视频、沉浸真实语境,轻松提升英语水平。#美剧 #播放器 #听力
Kedreamix/Linly-Dubbing
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
Nutlope/notesGPT
Record voice notes & transcribe, summarize, and get tasks
CyberAlbSecOP/Awesome_GPT_Super_Prompting
ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.
Amery2010/TalkWithGemini
Deploy your private Gemini application for free with one click, supporting Gemini 1.5 Pro, Gemini 1.5 Flash, Gemini Pro and Gemini Pro Vision models. 一键免费部署您的私人 Gemini 应用, 支持 Gemini 1.5 Pro、Gemini 1.5 Flash、Gemini Pro 和 Gemini Pro Vision 模型。
ZHO-ZHO-ZHO/ComfyUI-Gemini
Using Gemini in ComfyUI
idiap/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ayushpai/AI-Math-Notes
Open Source AI Math Notes
johndpope/VASA-1-hack
Using Claude Sonnet 3.5 to forward (reverse) engineer code from VASA white paper - WIP - (this is for La Raza 🎷)
shuaijiang/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
sevagh/demucs.cpp
C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3
lovemefan/SenseVoice-python
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
dengcunqin/noise-reduction
noise reduction
Megasu/bilibili-nuxt3
Nuxt3 版 哔哩哔哩_bilibili