chenboos5's Stars
NJU-PCALab/STAR
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
wangzhiyaoo/SVFR
Official implementation of SVFR.
zaidmukaddam/scira
Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.
Crayon-Shinchan/AnyDressing
Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"
Snowfallingplum/SHMT
[NeurIPS 2024] SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
SleeeepyZhou/LiveDevAgents
2024.12 CAMEL-AI Hackathon. Multi-Agent danmaku game engine.
deepseek-ai/DeepSeek-V3
thinkany-ai/rag-search
RAG Search API
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
openai-translator/openai-translator
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
stackblitz-labs/bolt.diy
Prompt, run, edit, and deploy full-stack web applications using any LLM you want!
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
vietnh1009/ASCII-generator
ASCII generator (image to text, image to image, video to video)
ali-vilab/In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
zhanghao5683934/Meihu-Beautyface-sdk
美狐美颜sdk,支持美颜滤镜(Beauty Filter)、面具特效(Mask the special effects)、贴纸(Software/Hardware Encoder) 、滤镜(LUTs)
wuhaoyu1990/MagicCamera
Real-time Filter Camera&VideoRecorder And ImageEditor With Face Beauty For Android---包含美颜等40余种实时滤镜相机,可拍照、录像、图片修改
Tencent/Hunyuan3D-1
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
pagefaultgames/pokerogue
A browser based Pokémon fangame heavily inspired by the roguelite genre.
rhymes-ai/Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
facebookresearch/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
ddean2009/MoneyPrinterPlus
AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using AI LLM,print money together! support:chatTTS,faster-whisper,GPTSoVITS,Azure,tencent Cloud,Ali Cloud.
wxh1996/VideoAgent
F4bwDP6a6W/FLY_US
美国大学备考资料 How to apply US colleges
shyjal/visual-try-on
A chrome extension to easily do visual trials of clothing from any e-commerce store. Here is the easy to use install option 👇
muxinc/stream.new
The repo for https://stream.new
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Ceelog/DictionaryByGPT4
一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事
alchaincyf/img2046
图像魔方 - 一个强大的图像编辑和AI图片生成工具