Pinned Repositories
Angular-fullstack-admin-theme
Angular Fullstack RC 5
angular-fullstack-multi-tenant
angular-graphql-nestjs-postgres-starter-kit
🚀Angular 8 + GraphQL + NestJS + Postgres Starter Kit
animate.css
Cross-browser CSS3 animations. Plug and play. Do a little dance.
bootbox
Wrappers for JavaScript alert(), confirm() and other flexible dialogs using Twitter's bootstrap framework
CustomTShirt
Fabric.js
fabric.curvedText
Curved text for fabric.js
POS
Directory watcher with post data
RaspberryPI-Node-AI
RaspberryPI 3, Nodejs, Sox, eSpeak, OpenCV
videoChat
WebRTC, Socket.io and Node.js
imomin's Repositories
imomin/background-removal-js
imomin/bark-TTS
🔊 Text-Prompted Generative Audio Model
imomin/calendso
The open-source Calendly alternative.
imomin/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
imomin/ChatGPTFromKB
Intelligent customer support bot
imomin/cosmic-media-extension
Search millions of high-quality royalty-free stock photos, images, and videos from popular online media services.
imomin/DPE
[CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
imomin/faster-whisper
Faster Whisper transcription with CTranslate2
imomin/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
imomin/llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper
imomin/OpenVoice
Instant voice cloning by MyShell
imomin/pywinassistant
The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.
imomin/Real-Time-Accent-Conversion
Real Time Foreign Accent Conversion
imomin/roomGPT
Upload a photo of your room to generate your dream room with AI.
imomin/roop
one-click deepfake (face swap)
imomin/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
imomin/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
imomin/Scrapegraph-ai
Python scraper based on AI
imomin/ShortGPT
AI framework for automating video and short content creation
imomin/spleeter
Deezer source separation library including pretrained models.
imomin/stable-diffusion-webui
Stable Diffusion web UI
imomin/StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
imomin/STIT
imomin/Storyblocks
✨ Experience the enchantment of Story Block: an open-source project merging AI text generation and image synthesis to create captivating video narratives. 📚🎥 Watch as your text prompts come to life with stunning visuals, exploring new frontiers in storytelling!
imomin/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
imomin/text2cinemagraph
Official Pytorch implementation of Text2Cinemagraph: Synthesizing Artistic Cinemagraphs from Text
imomin/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
imomin/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
imomin/wesper-demo
imomin/whisper.cpp
Port of OpenAI's Whisper model in C/C++