zsxkib
Born too late to explore the earth. \\ Born too early to explore the universe. \\ Born just in time for the AI uprising.
ReplicateEdinburgh
Pinned Repositories
cog-comfyui
Run ComfyUI with an API
AICoverGen
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
Cog-in-Colab-Notebook-Examples
A few Colab Notebooks which showcase a hacky way to run Cog Containers in Google Colag
cog-yolo-world
InstantID
Replicate Repo for InstantID : Instant Faceswap AI Avatars in Seconds 🔥
playground-v2-1024px-aesthetic
Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at Playground.
PuLID
ST-MFNet
[IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull
voice-cloning-create-dataset
Create your own RVC v2 dataset from a youtube video
voice-cloning-training
Voice data <= 10 mins can also be used to train a good VC model!
zsxkib's Repositories
zsxkib/InstantID
Replicate Repo for InstantID : Instant Faceswap AI Avatars in Seconds 🔥
zsxkib/PuLID
zsxkib/cog-yolo-world
zsxkib/IC-Light
More relighting!
zsxkib/sd3-on-apple-silicon
Run Stable Diffusion on Apple Silicon
zsxkib/FlashFace
zsxkib/cog-aura-sr
AuraSR: GAN-based Super-Resolution for real-world
zsxkib/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
zsxkib/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
zsxkib/cog-aya-101
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
zsxkib/hololive-style-bert-vits2
🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)
zsxkib/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
zsxkib/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
zsxkib/Arc2Face
Arc2Face: A Foundation Model of Human Faces
zsxkib/Arc2Face-replicate
zsxkib/cog-aura-sr-v2
AuraSR v2: Second-gen GAN-based Super-Resolution for real-world applications
zsxkib/cog-comfyui
Run ComfyUI with an API
zsxkib/conda-envs-in-cog
How to use Conda with Replicate Cog to easily manage packages in your projects. Step-by-step examples included!
zsxkib/animate-diff-scene-assembler
Dkamacho’s Scene Assembler
zsxkib/animatediff-cli-prompt-travel
animatediff prompt travel
zsxkib/cog-blip-3
zsxkib/cog-idefics3
Idefics3-8B-Llama3: A powerful multimodal AI model by Hugging Face that integrates image and text inputs to enhance visual reasoning and text generation
zsxkib/cog-qwen-2
Attempt at cog wrapper for QwenLM/Qwen2
zsxkib/cog-stable-diffusion-3-with-instantx-controlnets
A template for running Stable Diffusion 3 InstantX/SD3-Controlnet-Canny with Cog
zsxkib/cog-wd-tagger
zsxkib/EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
zsxkib/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
zsxkib/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
zsxkib/ToonCrafter
a research paper for generative cartoon interpolation
zsxkib/YOLO-World
Real-Time Open-Vocabulary Object Detection