Pinned Repositories
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
bark
🔊 Text-Prompted Generative Audio Model
Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
medicat_installer
Medicat Installer Repo
OpenClap-Format
OpenClap is a file format for the age of AI content production
openjourney
A fine-tuned model based on Stable Diffusion to create images in the style of Midjourney
parler-tts
Inference and training library for high-quality TTS models.
StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Tecknomancer's Repositories
Tecknomancer/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Tecknomancer/bark
🔊 Text-Prompted Generative Audio Model
Tecknomancer/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
Tecknomancer/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
Tecknomancer/Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
Tecknomancer/medicat_installer
Medicat Installer Repo
Tecknomancer/OpenClap-Format
OpenClap is a file format for the age of AI content production
Tecknomancer/openjourney
A fine-tuned model based on Stable Diffusion to create images in the style of Midjourney
Tecknomancer/parler-tts
Inference and training library for high-quality TTS models.
Tecknomancer/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Tecknomancer/Video-UP-Scaler4K
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Super Resolution, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
Tecknomancer/video2x
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley II, 2018.
Tecknomancer/Wav2Lip-GFPGAN
High quality Lip sync