Pinned Repositories
CodeActAgent-Gradio
UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
ControlLoRA-Chinese
A Light Neural Network To Control Stable Diffusion Spatial Information tuned by Chinese
docvqa-gen
Question Answering dataset generator of Document Visual in English and Chinese
Genshin-Impact-BookQA-LLM
A Genshin Impact Book Question Answer Project supported by LLM
Genshin-Impact-Character-Instruction
Genshin Impact Character Instruction Models tuned by Lora on LLM
Genshin-Impact-Fan-Video
一个《原神》AI驱动视频项目,利用LLM API生成角色互动文案,VITS技术进行语音合成,并结合先进的文生图和视频合成技术,创造出游戏角色之间有趣的场景。最终产出为短视频。
Sbert-ChineseExample
Sentence-Transformers Information Retrieval example on Chinese
Stable-Diffusion-Chinese-Extend
A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it
Stable-Diffusion-Pokemon
A demo of fine tune Stable Diffusion on Pokemon-Blip-Captions in English, Japanese and Chinese Corpus
tableQA-Chinese
Unsupervised tableQA and databaseQA on chinese finance question and tabular data
svjack's Repositories
svjack/BRIA-Background-Removal
svjack/LVCD
The official code of paper "LVCD: Reference-based Lineart Video Colorization with Diffusion Models"
svjack/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
svjack/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
svjack/ConsisID
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
svjack/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
svjack/qwen2vl-flux
svjack/OminiControl
A minimal and universal controller for FLUX.1.
svjack/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
svjack/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
svjack/APISR
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
svjack/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
svjack/AnimateLCM
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
svjack/Motion-I2V
[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
svjack/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
svjack/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
svjack/MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
svjack/comfyui-animatediff
AnimateDiff for ComfyUI
svjack/AnimateDiff-MotionDirector
MotionDirector Training For AnimateDiff. Train a MotionLoRA and run it on any compatible AnimateDiff UI.
svjack/ComfyUI-ComfyCouple
Attention Couple made easier for ComfyUI.
svjack/katna
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
svjack/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
svjack/Regional-Prompting-FLUX
Training-free Regional Prompting for Diffusion Transformers 🔥
svjack/ComfyScript
A Python frontend and library for ComfyUI
svjack/Perturbed-Attention-Guidance
Official implementation of "Perturbed-Attention Guidance"
svjack/temporalnet-xl
svjack/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
svjack/ToonCrafter_with_SketchGuidance_fp16
This repository is an implementation that recreates the SketchGuidance feature of "ToonCrafter".
svjack/ToonCrafterSimple
simple tooncrafter implementation for inference
svjack/CogvideX-Interpolation
Keyframe Interpolation with CogvideoX