Pinned Repositories
CodeActAgent-Gradio
UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
ControlLoRA-Chinese
A Light Neural Network To Control Stable Diffusion Spatial Information tuned by Chinese
docvqa-gen
Question Answering dataset generator of Document Visual in English and Chinese
Genshin-Impact-BookQA-LLM
A Genshin Impact Book Question Answer Project supported by LLM
GLM-Open-Dialogue
A enhanced Open Dialogue Context Generator supported by General Language Model Pretraining with Autoregressive Blank Infilling
PhotoWCT
Unofficial implementation of "A Closed-form Solution to Photorealistic Image Stylization"
Sbert-ChineseExample
Sentence-Transformers Information Retrieval example on Chinese
Stable-Diffusion-Chinese-Extend
A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it
Stable-Diffusion-Pokemon
A demo of fine tune Stable Diffusion on Pokemon-Blip-Captions in English, Japanese and Chinese Corpus
tableQA-Chinese
Unsupervised tableQA and databaseQA on chinese finance question and tabular data
svjack's Repositories
svjack/Genshin-Impact-Fan-Video
一个《原神》AI驱动视频项目,利用LLM API生成角色互动文案,VITS技术进行语音合成,并结合先进的文生图和视频合成技术,创造出游戏角色之间有趣的场景。最终产出为短视频。
svjack/AnimateDiff-MotionDirector
MotionDirector Training For AnimateDiff. Train a MotionLoRA and run it on any compatible AnimateDiff UI.
svjack/APISR
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
svjack/COCOCO
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
svjack/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
svjack/cogvideox-factory
Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed
svjack/CogvideX-Interpolation
Keyframe Interpolation with CogvideoX
svjack/ComfyScript
A Python frontend and library for ComfyUI
svjack/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
svjack/comfyui-animatediff
AnimateDiff for ComfyUI
svjack/ComfyUI-ComfyCouple
Attention Couple made easier for ComfyUI.
svjack/ComfyUI-segment-anything-2
ComfyUI nodes to use segment-anything-2
svjack/FasterCache
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
svjack/FreeStyle
FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models
svjack/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
svjack/katna
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
svjack/MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
svjack/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
svjack/Perturbed-Attention-Guidance
Official implementation of "Perturbed-Attention Guidance"
svjack/Practical-RIFE
More practical frame interpolation approach.
svjack/Regional-Prompting-FLUX
Training-free Regional Prompting for Diffusion Transformers 🔥
svjack/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
svjack/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
svjack/svjack
svjack/temporalnet-xl
svjack/ToonCrafter_with_SketchGuidance_fp16
This repository is an implementation that recreates the SketchGuidance feature of "ToonCrafter".
svjack/ToonCrafterSimple
simple tooncrafter implementation for inference
svjack/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
svjack/UniAnimate-GradioUI
WebUI & Docker image of UniAnimate
svjack/WatermarkRemover
批量去除视频中位置固定的水印