Pinned Repositories
Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
IC-Light
More relighting!
diffusers-layout-token-guidence
基于diffusers实现, 对sdxl模型推理过程使用cross_attn的map去计算loss,并根据loss对latent进行更新;其中loss的设计至关重要,可以实现不同的目标,比如layout控制,token增强等
diffusion_fine_tune
instant_id
models
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
pljj315.github.io
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
CustomNet
CharacterGen
[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization
pljj315's Repositories
pljj315/instant_id
pljj315/diffusion_fine_tune
pljj315/pljj315.github.io
pljj315/diffusers-layout-token-guidence
基于diffusers实现, 对sdxl模型推理过程使用cross_attn的map去计算loss,并根据loss对latent进行更新;其中loss的设计至关重要,可以实现不同的目标,比如layout控制,token增强等
pljj315/models
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)