zhLawliet's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Stability-AI/generative-models
Generative Models by Stability AI
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
sczhou/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Stability-AI/StableCascade
Official Code for Stable Cascade
guofei9987/blind_watermark
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
VAST-AI-Research/TripoSR
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
argosopentech/argos-translate
Open-source offline translation library written in Python
layerdiffusion/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
PINTO0309/onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
Mikoto10032/AutomaticWeightedLoss
Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning
ApolloScapeAuto/dataset-api
The ApolloScape Open Dataset for Autonomous Driving and its Application.
KU-CVLAB/Perturbed-Attention-Guidance
Official implementation of "Perturbed-Attention Guidance"
vkgo/OCRAutoScore
OCR自动化阅卷项目
Kamino666/watermark-tracer
一个基于可视水印检测识别的数字媒体溯源应用系统,是我的大作业项目,包含这个系统以及一个开源的大规模常见水印图像数据集(Large-scale Common Watermark Dataset, LCWD)。 输入一个带有可视水印的图片或视频,系统会检测定位到水印所在的区域,然后将其提取出来,然后借助百度AI开放平台的OCR和logo识别以及Bing搜索引擎,溯源到这个图片或视频的源头。
fh2019ustc/DocGeoNet
The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.
xiaomore/Document-Image-Dewarping
hellloxiaotian/SWCNN
A self-supervised CNN for image watermark removal (IEEE Transactions on Circuits and Systems for Video 2024)