Pinned Repositories
2020-
剑指+leetcode
hope_better_job
image-text-localization-recognition
A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源汇总
machine-learning-notes
This contains my past machine learning notes
pixel_link
Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018
pp_mattingv2
实时人像抠图
pytorch-caffe-darknet-convert
convert between pytorch, caffe prototxt/weights and darknet cfg/weights
R2CNN_FPN_Tensorflow
R2CNN: Rotational Region CNN Based on FPN (Tensorflow)
tensorflow-deeplab-v3-plus
DeepLabv3+ built in TensorFlow
voc2coco-pattern
change the vocdataset 2 cocodataset pattern
jiachen0212's Repositories
jiachen0212/3D-VisTA
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
jiachen0212/ADA-Track
Offical implementation of CVPR2024 paper ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association.
jiachen0212/CN-RMA
Official implementation of CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images
jiachen0212/ControlNet
Let us control diffusion models!
jiachen0212/DiffPIR
"Denoising Diffusion Models for Plug-and-Play Image Restoration", Yuanzhi Zhu, Kai Zhang, Jingyun Liang, Jiezhang Cao, Bihan Wen, Radu Timofte, Luc Van Gool.
jiachen0212/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
jiachen0212/diffusion
Denoising Diffusion Probabilistic Models
jiachen0212/EmbodiedScan
[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
jiachen0212/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
jiachen0212/facefusion
Industry leading face manipulation platform
jiachen0212/GaussianPro
jiachen0212/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
jiachen0212/insightface
State-of-the-art 2D and 3D Face Analysis Project
jiachen0212/Instance_NeRF
jiachen0212/labelCloud
A lightweight tool for labeling 3D bounding boxes in point clouds.
jiachen0212/lang-segment-anything
SAM with text prompt
jiachen0212/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
jiachen0212/MedSAM
The official repository for MedSAM: Segment Anything in Medical Images.
jiachen0212/paper-reading
keep reading
jiachen0212/Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
jiachen0212/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
jiachen0212/Qbot
[updating ...] 自动量化交易机器人 Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment.
jiachen0212/recognize-anything
Open-source and strong foundation image recognition models.
jiachen0212/SAM-Adapter-PyTorch
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
jiachen0212/sd-scripts
jiachen0212/SegAnyGAussians
The official implementation of SAGA (Segment Any 3D GAussians)
jiachen0212/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
jiachen0212/Simple-Lora
sd-lore, controlnet-lora ~
jiachen0212/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
jiachen0212/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information