John1999-lang's Stars
black-forest-labs/flux
Official inference repo for FLUX.1 models
mcpaulgeorge/WalMaFa
[ACCV 2024] Source code of WalMaFa
zs1314/SkinMamba
【ACCVW2025】Offical Pytorch Code for "SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance"
BGU-CS-VIL/WTConv
Wavelet Convolutions for Large Receptive Fields. ECCV 2024.
KNITPhoenix/Ridgeformer
Ridgeformer submission for ICASSP 2025
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Winn1y/Awesome-Human-Motion-Video-Generation
Human Motion Video Generation: A Survey (https://www.techrxiv.org/users/836049/articles/1228135-human-motion-video-generation-a-survey)
ziyangwang007/Awesome-Medical-Image-Segmentation-Dataset
A list of publicly available medical image segmentation dataset.
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
fudan-zvg/GSS
[CVPR 2023] Official repository of Generative Semantic Segmentation
fMRIAnalysisCourse/fmri-analysis-course
Materials from fMRI data analysis course for cognitive science students.
liyidi/STNet
STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking
gndlwch2w/msvm-unet
The official codes for the work "MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation".
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
public-apis/public-apis
A collective list of free APIs
zs1314/OCTAMamba
Offical Pytorch Code for "OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation"
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
zang0902/EM-Net
Official repository for EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation (MICCAI 2024)
joeyan/gaussian_splatting
Unofficial implementation of 3D Gaussian Splatting in PyTorch + CUDA with MIT license
zhengli97/Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
FANGAreNotGnu/ControlAug
Official Implementation of WACV 2024 paper "Data Augmentation for Object Detection via Controllable Diffusion Models"
IVRL/MulT
(CVPR 2022) MulT: An End-to-End Multitask Learning Transformer
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
X-PLUG/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
gmongaras/Cottention_Transformer
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
BolinLai/GLC
[BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation".
remega/Sal-DCNN
The released code of AAAI2019 paper "Image Saliency Prediction in Transformed Domain: A Deep Complex Neural Network Method"
MinglangQiao/Sports_saliency
Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024
IVRL/DisSal
(TMLR 2022) Modeling Object Dissimilarity for Deep Saliency Prediction https://ivrl.github.io/DisSal
CZHQuality/Sal-CFS-GAN
TIP2019-GazeGAN Saliency Model