John1999-lang

John1999-lang's Stars

black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python18.1k1.3k
mcpaulgeorge/WalMaFa
[ACCV 2024] Source code of WalMaFa
Language:Python183
zs1314/SkinMamba
【ACCVW2025】Offical Pytorch Code for "SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance"
Language:Python244
BGU-CS-VIL/WTConv
Wavelet Convolutions for Large Receptive Fields. ECCV 2024.
Language:Python32412
KNITPhoenix/Ridgeformer
Ridgeformer submission for ICASSP 2025
Language:Python3
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python20.3k1.4k
Winn1y/Awesome-Human-Motion-Video-Generation
Human Motion Video Generation: A Survey (https://www.techrxiv.org/users/836049/articles/1228135-human-motion-video-generation-a-survey)
1174
ziyangwang007/Awesome-Medical-Image-Segmentation-Dataset
A list of publicly available medical image segmentation dataset.
413
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Language:Python2.6k389
fudan-zvg/GSS
[CVPR 2023] Official repository of Generative Semantic Segmentation
Language:Python20713
fMRIAnalysisCourse/fmri-analysis-course
Materials from fMRI data analysis course for cognitive science students.
Language:Jupyter Notebook8441
liyidi/STNet
STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking
Language:Python43
gndlwch2w/msvm-unet
The official codes for the work "MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation".
Language:Python445
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook34.5k4.2k
public-apis/public-apis
A collective list of free APIs
Language:Python319k34k
zs1314/OCTAMamba
Offical Pytorch Code for "OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation"
Language:Python242
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Language:Python64.8k7.9k
zang0902/EM-Net
Official repository for EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation (MICCAI 2024)
Language:Python24
joeyan/gaussian_splatting
Unofficial implementation of 3D Gaussian Splatting in PyTorch + CUDA with MIT license
Language:Python15510
zhengli97/Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
32014
FANGAreNotGnu/ControlAug
Official Implementation of WACV 2024 paper "Data Augmentation for Object Detection via Controllable Diffusion Models"
Language:Python193
IVRL/MulT
(CVPR 2022) MulT: An End-to-End Multitask Learning Transformer
Language:Jupyter Notebook514
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
Language:Python1.5k120
X-PLUG/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Language:Python22018
gmongaras/Cottention_Transformer
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
Language:Cuda13
BolinLai/GLC
[BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation".
Language:Python203
remega/Sal-DCNN
The released code of AAAI2019 paper "Image Saliency Prediction in Transformed Domain: A Deep Complex Neural Network Method"
Language:Python213
MinglangQiao/Sports_saliency
Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024
Language:Python10
IVRL/DisSal
(TMLR 2022) Modeling Object Dissimilarity for Deep Saliency Prediction https://ivrl.github.io/DisSal
3
CZHQuality/Sal-CFS-GAN
TIP2019-GazeGAN Saliency Model
Language:Python345