Ki-Zhang

Ki-Zhang's Stars

openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook25.4k 323 3963.3k
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python20.1k 153 2653k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python19.9k 114 1.3k2k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15k 114 3871.4k
labelmeai/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Language:Python13.2k 147 7423.4k
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.5k 330 264903
mamba-org/mamba
The Fast Cross-Platform Package Manager
Language:C++6.9k 46 1.8k353
PyQt5/PyQt
PyQt Examples（PyQt各种测试和例子） PyQt4 PyQt5
Language:Python6.6k 195 1542k
pengsida/learning_research
本人的科研经验
5.7k 69 28342
victoresque/pytorch-template
PyTorch deep learning projects made easy.
Language:Python4.7k 55 641.1k
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k 28 131277
bowang-lab/MedSAM
Segment Anything in Medical Images
Language:Jupyter Notebook2.9k 20 287396
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Language:Python2.2k 31 262247
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Language:Jupyter Notebook2.1k 25 68151
botuniverse/onebot
OneBot：统一的聊天机器人应用接口标准
Language:CSS1.7k 21 35163
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language:Python1.1k 47 4182
InsightSoftwareConsortium/SimpleITK-Notebooks
Jupyter notebooks for learning how to use SimpleITK
Language:Jupyter Notebook836 47 76349
bowang-lab/U-Mamba
U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation
Language:Python663 11 6160
radarFudan/Awesome-state-space-models
Collection of papers on state-space models
537 17 419
Curt-Park/segment-anything-with-clip
Segment Anything combined with CLIP
Language:Python331 1 423
kyegomez/NaViT
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Language:Python177 7 49
sail-sg/ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
Language:Python149 7 104
nowsyn/InstMatt
Official repository for Instance Human Matting via Mutual Guidance and Multi-Instance Refinement
Language:Python101 14 54
nobodyplayer1/VM-UNetV2
Language:Python80 3 1611
wenyalintw/Dicom-Viewer
An application displaying 2D/3D Dicom
Language:Python60 3 024
wenzhengzeng/MPEblink
[CVPR 2023] Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video
Language:Python49 3 47
hustvl/ViTGaze
Language:Python34 3 24
yuechuanlin-cw/PyOCT
Image reconstruction and data processing for spectral-domain optical coherence tomography
Language:Python15 3 02
TomographicImaging/iDVC
Digital Volume Correlation user interface
Language:Python5 2 2362
Baron-sanmen/CrossGLG
The code for "CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner"
4 1 00