lingxitong's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
ShiArthur03/ShiArthur03
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Blealtan/efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Linfeng-Tang/Image-Fusion
Deep Learning-based Image Fusion: A Survey
CircleRadon/Osprey
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
jaredleekatzman/DeepSurv
DeepSurv is a deep learning approach to survival analysis.
sail-sg/metaformer
MetaFormer Baselines for Vision (TPAMI 2024)
YUANZHUO-BNU/metahuman_overview
数字人资料整理
wilmerwang/SLFCD
The solution to cameyon16 and camelyon17 challenge and also to your own WSI data project.
seanzhang-zhichen/llama3-chinese
Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
OpenGVLab/gv-benchmark
General Vision Benchmark, GV-B, a project from OpenGVLab
mahmoodlab/Patch-GCN
Context-Aware Survival Prediction using Patch-based Graph Convolutional Networks - MICCAI 2021
fedshyvana/pathology_mllm_training
liupei101/PseMix
[IEEE TMI 2024] Pseudo-Bag Mixup Augmentation for Multiple Instance Learning-Based Whole Slide Image Classification
chy-upc/ABD
[CVPR 2024] Code for "Adaptive Bidirectional Displacement for Semi-Supervised Medical Image Segmentation"
FangHeng/SAM-MIL
[ACM MM 2024] SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification
Dootmaan/DTFD-MIL.PyTorch
Unofficial implementation of CVPR2022 paper DTFD-MIL. Use the official CAMELYON16 dataset instead of the .pickle file used in the official DTFT-MIL repo.
marrlab/HistoGPT
A vision language model for gigapixel whole slide images in histopathology
whq-xxh/SFADA-GTV-Seg
(TMI-2024) Source-Free Active Domain Adaptation (SFADA) for GTV Segmentation across Multiple Hospitals
yang-ze-kang/SCMIL
Official implementation for "SCMIL: Sparse Context-aware Multiple Instance Learning for Predicting Cancer Survival Probability Distribution in Whole Slide Images"
CUHK-AIM-Group/MultiModal-learning
weiaicunzai/Norma
lingxitong/MM_BASELINE
A library that integrates different Multi-Modal Fusion methods into a unified framework