Pinned Repositories
clash-for-linux-backup
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Doubleunet_pytorch
Official implementation of DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation (pytorch implementation)
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
nnUNet
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
JesseZZZZZ's Repositories
JesseZZZZZ/Doubleunet_pytorch
Official implementation of DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation (pytorch implementation)