Dawn-LX

PhD student, Zhejiang University, China.

Zhejiang University

Dawn-LX's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook48.4k 313 6805.7k
QSCTech/zju-icicles
浙江大学课程攻略共享计划
Language:HTML37.6k 1.1k 779.5k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.4k 351 1.8k4.6k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k 218 4692.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21k 158 1.6k2.3k
onnx/onnx
Open standard for machine learning interoperability
Language:Python18.2k 434 2.9k3.7k
gitalk/gitalk
Gitalk is a modern comment component based on Github Issue and Preact.
Language:JavaScript7.1k 49 437616
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Language:Python4.3k 51 97388
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k 48 176288
google-research/kubric
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
Language:Jupyter Notebook2.4k 40 189234
allenai/open-instruct
Language:Python2.3k 22 151260
open-mmlab/Multimodal-GPT
Multimodal-GPT
Language:Python1.5k 13 20127
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1.2k 38 658
vacancy/SceneGraphParser
A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).
Language:Python560 7 1955
wutong16/DistributionBalancedLoss
[ ECCV 2020 Spotlight ] Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets"
Language:Python364 8 2147
OpenGVLab/LAMM
[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents
Language:Python306 9 4417
fredzzhang/upt
[CVPR'22] Official PyTorch implementation for paper "Efficient Two-Stage Detection of Human–Object Interactions with a Novel Unary–Pairwise Transformer"
Language:Python158 7 4525
yunqing-me/WatermarkDM
Code of the paper: A Recipe for Watermarking Diffusion Models
Language:Jupyter Notebook134 2 197
ChenDelong1999/polite-flamingo
🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)
Language:Python63 5 53
NVlabs/Bongard-LOGO
Bongard-LOGO is a Python code repository with the purpose of generating synthetic Bongard problems on a large scale with little human intervention.
Language:Python51 13 711
richard-peng-xia/LMPT
[ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition
Language:Python51 2 73
Jeeseung-Park/ViPLO
[CVPR 2023] ViPLO - Official Pytorch Implementation
Language:Python38 3 81
ZJUSCT/mirror-front
ZJU mirror front-end
Language:TypeScript32 7 106
YAIxPOZAlabs/MuseDiffusion
YAI 11 x @POZAlabs : Music generation & modification from Unclear midi SEquence with Diffusion model
Language:Python27 3 42
ZJUSCT/MirrorsDotNet
Mirrors.NET, the Mirror Manager for ZJU Mirror
Language:C#16 7 50
joyhsu0504/geoclidean_framework
Language:Jupyter Notebook12 2 01
ZJUSCT/mirror-issues
Code Unrelated Issues for ZJU Mirror
6 8 340
bobwan1995/Weakly-HOI
5 2 10
MIvanovska/TomatoDIFF
Official implementation of the paper "TomatoDIFF: On-plant Tomato Segmentation with Denoising Diffusion Models"
5 3 20
shaucky/Petdoctor
一款基于AIR的《赛尔号》Flash页游对战动画播放器
Language:ActionScript5 1 00

Dawn-LX

Dawn-LX's Stars

facebookresearch/segment-anything

QSCTech/zju-icicles

lm-sys/FastChat

Vision-CAIR/MiniGPT-4

haotian-liu/LLaVA

onnx/onnx

gitalk/gitalk

showlab/Tune-A-Video

mlfoundations/open_flamingo

google-research/kubric

allenai/open-instruct

open-mmlab/Multimodal-GPT

Computer-Vision-in-the-Wild/CVinW_Readings

vacancy/SceneGraphParser

wutong16/DistributionBalancedLoss

OpenGVLab/LAMM

fredzzhang/upt

yunqing-me/WatermarkDM

ChenDelong1999/polite-flamingo

NVlabs/Bongard-LOGO

richard-peng-xia/LMPT

Jeeseung-Park/ViPLO

ZJUSCT/mirror-front

YAIxPOZAlabs/MuseDiffusion

ZJUSCT/MirrorsDotNet

joyhsu0504/geoclidean_framework

ZJUSCT/mirror-issues

bobwan1995/Weakly-HOI

MIvanovska/TomatoDIFF

shaucky/Petdoctor