dddraxxx

Dong, Qihua. Interested in discovering intelligence in M-LLM and building general AI!

Northeastern University, SmileLabBoston

dddraxxx's Stars

CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.1k 98 3481.6k
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell11.4k 71 897690
ShiArthur03/ShiArthur03
Language:MATLAB10.3k 32 1.4k1.9k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.8k 77 580633
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python4.1k 36 548324
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python4k 29 469241
poloclub/transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Language:JavaScript3.7k 36 17330
LLaVA-VL/LLaVA-NeXT
Language:Python3.2k 37 338280
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Language:Python1.6k 12 262226
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Language:Python1.3k 22 5950
pytorch/data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
Language:Python1.1k 35 513155
cheahjs/free-llm-api-resources
A list of free LLM inference resources accessible via API.
Language:Python1.1k 23 9106
waspinator/pycococreator
Helper functions to create COCO datasets
Language:Python775 19 42179
NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
Language:Python743 11 6553
iamhyc/Overleaf-Workshop
Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.
Language:TypeScript630 5 11214
conda/conda-pack
Package conda environments for redistribution
Language:Python530 29 16394
lvis-dataset/lvis-api
Python API for LVIS Dataset
Language:Python411 12 3263
tsb0601/MMVP
Language:Python298 10 277
lil-lab/nlvr
Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
Language:HTML258 8 958
ZrrSkywalker/MathVerse
[ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Language:Python154 7 812
sail-sg/ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
Language:Python150 8 104
ByungKwanLee/Full-Segment-Anything
This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-processing: removing duplicated or small regions and holes, under flexible input image size
Language:Python146 2 109
stevewongv/SSIS
Instance Shadow Detection with A Single-Stage Detector [SSIS & SSISv2] (CVPR 2021 Oral & TPAMI 2022)
Language:Python100 7 1915
scenarios/WeMM
Language:Python85 4 712
PhyscalX/gradio-image-prompter
Image Prompter for Gradio
Language:JavaScript81 2 811
harrytea/Detect-AnyShadow
Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"
Language:Python54 4 64
cmprmsd/Overleaf-Image-Helper
Adds functionality to paste screenshots from your clipboard to Overleaf cloud and on-premise.
Language:JavaScript42 2 108
filipbasara0/simple-clip
A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch
Language:Jupyter Notebook27 1 04
JierunChen/Ref-L4
Evaluation code for Ref-L4, a new REC benchmark in the LMM era
Language:Python21 2 30
liujunzhuo/FineCops-Ref
Official repo for "FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension." EMNLP 2024
Language:Python6 0 00