jin-s13

University of Hong KongHong Kong

jin-s13's Stars

voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python8.9k567
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell10.1k628
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Language:Python4.2k485
chongzhou96/EdgeSAM
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Language:Jupyter Notebook94642
wkentaro/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Language:Python13.6k3.4k
siyuanliii/masa
Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
Language:Python1k66
InternLM/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Language:JavaScript5.2k531
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook12.7k1.2k
InternLM/lagent
A lightweight framework for building LLM-based agents
Language:Python1.9k199
facebookresearch/MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
Language:Python1.2k65
LazyAGI/LazyLLM
Easiest and laziest way for building multi-agent LLMs applications.
Language:Python1k68
jin-s13/UniFS
Language:Python3
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python58.2k6.2k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python24k2.4k
jin-s13/GKGNet
ECCV'2024 "GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition"
Language:Python81
lishuhuai527/COCO-UniHuman
Language:Python134
facebookresearch/eft
visualization code for 3D human body annotation by EFT (Exemplar Fine-tuning)
Language:Python37634
BubblyYi/MMPedestron
[ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"
Language:Python483
jin-s13/MMPD-Dataset
MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"
121
kennethwdk/LocLLM
Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight
Language:Python313
Ber666/ToolkenGPT
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
Language:Python23621
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript53.1k7.7k
IDEA-Research/X-Pose
[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"
Language:Python52925
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.3k3.1k
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python4k314
jy0205/LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
Language:Jupyter Notebook54229
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Language:Python2.1k162
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Language:Python1.4k195
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.4k1.4k
anishmadan23/foundational_fsod
This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"
Language:Python231

jin-s13

jin-s13's Stars

voxel51/fiftyone

QwenLM/Qwen2.5

CVHub520/X-AnyLabeling

chongzhou96/EdgeSAM

wkentaro/labelme

siyuanliii/masa

InternLM/MindSearch

facebookresearch/sam2

InternLM/lagent

facebookresearch/MobileLLM

LazyAGI/LazyLLM

jin-s13/UniFS

comfyanonymous/ComfyUI

infiniflow/ragflow

jin-s13/GKGNet

lishuhuai527/COCO-UniHuman

facebookresearch/eft

BubblyYi/MMPedestron

jin-s13/MMPD-Dataset

kennethwdk/LocLLM

Ber666/ToolkenGPT

langgenius/dify

IDEA-Research/X-Pose

meta-llama/llama3

InternLM/xtuner

jy0205/LaVIT

EvolvingLMMs-Lab/lmms-eval

open-compass/VLMEvalKit

Dao-AILab/flash-attention

anishmadan23/foundational_fsod