yehengchen

Robotics and Computer Vision

Hanyang UniversitySouth Korea

yehengchen's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python45.5k 248 6.3k5.6k
chenfei-wu/TaskMatrix
Language:Python34.5k 301 3563.3k
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
Language:Python23.5k 95 4161.4k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python17.6k 134 1.1k1.5k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook16k 115 4041.5k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python13.1k 107 617914
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
9.8k 188 17764
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
Language:Python9.4k 64 219604
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
Language:Jupyter Notebook7k 55 152601
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python6.4k 70 439427
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python5k 27 663527
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Language:Python5k 48 315431
landing-ai/vision-agent
Vision agent
Language:Python4.4k 48 40495
modelscope/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
Language:Python3k 40 209344
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.7k 15 24104
textstat/textstat
:memo: python package to calculate readability statistics of a text object - paragraphs, sentences, articles.
Language:Python1.3k 17 114169
jf-tech/omniparser
omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Language:Go1k 15 6476
mbzuai-oryx/GeoChat
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Language:Python541 12 5747
ViTAE-Transformer/Remote-Sensing-RVSA
The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"
Language:Python433 5 4438
SalesforceAIResearch/xLAM
Language:Python376 11 730
ChenDelong1999/RemoteCLIP
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
Language:Jupyter Notebook363 4 3923
orfeotoolbox/OTB
Github mirror of https://gitlab.orfeo-toolbox.org/orfeotoolbox/otb
Language:GLSL359 39 0118
hustvl/Senna
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Language:Python348 19 3921
HaonanGuo/Remote-Sensing-ChatGPT
Chat with RS-ChatGPT and get the remote sensing interpretation results and the response!
Language:Python226 2 1028
ZhanYang-nwpu/Awesome-Remote-Sensing-Multimodal-Large-Language-Model
Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey
210 6 16
nv-nguyen/gigapose
[CVPR 2024] PyTorch implementation of GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence
Language:Python185 9 3119
NJU-LHRS/LHRS-Bot
VGI-Enhanced multimodal large language model for remote sensing images.
Language:Python142 4 3312
Chen-Yang-Liu/Change-Agent
Official PyTorch implementation of ''Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis"
Language:Python110 3 97
ermongroup/TEOChat
Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).
Language:Python92 12 83
LinWeizheDragon/FLMR
The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.
Language:Python83 3 358

yehengchen

yehengchen's Stars

hiyouga/LLaMA-Factory

chenfei-wu/TaskMatrix

VikParuchuri/marker

QwenLM/Qwen

IDEA-Research/Grounded-Segment-Anything

OpenBMB/MiniCPM-V

Mooler0410/LLMsPracticalGuide

facebookresearch/nougat

geekyutao/Inpaint-Anything

THUDM/CogVLM

open-compass/opencompass

OpenBMB/ToolBench

landing-ai/vision-agent

modelscope/modelscope-agent

ttengwang/Caption-Anything

textstat/textstat

jf-tech/omniparser

mbzuai-oryx/GeoChat

ViTAE-Transformer/Remote-Sensing-RVSA

SalesforceAIResearch/xLAM

ChenDelong1999/RemoteCLIP

orfeotoolbox/OTB

hustvl/Senna

HaonanGuo/Remote-Sensing-ChatGPT

ZhanYang-nwpu/Awesome-Remote-Sensing-Multimodal-Large-Language-Model

nv-nguyen/gigapose

NJU-LHRS/LHRS-Bot

Chen-Yang-Liu/Change-Agent

ermongroup/TEOChat

LinWeizheDragon/FLMR