MhLiao

HUSTWuhan, China

MhLiao's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook96.5k 690 8k15.7k
chatchat-space/Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language:TypeScript32.6k 287 4k5.6k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k 219 4672.9k
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python21.6k 114 7531.6k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.8k 157 1.6k2.3k
meta-llama/codellama
Inference code for CodeLlama models
Language:Python16.1k 187 2071.9k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.2k 256 127839
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language:Python12.8k 186 4.3k3.1k
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
12.3k 223 34915
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.9k 166 8042.4k
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python10.6k 104 1491.1k
mikel-brostrom/boxmot
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
Language:Python6.8k 59 1.1k1.7k
luban-agi/Awesome-AIGC-Tutorials
Curated tutorials and resources for Large Language Models, AI Painting, and more.
3.9k 29 2267
openvinotoolkit/anomalib
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
Language:Python3.9k 40 930693
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.8k 31 263345
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k 48 176287
OpenGVLab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Language:Python3.2k 44 50232
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Language:Python2.8k 26 473325
CVCUDA/CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Language:C++2.4k 47 173216
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
Language:Python1.6k 7 73120
keyu-tian/SparK
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
Language:Python1.5k 27 8886
mlfoundations/MINT-1T
MINT-1T: A one trillion token multimodal interleaved dataset.
783 25 1120
shikras/shikra
Language:Python751 8 6645
debidatta/syndata-generation
Code used to generate synthetic scenes and bounding box annotations for object detection. This was used to generate data used in the Cut, Paste and Learn paper
Language:Python289 7 1872
OpenGVLab/OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Language:Python288 12 97
buptlihang/CDLA
CDLA: A Chinese document layout analysis (CDLA) dataset
Language:Python253 3 1131
mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore
Language:Python241 14 10756
YanjingLi0202/Q-ViT
The official implementation of the NeurIPS 2022 paper Q-ViT.
Language:Python85 3 156
OpenGVLab/GUI-Odyssey
GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes from 6 mobile devices, spanning 6 types of cross-app tasks, 201 apps, and 1.4K app combos.
Language:Python76 3 94
yuyq96/TextHawk
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Language:Python55 5 23

MhLiao

MhLiao's Stars

langchain-ai/langchain

chatchat-space/Langchain-Chatchat

Vision-CAIR/MiniGPT-4

opendatalab/MinerU

haotian-liu/LLaVA

meta-llama/codellama

BradyFU/Awesome-Multimodal-Large-Language-Models

cvat-ai/cvat

e2b-dev/awesome-ai-agents

NVIDIA/Megatron-LM

magic-research/magic-animate

mikel-brostrom/boxmot

luban-agi/Awesome-AIGC-Tutorials

openvinotoolkit/anomalib

rom1504/img2dataset

mlfoundations/open_flamingo

OpenGVLab/InternGPT

lyuwenyu/RT-DETR

CVCUDA/CV-CUDA

facebookresearch/ConvNeXt-V2

keyu-tian/SparK

mlfoundations/MINT-1T

shikras/shikra

debidatta/syndata-generation

OpenGVLab/OmniCorpus

buptlihang/CDLA

mindspore-lab/mindocr

YanjingLi0202/Q-ViT

OpenGVLab/GUI-Odyssey

yuyq96/TextHawk