DemoGit4LIANG's Stars
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
apple/ml-ferret
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
wenda-LLM/wenda
闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
vikhyat/moondream
tiny vision language model
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
LLaVA-VL/LLaVA-NeXT
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
liuhuanyong/CrimeKgAssitant
Crime assistant including crime type prediction and crime consult service based on nlp methods and crime kg,罪名法务智能项目,内容包括856项罪名知识图谱, 基于280万罪名训练库的罪名预测,基于20W法务问答对的13类问题分类与法律资讯问答功能.
yechens/NL2SQL
Text2SQL 语义解析数据集、解决方案、paper资源整合项目
mbzuai-oryx/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
magic-research/PLLaVA
Official repository for the paper PLLaVA
zhuyiche/llava-phi
LinkSoul-AI/Chinese-LLaVA
支持中英文双语视觉-文本对话的开源可商用多模态模型。
showlab/UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
RupertLuo/Valley
The official repository of "Video assistant towards large language model makes everything easy"
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
jefferyZhan/Griffon
【ECCV2024】The official repo of Griffon series
DemoGit4LIANG/Chat2Anything
An LLM-based tool to chat with your documents and databases, including a management system | 面向企业内部环境的大模型(LLM)知识库问答系统,包含后台管理系统
TRI-ML/vlm-evaluation
VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
zyayoung/Awesome-Video-LLMs
Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.
sanbuphy/llm-vision-datasets
Collection of image and video datasets for generative AI and multimodal visual AI
DemoGit4LIANG/AdaMOT
(IEEE Transactions on Image Processing, 2022) "A closer look at the joint training of object detection and Re-identification in multi-object tracking"