JCruan519's Stars
CosmosShadow/gptpdf
Using GPT to parse PDF
vlf-silkie/VLFeedback
princeton-nlp/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程
baaivision/EVE
EVE: Encoder-Free Vision-Language Models from BAAI
Spico197/MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
zhourax/VEGA
lartpang/OVCamo
(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation
deepglint/RWKV-CLIP
The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"
TideDra/VL-RLHF
A RLHF Infrastructure for Vision-Language Models
CUHK-AIM-Group/U-KAN
[ArXiv' 24] U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
adarobustness/corruption
The code for generating natural distribution shifts on image and text datasets.
yfzhang114/LLaVA-Align
This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.
opendatalab/VIGC
AAAI 2024: Visual Instruction Generation and Correction
amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
mobaidoctor/polyp-ddpm
Polyp dataset generation
IAAR-Shanghai/CRUD_RAG
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
King-HAW/GMS
Official repository of Generative Medical Segmentation
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
OpenBMB/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
QuivrHQ/quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
KindXiaoming/pykan
Kolmogorov Arnold Networks
magic-research/PLLaVA
Official repository for the paper PLLaVA
hammoudhasan/SynthCLIP
Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
GopeedLab/gopeed
A modern download manager that supports all platforms. Built with Golang and Flutter.
OpenBMB/VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
KMnO4-zx/TinyRAG
TinyRAG