jhCOR

interested in AI, Web

jhCOR's Stars

X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python2.2k170
findalexli/mllm-dpo
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
Language:Jupyter Notebook18
mzbac/llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
Language:Python17836
opendatalab/HA-DPO
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
Language:Python584
nyunAI/Faster-LLM-Survey
Language:Python407
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k2.9k
mytechnotalent/Gemini
Google Gemini AI model w/speech recognition and voice.
Language:Python173
tsb0601/MMVP
Language:Python2777
GuyTevet/diversity-eval
Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"
Language:Python192
HadiZayer/eyenerf
Language:Python204
palchenli/VL-Instruction-Tuning
833
geuk-hub/-Dacon-Multimodal-vqa
Language:Jupyter Notebook2
LLaVA-VL/LLaVA-NeXT
Language:Python2.4k164
chuangchuangtan/LLaVA-NeXT-Image-Llama3-Lora
LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft
Language:Python363
teddysum/Korean_DCS_2024
Language:Python41
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook8.8k762
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python9.8k951
RevenueCat/purchases-android
Android in-app purchases and subscriptions made easy.
Language:Kotlin24750
google/oboe
Oboe is a C++ library that makes it easy to build high-performance audio apps on Android.
Language:C++3.7k559
ThuCCSLab/FigStep
Jailbreaking Large Vision-language Models via Typographic Visual Prompts
Language:Python755
Unispac/Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
Language:Python15612
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.6k941
AttentionX/InstructBLIP_PEFT
Language:Jupyter Notebook274
lioryariv/idr
Language:Python69585
bluer555/CR-GAN
Yu Tian et al. "CR-GAN: Learning Complete Representations for Multi-view Generation", IJCAI 2018
Language:Python12228
HRI-UESTC/CFM-HRI-RGB-D-action-database
UESTC RGB-D Varying-view action database. This multi-view action database is captured by Kinect v2.0 with modality of RGB video, 3D skeleton sequences and depth map sequences.
Language:Python4811
Totoro97/NeuS
Code release for NeuS
Language:Python1.6k210
googlearchive/android-Camera2Raw
Migrated:
Language:Java388184
junyangwang0410/AMBER
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
Language:Python852
za-cheng/WildLight
official implementation of our CVPR 2023 paper "In-the-wild Inverse Rendering with a Flashlight"
Language:Python734

jhCOR

jhCOR's Stars

X-PLUG/mPLUG-Owl

findalexli/mllm-dpo

mzbac/llama2-fine-tune

opendatalab/HA-DPO

nyunAI/Faster-LLM-Survey

Vision-CAIR/MiniGPT-4

mytechnotalent/Gemini

tsb0601/MMVP

GuyTevet/diversity-eval

HadiZayer/eyenerf

palchenli/VL-Instruction-Tuning

geuk-hub/-Dacon-Multimodal-vqa

LLaVA-VL/LLaVA-NeXT

chuangchuangtan/LLaVA-NeXT-Image-Llama3-Lora

teddysum/Korean_DCS_2024

facebookresearch/dinov2

mlfoundations/open_clip

RevenueCat/purchases-android

google/oboe

ThuCCSLab/FigStep

Unispac/Visual-Adversarial-Examples-Jailbreak-Large-Language-Models

salesforce/LAVIS

AttentionX/InstructBLIP_PEFT

lioryariv/idr

bluer555/CR-GAN

HRI-UESTC/CFM-HRI-RGB-D-action-database

Totoro97/NeuS

googlearchive/android-Camera2Raw

junyangwang0410/AMBER

za-cheng/WildLight