Ji-eun-Kim's Stars
google-research/composed_image_retrieval
Walter0807/MotionBERT
[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
mkshing/DiffFit-pytorch
Implementation of "DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning"
Code-kunkun/ZS-CIR
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
TonyLianLong/LLM-groundedDiffusion
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
guoyww/AnimateDiff
Official implementation of AnimateDiff.
xUhEngwAng/I2V-Adapter-Unofficial
Unofficial implementation of the paper I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models.
CyberAgentAILab/layout-dm
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation [Inoue+, CVPR2023]
omer11a/bounded-attention
Yutong-Zhou-cv/Awesome-Multimodality
A Survey on multimodal learning research.
Wu-Zongyu/Medical-VLM-Paper-List
A paper list about medical vision language model.
jianzhnie/awesome-text-to-video
A Survey on Text-to-Video Generation/Synthesis.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Wu-Zongyu/LLM-and-Multimodal-Paper-List
A paper list about large language models and multimodal models (Diffusion, VLM). From foundations to applications. It is only used to record papers for my personal needs.
snunlp/KR-SBERT
KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Yutong-Zhou-cv/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
jihwan21/CodingTest_Study
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
AILab-CVC/TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
jbhuang0604/awesome-computer-vision
A curated list of awesome computer vision resources
StonyBrookNLP/ircot
Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23
terryum/awesome-deep-learning-papers
The most cited deep learning papers
prometheus-eval/prometheus
[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.
camenduru/text-to-video-synthesis-colab
Text To Video Synthesis Colab