olccihyeon

Ph.d student in SNU ECE

Seoul National University

olccihyeon's Stars

joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
18.2k 145 1332.4k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.9k 193 4052.3k
stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python12.5k 118 31765
dabeaz-course/python-mastery
Advanced Python Mastery (course by @dabeaz)
Language:Python10.8k 86 361.8k
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.7k 130 30490
teddylee777/machine-learning
머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 repository입니다. (This repository is intented for helping whom are interested in machine learning study)
Language:Jupyter Notebook2.7k 64 6861
cvlab-columbia/viper
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
Language:Jupyter Notebook1.7k 88 47120
likejazz/llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
Language:Python975 13 479
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python849 19 2545
gabriben/awesome-generative-information-retrieval
638 24 749
xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
Language:Jupyter Notebook383 5 3125
ziplab/SN-Net
[CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".
Language:Python246 4 611
kongds/E5-V
E5-V: Universal Embeddings with Multimodal Large Language Models
Language:Python217 3 178
haokunwen/Awesome-Composed-Image-Retrieval
Collection of Composed Image Retrieval (CIR) papers.
135 8 47
navervision/lincir
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
Language:Python123 7 217
TIGER-AI-Lab/UniIR
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
Language:Python120 3 2112
muzairkhattak/ProText
[AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".
Language:Python98 3 86
umd-huang-lab/perceptionCLIP
Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"
Language:Jupyter Notebook76 3 34
Code-kunkun/ZS-CIR
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
Language:Jupyter Notebook51 3 91
OpenMatch/UniVL-DR
[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval".
Language:Python50 5 77
facebookresearch/Whac-A-Mole
Code for the paper "A Whac-A-Mole Dilemma Shortcuts Come in Multiples Where Mitigating One Amplifies Others"
Language:Python47 12 45
haofanwang/cropimage
A simple toolkit for detecting and cropping main body from pictures. Support face and saliency detection.
Language:Python44 1 06
lezhang7/Enhance-FineGrained
[CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
Language:Python44 2 111
zycheiheihei/Transferable-Visual-Prompting
[CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompting for Multimodal Large Language Models" has been accepted in CVPR2024.
Language:Python34 1 00
JUNJIE99/VISTA_Evaluation_FineTuning
Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.
Language:Python28 1 122
luomancs/ReMuQ
a multimodal retrieval dataset
Language:Jupyter Notebook23 1 21
suoych/KEDs
Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
Language:Shell14 1 60
levymsn/LaSCo
Official repository of the LaSCo dataset
9 1 50
tmlabonte/last-layer-retraining
Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.org/abs/2309.08534
Language:Python9 2 10
clause-bielefeld/wikiscenes_descriptions
Datatset of annotated text image alignments for Wikiscenes (a dataset of multimodal Wikipedia articles on buildings)
1

olccihyeon

olccihyeon's Stars

joonspk-research/generative_agents

meta-llama/llama-recipes

stas00/ml-engineering

dabeaz-course/python-mastery

cmhungsteve/Awesome-Transformer-Attention

teddylee777/machine-learning

cvlab-columbia/viper

likejazz/llama3.np

NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion

gabriben/awesome-generative-information-retrieval

xmed-lab/CLIP_Surgery

ziplab/SN-Net

kongds/E5-V

haokunwen/Awesome-Composed-Image-Retrieval

navervision/lincir

TIGER-AI-Lab/UniIR

muzairkhattak/ProText

umd-huang-lab/perceptionCLIP

Code-kunkun/ZS-CIR

OpenMatch/UniVL-DR

facebookresearch/Whac-A-Mole

haofanwang/cropimage

lezhang7/Enhance-FineGrained

zycheiheihei/Transferable-Visual-Prompting

JUNJIE99/VISTA_Evaluation_FineTuning

luomancs/ReMuQ

suoych/KEDs

levymsn/LaSCo

tmlabonte/last-layer-retraining

clause-bielefeld/wikiscenes_descriptions