priancho
Working in Multi-modal Understanding team at KakaoBrain. Interested in NLP and CV.
KakaoBrainSouth Korea
priancho's Stars
mlfoundations/open_clip
An open source implementation of CLIP.
huggingface/trl
Train transformer language models with reinforcement learning.
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
deep-floyd/IF
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
refuel-ai/autolabel
Label, clean and enrich text datasets with LLMs.
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
facebookresearch/cc_net
Tools to download and cleanup Common Crawl data
mlmed/torchxrayvision
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
tianyi-lab/Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
mertyg/vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
hendrycks/imagenet-r
ImageNet-R(endition) and DeepAugment (ICCV 2021)
facebookresearch/Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
Libr-AI/do-not-answer
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
google-research/composed_image_retrieval
allenai/peS2o
Pretraining Efficiently on S2ORC!
cohere-ai/magikarp
Code for the paper "Fishing for Magikarp"
applicaai/lambert
Publicly released code for the LAMBERT model
ryanwebster90/snip-dedup
navervision/KELIP
Official PyTorch implementation of "Large-scale Bilingual Language-Image Contrastive Learning" (ICLRW 2022)
yk/litter
OpenGVLab/MMIU
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
naver-ai/tablevqabench
HY-UDBMS/UniBench
Towards Benchmarking Multi-Model DBMS
segmed/openjpeg
Port of OpenJPEG, an open-source JPEG2000 codec, to Golang