LiQiiiii's Stars
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
mlfoundations/dclm
DataComp for Language Models
daochenzha/data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
magpie-align/magpie
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
WisconsinAIVision/ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
gstoica27/ZipIt
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training
FreedomIntelligence/ALLaVA
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
YJiangcm/Lion
Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"
doujiang-zheng/Graph-Learning-Reading-List
Advances on machine learning of graphs, covering the reading list of recent top academic conferences.
JUNJIE99/MLVU
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
cure-lab/MMA-Diffusion
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
BAAI-DCAI/SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
yu-rp/KANbeFair
A More Fair and Comprehensive Comparison between KAN and MLP
IBM/ai-privacy-toolkit
A toolkit for tools and techniques related to the privacy and compliance of AI models.
yihedeng9/STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
Adamdad/vico
Vico: Compositional Video Generation as Flow Equalization
OPTML-Group/Diffusion-MU-Attack
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and effective attack method to evaluate the harmful-content generation ability of safety-driven unlearned diffusion models.
BAAI-DCAI/Multimodal-Robustness-Benchmark
YiyangZhou/CSR
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
ruchtem/cosmos
This is the official implementation for COSMOS: a method to learn Pareto fronts that scales to large datasets and deep models.
nik-dim/tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
Bolin97/awesome-instruction-selector
Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning
aengusl/latent-adversarial-training
messense/fasttext-wheel
Build and upload fastText Python wheels to PyPI
NUS-HPC-AI-Lab/InfoGrowth
Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data
Egg-Hu/BiDf-MKD
Official Pytorch Implementation for "Learning to Learn from APIs: Black-Box Data-Free Meta-Learning" (ICML-2023)
LiQiiiii/Encapsulating-Knowledge-In-One-Prompt
[ECCV2024] The official implementation of paper "Encapsulating Knowledge in One Prompt"
YuYang0901/CREST
Towards Sustainable Learning: Coresets for Data-efficient Deep Learning (ICML 2023)