Pinned Repositories
opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
beta_shapley
Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)
dagmm-1
A Pytorch implementation of the paper `Deep Autoencoding Gaussian Mixture Model For Unsupervised Anomaly Detection` by Zong et al.
DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
dataoob
Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)
fast_dist_shapley
Efficient Computation and Analysis of Distributional Shapley Values (AISTATS 2021)
UQ_BNN
Uncertainty quantification using Bayesian neural networks in classification (MIDL 2018, CSDA)
variational_autoencoder
variational_autoencoder
wdro_local_perturbation
Principled learning method for Wasserstein distributionally robust optimization with local perturbations (ICML 2020)
WeightedSHAP
WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)
ykwon0407's Repositories
ykwon0407/WeightedSHAP
WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)
ykwon0407/DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
ykwon0407/beta_shapley
Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)
ykwon0407/wdro_local_perturbation
Principled learning method for Wasserstein distributionally robust optimization with local perturbations (ICML 2020)
ykwon0407/fast_dist_shapley
Efficient Computation and Analysis of Distributional Shapley Values (AISTATS 2021)
ykwon0407/dataoob
Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)
ykwon0407/data_purchase_in_comp
Competition over data: how does data purchase affect users? (TMLR)
ykwon0407/STAT5206_Fall_2022
Statistical Computing and Introduction to Data Science @ Columbia Stats
ykwon0407/STAT5206_Spring_2023
Statistical Computing and Introduction to Data Science @ Columbia Stats
ykwon0407/ykwon0407.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
ykwon0407/incontext_influences
Official repository for "In-context Example Selection with Influences"
ykwon0407/influence_analysis_papers
Influence Analysis and Estimation - Survey, Papers, and Taxonomy
ykwon0407/Intro_to_ML_Safety
ykwon0407/introduction-to-github
Get started using GitHub in less than an hour.
ykwon0407/leewtai.github.io
Github Pages
ykwon0407/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
ykwon0407/LoRA-MPT
A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted to use another.
ykwon0407/Modality-Gap
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
ykwon0407/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
ykwon0407/sanity_checks_saliency
ykwon0407/sktime
A unified framework for machine learning with time series
ykwon0407/STAT5206_Fall_2023
ykwon0407/STAT5206_Fall_2024
STAT5206_Fall_2024
ykwon0407/STAT5206_Summer_2023
Statistical Computing and Introduction to Data Science @ Columbia Stats
ykwon0407/STAT5206_Summer_2024
Statistical Computing and Introduction to Data Science @ Columbia Stats
ykwon0407/TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
ykwon0407/the-algorithm
Source code for Twitter's Recommendation Algorithm
ykwon0407/the-pile
ykwon0407/tree_influence
Influence Estimation for Gradient-Boosted Decision Trees
ykwon0407/won-j.github.io
my home page