ykwon0407's Stars
chenfei-wu/TaskMatrix
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
faridrashidi/kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
Nixtla/nixtla
TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
EleutherAI/the-pile
Weixin-Liang/LLM-scientific-feedback
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
lm-sys/llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
HazyResearch/TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
opendataval/opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
alstonlo/torch-influence
A simple PyTorch implementation of influence functions.
centerforaisafety/Intro_to_ML_Safety
ykwon0407/DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
wagner-d/TimeSeAD
reds-lab/LAVA
This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).
samuel-yeom/ml-privacy-csf18
Code for the CSF 2018 paper "Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting"
Rowan1224/FakeNews
uvanlp/valda
A Python Data Valuation Package
jjbrophy47/tree_influence
Influence Estimation for Gradient-Boosted Decision Trees
mikeybellissimo/LoRA-MPT
A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted to use another.
BrachioLab/incontext_influences
In-context Example Selection with Influences
yuhui-zh15/NeQA
Official Code Release for "Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models" (ACL 2023 Findings)
yzhang511/TimeInf
Time series data contribution via influence functions
linamy85/arxiv-crawler
Arxiv crawler (only abstract)
reds-lab/2d-shapley
This is an official repository for "2D-Shapley: A Framework for Fragmented Data Valuation" (ICML2023).
opendataval/opendataval.github.io
opendataval documentation