ykwon0407

ykwon0407's Stars

chenfei-wu/TaskMatrix
Language:Python34.6k 301 3553.3k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.6k 342 2684.1k
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Language:Jupyter Notebook14.1k 257 1052k
faridrashidi/kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
Language:HTML5k 87 81.9k
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Language:Python3.8k 38 119328
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k 100 163243
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.7k 50 3171
Nixtla/nixtla
TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.
Language:Jupyter Notebook2.4k 36 165192
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
Language:Jupyter Notebook2.3k 33 109173
EleutherAI/the-pile
Language:Python1.5k 30 100128
Weixin-Liang/LLM-scientific-feedback
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
Language:Python499 4 2648
lm-sys/llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
Language:Python294 3 623
HazyResearch/TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
Language:Python190 20 215
opendataval/opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
Language:Python88 2 188
alstonlo/torch-influence
A simple PyTorch implementation of influence functions.
Language:Python80 3 1111
centerforaisafety/Intro_to_ML_Safety
65 4 019
ykwon0407/DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
Language:Jupyter Notebook55 1 47
wagner-d/TimeSeAD
Language:Python44 2 78
reds-lab/LAVA
This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).
Language:Python43 0 28
samuel-yeom/ml-privacy-csf18
Code for the CSF 2018 paper "Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting"
Language:Python38 5 35
Rowan1224/FakeNews
Language:Python34 5 213
uvanlp/valda
A Python Data Valuation Package
Language:Python28 2 14
jjbrophy47/tree_influence
Influence Estimation for Gradient-Boosted Decision Trees
Language:Python25 4 410
mikeybellissimo/LoRA-MPT
A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted to use another.
Language:Jupyter Notebook18 1 97
BrachioLab/incontext_influences
In-context Example Selection with Influences
Language:Python14 2 11
yuhui-zh15/NeQA
Official Code Release for "Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models" (ACL 2023 Findings)
Language:Jupyter Notebook9 1 00
yzhang511/TimeInf
Time series data contribution via influence functions
Language:Python8 3 00
linamy85/arxiv-crawler
Arxiv crawler (only abstract)
Language:Python4 2 04
reds-lab/2d-shapley
This is an official repository for "2D-Shapley: A Framework for Fragmented Data Valuation" (ICML2023).
Language:Jupyter Notebook4 0 11
opendataval/opendataval.github.io
opendataval documentation
Language:HTML3 1 01