Pinned Repositories
backward_baselines
Code for "Is your model predicting the past?"
benchbench
BenchBench is a Python package to evaluate multi-task benchmarks.
causal-features
Code to reproduce the paper "Predictors from causal features do not generalize better to new domains"
error-parity
Achieve error-rate fairness between societal groups for any score-based classifier.
folktables
Datasets derived from US census data
surveying-language-models
Code to reproduce the paper "Questioning the Survey Responses of Large Language Models"
tttlm
Test-time-training on nearest neighbors for large language models
whynot
A Python sandbox for decision making in dynamics
Social Foundations of Computation's Repositories
socialfoundations/whynot
A Python sandbox for decision making in dynamics
socialfoundations/folktables
Datasets derived from US census data
socialfoundations/tttlm
Test-time-training on nearest neighbors for large language models
socialfoundations/error-parity
Achieve error-rate fairness between societal groups for any score-based classifier.
socialfoundations/benchbench
BenchBench is a Python package to evaluate multi-task benchmarks.
socialfoundations/surveying-language-models
Code to reproduce the paper "Questioning the Survey Responses of Large Language Models"
socialfoundations/causal-features
Code to reproduce the paper "Predictors from causal features do not generalize better to new domains"
socialfoundations/backward_baselines
Code for "Is your model predicting the past?"