Scripts used in support of this post: Mortgages Are About Math: Open-Source Loan-Level Analysis of Fannie and Freddie
- Make sure you have PostgreSQL installed locally. If you want to use R, install it too
- Download data from Fannie Mae and/or Freddie Mac and unzip all files into a directory with
fannie/
andfreddie/
subdirectories - Make sure to update the proper
/path/to/
paths ininitialize_database.sh
,create_loans_and_supporting_tables.sql
, andload_all_loans_script.sh
./initialize_database.sh
creates a Postgres database calledagency-loan-level
, creates some tables, and imports supporting data including FHFA home price data and Freddie Mac mortgage rate data./db_scripts/load_all_loans.sh
to import the data files. This might take a very long time (~2 days), so you could consider loading the data in chunks. The total database takes up around 215 GB on disk
The analysis/
folder has additional SQL and R scripts used to analyze the data, see more in the full post