About ✌🏽

Hello world 🌏! Are you an economist, or economics student, or just a random person like me who is interested in economics? Do you want to write a paper, a thesis, or just ramble on some stuffs but don't have any fresh ideas on what should be the topic? Worry no more! Because, this repository is for you!

Warning! ⚠️

Since this repository uses cron job from GitHub Actions to update the data, consequently the .git directory will eat up disk space. Hence, it is not advisable to clone this repository to your local machine. If you are interested to do something similar, just download this repository as a zipped file. You can do the following:

# download repository from main branch
wget https://github.com/ledwindra/nber/archive/main.zip

This won't include the .git directory and you can play around with the programs and data inside your local machine.

Download data

If you don't want to run this locally and just want to get straight to the data, just chill, relax and, download them Enjoy! 🌞 ⛱ 🥥 🌴 😎.

NBER

column_name	data_type	description
id	integer	NBER working paper ID
citation_title	string	Paper title
citation_author	string	Paper author(s). Can be more than one. Hence it is stored as an array
citation_publication_date	date	Date of paper being published
issue_date	date	Paper's issuance date
revision_date	date	Paper's revision date
topics	string	Paper topic(s). Can be more tan one. Hence it is stored as an array
program	string	Paper program(s). Can be more tan one. Hence it is stored as an array
projects	string	Paper project(s). Can be more tan one. Hence it is stored as an array
working_groups	string	Paper working group(s). Can be more tan one. Hence it is stored as an array
abstract	string	string
acknowledgement	string	Paper's acknowledgement (in paragraph)

NBER citations (from RePEc)

column_name	data_type	description
id	integer	NBER working paper ID
cites	integer	Total cites for each paper
cited_by	integer	Numbers of times each paper being cited by other researchers
reference	string	A list of references for each paper

Wikipedia

Columns are not fixed because each economist may have different completeness of information.

Use case

What can be done from this dataset? Well, let's take a look at index.ipynb. 📙

Permission

NBER Check its robots.txt. Everybody is not disallowed to get /papers/ tag.
RePEc Coming from its open API: http://citec.repec.org/api.html
Wikipedia Check robots.txt:

User-agent: *
Allow: /w/api.php?action=mobileview&
Allow: /w/load.php?
Allow: /api/rest_v1/?doc
Disallow: /w/
Disallow: /api/
Disallow: /trap/
Disallow: /wiki/Special:
Disallow: /wiki/Spezial:
Disallow: /wiki/Spesial:
Disallow: /wiki/Special%3A
Disallow: /wiki/Spezial%3A
Disallow: /wiki/Spesial%3A

We're using https://en.wikipedia.org/wiki/ so it's safe.

Closing

If you have read up to this line, thank you for bearing with me. Hope this is useful for your purpose! 😎 🍻

jin-deng/nber-tmp