unitedstates/unitedstates.github.io

Feature Suggestion: Congressional Speech Corpora Builder

Plaba opened this issue · 1 comments

Plaba commented

I've been working on a project at this repo. This downloads the congressional transcripts from congress.gov and converts them to text.

Since the UnitedStates organization's purpose is to make open data on Congress more accessible, I'm wondering if you would host this project or something like it.

@Plaba sorry for the delay in responding. We've already got a parser for the HTML versions of the Record and you can see that here: https://github.com/unitedstates/congressional-record.