ortsed's Stars
jarun/ddgr
:duck: DuckDuckGo from the terminal
SCPR/kpcc-data-team
Where we attempt to lay a foundation, document practices and find our way to sharing the work we do and tools we use to do it at KPCC/SCPR
piskvorky/gensim
Topic Modelling for Humans
rchowe/textsql
Run SQLite commands on text files.
associatedpress/national-caseload-data-ingest
Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying
mediacloud/date_guesser
A library to extract a publication date from a web page, along with a measure of the accuracy.
PublicI/state-lawmakers-disclosures
Data collected from the personal financial disclosure reports of 6,933 state legislators
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
erikgahner/PolData
A dataset with political datasets
18F/crime-data-explorer
Moved to https://github.com/fbi-cde
DOI-ONRR/doi-extractives-data
Information on the extractive industries in the U.S. from federal data.
jsvine/weightedcalcs
Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.
nprapps/bernard
A Slack incoming webhook setup as a cron job that retrieves all new rules, proposed rules, and presidential documents from the Federal Register
sunlightlabs/openitup
A repository for projects that scrape data from government agencies
InternationalTradeAdministration/explorer
jsvine/waybackpack
Download the entire Wayback Machine archive for a given URL.
daleroberts/tv
Quickly view (satellite) imagery directly in your terminal using Unicode 9.0 characters and true color.
dracodoc/Geocode
Batch geocoding addressess, map to census block with PostGIS Tiger Geocoder
amaboura/panama-papers-dataset-2016
Structured data about Panama papers collected from official ICIJ website
glitchdigital/video-transcriber
Computer assisted video/audio transcription
dannguyen/watson-word-watcher
A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptions
AtomBoy/double-metaphone
Python and MySQL implementations of the double metaphone algorithm which is useful for matching different spellings of names.
newsdev/nyt-clerk
A set of Python modules for downloading, parsing, and outputting data related to the Supreme Court.
dariusk/corpora
A collection of small corpuses of interesting data for the creation of bots and similar stuff.
cfpb/clouseau
⚠️ THIS PROJECT IS DEPRECATED ⚠️ Search your repository's git history for undesirable text patterns such as passwords, ssh keys and other personal identifiable information
minimaxir/big-list-of-naughty-strings
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
google/deepdream
jlevy/the-art-of-command-line
Master the command line, in one page
highperformancecoder/minsky
A systems dynamics economics modeling software
openelections/clarify
Discover and parse results for jurisdictions that use Clarity-based election systems.