eric-epstein-5747
Fellow at Insight Datascience Cornell Univ. Ph.D. in Philosophy B.A. in Math & Phil., specialty in natural languages
Insight DatascienceIthaca, New York
Pinned Repositories
ada_sst
Using state of the art speech-to-text models to transcribe interviews with my grandmother
alphabetizing_addresses_by_last_name
Contains two utilities which: (1) extract addresses from table in .docx file into a .xlsx file, capturing last name in 1st column for easy alphabetization; (2) export addresses from such .xlsx file to .docx file that is (assumed to be) pre-formatted for pretty printing.
automate_unzipping_files
Contains a python script that unzips all .zip files in the cwd
business_patterns
Using government data, I identify some patterns of growth and decline in different industries across New York State.
column_swapping
Contains a utility for swapping columns in a csv file, plus a test suite and various related files. E.g., a utility that writes the test results to an html table which can be opened in a web browser.
data-science-interviews
Data science interview questions and answers
Data_Challenge_Checklist
A checklist to maximize effective usage of time in data challenges
email_data
I took a large flat text file of anonymized email data and used a regex (see tablemaker_better) to capture the desired information in a csv file with three columns. This repo also contains a script for importing csv into R and generating histograms and further datasets from it. (Script to be run from an R console; needs to be rewritten if run from command line.)
eric-epstein7.github.io
happy_hotels
In this project, I examine a large number of hotel reviews, identify the topics discussed, and surface, for each topic, an important sentence about that topic.
eric-epstein-5747's Repositories
eric-epstein-5747/data-science-interviews
Data science interview questions and answers
eric-epstein-5747/ada_sst
Using state of the art speech-to-text models to transcribe interviews with my grandmother
eric-epstein-5747/alphabetizing_addresses_by_last_name
Contains two utilities which: (1) extract addresses from table in .docx file into a .xlsx file, capturing last name in 1st column for easy alphabetization; (2) export addresses from such .xlsx file to .docx file that is (assumed to be) pre-formatted for pretty printing.
eric-epstein-5747/automate_unzipping_files
Contains a python script that unzips all .zip files in the cwd
eric-epstein-5747/business_patterns
Using government data, I identify some patterns of growth and decline in different industries across New York State.
eric-epstein-5747/column_swapping
Contains a utility for swapping columns in a csv file, plus a test suite and various related files. E.g., a utility that writes the test results to an html table which can be opened in a web browser.
eric-epstein-5747/Data_Challenge_Checklist
A checklist to maximize effective usage of time in data challenges
eric-epstein-5747/email_data
I took a large flat text file of anonymized email data and used a regex (see tablemaker_better) to capture the desired information in a csv file with three columns. This repo also contains a script for importing csv into R and generating histograms and further datasets from it. (Script to be run from an R console; needs to be rewritten if run from command line.)
eric-epstein-5747/eric-epstein7.github.io
eric-epstein-5747/happy_hotels
In this project, I examine a large number of hotel reviews, identify the topics discussed, and surface, for each topic, an important sentence about that topic.
eric-epstein-5747/keras_lstm_tutorial
This repo contains my code for a tutorial in using a LSTM for text generation. The tutorial can be found here: https://adventuresinmachinelearning.com/keras-lstm-tutorial/
eric-epstein-5747/python-explorations
Contains my solutions to the exercises in John V. Guttag's "Introduction to Computation and Programming Using Python", 2nd Edition.
eric-epstein-5747/social_distancing
Is now a good time to go to prospect park?
eric-epstein-5747/weather_and_energy_demand
In this project, I fit and assess a model that predicts energy demand based on weather conditions.
eric-epstein-5747/YoastSEO.js
Analyze content on a page and give SEO feedback as well as render a snippet preview.