Issues
- 0
new header metadata?
#36 opened by hzadeh17 - 0
weird msp, stacked email headings
#35 opened by hzadeh17 - 1
what to do with misfits?
#32 opened by hzadeh17 - 0
lower & upper limits of timestamps
#34 opened by hzadeh17 - 0
treasury appears not to have OCR'd
#33 opened by hzadeh17 - 2
total number of DEQ emails
#31 opened by Louise-Seamster - 1
workflow for correcting textfile text
#30 opened by Louise-Seamster - 3
- 10
Timestamps need fixing
#21 opened by hzadeh17 - 1
mismatched number of senders and UTS
#26 opened by hzadeh17 - 1
mismatched header lines
#19 opened by hzadeh17 - 6
- 4
Creating filtered datasets
#25 opened by akrinaldi - 2
- 0
- 1
evaluate email breaks in data sample
#5 opened by Louise-Seamster - 0
generating official database prototype
#29 opened by Louise-Seamster - 0
- 2
update file opening tutorial?
#27 opened by Louise-Seamster - 0
New timestamp extractor
#22 opened by hzadeh17 - 2
Data cleaning and new data formats needed
#24 opened by akrinaldi - 1
what client?
#20 opened by hzadeh17 - 1
OCR issue: skipping text?
#18 opened by hzadeh17 - 2
- 3
Create Google Doc with deq14 descriptives
#15 opened by hzadeh17 - 1
Turn sender/reciever code into module
#14 opened by hzadeh17 - 2
- 1
- 5
Write up a short analysis of different presentations of the sender line in email metadata
#12 opened by Louise-Seamster - 4
- 1
run datetime extraction on entire dataset
#9 opened by mbutler - 1
create ordered list of filenames
#8 opened by mbutler - 1
install nltk and pandas on remote AWS server
#3 opened by mbutler - 0
pandemic
#2 opened by Louise-Seamster - 1
timestamp not capturing dksfkd
#1 opened by mbutler