openelections/openelections-data-ny

2020 Queens County general file has inconsistent number of columns

warwickmm opened this issue · 1 comments

2020/counties/20201103__ny__general__queens__precinct.csv has rows with an inconsistent number of columns:

Header ['county', 'precinct', 'office', 'district', 'party', 'candidate', 'votes', 'public_counter_votes', 'emergency_votes', 'absentee_military_votes', 'federal_votes', 'affidavit_votes', 'manually_counted_emergency', 'special_presidential'] has 14 entries, but there are 29034 rows with an inconsistent number of columns:

	Row 2 (15 entries): ['Queens', '1~23', 'Borough President', '13', 'Save Our City', 'Joann Ariola', '13', '797', '', '121', '0', '3', '25', '0', '0']
	Row 3 (15 entries): ['Queens', '1~23', 'State Senate', '15', 'Save Our City', 'Thomas P. Sullivan', '24', '797', '', '121', '0', '3', '25', '0', '0']
	Row 4 (15 entries): ['Queens', '1~23', 'President', '', 'Working Families', 'Joseph R. Biden / Kamala D. Harris', '27', '797', '', '121', '0', '3', '25', '0', '0']
	Row 5 (15 entries): ['Queens', '1~23', 'Borough President', '13', 'Red Dragon', 'Dao Yin', '5', '797', '', '121', '0', '3', '25', '0', '0']
	Row 6 (15 entries): ['Queens', '1~23', 'State Senate', '15', 'Democratic', 'Joseph P. Addabbo Jr.', '154', '797', '', '121', '0', '3', '25', '0', '0']
	Row 7 (15 entries): ['Queens', '1~23', 'President', '', 'Democratic', 'Joseph R. Biden / Kamala D. Harris', '216', '797', '', '121', '0', '3', '25', '0', '0']
	Row 8 (15 entries): ['Queens', '1~23', 'U.S. House', '5', 'Democratic', 'Gregory W. Meeks', '315', '797', '', '121', '0', '3', '25', '0', '0']
	Row 9 (15 entries): ['Queens', '1~23', 'State Assembly', '23', 'Save Our City', 'Peter D. Hatzipetros', '13', '797', '', '121', '0', '3', '25', '0', '0']
	Row 10 (15 entries): ['Queens', '1~23', 'Borough President', '13', 'Republican', 'Joann Ariola', '542', '797', '', '121', '0', '3', '25', '0', '0']
	Row 11 (15 entries): ['Queens', '1~23', 'President', '', 'Conservative', 'Donald J. Trump / Michael R. Pence', '141', '797', '', '121', '0', '3', '25', '0', '0']
	[Truncated to 10 examples]

There appears to be a larger issue with all the NYC counties. NYC provides ballot type counts at the precinct level, but not broken down by candidate. The files seem to copy these values for each candidate, which is incorrect.

See official data here