def(prepare cases) function not working....

Question

def(prepare cases) function not working....

Opened this issue 4 years ago · 19 comments

when you run the block where posteriors are being calculated for all counties it gives you this error:
TypeError: cannot do slice indexing on <class 'pandas.core.indexes.multi.MultiIndex'> with these indexers [[29]] of <class 'numpy.ndarray'>

the indexing is not correct for smoothed

Answer 1 · 2020-05-26T13:45:50.000Z

Please take a look at https://github.com/sanzgiri/covid-19-dashboards/blob/master/_notebooks/Realtime_R0_USA_By_County.ipynb

That is a working version of the notebook that runs as a scheduled job every day.

Answer 2 · 2020-05-26T13:48:06.000Z

The smoothing function requires a minimum threshold for positive case counts. In the original KS repo (for states), it is 25 or 10, For counties, you have to lower it a bit in order to get a sufficient range of dates.

Answer 3 · 2020-05-27T05:40:37.000Z

I am not able to recreate this problem. What state are you running for?

Answer 4 · 2020-05-27T06:18:44.000Z

I am running your code only same as it is that is not working because of smoothed curve it generates....

Answer 5 · 2020-05-28T05:49:21.000Z

Also, i need a little help from your side can we execute this whole process on all counties regardless of how many cases they have or for a long time they are having stable cases?? is there any possiblity??

Answer 6 · 2020-05-28T06:27:57.000Z

No, there are several counties for which it won't run. What state are you trying to analyze?

Answer 7 · 2020-05-28T06:30:08.000Z

I am trying it for State Iowa but i need all counties.. for example Iowa state has 99 counties in total but i get result for 50 counties. what is the reason and specific logic behind that i cannot get it for all 99 counties??

Answer 8 · 2020-05-28T06:38:48.000Z

Have you looked at the data for the counties that fail? How many cases do they have? Do they have lots of days with 0 cases?

Answer 9 · 2020-05-28T06:52:50.000Z

In the starting days yes there are lots of 0's we can say as it is for all counties of all states...
Talking about daily cases in past 7 days i think that we are doing we are rolling 7 days backward to see if there are daily cases more than 1..
For example lets say i talk about state - Alabama and county - Clay it is having 27 cases as cumulative since 11th may 2020. the cases in this county for Alabama has not increased since 11th may 2020. so why this logic fails here?? i want to ask that?? if you can help me understand this?? why this logic fails when there is no rise in cases for past days???

Answer 10 · 2020-05-28T14:46:27.000Z

Yes, you are correct. It does fail for Clay even though there are several days with non-zero cases. Not sure why. Will dig into this when I get a chance.

Answer 11 · 2020-05-28T15:11:14.000Z

Iowa_Rt.xlsx
Attached is a full fledged example for State-IOWA..
The feilds marked in orange are those for which the logic fails...
Although i tried this:
for country_name, cases in countries_to_process.groupby(level='state_county'):
#clear_output(wait=True)
print(f'Processing {country_name}')
new, smoothed = prepare_cases(cases)
if (len(smoothed) < 1):
print(f"Skipping {country_name}, too few cases from smoothing algorithm")
failed_countries.append(country_name)

i have made change in len(smoothed) as less than 1 so i get the maximum counties result but still counties marked in orange fails....

i would really appreciate if you can look why this fails

Answer 12 · 2020-05-28T16:07:21.000Z

When I ran this for Iowa, with the reduced filtering, prepare cases failed only for Clay and Cleburne
HDI failed for these: ['Autauga', 'Barbour', 'Bibb', 'Blount', 'Chambers', 'Clarke', 'Coosa', 'Covington', 'Crenshaw', 'Cullman', 'Dallas', 'Escambia', 'Franklin', 'Henry', 'Jackson', 'Lauderdale', 'Limestone', 'Macon', 'Marengo', 'Marion', 'Perry', 'Pickens', 'Randolph', 'Walker', 'Washington']

Answer 13 · 2020-05-28T16:09:50.000Z

Sorry, that's Alabama

Answer 14 · 2020-05-28T21:10:45.000Z

Even though you ran for State Alabama why are the prepare cases getting failed and why we get HDI errors. And what filtering have you reduced??
I want to understand why logic fails for several counties. Help me understsnd this please ...

Answer 15 · 2020-05-28T22:12:32.000Z

Yes, I see that for Iowa, smoothing fails for about 16 counties and HDI for 29. The filters I meant were on the positive case threshold and number of rows for smoothing. With the sparse data the intervals on R_t are very wide, so the calculation is not very useful.

Answer 16 · 2020-05-28T22:14:41.000Z

Any solution to this??

Answer 17 · 2020-05-28T22:21:12.000Z

No, sorry, I can't help you with this.

Answer 18 · 2020-05-28T22:24:51.000Z

Ok. No problem. Can you just let me know the whole logic on which we are doing the smoothing.. as in what does this functio def prepare cases do??
As in i want to understand the whole logic of this code. If you can let me know that would do the needful.. please...

Answer 19 · 2020-05-28T23:28:17.000Z

Have you looked at the original notebook from which I forked? https://github.com/k-sys/covid-19/blob/master/Realtime%20R0.ipynb That has some explanations that I trimmed out from mine.

…

On Thu, May 28, 2020 at 3:25 PM kashishminocha ***@***.***> wrote: Ok. No problem. Can you just let me know the whole logic on which we are doing the smoothing.. as in what does this functio def prepare cases do?? As in i want to understand the whole logic of this code. If you can let me know that would do the needful.. please... — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#4 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACHEKVCLPI4PIT2O3TJEBLRT3QEBANCNFSM4NKGSQBQ> .