left-censored data
kimiakarimi opened this issue · 1 comments
kimiakarimi commented
Are there any criteria for the % of left-censored data in a sample dataset? The minNumUncen argument handles the minimum number of censored values but I was wondering if having too many censored values may overpredict the estimates. Also, how is annual load calculated when the sample dataset contains left-censored data.
rmhirsch49 commented
There is no formal criteria. A lot depends on what you are trying to do. If we want to get a sense of overall average concentrations or fluxes I’d be comfortable with as much as 50% censored data. But, if there were substantial changes in the reporting level then I would be much more conservative. When looking for trends I’d be concerned when we get to more than 25% censored and even more worried if there is a trend in the reporting limits.
As to computing the annual loads, the WRTDS model makes an estimate of load for every day (regardless of whether or not there are any data on the day). The WRTDS model gives us an expected value for the load on each day and that is what it uses. The process for WRTDS-Kalman when there is censoring. I can explain that if you would like that too.
Bob Hirsch
From: kimiakarimi ***@***.***>
Date: Thursday, March 9, 2023 at 3:31 PM
To: DOI-USGS/EGRET ***@***.***>
Cc: Subscribed ***@***.***>
Subject: [EXTERNAL] [DOI-USGS/EGRET] left-censored data (Issue #366)
This email has been received from outside of DOI - Use caution before clicking on links, opening attachments, or responding.
Are there any criteria for the % of left-censored data in a sample dataset? The minNumUncen argument handles the minimum number of censored values but I was wondering if having too many censored values may overpredict the estimates. Also, how is annual load calculated when the sample dataset contains left-censored data.
—
Reply to this email directly, view it on GitHub<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FDOI-USGS%2FEGRET%2Fissues%2F366&data=05%7C01%7Crhirsch%40usgs.gov%7C2db1f3dd0d1a493b775508db20dd39d2%7C0693b5ba4b184d7b9341f32f400a5494%7C0%7C0%7C638139906753535678%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=8w3rZqcB2GlTzm5G0T712tgLmwHGSwnvDM96aoGk5W0%3D&reserved=0>, or unsubscribe<https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAAI53ULUYCJLX24XPHBJVODW3I4Y3ANCNFSM6AAAAAAVVSZVHQ&data=05%7C01%7Crhirsch%40usgs.gov%7C2db1f3dd0d1a493b775508db20dd39d2%7C0693b5ba4b184d7b9341f32f400a5494%7C0%7C0%7C638139906753535678%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=fePxY207VjHY15%2FBWH7Ny7rYtyRR09wMgEhS4rUV3fk%3D&reserved=0>.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>