Refactoring

Question

Refactoring

Closed this issue 2 years ago · 5 comments

Answer 1 · 2023-02-25T13:46:55.000Z

Questions:

In the proposed architecture, this repo only handles calculating penalties given some pairs of measured and predicted values, and is completely agnostic to which prediction approach is used. Hence, does the tidepool_parser and scripts retreiving loop-forecasts with pyloopkit belong in this repository, or should we create an additional repository for these tasks? @PorkShoulderHolder

Answer 2 · 2023-02-25T13:51:54.000Z

TO DO: Add a sample of the input for calculating penalties (dataframe with measured and predicted values), and output

Answer 3 · 2023-04-19T14:17:57.000Z

Notes meeting 19.04:

Inspiration from https://scikit-learn.org/stable/modules/generated/sklearn.metrics.mean_squared_error.html
The penalty functions should expect two arrays: predicted and measured values
"Extra" parameters (like squared, not squared) can be specific to the error function
Place for "all" the common error metrics to live (but they should be well documented!). Implement them. Describe them in a table, and refer to literature if necessary.
Predictions: .fit(), .predict(). Users have a model and some data. Make it as easy as possible using that. Using scikit learn as a template
Interface for a model, and preprocessing etc is handle inside of the class. We are leaning on the user to not train on the test data (which is a downside). Returning ONLY prediction.
predictions need to have data inputs separate, and create its own dataframe or whatever
we need to specify clearly what are the formats of the insulin, glucose, carb data etc... documentation
Be specific about the data that is coming in. People use only what they need.
Model return: one value or a list of predictions. Should be specified in the model (input, list of offsets). But in general, we expect there to be a trajectory of predictions.
Model instance like in scikit: Instance. Train with some fed data (handled inside of the model). Fit method takes in some data. Predict method takes in some data. People must themselves handle data leakage. Model has a function: get_prediction_output_format but also set_prediction_output_format.
tidepool_parser: we could create a dataloader object
tidepool_parser: separate parse report, and run a predict

Answer 4 · 2023-04-19T14:35:12.000Z

folder of models/evaluation: base classes, implement methods that all models/evaluation will have.
base classes. sub classes inherit from bases classes will return a "not implemented" error.

Answer 5 · 2023-04-29T16:26:02.000Z

Questions for next meeting:

Having issues with imports and ModuleNotFoundError in tests folder
Assuming mg/dL as input in metrics.
model.predict() actually returns predictions AND true values
- I would like to improve this output. It should include: Prediction date, reference prediction date and value.
I have included future inputs for carbs and insulin in the predictions. If not, we need to filter out future inputs for each prediction. Its still a good idea to filter out data 6hrs ahead, to avoid getting predictions "forever" into the future.
I cant get the retrospective correction numbers right - but I also dont know how it works... So it is hard to debug