cdfoundation/sig-mlops

Jupyter Notebooks - Roadmap Discussion

hamelsmu opened this issue · 2 comments

Thanks for the comprehensive roadmap, @tdcox! I have one question in the technical requirements section:

Educating data science teams regarding the risks of trying to use Jupyter Notebooks in production

Can you expand on this a bit more? I have found it difficult to make an argument that data scientists should not use their most beloved tool when crafting their deliverables. There are some tractable ways that data scientists can reliably use notebooks in a production workflow:

I am not sure if this is what you meant, but I wanted to pause and have a discussion on this point. I am still reviewing the rest of the document, but I figured I should bring this up since it caught my attention.

cc: @jlewi, @aronchick

tdcox commented

Thanks for expanding on your thinking, @tdcox! That was helpful to know the various concerns. I certainly wish the Jupyter-to-production workflow was more mature, and there are some promising tools that have been designed as of recent that address some of these limitations. '

However, my favorite tools for doing this converts notebooks to scripts behind the scenes for similar reasons you described (to take advantage of the full suite of devops tools), so in that sense I agree with you.

Thanks again for the detailed writeup and roadmap!