- Hidden Technical Debt in Machine Learning Systems (NeurIPS'15)
- TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (KDD'17)
- Towards Unified Data and Lifecycle Management for Deep Learning (ICDE'17)
- A Few Useful Things to Know About Machine Learning
- Principles of Computer System Design
- SysML: The New Frontier of Machine Learning Systems
- A Berkeley Views of Systems Challenges for AI
- TensorFlow: A System for Large-Scale Machine Learning
- PyTorch: AnImperativeStyle,High-Performance DeepLearningLibrary
- Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems
- Ray: A Distributed Framework for Emerging AI Applications
- Clipper:ALow-LatencyOnlinePredictionServingSystem
- [InferLine: Prediction Pipeline Provisioning and Management for Tight Latency Objectives]