/Submarine-SDK

A python SDK for Apache/Submarine

Primary LanguagePythonApache License 2.0Apache-2.0

What is Hadoop Submarine? Build Status

Submarine is a new subproject of Apache Hadoop.

Submarine is a project which allows infra engineer / data scientist to run unmodified Tensorflow or PyTorch programs on YARN or Kubernetes.

Goals of Submarine:

  • It allows jobs easy access data/models in HDFS and other storages.
  • Can launch services to serve Tensorflow/PyTorch models.
  • Support run distributed Tensorflow jobs with simple configs.
  • Support run user-specified Docker images.
  • Support specify GPU and other resources.
  • Support launch tensorboard for training jobs if user specified.
  • Support customized DNS name for roles (like tensorboard.$user.$domain:6006)

Submarine-SDK

  • Allow data scients to track distributed ML job
  • Support store ML parameters and metrics in Submarine-server
  • Support store ML job output (e.g. csv,images)
  • Support hdfs,S3 and mysql
  • (Submarine-DB) metric and param instance in submarine-server database
  • (Submarine-DB) Support REST Api for submarine server
  • (WEB) Metric tracking ui in submarine-web
  • (WEB) Metric graphical display in submarine-web