A Day in the Life of a Data Scientist Conquer Machine Learning Lifecycle on Kubernetesmanage the deployment of training jobs • TFJob – custom resource to handle drivers and config • Tensorflow, PyTorch, MXNet, Chainer, and more • JupyterHub to create and manage interactive Jupyter notebooks learning Demo: Run TensorFlow Training with Containers Demo: Serving the Model with TF Serving • Options for serving • Wrap model in a web framework (eg – Flask) • Tensorflow Serving • Seldon Seldon Demo: Run TensorFlow Training with Kubeflow Demo: Scale and Test Experiments in Parallel using Kubernetes, TFJob, and Helm • Spin up pods for each variation of hyperparameters • One centralized0 码力 | 21 页 | 68.69 MB | 1 年前3
基于 KUBERNETES 的 容器器 + AI 平台的应⽤用 • Kubeflow 社区的联合创始⼈人 • kubeflow/tf-operator • 定义 TFJob Spec (CRD) • 跟踪 TensorFlow 任务运⾏行行状态 • ⽀支持分布式 TensorFlow 任务 KUBEFLOW 之上 • 借⼒力力容器器平台提供⽣生产级的集群资源管理理 • ⼯工作区隔离与共享 • 数据、模型、环境、应⽤用等 • 全⾯面⽀支持0 码力 | 19 页 | 3.55 MB | 1 年前3
共 2 条
- 1













