GPU Resource Management On JDOSGPU Resource Management On JDOS 梁永清 liangyongqing1@jd.com 提供的服务 1. 用于实验的 GPU 容器 2.基于 Kubeflow 的机器学习训练服务 3.模型管理和模型 Serving 服务 Experiment Training Serving 均基于容器,不对业务方直接提供 GPU 物理机 GPU 实验 JDOS 常规的容器服务0 码力 | 11 页 | 13.40 MB | 1 年前3
Secrets Management at
Scale with Vault & RancherSecrets Management at Scale with Vault & Rancher 24. June Robert de Bock Senior DevOps Engineer Adfinis robert.debock@adfinis.com Kapil Arora Senior Solution Engineer HashiCorp kapil@hashicorp.com Field Engineer SUSE bastian.hofmann@suse.com Containers are great! 2 One self-contained, portable package for your application 3 Managing a couple – no problem Containers are great……..but Containers Infrastructure Management (Run & Manage) GitOps Continuous Delivery Cluster Templates & Config Enforcement K8s Version Management Node Pool Management Cluster Provisioning & Lifecycle Management Platform0 码力 | 36 页 | 1.19 MB | 1 年前3
Node Operator: Kubernetes Node Management Made SimpleNode Operator: Kubernetes Node Management Made Simple 陈俊(Joe), Ant Financial Agenda • Background and Motivation • Introduction of Operators • Node-Operator • Advanced Topic: • Upgrade Master & Node Components reliably • Canary Rollout • Master & Node Component Versions Management Motivation: Work Order Deployment Worker Order • Upgrade Nodes Versions • Upgrade Node 10.10 Complicated architecture Work order deployment system can not meet the requirements of resource management. Operator Observe Action Analyze • Observe: watch desired resource and actual resource0 码力 | 18 页 | 11.70 MB | 1 年前3
State management - CS 591 K1: Data Stream Processing and Analytics Spring 2020Processing and Analytics Vasiliki (Vasia) Kalavri vkalavri@bu.edu Spring 2020 2/25: State Management Vasiliki Kalavri | Boston University 2020 Logic State<#Brexit, 520> <#WorldCup, 480> key of the current record so that all records with the same key access the same state State management in Apache Flink 5 Vasiliki Kalavri | Boston University 2020 Operator state Keyed state State state is stored, accessed, and maintained. State backends are responsible for: • local state management • checkpointing state to remote and persistent storage, e.g. a distributed filesystem or a database 0 码力 | 24 页 | 914.13 KB | 1 年前3
Apache Karaf Container 4.x - DocumentationPersistence (JPA) 4.17.8. EJB 4.17.9. CDI 4.17.10. HA/failover and cluster 4.18. Monitoring and Management using JMX 4.18.1. Connecting 4.18.2. Configuration 4.18.3. MBeans 4.18.4. RBAC 4.18.5. JMX-HTTP reflection) 5.2.8. Examples 5.3. Programmatically connect 5.3.1. To the console 5.3.2. To the management layer 5.4. Branding 5.4.1. Console 5.4.2. Adding a branding.properties file to etc 5.5. Adding Features" which is a way to describe your application. • Management: Apache Karaf is an enterprise-ready container, providing many management indicators and operations via JMX. • Remote: Apache Karaf0 码力 | 370 页 | 1.03 MB | 1 年前3
Apache Karaf 3.0.5 Guidesthe container. • Dynamic Configuration: Apache Karaf provides a set of command dedicated for the management of the configuration files. All configuration files are centralized in the etc folder. Any change Feature" which is a way to describe your application. • Management: Apache Karaf is an enterprise-ready container, providing a lot of management indicators and operations via JMX. • Remote: Apache Karaf Karaf embeds an SSHd server allowing you to use the console remotely. The management layer is also accessible remotely. • Security: Apache Karaf provides a complete security framework (based on JAAS), and0 码力 | 203 页 | 534.36 KB | 1 年前3
BAETYL 0.1.6 Documentationetc. The combination of Baetyl and the Cloud Management Suite of BIE [https://cloud.baidu.com/product/bie.html](Baidu IntelliEdge) will achieve cloud management and application distribution, enable applications or any machine learning framework). Simplify Application Production: Baetyl combines with Cloud Management Suite of BIE and many other productions of Baidu Cloud(such as CFC [https://cloud.baidu.com/product/cfc provides features such as underlying service management, but also provides some basic functional modules, as follows: Baetyl Master is responsible for the management of service instances, such as start, stop0 码力 | 119 页 | 11.46 MB | 1 年前3
BAETYL 0.1.6 Documentationdevice resources report etc. The combination of Baetyl and the Cloud Management Suite of BIE(Baidu IntelliEdge) will achieve cloud management and application distribution, enable applications running on edge any machine learning framework). • Simplify Application Production: Baetyl combines with Cloud Management Suite of BIE and many other productions of Baidu Cloud(such as CFC, Infinite, EasyEdge, TSDB, IoT provides features such as underlying service management, but also provides some basic functional modules, as follows: • Baetyl Master is responsible for the management of service instances, such as start, stop0 码力 | 120 页 | 7.27 MB | 1 年前3
BAETYL 1.0.0 Documentationetc. The combination of Baetyl and the Cloud Management Suite of BIE [https://cloud.baidu.com/product/bie.html](Baidu IntelliEdge) will achieve cloud management and application distribution, enable applications or any machine learning framework). Simplify Application Production: Baetyl combines with Cloud Management Suite of BIE and many other productions of Baidu Cloud(such as CFC [https://cloud.baidu.com/product/cfc provides features such as underlying service management, but also provides some basic functional modules, as follows: Baetyl Master is responsible for the management of service instances, such as start, stop0 码力 | 135 页 | 15.44 MB | 1 年前3
BAETYL 1.0.0 Documentationdevice resources report etc. The combination of Baetyl and the Cloud Management Suite of BIE(Baidu IntelliEdge) will achieve cloud management and application distribution, enable applications running on edge any machine learning framework). • Simplify Application Production: Baetyl combines with Cloud Management Suite of BIE and many other productions of Baidu Cloud(such as CFC, Infinite, EasyEdge, TSDB, IoT provides features such as underlying service management, but also provides some basic functional modules, as follows: • Baetyl Master is responsible for the management of service instances, such as start, stop0 码力 | 145 页 | 9.31 MB | 1 年前3
共 385 条
- 1
- 2
- 3
- 4
- 5
- 6
- 39













