B站统⼀监控系统的设计,演进
与实践分享获取监控数据 获取监控数据 推送告警 1. 降低编写规则的成本 2. 降低多idc维护成本 规则管理理⻚页⾯面 例例⼦子 - 业务监控 稿件 账号 Feed PAAS托管 服务树 container http server sdk 注册 获取target 采集数据 吞吐量量 响应时间 错误率 饱和度 熔断 限流 投稿数量量 订单数据 在线⼈人数 … ⻩黄⾦金金指标0 码力 | 34 页 | 650.25 KB | 1 年前3
Intro to Prometheus - With a dash of operations & observabilityIntroduction Background Operations & observability Outro Main selling points Highly dynamic, built-in service discovery No hierarchical model, n-dimensional label set PromQL: for processing, graphing, alerting Looking at a service from the outside (Does the server answer to HTTP requests?) White-box monitoring: Instrumention code from the inside (How much time does this subroutine take?) Every service should have Outro Toil ”Toil is manual, repeated work with no lasting benefit which scales linearly with your service” If teams are busy firefighting, they don’t have time to engineer Keep legacy systems working,0 码力 | 19 页 | 63.73 KB | 1 年前3
Prometheus Deep Dive - Monitoring. At scale.Introduction Intro 2.0 to 2.2.1 2.4 - 2.6 Beyond Outro Main selling points Highly dynamic, built-in service discovery No hierarchical model, n-dimensional label set PromQL: for processing, graphing, alerting0 码力 | 34 页 | 370.20 KB | 1 年前3
共 3 条
- 1













