POST方法 - IT文库_程序员IT互联网编程电子书和文档免费下载，助您码力十足！

首页文库资料文章资讯上传文档发布文章登录账户

Falcon v1.4.1-post-1 Documentation

href='http://docs.example.com/api/json') if req.method in ('POST', 'PUT'): if 'application/json' not in req.content_type: #39;, 'Falcon') resp.status = falcon.HTTP_200 @falcon.before(max_body(64 * 1024)) def on_post(self, req, resp, user_id): try: doc = req.context['doc'] except KeyError: of all achievements for the player resource with ID 45301f54”. $$ \begin{aligned}&\underbrace{POST}_{Action}\quad&\underbrace{\quad/players/45301f54/achievements}_{Resource\ Identifier}\end{aligned}

0 码力 | 229 页 | 273.39 KB | 2 年前
3
vLLM v0.6.1.post1 Documentation

support - Multi-lora support For more information, check out the following: - vLLM announcing blog post (intro to PagedAttention) - vLLM paper (SOSP 2023) - How continuous batching enables 23x throughput flash attention, you can install flash attention for ROCm Install ROCm's flash attention (v2.5.9.post1) following the instructions from ROCm/flash-attention Alternatively, wheels intended for vLLM use of the issue, your environment, and the logs. Some known issues: - In v0.5.2, v0.5.3, and v0.5.3.post1, there is a bug caused by zmq , which can cause hangs at a low probability (once in about 20 times

0 码力 | 215 页 | 1.28 MB | 5 月前
3
vLLM v0.6.1.post2 Documentation

support - Multi-lora support For more information, check out the following: - vLLM announcing blog post (intro to PagedAttention) - vLLM paper (SOSP 2023) - How continuous batching enables 23x throughput PyTorch release versions: ```bash $ # Install vLLM with CUDA 11.8. $ export VLLM_VERSION=0.6.1.post1 $ export PYTHON_VERSION=310 $ pip install https://github.com/vllm-project/vllm/releases/downlo since v0.5.3. You can download them with the following command: ```bash $ export VLLM_VERSION=0.6.1.post1 # vLLM's main branch version is currently set to latest →released tag $ pip install https://vllm-wheels

0 码力 | 215 页 | 1.29 MB | 5 月前
3
vLLM v0.4.0.post1 Documentation

(Experimental) Multi-lora support For more information, check out the following: - vLLM announcing blog post (intro to PagedAttention) - vLLM paper (SOSP 2023) - How continuous batching enables 23x throughput

0 码力 | 68 页 | 810.15 KB | 5 月前
3
vLLM v0.5.0.post1 Documentation

(Experimental) Multi-lora support For more information, check out the following: - vLLM announcing blog post (intro to PagedAttention) - vLLM paper (SOSP 2023) - How continuous batching enables 23x throughput print(LINE_UP, end=LINE_CLEAR, flush=True) ``` (continues on next page) ```python def post_http_request(prompt: str, api_url: str, n: int = 1, "max_tokens": 16, "stream": stream, } response = requests.post(api_url, headers=headers, json=pload, stream=True) return response def get_streaming_response(response:

0 码力 | 144 页 | 1.09 MB | 5 月前
3
vLLM v0.5.3.post1 Documentation

(Experimental) Multi-lora support For more information, check out the following: - vLLM announcing blog post (intro to PagedAttention) - vLLM paper (SOSP 2023) - How continuous batching enables 23x throughput flash attention, you can install flash attention for ROCm Install ROCm's flash attention (v2.5.9.post1) following the instructions from ROCm/flash-attention Alternatively, wheels intended for vLLM use '\x1b[2K' for _ in range(n): print(LINE_UP, end=LINE_CLEAR, flush=True) def post_http_request(prompt: str, api_url: str, n: int = 1,

0 码力 | 143 页 | 1.07 MB | 5 月前
3
Practices of Go Microservices on Post-Kubernetes-Wei Zheng

## GCN ## Practices of Go Microservices on Post-Kubernetes ## 郑伟石墨文档 ## Background in Shimo ## Language • Go • Node • Rust ## Background in Shimo ## Framework • Gin • Echo • gRPC … ## Background

0 码力 | 59 页 | 5.66 MB | 2 年前
3
告警OnCall事件中心建设方法白皮书

![Image](/uploads/documents/a/f/2/3/af23dd3a5d68a86ba08b082c21337120/p1_1.jpg) # 事件 ONCALL 中心建设方法一站式处理值班 OnCall，智能降噪 ![Image](/uploads/documents/a/f/2/3/af23dd3a5d68a86ba08b082c21337120/p1_2.jpg) 68a86ba08b082c21337120/p2_1.jpg) 对于告警事件的后续处理，有哪些问题和需求以及何为最佳实践？我们从思路方法和工具实践两个方面分别进行探讨，下面先行探讨思路方法，看看要解决这些问题和需求，我们有哪些可能的解法。 ## 思路方法篇告警事件的后续处理：多渠道分级通知、告警静默、抑制、收敛聚合、降噪、排班、认领升级、协同闭环处理等等。看起来需求很多，最核心的痛点有两个：能加人了，或者明确说明在架构调整好之前，不负责 SLA，反推业务改造。上面介绍的两个告警规则优化原则，是最重要的两个原则。照做的话，可以搞定大部分无效告警。除了原则方面，另一个应对过多告警的方法就是靠产品工具了，比如告警事件在哪些时间段发送、如何过滤、如何屏蔽、如何抑制等等，通常，监控系统和统一的 OnCall 中心（PagerDuty FlashDuty 这种产品）在这些功能上会有一定的

0 码力 | 23 页 | 1.75 MB | 2 年前
3
Java EE 企业应用系统设计 - HTTP 请求处理编程

学习目标 1. 理解 Web 的工作模式，掌握 HTTP 协议的特点以及 HTTP 请求中包含哪些信息。 2. 理解 Java HTTP 请求对象的类型及其生命周期，掌握请求对象的功能，学习部分请求对象方法的用法。 HTTP 请求内容 ## 大纲 HTTP 请求内容 Java EE 请求对象 HTTP 请求内容 ## 接下来… HTTP 请求内容 Java EE 请求对象 ## Web 工作模式 Host 浏览器访问的主机名 Referer 浏览器是从哪个页面来的 Cookie 浏览器保存的 cookie 对象 Java EE Web 组件 Servlet 和 JSP 中可以使用请求对象的方法读取这些请求内容，进而进行相应的处理。 ## HTTP 请求中包含的信息 ## ✿ 请求体每次 HTTP 请求时，在请求头之后会有一个空行，接下来是请求中包含的提交数据，即请求体。 ## HTTP 请求时数据会出现在 URL 中，保密性差，实际编程中要尽量避免。 ## HTTP 请求中包含的信息 ## ② POST 请求 ▶ 请求体数据单独打包为数据块，通过 Socket 直接传递到 Web 服务器端，数据不会在地址栏出现。可以提交大的数据，包括二进制文件，实现文件上传功能。原则上 POST 请求对提交的数据没有大小限制。 HTTP 请求内容 ## 接下来… HTTP 请求内容 Java

0 码力 | 27 页 | 565.27 KB | 2 年前
3
在大规模Kubernetes集群上实现高SLO的方法

0 码力 | 11 页 | 4.01 MB | 2 年前
3

共 1000 条前往

页

分类

语言

格式

Falcon v1.4.1-post-1 Documentation

vLLM v0.6.1.post1 Documentation

vLLM v0.6.1.post2 Documentation

vLLM v0.4.0.post1 Documentation

vLLM v0.5.0.post1 Documentation

vLLM v0.5.3.post1 Documentation

Practices of Go Microservices on Post-Kubernetes-Wei Zheng

告警OnCall事件中心建设方法白皮书

Java EE 企业应用系统设计 - HTTP 请求处理编程

在大规模Kubernetes集群上实现高SLO的方法

搜索

分类

语言

格式