阿里云上深度学习建模实践-程孟力
标准化模型库 标准化解决方案 1.方案复杂 图像 搜索 推荐 语音 视频理解 NLP 广告 CNN RNN GNN MLP Tensorflow PyTorch Parameter Server MPI TreeModel SQL MapReduce Blink 场景丰富: 图像/视频/推荐/搜索 大数据+大模型: Model Zoo 跨场景+跨模态 开箱即用: EAS Web App Mobile App On-prem System 3 1 2 证件扫描 活体检测 人脸比对 • 卡证OCR • 人脸检测 • 活体检测 •人脸比对 Mobile SDK API + customer 示例: e-Know Your Customer eKYC eKYC Server eKYC SDK/API 多语言、国际化 0 码力 | 40 页 | 8.51 MB | 1 年前3AI大模型千问 qwen 中文文档
install vllm 运行以下代码以构建 vllm 服务。此处我们以 Qwen1.5-7B-Chat 为例: python -m vllm.entrypoints.openai.api_server --model Qwen/Qwen1.5-7B-Chat 然后,您可以使用 create chat interface 来与 Qwen 进行交流: curl http://localhos 包中的 Python 客户端: from openai import OpenAI # Set OpenAI's API key and API base to use vLLM's API server. openai_api_key = "EMPTY" openai_api_base = "http://localhost:8000/v1" client = OpenAI( (续下页) safetensor.index.json │ │ ├── merges.txt │ │ ├── tokenizer_config.json │ │ └── vocab.json 随后你需要运行 python server.py 来启动你的网页服务。请点击进入 `http://localhost:7860/?__theme=dark` 然后享受使用 Qwen 的 Web UI 吧! 1.6.2 下一步 TGW0 码力 | 56 页 | 835.78 KB | 1 年前3PyTorch Release Notes
4.6.1 ‣ Jupyter Notebook 6.0.3 ‣ JupyterLab 2.3.2, including Jupyter-TensorBoard ‣ JupyterLab Server 1.0.6 ‣ Jupyter-TensorBoard Driver Requirements Release 22.08 is based on CUDA 11.7.1, which requires 4.6.1 ‣ Jupyter Notebook 6.0.3 ‣ JupyterLab 2.3.2, including Jupyter-TensorBoard ‣ JupyterLab Server 1.0.6 ‣ Jupyter-TensorBoard Driver Requirements Release 22.07 is based on CUDA 11.7 Update 1 Preview 4.6.1 ‣ Jupyter Notebook 6.0.3 ‣ JupyterLab 2.3.2, including Jupyter-TensorBoard ‣ JupyterLab Server 1.0.6 ‣ Jupyter-TensorBoard Driver Requirements Release 22.06 is based on CUDA 11.7 Update 1 Preview0 码力 | 365 页 | 2.94 MB | 1 年前3微博在线机器学习和深度学习实践-黄波
serving server server server worker Model Serving System Serving PS Traing PS Traing Model System Predict Score Sample Data worker worker worker 3 在线机器学习-参数服务器 serving serving serving server server server server server server worker worker worker PSscheduler PSserver PSserver PSserver PSagent PSagent zookeeper PSproxy PSproxy PSsubmit File System checkpoint Model Training System Model Status set/get Model delete Model Save Model Load HA Fault tolerance checkpoint Local HDFS Param Server System Model Serving System 3 在线机器学习-参数服务器 • 参数规模 • 支持百亿特征维度,千亿参数 • 模型版本 • 多模型多版本:多组实验并行执行,提高实验迭代效率0 码力 | 36 页 | 16.69 MB | 1 年前3搜狗深度学习技术在广告推荐领域的应用
查询特征 广告特征 匹配特征 线性模型 非线性模型 Data Feature Model 线上Server CTR预估 Rank Online 特征抽取 CTR预估涉及技术 CTR预估 数据 模型 平台 MPI XgBoost Parameter Server 线性(LR) 非线性(GBDT) 深度(DNN) 实时(FTRL) 特征 训练数据 融合模型 Feature Maker One Case ALL One Hot 特征 Final CTR Bidding Server OFFLINE ONLINE OneHot Float LR Model DNN Model Retriever Server CTR Table DNN Model Feature LR Model Feature 特 征 池0 码力 | 22 页 | 1.60 MB | 1 年前3从推荐模型的基础特点看大规模推荐类深度学习系统的设计 袁镱
内存成为主要资源瓶颈。由于需要等待全部参数 就绪,Parameter Server难以利⽤速度慢的存储 介质 样本读取 样本解析 参数拉 取 训练 参数更新 查询Sparse Table 查询Dense Tensor Reader Learner Worker 返回参数 Request Handler Parameter Server 查询Sparse Table 查询Dense Tensor 参数更新 查询Sparse Table 查询Dense Tensor Reader Learner Worker 返回参数 Request Handler Parameter Server 更新参数 � 异步参数处理流⽔线 参数 预准备 Batch⼊队列 Batch⼊队列 � 效果: � 在不影响训练效果的情况下,降低参数准备与更新耗时,提 ⾼训练速度。训练耗时下降超50%0 码力 | 22 页 | 6.76 MB | 1 年前3《Efficient Deep Learning Book》[EDL] Chapter 1 - Introduction
Having such a toolbox to make our models pareto-optimal has the following benefits: Sustainable Server-Side Scaling Training and deploying large deep learning models is costly. While training is a one-time inference can be run completely on the user’s device without the need to send the input data to the server-side. New Applications Efficiency would also enable applications that couldn’t have otherwise been device. This further reduces the available resources for a single model. This could happen on the server-side where multiple models are co-located on the same machine, or could be in an app where different0 码力 | 21 页 | 3.17 MB | 1 年前3【PyTorch深度学习-龙龙老师】-测试版202112
的梯度信息并不直接用于更新 Worker 的 Actor-Critic 网络,而是提 交到 Global Network 更新。具体地,在 Worker 类初始化阶段,获得 Global Network 传入的 server 对象和 opt 对象,分别代表了 Global Network 模型和优化器;并创建私有的 ActorCritic 网络类 client 和交互环境 env。代码如下: class Worker(threading def __init__(self, server, opt, result_queue, idx): super(Worker, self).__init__() self.result_queue = result_queue # 共享队列 self.server = server # 中央模型 self.opt # 梯度提交到 server,在 server 上更新梯度 self.opt.apply_gradients(zip(grads, self.server.trainable_weights))0 码力 | 439 页 | 29.91 MB | 1 年前3深度学习与PyTorch入门实战 - 30. Visdom可视化
install tensorboardX TensorboardX Visdom from Facebook Step 1. install Step2. run server damon Step2. run server damon install from source lines: single trace lines: multi-traces visual X 下一课时0 码力 | 17 页 | 1.47 MB | 1 年前3《TensorFlow 快速入门与实战》3-TensorFlow基础概念解析
��������������� �������������� TensorFlow ���� Client Server (local machine) Worker /cpu:0 Worker /gpu:0 TensorFlow ���� Client Server (local machine) RunStep() Worker /cpu:0 Worker /gpu:00 码力 | 50 页 | 25.17 MB | 1 年前3
共 16 条
- 1
- 2