Kubernetes Day 2 Operation - IT文库_程序员IT互联网编程电子书和文档免费下载，助您码力十足！

首页文库资料文章资讯上传文档发布文章登录账户

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model DeepSeek-AI research@deepseek.com Abstract We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees strong models at an economical cost through sparse computation. Compared with DeepSeek 67B, DeepSeek-V2 achieves significantly stronger performance, and meanwhile saves 42.5% of training costs, reduces the

0 码力 | 52 页 | 1.23 MB | 1 年前
3
Trends Artificial Intelligence

Artificial Intelligence (AI) May 30, 2025 Mary Meeker / Jay Simons / Daegwon Chae / Alexander Krey2 Context We set out to compile foundational trends related to AI. A starting collection of several from Get-Go = Growth We Have Not Seen Likes of Before • AI & Work Evolution = Real + Rapid 3 1 2 3 4 5 6 7 8 9-51 52-128 129-152 153-247 248-298 299-307 308-322 # 323-336 OutlineWeekly Details on Page 56 AI User + Usage + CapEx Growth = Unprecedented Leading USA-Based LLM Users 2 Source: Company disclosures Details on Page 55 6MM 2005 2025 Number of Developers, MM 0% 50%

0 码力 | 340 页 | 12.14 MB | 5 月前
3
OpenAI - AI in the Enterprise

experts 16 Unblock your developers 18 Set bold automation goals 21 Conclusion 22 More resources 24 2 AI in the EnterpriseA new way   to work As an AI research and deployment company, OpenAI prioritizes production. 6 AI in the EnterpriseHow it’s going Today, 98% of Morgan Stanley advisors use OpenAI every day; access to documents has jumped from 20% to 80%, with dramatically reduced search time; and advisors Your key metrics will depend on what matters most for each   use case. 8 AI in the EnterpriseLesson 2 Embed AI into your products How Indeed humanizes job matching When AI is used to automate and accelerate

0 码力 | 25 页 | 9.48 MB | 6 月前
3
TVM@Alibaba AI Labs

PART 2 : HIFI4 DSP PART 3 : _ PowervVR GPU [和| Alibaba AL.Labs 阿里巴巴人工智能实验室 ARM 32 CPU Resolution Quantization Orize Kernel ALIOS TVM Alibaba Al.Labs 阿里巴巴人工智能实验室 7= 590一I) rm 一下Er (mm) =肪+2mxaM0 =肪+2mxaM0 [5 (全-2)+o| current plan 1 = int16 * int16 erflow-aware int16 = int8 xint8 ent pl 1=int8 int8 * int8 int32 = int16 1 + int16 x int8 Alibaba Al.Labs 阿里巴巴人工智能实验室 CPU : MTK8167S (ARM32 A35 1.5GHz) 5GHz) Model : MobileNetV2_ 1.0_ 224 400 336 350 3丈 300 250 230 214 0 200 0

0 码力 | 12 页 | 1.94 MB | 6 月前
3
Dynamic Model in TVM

ty=int32 */; %2 = @sum_up(%1) /* ty=int32 */; add(%2, %i1) /* ty=int32 */ } } sum_up: alloc_storage 1 1 64 bool alloc_tensor $2 $1 [] uint1 invoke_packed PackedFunc[0] (in: $0, out: $2) load_consti load_consti $3 1 if $2 $3 1 2 goto 9 alloc_storage 4 4 64 int32 alloc_tensor $5 $4 [] int32 invoke_packed PackedFunc[1] (in: $0, out: $5) invoke $6 VMFunc[0]($5) alloc_storage 7 4 64 int32 alloc_tensor $8 $7 [] int32 int32 invoke_packed PackedFunc[2] (in: $6, $0, out: $8) move $0 $8 ret $0 main: invoke $1 VMFunc[0]($0) ret $1© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Dynamic codegen:

0 码力 | 24 页 | 417.46 KB | 6 月前
3
XDNN TVM - Nov 2019

Elliott Delaye FPGA CNN Accelerator and TVM© Copyright 2018 Xilinx TVM Target devices and models >> 2 HW Platforms ZCU102 ZCU104 Ultra96 PYNQ Face detection Pose estimation Video analytics Queue Instruction Buffer Cross Bar Pooling/ EWA© Copyright 2018 Xilinx Xilinx Edge DPU IP (DPUv2) Source: Published results from Huawei 18% 13% 14% 40% 24% 23% 85% 51% 52% 0% 20% 40% 60% inputs] out_shapes = [[int(i) for i in outputs[0].shape]] # EXTERNAL FUNCTION TO RUN THE FUSED OPERATION out = tvm.extern(outputs[0].shape, inputs, lambda ins, outs: tvm.call_packed('tvm.accel.accel_fused'

0 码力 | 16 页 | 3.35 MB | 6 月前
3
清华大学第二弹：DeepSeek赋能职场

(Identity) •角色属性 •专业背景 •交互特征执行层: 2. 能力矩阵 (Capability Matrix) •功能范围 •专业技能 •决策权限约束层: 3. 边界系统 (Boundary System) •伦理规范 •安全限制 •资源约束操作层: 4. 工作引擎 (Operation Engine) •输入处理 •执行流程 •输出规范如何使用DeepSeek制作可视化图表？少关键信息，不少于30页内容，内容一定要完整。 •确保所有信息的准确性和来源可靠性，尤其是行业数据和市场分析。工作流程: 1.确认主题：询问用户的PPT主题，并了解用户的具体需求和内容重点。 2.收集资料：快速查找相关的研究报告、市场分析数据等，确保信息的最新性和权威性，并将关键资料整理形成表格。 3.生成PPT大纲：根据用户需求和收集到的资料，构建PPT大纲，明确每一页的内容和结构。要现。不少于30页内容。 4. 生成关键页面的流程图，针对部分核心页面内容，生成流程图。输出内容及格式: 1、研究资料摘要，表格格式，包含报告主题、关键摘要、报告地址，不少于5份； 2、PPT大纲，Markdown格式，不少于30页； 3、核心内容的流程图，Mermaid格式。当用户输入特定主题时，请首

0 码力 | 35 页 | 9.78 MB | 8 月前
3
Google 《Prompt Engineering v7》

Prompt Engineering Author: Lee Boonstra Prompt Engineering February 2025 2 Acknowledgements Content contributors Michael Sherman Yuan Cao Erick Armbrust Anant Nawalgaria Antonio Gulli Simone Cammel meet the top-P criteria, and none are selected out. As a general starting point, a temperature of .2, top-P of .95, and top-K of 30 will give you relatively coherent results that can be creative but not complex tasks, or you may need to use fewer due to the input length limitation of your model. Table 2 shows a few-shot prompt example, let’s use the same gemini-pro model configuration settings as before

0 码力 | 68 页 | 6.50 MB | 6 月前
3
PAI & TVM Meetup - Shanghai 20191116

groundbreaking AI performance. 。 Performs /mxeo-Drecsion matrix multiply and accumulate in a single operation. Background 全于由。TensorCore 。 Poograrm171aple matrix-multiply-and-accumulate

0 码力 | 26 页 | 5.82 MB | 6 月前
3
DeepSeek从入门到精通(20250204)

提示语策略差异 1 2 推理模型通用模型 • 提示语更简洁，只需明确任务目标和需求（因其已内化推理逻辑）。 • 无需逐步指导，模型自动生成结构化推理过程（若强行拆解步骤，反而可能限制其能力）。 • 需显式引导推理步骤（如通过CoT提示），否则可能跳过关键逻辑。 • 依赖提示语补偿能力短板（如要求分步思考、提供示例）。关键原则 3 2 1 模型选择 • 特点需求表达公式推理模型适配策略通用模型适配策略 1. 决策需求需权衡选项、评估风险、选择最优解目标 + 选项 + 评估标准要求逻辑推演和量化分析直接建议，依赖模型经验归纳 2. 分析需求需深度理解数据/信息、发现模式或因果关系问题 + 数据/信息 + 分析方法触发因果链推导与假设验证表层总结或分类 3. 创造性需求需生成新颖内容（文本/ 地理解和执行任务。 ▪ 期望（Expectation)：明确或隐含地表达你对AI输出的要求和预期。提示语类型提示语的本质 1. 指令型提示语：直接告诉AI需要执行的任务。 2. 问答型提示语：向AI提出问题，期望得到相应的答案。 3. 角色扮演型提示语：要求AI扮演特定角色，模拟特定场景。 4. 创意型提示语：引导AI进行创意写作或内容生成。 5.

0 码力 | 104 页 | 5.37 MB | 8 月前
3

共 34 条前往

页

分类

语言

格式