积分充值
 首页
前端开发
AngularDartElectronFlutterHTML/CSSJavaScriptReactSvelteTypeScriptVue.js构建工具
后端开发
.NetC#C++C语言DenoffmpegGoIdrisJavaJuliaKotlinLeanMakefilenimNode.jsPascalPHPPythonRISC-VRubyRustSwiftUML其它语言区块链开发测试微服务敏捷开发架构设计汇编语言
数据库
Apache DorisApache HBaseCassandraClickHouseFirebirdGreenplumMongoDBMySQLPieCloudDBPostgreSQLRedisSQLSQLiteTiDBVitess数据库中间件数据库工具数据库设计
系统运维
AndroidDevOpshttpdJenkinsLinuxPrometheusTraefikZabbix存储网络与安全
云计算&大数据
Apache APISIXApache FlinkApache KarafApache KyuubiApache OzonedaprDockerHadoopHarborIstioKubernetesOpenShiftPandasrancherRocketMQServerlessService MeshVirtualBoxVMWare云原生CNCF机器学习边缘计算
综合其他
BlenderGIMPKiCadKritaWeblate产品与服务人工智能亿图数据可视化版本控制笔试面试
文库资料
前端
AngularAnt DesignBabelBootstrapChart.jsCSS3EchartsElectronHighchartsHTML/CSSHTML5JavaScriptJerryScriptJestReactSassTypeScriptVue前端工具小程序
后端
.NETApacheC/C++C#CMakeCrystalDartDenoDjangoDubboErlangFastifyFlaskGinGoGoFrameGuzzleIrisJavaJuliaLispLLVMLuaMatplotlibMicronautnimNode.jsPerlPHPPythonQtRPCRubyRustR语言ScalaShellVlangwasmYewZephirZig算法
移动端
AndroidAPP工具FlutterFramework7HarmonyHippyIoniciOSkotlinNativeObject-CPWAReactSwiftuni-appWeex
数据库
ApacheArangoDBCassandraClickHouseCouchDBCrateDBDB2DocumentDBDorisDragonflyDBEdgeDBetcdFirebirdGaussDBGraphGreenPlumHStreamDBHugeGraphimmudbIndexedDBInfluxDBIoTDBKey-ValueKitDBLevelDBM3DBMatrixOneMilvusMongoDBMySQLNavicatNebulaNewSQLNoSQLOceanBaseOpenTSDBOracleOrientDBPostgreSQLPrestoDBQuestDBRedisRocksDBSequoiaDBServerSkytableSQLSQLiteTiDBTiKVTimescaleDBYugabyteDB关系型数据库数据库数据库ORM数据库中间件数据库工具时序数据库
云计算&大数据
ActiveMQAerakiAgentAlluxioAntreaApacheApache APISIXAPISIXBFEBitBookKeeperChaosChoerodonCiliumCloudStackConsulDaprDataEaseDC/OSDockerDrillDruidElasticJobElasticSearchEnvoyErdaFlinkFluentGrafanaHadoopHarborHelmHudiInLongKafkaKnativeKongKubeCubeKubeEdgeKubeflowKubeOperatorKubernetesKubeSphereKubeVelaKumaKylinLibcloudLinkerdLonghornMeiliSearchMeshNacosNATSOKDOpenOpenEBSOpenKruiseOpenPitrixOpenSearchOpenStackOpenTracingOzonePaddlePaddlePolicyPulsarPyTorchRainbondRancherRediSearchScikit-learnServerlessShardingSphereShenYuSparkStormSupersetXuperChainZadig云原生CNCF人工智能区块链数据挖掘机器学习深度学习算法工程边缘计算
UI&美工&设计
BlenderKritaSketchUI设计
网络&系统&运维
AnsibleApacheAWKCeleryCephCI/CDCurveDevOpsGoCDHAProxyIstioJenkinsJumpServerLinuxMacNginxOpenRestyPrometheusServertraefikTrafficUnixWindowsZabbixZipkin安全防护系统内核网络运维监控
综合其它
文章资讯
 上传文档  发布文章  登录账户
IT文库
  • 综合
  • 文档
  • 文章

无数据

分类

全部综合其他(6)人工智能(6)

语言

全部英语(2)[zh](1)fj(1)日语(1)ro(1)

格式

全部PDF文档 PDF(6)
 
本次搜索耗时 0.017 秒,为您找到相关结果约 6 个.
  • 全部
  • 综合其他
  • 人工智能
  • 全部
  • 英语
  • [zh]
  • fj
  • 日语
  • ro
  • 全部
  • PDF文档 PDF
  • 默认排序
  • 最新排序
  • 页数排序
  • 大小排序
  • 全部时间
  • 最近一天
  • 最近一周
  • 最近一个月
  • 最近三个月
  • 最近半年
  • 最近一年
  • pdf文档 XDNN TVM - Nov 2019

    we track: Latency & Throughput ˃ ML pipeline contains multiple stages, performance limited by slowest one ˃ Performance results based on Xilinx own runtime pipeline available in github (https://github es/mp_classify.py) Streamlined multi-process pipeline using shared memory Usually need >4 Pre-Process cores running to keep up with FPGA ˃ TVM pipeline needed. CPU/FPGA partitions ideally run in parallel Post-Process (fc/softmax/nms) FPGA Acceleration Pre-Process (resize)© Copyright 2018 Xilinx FPGA Pipeline report in MLSuite 1.5 (animated gif of ResNet-50, view in slideshow mode) >> 14© Copyright 2018
    0 码力 | 16 页 | 3.35 MB | 5 月前
    3
  • pdf文档 DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    training. We set the maximum sequence length to 4K, and train DeepSeek-V2 on 8.1T tokens. We leverage pipeline parallelism to deploy different layers of a model on different devices, and for each layer, the light-weight training framework developed internally by our engineers. It employs a 16-way zero-bubble pipeline parallelism (Qi et al., 2023), an 8-way expert parallelism (Lepikhin et al., 2021), and ZeRO-1 data models. arXiv preprint arXiv:2309.00071, 2023. P. Qi, X. Wan, G. Huang, and M. Lin. Zero bubble pipeline parallelism. arXiv preprint arXiv:2401.10241, 2023. S. Rajbhandari, J. Rasley, O. Ruwase, and
    0 码力 | 52 页 | 1.23 MB | 1 年前
    3
  • pdf文档 TVM Meetup: Quantization

    new/tuned TVM schedules using fast Integer operations like Intel VNNI, ARM Dot, Nvidia DP4A • Full pipeline is available. Please try it and give suggestions. • Open-source discussions formed the foundations
    0 码力 | 19 页 | 489.50 KB | 5 月前
    3
  • pdf文档 TVM@AliOS

    libtvm_hexagon_runtime.so Alios TVM @ Hexagon DSP 。 Compute Kernel Offload to DSP ,loop nests marked as pipeline 。, Implement complete Hexagon runtime based on community PR. ADSPRPC Framework Applications Processor
    0 码力 | 27 页 | 4.86 MB | 5 月前
    3
  • pdf文档 TVM@Alibaba AI Labs

    ] Cooperative Fetching Lets threads (work item) in the same thread block (work group) cooperatively fetch dependent data https/www khronos.org/
    0 码力 | 12 页 | 1.94 MB | 5 月前
    3
  • pdf文档 Trends Artificial Intelligence

    data center design. In 2019, AI was a research feature; by 2023, it was a capital expenditure line item. Microsoft Vice Chair and President Brad Smith put it well in a 4/25 blog post: Like electricity
    0 码力 | 340 页 | 12.14 MB | 4 月前
    3
共 6 条
  • 1
前往
页
相关搜索词
XDNNTVMNov2019DeepSeekV2StrongEconomicalandEfficientMixtureofExpertsLanguageModelMeetupQuantizationAliOSAlibabaAILabsTrendsArtificialIntelligence
IT文库
关于我们 文库协议 联系我们 意见反馈 免责声明
本站文档数据由用户上传或本站整理自互联网,不以营利为目的,供所有人免费下载和学习使用。如侵犯您的权益,请联系我们进行删除。
IT文库 ©1024 - 2025 | 站点地图
Powered By MOREDOC AI v3.3.0-beta.70
  • 关注我们的公众号【刻舟求荐】,给您不一样的精彩
    关注我们的公众号【刻舟求荐】,给您不一样的精彩