Trends Artificial Intelligence
Vint Cerf, one of the ‘Founders of the Internet,’ said in 1999, ‘…they say a year in the Internet business is like a dog year – equivalent to seven years in a regular person's life.’ At the time, the pace universally accessible and useful.’ Alibaba’s founding mission (1999) was to ‘make it easy to do business anywhere.’ Facebook’s founding mission (2004) was ‘to give people the power to share and make the logarithmic scale. GK$ (Gross Knowledge Dollars) is an informal term used to estimate the potential business value of a specific insight, idea, or proprietary knowledge. It reflects how much that knowledge0 码力 | 340 页 | 12.14 MB | 4 月前3OpenAI 《A practical guide to building agents》
issue, booking a restaurant reservation, committing a code change, or generating a report. Applications that integrate LLMs but don’t use them to control workflow execution—think simple chatbots, single-turn single-turn LLMs, or sentiment classifiers—are not agents. More concretely, an agent possesses core characteristics that allow it to act reliably and consistently on behalf of a user: 01 It leverages building agents Agent design foundations In its most fundamental form, an agent consists of three core components: 01 Model The LLM powering the agent’s reasoning and decision-making 02 Tools External0 码力 | 34 页 | 7.00 MB | 5 月前3OpenAI - AI in the Enterprise
leader in financial services, Morgan Stanley is a relationship business. Not surprisingly, there were some questions across the business about how AI could add value to the highly personal and sensitive testing the outputs that your models produce. Rigorous evals lead to more stable, reliable applications that are resilient to change. Evals are built around tasks that measure the quality of the the new, customized context. The performance uplift was significant: A 20% increase in job applications started A 13% uplift in downstream success—not only were more candidates likely to apply,0 码力 | 25 页 | 9.48 MB | 5 月前3DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
limits the maximum batch size and sequence length. 2.1.2. Low-Rank Key-Value Joint Compression The core of MLA is the low-rank joint compression for keys and values to reduce KV cache: c?? ? = ? ???h? Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances Yanhong Xu Yanping Huang Yaohui Li Yi Zheng Yuchen Zhu Yunxian Ma Zhen Huang Zhipeng Xu Zhongyu Zhang Business & Compliance Bin Wang Dongjie Ji Jian Liang Jin Chen Leyi Xia Miaojun Wang Mingming Li Peng Zhang0 码力 | 52 页 | 1.23 MB | 1 年前3TVM@AliOS
MobileNetv2 LaneNet 图TFLite1core 图TFLite4core 国QNNPACK 1core 四QNNPACK4core 四TVM1core 四TVM4core AiOS 1驱动万物智能 Alios TVM @ ARM CPU FP32 。,NHWC layout 。 For pointwise as pipeline 。, Implement complete Hexagon runtime based on community PR. ADSPRPC Framework Applications Processor | | DSP Processor /NiiOS ! 驱动万物智能 Alios TVM Q@ Hexagon0 码力 | 27 页 | 4.86 MB | 5 月前3Google 《Prompt Engineering v7》
that variable in each prompt. This makes a lot of sense when integrating prompts into your own applications. Prompt VARIABLES {city} = "Amsterdam" PROMPT You are a travel guide. Tell me a fact about the without its drawbacks. The structured nature of JSON, while beneficial for parsing and use in applications, requires significantly more tokens than plain text, leading to increased processing time and especially valuable when working with large volumes of data or when integrating LLMs into complex applications. Experiment together with other prompt engineers If you are in a situation where you have to try0 码力 | 68 页 | 6.50 MB | 6 月前3Facebook -- TVM AWS Meetup Talk
Sparse Transformers, etc - Reduce precision with int8/float16 - very helpful to maintain model in core-private L1 dcaches - Use rational approximations for transcendentals (exp, tanh, erf, etc) - very lines of Relay IR) - A few days of work - TVM sampling model running in 30us on single server CPU core - Beat hand-written, highly optimized baselines (https://github.com/mozilla/LPCNet) by ~40% - Bonus:0 码力 | 11 页 | 3.08 MB | 5 月前3OctoML OSS 2019 11 8
multiple employees to contribute to TVML. ee Today we'ltouch on a few of those contribution areas: o Core Infrastructure Improvements to TVM o_uTVM: support for microcontrollers in TVM o_ Virtual Machine dynamic NNs support (w/ AWS folks) o_ Improved NLP support, with focus on transformers QQ octoML Core Infrastructure Refactors ee New Integer Analysis Infrastructure o_ Supports the ability to handle0 码力 | 16 页 | 1.77 MB | 5 月前3TVM Meetup: Quantization
Amazon Web Services, Inc. or its Affiliates. All rights reserved. Evaluation • Intel Cascade Lake 12-core Server • TFLite Pre-quantized Hosted Models© 2019, Amazon Web Services, Inc. or its Affiliates. All0 码力 | 19 页 | 489.50 KB | 5 月前3
共 9 条
- 1