《Efficient Deep Learning Book》[EDL] Chapter 5 - Advanced Compression Techniquespruning technique because of its simplicity and effectiveness. Later on in this chapter, we have a project that relies on it for sparsifying a deep learning model. The authors of the Optimal Brain Damage trained a model to predict masks for pets to build snapchat like filters. Let’s continue on the same project to demonstrate how we can create a pruned network without significant drop in accuracy in the next 09723v1 3 https://github.com/google/XNNPACK Project: Lightweight model for pet filters application Recall that our regular CNN model in the pet filters project consisted of thirteen convolution blocks and0 码力 | 34 页 | 3.18 MB | 1 年前3
 keras tutorialenvironment while developing Python applications. Linux/Mac OS Linux or mac OS users, go to your project root directory and type the below command to create virtual environment, python3 -m venv kerasenv install numpy you could see the following response, Collecting numpy Downloading https://files.pythonhosted.org/packages/cf/a4/d5387a74204542a60ad1baa84cd2d3353 c330e59be8cf2d47c0b11d3cde8/ install pandas We could see the following response: Collecting pandas Downloading https://files.pythonhosted.org/packages/cf/a4/d5387a74204542a60ad1baa84cd2d3353 c330e59be8cf2d47c0b11d3cde8/0 码力 | 98 页 | 1.57 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 6 - Advanced Learning Techniques - Technical Reviewsuch models, the high costs of pre-training get spread over the number of applications using it. Project: Using Pre-trained Language Models for News Classification That was a lot of talk without any code we need, and the number of training epochs we need to achieve our desired model quality. In this project we will demonstrate that self-supervised models provide both those efficiency gains. We will work quality and faster convergence than a BERT model that is trained from scratch. The code for this project is available here as a Jupyter notebook. We will not be explicitly demonstrating pre-training BERT0 码力 | 31 页 | 4.03 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 2 - Compression Techniquesquantization section delves into the implementation details using code samples. We finish with a hands-on project that will walk you through the process of applying quantization in practical situations using popular precision trade off! It’s time to put our understanding of quantization into practice with a hands-on project. We will apply the learnings from weight and activation quantizations to a real world deep learning learning model and demonstrate the size reduction and inference efficiency improvements. The project will use the famous MNIST dataset! Figure 2-10: Latency v/s accuracy trade off for unoptimized representation0 码力 | 33 页 | 1.96 MB | 1 年前3
 AI大模型千问 qwen 中文文档./main -h 以了解它们。 1.4.3 生成你的 GGUF 文件 We introduce the method of creating and quantizing GGUF files in quantization/llama.cpp. You can refer to that document for more information. 1.4.4 PPL 评测 llama built-in tool␣ �→in Qwen-Agent bot = Assistant(llm=llm_cfg, system_message=system, function_list=tools, files=[os.path.abspath('doc.pdf')]) messages = [] while True: query = input('user question: ') messages establish a knowledge base Q&A solution. 1.16.1 基础用法 The implementation process of this project includes loading files -> reading text -> segmenting text -> vectorizing text -> vectorizing questions -> matching0 码力 | 56 页 | 835.78 KB | 1 年前3
 PyTorch Release Notesabout PyTorch, including tutorials, documentation, and examples, see: ‣ PyTorch website ‣ PyTorch project This document provides information about the key features, software enhancements and improvements Facebook's Fairseq NLP Toolkit and is built on top of PyTorch. The original version in the Fairseq project was developed using Tensor Cores, which provides significant training speedup. Our implementation The original version PyTorch Release 19.12 PyTorch RN-08516-001_v23.07 | 282 in the Fairseq project was developed using Tensor Cores, which provides significant training speedup. Our implementation0 码力 | 365 页 | 2.94 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 7 - Automationoxford_flowers102 dataset. In the next section, we will retrain the same model but with a twist! Project: Oxford Flower Classification With Hyperparameter Tuning Recall that in chapter 3, we trained a dropout rate was 0.2. The model reached the top accuracy of 70% after training for 100 epochs. In this project, we will let the HyperBand choose the best values for these hyperparameters and see if we can do alse) ) Let's resize the dataset splits to the same size. The target size is identical to the project in chapter 3. # Dataset image size IMG_SIZE = 264 def resize_image(image, label): image = tf.image0 码力 | 33 页 | 2.48 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 3 - Learning TechniquesSmaller, Faster, and Better." arXiv preprint arXiv:2106.08962 (2021). It’s time for a hands-on project to apply our recent learnings and measure their impact. We will use the oxford_flowers102 dataset performances with and without data augmentation to measure the benefits of the techniques we just learnt. Project: Oxford Flowers Classification The oxford_flowers102 dataset contains 1020 labeled examples each good quality model. So, we use a pre-trained ResNet50 model and fine tune it. The code for this project is available as a Jupyter notebook here. Tensorflow provides easy access to this dataset through0 码力 | 56 页 | 18.93 MB | 1 年前3
 Experiment 1: Linear Regressionstart a very simple case where n = 1. Download data1.zip, and extract the files (ex1x.dat and ex1y.dat) from the zip file. The files contain some example measurements of heights for various boys between the complicated case where each training data contains mul- tiple features. Download data1.zip, and extract the files (ex2x.dat and ex2y.dat) from the zip file. This is a training set of housing prices in Portland, Oregon0 码力 | 7 页 | 428.11 KB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 4 - Efficient Architecturesyour embeddings similar to Word2Vec, GloVe, and other embedding methods. How about we jump into a project now to demonstrate how embeddings can be used to achieve a high performing model while optimizing the recent literature. Using pre-trained embeddings to improve accuracy of a NLP task. In this project we will work with the DBPedia dataset, which has snippets of text from Wikipedia. Our goal is to attention parameters, the next component to attack is the softmax computation. The Low Rank methods project the keys and the values to a smaller dimension k to reduce the computation and memory complexity0 码力 | 53 页 | 3.92 MB | 1 年前3
共 19 条
- 1
 - 2
 













