Methodology for cloud computing to enter deep learning

Artificial intelligence is a hot topic in today's hot industry. Deep learning is a hot topic, but for traditional IT practitioners, artificial intelligence technology is full of models, algorithms, vector vectors, and it is too difficult to understand. So the goal of this article is to enable IT practitioners to understand the characteristics of deep learning technology and hope that readers will benefit from it.

First, the artificial intelligence of the time and place and

The maturity of the industry depends on the struggle of the practitioners (people and people), but also the process of the environment and history (day and place).

The blowout of artificial intelligence technology is not a simple technological advancement, but the result of the joint efforts of software, hardware and data. Deep learning is the hottest branch of AI technology, and it is also limited by these three conditions.

The algorithms that AI software relies on have been around for many years. Neural networks are technologies proposed 50 years ago, and algorithms such as CNN/RNN are older than most readers. AI technology has been shelved because of the lack of hardware power and massive data. With the update of CPU, GPU, and FPGA hardware, the hardware computing power has been expanded by 10,000 times in decades, and the hardware computing power has been gradually liberated. With the speed reduction of hard drives and bandwidth, there weren't a few high-definition photos of all humans 20 years ago, and now the data volume of a single company can reach EB. Big data technology can only read and write structured logs. To read videos and pictures, you must use AI. Humans have been unable to stare at so many cameras.

We can only use AI technology from the heart to go to the altar to use it as a convenient tool. AI's technology is very deep in theory, mainly because the industry has just sprouted and has not been stratified. Just like IT engineers needed to master the skills 20 years ago, the children's hyphens don't need to pay attention.

Second, the relevance model

Deep learning has two steps. You must first train the generated model and then use the model to guess the current task.

For example, I used 1 million images to mark whether this is a cat or a dog. The AI â€‹â€‹extracts the features of each segment in the image to generate a cat and dog recognition model. Then we will put the model on the interface to make a dog and dog detection program. Each time you give this program a photo, it will tell you how likely it is that the cat has a chance to be a dog.

This recognition model is the most critical part of the whole program, and it can be vaguely considered to be a recognition function of a sealed black box. In the past, we wrote the program to do if-then-else causal judgment, but the image features have no causal relationship and only look at the degree of relevance. The past work experience has become a new cognitive obstacle. It is better to use it as a black box. use.

Next, I put a screenshot of the experimental training and speculation experiment steps to explain two questions:

It is necessary to use the customer's field data for training to get the model. The training model is not a software outsourcing day, and it is difficult to directly commit to the model training results.

The process of training the model is cumbersome and time consuming, but it is not difficult to master. The work pressure is much smaller than the DBA online debugging SQL. IT engineers still have a place in the AI â€‹â€‹era.

Third, hands-on experiment

This section is longer. If you are not interested in the experimental steps and results, but want to see my conclusions directly, you can skip this section.

This lab is an introductory training course offered by Nvidia - ImageClassificaTIon with DIGITS - Training a model.

Our experiment is very simple, using 6,000 images to train the AI â€‹â€‹to recognize the numbers 0-9.

The training sample data is 6000 small pictures with numbers 0-9, of which 4500 are used for training and 1500 are for val training.

Experimental data preparation

The training picture is very small and very simple. The preview below is a bunch of numbers:

-- The picture below is a 01 sample picture --