This article focus on the deep learning in computer vision areas organial by me. If you want to copy something, please referring this. Thank you.
The main challenges of deep learning are to solve the tasks which can be solved easily by human beings but hardly for computers.
At begining, features be got by traditional way, just like getting the color and texture of the object picture to make the computer can get the ROI. It’s called engineered handcraft feature by model pattern.
By the ability of the computation growing up and the GPU get more powerful use, there is a possibility to make sample units to compose the most complex model with ANN.
So let’s talk about the most excited part of the most basic function of the deep learning as the basic unit of the net.
y
=
f
(
w
x
+
b
)
y=f(wx+b)
y=f(wx+b)
x is the input of the layer, y is output of the layer. w is called the weight which can be trained by the obtained result. the wx+b is easily regarded as the liner relation, and the result of it is unlimited.If we want make it limited, we can use the activation function. The f(.) is a nonlinear function named activation function which commonly uses sigmoid function and ReLU function.
Autoencoder (AE) is a kind of basic networks
Convolution is a kind of method to find the relation of the pixels from the input images.using a kernels can get the features from the image.
GANs is a kind of model like zero-sum two players game, one is named generative model which can generate the data from the orginal data distribution.