Welcome to our deep learning lecture. We are now in part four of the introduction.
Now in this fourth part we want to talk about machine learning and pattern recognition.
And first of all we have to introduce a bit of terminology and notation.
So throughout this entire lecture series we will use the following notation.
So matrices are bold and uppercase, so examples here are M and A.
And vectors are bold and lowercase, examples V, X.
Scalars are italic and lowercase, Y, W, alpha.
And for the gradient of a function we use the gradient simple.
For partial derivatives we use the partial notation.
Furthermore, we have some specifics about deep learning.
So the trainable weights will generally be called W.
Features inputs are X, they're typically vectors.
Then we have the ground truth label, which is Y.
We have some estimated output, that is Y hat.
And if we have some iterations going on, we typically do that in a superscript and put it into brackets.
This is an iteration index here, iteration i for the variable X.
Of course, this is a very coarse notation and we will further refine it throughout the lecture.
If you have attended previous lectures of our group, then you should know the classical image processing pipeline, pattern recognition pipeline.
We do the recording with the sampling and the analog to digital conversion takes place.
Then you have the pre-processing, feature extraction, followed by classification.
And of course, in the classification step you have to do the training.
The first part of the pattern recognition pipeline is covered in our lecture introduction pattern recognition.
The main part of classification is covered in pattern recognition.
Now what you see in this image is a classical image recognition problem.
Let's say you want to differentiate apples from pears.
Now one idea that you could do is you could draw a circle around them and then measure the length of the major axis.
So you will recognize that apples are round and pears are longer.
So their ellipses have a difference in the major and minor axis.
Now you could take those two numbers and represent them as a vector value.
Then you enter a two-dimensional space in which you will find that all of the apples are located on the diagonal through the x-axis.
Because if their diameter in one direction increases, also the diameter in the other direction increases.
While your pears are off this straight line because they have a difference in their two minor and major axis.
Now you can find a line that separates those two and there you have your first classification system.
Now what many people think how the big data processing works is shown in this small figure.
So is this your machine learning system?
Yep. Pour the data into this big pile of linear algebra.
Then collect the answers on the other side.
And what if the answers are wrong?
Just stir the pile until they start looking right.
So what you can see in this picture is that of course this is how many people think that they approach deep learning.
And you just pour the data in and in the end you just stir a bit and then you get the right results.
But that's not actually how it works.
Reminder what you want to do is you want to build a system that runs a classification.
Which means that from your measurement you first have to do some pre-processing like reduce noise.
You have to get a meaningful image then do a feature extraction and from that you can then do a classification.
Now the difference to deep learning is that you put everything into a single kind of engine.
So this does the pre-processing, the feature extraction and the classification just in a single step.
And you just use the training data and the measurement in order to reproduce those systems.
Now this has been shown to work in a lot of applications but as we've already talked about in the last video.
You have to have the right data and you cannot just pour some data in and then stir until it starts looking right.
Presenters
Zugänglich über
Offener Zugang
Dauer
00:13:30 Min
Aufnahmedatum
2020-05-27
Hochgeladen am
2020-05-27 20:26:34
Sprache
en-US