How does a FC layer work in a typical CNN

up vote
3
down vote

favorite

I am new to CNNs and NNs. I am reading this blog: CNN and I am confused about this part: enter image description here What confuses me is the operation that will be performed on an input vector/matrix. Will we be using a typical ANN equation: "O = W.T * input"?. And then a sigmoid on top of it?

edited Aug 11 at 12:09

Djib2011

1,334515

asked Aug 11 at 11:05

user57521

161

add a commentÂ |Â

up vote
3
down vote

favorite

edited Aug 11 at 12:09

Djib2011

1,334515

asked Aug 11 at 11:05

user57521

161

add a commentÂ |Â

up vote
3
down vote

favorite

edited Aug 11 at 12:09

Djib2011

1,334515

asked Aug 11 at 11:05

user57521

161

edited Aug 11 at 12:09

Djib2011

1,334515

asked Aug 11 at 11:05

user57521

161

edited Aug 11 at 12:09

Djib2011

1,334515

edited Aug 11 at 12:09

Djib2011

1,334515

edited Aug 11 at 12:09

Djib2011

1,334515

asked Aug 11 at 11:05

user57521

161

asked Aug 11 at 11:05

user57521

161

asked Aug 11 at 11:05

user57521

161

add a commentÂ |Â

2 Answers
2

active

oldest

votes

up vote
2
down vote

Yes, essentially a typical CNN consists of two parts:

The convolution and pooling layers, whose goals are to extract features from the images. These are the first layers in the network.

The final layer(s), which are usually Fully Connected NNs, whose goal is to classify those features.

The latter do have a typical equation (i.e. $f(W^T cdot X + b)$), where $f$ is an activation function. Usually in the context of CNNs, $f$ is a ReLU, except for the activation function of the final layer, which is selected according to the nature of the problem. The most common cases are:

Sigmoid activation functions work for binary classification problems.

Softmax activation functions work practically for both binary and multi-class classification problem.

For regression problems, the final layer has no activation.

One final note I'd like to make is that before entering the first FC layer, the output of the previous layer is flattened. By this I mean that the (typically 3) dimensions of that tensor are layed out into one large dimension.

For example a tensor with a shape of $(5, 5, 32)$, when flattened would become $(5 cdot 5 cdot 32) = (800)$.

answered Aug 11 at 12:03

Djib2011

1,334515

add a commentÂ |Â

up vote
0
down vote

Basically, yes. But in order to pass input from a convolutional, or max pooling layer to a fully-connected one, you need to "flatten" the input tensor. That is, either to flatten the tensor/multi-dimensional array from the convolutional layer or to use something like Global Average Pooling that will reduce the tensor to a vector.

You can check code snippets in different frameworks, that will help you understand the process.

Also, should be noted that fully connected layers are used not only as the last layer that outputs class probabilities in a CNN, check for example VGG networks, they have 2-3 fully connected layers at the end.

And the last remark, to get class scores you usually (not always!) use Softmax, not simple sigmoid. Softmax ensures that the sum of the values in your output vector is equal to 1.

answered Aug 11 at 12:03

Alexandru Burlacu

add a commentÂ |Â

Your Answer

StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: false,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f36780%2fhow-does-a-fc-layer-work-in-a-typical-cnn%23new-answer', 'question_page');

);

Post as a guest

Name

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

up vote
2
down vote

Yes, essentially a typical CNN consists of two parts:

The convolution and pooling layers, whose goals are to extract features from the images. These are the first layers in the network.

The final layer(s), which are usually Fully Connected NNs, whose goal is to classify those features.

Sigmoid activation functions work for binary classification problems.

Softmax activation functions work practically for both binary and multi-class classification problem.

For regression problems, the final layer has no activation.

answered Aug 11 at 12:03

Djib2011

1,334515

add a commentÂ |Â

up vote
2
down vote

Yes, essentially a typical CNN consists of two parts:

The convolution and pooling layers, whose goals are to extract features from the images. These are the first layers in the network.

The final layer(s), which are usually Fully Connected NNs, whose goal is to classify those features.

Sigmoid activation functions work for binary classification problems.

Softmax activation functions work practically for both binary and multi-class classification problem.

For regression problems, the final layer has no activation.

answered Aug 11 at 12:03

Djib2011

1,334515

add a commentÂ |Â

up vote
2
down vote

Yes, essentially a typical CNN consists of two parts:

The convolution and pooling layers, whose goals are to extract features from the images. These are the first layers in the network.

The final layer(s), which are usually Fully Connected NNs, whose goal is to classify those features.

Sigmoid activation functions work for binary classification problems.

Softmax activation functions work practically for both binary and multi-class classification problem.

For regression problems, the final layer has no activation.

answered Aug 11 at 12:03

Djib2011

1,334515

Yes, essentially a typical CNN consists of two parts:

The convolution and pooling layers, whose goals are to extract features from the images. These are the first layers in the network.

The final layer(s), which are usually Fully Connected NNs, whose goal is to classify those features.

Sigmoid activation functions work for binary classification problems.

Softmax activation functions work practically for both binary and multi-class classification problem.

For regression problems, the final layer has no activation.

answered Aug 11 at 12:03

Djib2011

1,334515

answered Aug 11 at 12:03

Djib2011

1,334515

answered Aug 11 at 12:03

Djib2011

1,334515

answered Aug 11 at 12:03

Djib2011

1,334515

add a commentÂ |Â

up vote
0
down vote

You can check code snippets in different frameworks, that will help you understand the process.

And the last remark, to get class scores you usually (not always!) use Softmax, not simple sigmoid. Softmax ensures that the sum of the values in your output vector is equal to 1.

answered Aug 11 at 12:03

Alexandru Burlacu

add a commentÂ |Â

up vote
0
down vote

You can check code snippets in different frameworks, that will help you understand the process.

And the last remark, to get class scores you usually (not always!) use Softmax, not simple sigmoid. Softmax ensures that the sum of the values in your output vector is equal to 1.

answered Aug 11 at 12:03

Alexandru Burlacu

add a commentÂ |Â

up vote
0
down vote

You can check code snippets in different frameworks, that will help you understand the process.

And the last remark, to get class scores you usually (not always!) use Softmax, not simple sigmoid. Softmax ensures that the sum of the values in your output vector is equal to 1.

answered Aug 11 at 12:03

Alexandru Burlacu

You can check code snippets in different frameworks, that will help you understand the process.

And the last remark, to get class scores you usually (not always!) use Softmax, not simple sigmoid. Softmax ensures that the sum of the values in your output vector is equal to 1.

answered Aug 11 at 12:03

Alexandru Burlacu

answered Aug 11 at 12:03

Alexandru Burlacu

answered Aug 11 at 12:03

Alexandru Burlacu

answered Aug 11 at 12:03

Alexandru Burlacu

add a commentÂ |Â

draft saved

draft discarded

draft saved

draft discarded

Post as a guest

Name

Search This Blog

Iyfjky