Loss function for ordinal target on SoftMax over Logistic Regression

3k Views Asked by Run2 At 18 December 2014 at 08:00

I am using Pylearn2 OR Caffe to build a deep network. My target is ordered nominal. I am trying to find a proper loss function but cannot find any in Pylearn2 or Caffe.

I read a paper "Loss Functions for Preference Levels: Regression with Discrete Ordered Labels" . I get the general idea - but I am not sure I understand what will the thresholds be, if my final layer is a SoftMax over Logistic Regression (outputting probabilities).

Can some help me by pointing to any implementation of such a loss function ?

Thanks Regards

Original Q&A

There are 2 best solutions below

user1269942 On 24 January 2015 at 23:25

For both pylearn2 and caffe, your labels will need to be 0-4 instead of 1-5...it's just the way they work. The output layer will be 5 units, each is a essentially a logistic unit...and the softmax can be thought of as an adaptor that normalizes the final outputs. But "softmax" is commonly used as an output type. When training, the value of any individual unit is rarely ever exactly 0.0 or 1.0...it's always a distribution across your units - which log-loss can be calculated on. This loss is used to compare against the "perfect" case and the error is back-propped to update your network weights. Note that a raw output from PL2 or Caffe is not a specific digit 0,1,2,3, or 5...it's 5 number, each associated to the likelihood of each of the 5 classes. When classifying, one just takes the class with the highest value as the 'winner'.

I'll try to give an example... say I have a 3 class problem, I train a network with a 3 unit softmax. the first unit represents the first class, second the second and third, third.

Say I feed a test case through and get...

0.25, 0.5, 0.25 ...0.5 is the highest, so a classifier would say "2". this is the softmax output...it makes sure the sum of the output units is one.

Georg M. Goerg On 22 February 2022 at 03:20

You should have a look at ordinal (logistic) regression. This is the formal solution to the problem setup you describe ( do not use plain regression as the distance measures of errors are wrong).

https://stats.stackexchange.com/questions/140061/how-to-set-up-neural-network-to-output-ordinal-data

In particular I recommend looking at Coral ordinal regression implementation at https://github.com/ck37/coral-ordinal/issues.

Loss function for ordinal target on SoftMax over Logistic Regression

There are 2 best solutions below

Related Questions in DEEP-LEARNING

Related Questions in THEANO

Related Questions in CAFFE

Related Questions in LOGISTIC-REGRESSION

Related Questions in SOFTMAX

Trending Questions

Popular # Hahtags

Popular Questions