In temperature scaling code,negative log likelihood is considered not considered as loss function where as only Log likelihood is considered.Is there any reason or is it an error
Actually, it is the same thing, the log loss is implemented as negative log-likelihood of a logistic model. More information in documentation of sklearn in here.