Confusion Matrix

What is Confusion Matrix and Advanced Classification Metrics?

After data preparation and model training, there is model evaluation phase which I mentioned in my earlier article Simple Picture of Machine Learning Modelling Process.

Once model is developed, the next phase is to calculate the performance of the developed model using some evaluation metrics. In this article, you will just discover about confusion matrix though there are many classification metrics out there.

Mainly, it focuses on below points:

What is confusion matrix?
Four outputs in confusion matrix
Advanced classification metrics

Table 1. Confusion matrix with advanced classification metrics

Confusion Matrix is a tool to determine the performance of classifier. It contains information about actual and predicted classifications. The below table shows confusion matrix of two-class, spam and non-spam classifier.

Table 2. Confusion matrix of email classification

Let’s understand four outputs in confusion matrix.

1. True Positive (TP) is the number of correct predictions that an example is positive which means positive class correctly identified as positive.

Example: Given class is spam and the classifier has been correctly predicted it as spam.

2. False Negative (FN) is the number of incorrect predictions that an example is negative which means positive class incorrectly identified as negative.
Example: Given class is spam however, the classifier has been incorrectly predicted it as non-spam.

3. False positive (FP) is the number of incorrect predictions that an example is positive which means negative class incorrectly identified as positive.
Example: Given class is non-spam however, the classifier has been incorrectly predicted it as spam.

4. True Negative (TN) is the number of correct predictions that an example is negative which means negative class correctly identified as negative.
Example: Given class is spam and the classifier has been correctly predicted it as negative.

Now, let’s see some advanced classification metrics based on confusion matrix. These metrics are mathematically expressed in Table 1 with example of email classification, shown in Table 2. Classification problem has spam and non-spam classes and dataset contains 100 examples, 65 are Spams and 35 are non-spams.

Sensitivity is also referred as True Positive Rate or Recall. It is measure of positive examples labeled as positive by classifier. It should be higher. For instance, proportion of emails which are spam among all spam emails.

Table 3. Sensitivity in confusion matrix

Sensitivity = 45/(45+20) = 69.23% .

The 69.23% spam emails are correctly classified and excluded from all non-spam emails.

Specificity is also know as True Negative Rate. It is measure of negative examples labeled as negative by classifier. There should be high specificity. For instance, proportion of emails which are non-spam among all non-spam emails.

Table 4. Specificity in confusion matrix

specificity = 30/(30+5) = 85.71% .

The 85.71% non-spam emails are accurately classified and excluded from all spam emails.

Precision is ratio of total number of correctly classified positive examples and the total number of predicted positive examples. It shows correctness achieved in positive prediction.

Table 5. Precision in confusion matrix

Precision = 45/(45+5)= 90%

The 90% of examples are classified as spam are actually spam.

Accuracy is the proportion of the total number of predictions that are correct.

Table 6. Accuracy in confusion matrix

Accuracy = (45+30)/(45+20+5+30) = 75%

The 75% of examples are correctly classified by the classifier.

F1 score is a weighted average of the recall (sensitivity) and precision. F1 score might be good choice when you seek to balance between Precision and Recall.

It helps to compute recall and precision in one equation so that the problem to distinguish the models with low recall and high precision or vice versa could be solved.

Kindly follow my blog by email and stay tuned for more advanced post on regression measures.

Thank you!

What is Confusion Matrix and Advanced Classification Metrics?

After data preparation and model training, there is model evaluation phase which I mentioned in my earlier article Simple Picture of Machine Learning Modelling Process.

Mainly, it focuses on below points:

What is confusion matrix?
Four outputs in confusion matrix
Advanced classification metrics

Table 1. Confusion matrix with advanced classification metrics

Table 2. Confusion matrix of email classification

Let’s understand four outputs in confusion matrix.

1. True Positive (TP) is the number of correct predictions that an example is positive which means positive class correctly identified as positive.

Example: Given class is spam and the classifier has been correctly predicted it as spam.

Table 3. Sensitivity in confusion matrix

Sensitivity = 45/(45+20) = 69.23% .

The 69.23% spam emails are correctly classified and excluded from all non-spam emails.

Table 4. Specificity in confusion matrix

specificity = 30/(30+5) = 85.71% .

The 85.71% non-spam emails are accurately classified and excluded from all spam emails.

Precision is ratio of total number of correctly classified positive examples and the total number of predicted positive examples. It shows correctness achieved in positive prediction.

Table 5. Precision in confusion matrix

Precision = 45/(45+5)= 90%

The 90% of examples are classified as spam are actually spam.

Accuracy is the proportion of the total number of predictions that are correct.

Table 6. Accuracy in confusion matrix

Accuracy = (45+30)/(45+20+5+30) = 75%

The 75% of examples are correctly classified by the classifier.

F1 score is a weighted average of the recall (sensitivity) and precision. F1 score might be good choice when you seek to balance between Precision and Recall.

What is Confusion Matrix and Advanced Classification Metrics?

After data preparation and model training, there is model evaluation phase which I mentioned in my earlier article Simple Picture of Machine Learning Modelling Process.

Mainly, it focuses on below points:

What is confusion matrix?
Four outputs in confusion matrix
Advanced classification metrics

Table 1. Confusion matrix with advanced classification metrics

Table 2. Confusion matrix of email classification

Let’s understand four outputs in confusion matrix.

1. True Positive (TP) is the number of correct predictions that an example is positive which means positive class correctly identified as positive.

Example: Given class is spam and the classifier has been correctly predicted it as spam.

Table 3. Sensitivity in confusion matrix

Sensitivity = 45/(45+20) = 69.23% .

The 69.23% spam emails are correctly classified and excluded from all non-spam emails.

Table 4. Specificity in confusion matrix

specificity = 30/(30+5) = 85.71% .

The 85.71% non-spam emails are accurately classified and excluded from all spam emails.

Precision is ratio of total number of correctly classified positive examples and the total number of predicted positive examples. It shows correctness achieved in positive prediction.

Table 5. Precision in confusion matrix

Precision = 45/(45+5)= 90%

The 90% of examples are classified as spam are actually spam.

Accuracy is the proportion of the total number of predictions that are correct.

Table 6. Accuracy in confusion matrix

Accuracy = (45+30)/(45+20+5+30) = 75%

The 75% of examples are correctly classified by the classifier.

F1 score is a weighted average of the recall (sensitivity) and precision. F1 score might be good choice when you seek to balance between Precision and Recall.

Advertisement

Ad code

Confusion Matrix

What is Confusion Matrix and Advanced Classification Metrics?

What is Confusion Matrix and Advanced Classification Metrics?

What is Confusion Matrix and Advanced Classification Metrics?

Posted by Rakesh Jatav

Post a Comment

0 Comments

Popular Posts

Free Fire Next Update Frenzy: Unveiling the Mysteries of OB45

PUBG Mobile 3.2 Update: New Mecha Fusion Mode and Exciting Features Revealed!

Ethics in AI Research: Discussing Responsible Conduct in AI Research in Financial Services

Comments

Search This Blog

Categories

Popular Posts

Unlock Your Potential with Coursera&#39;s Artificial Intelligence Courses: Master the Future of Technology

Real Estate Appellate Tribunal, Telangana State Real Estate Regulatory Authority (TS-RERA) Recruitment 2024 - Apply for Member Posts

Backlinks Checker Tool for SEO Success Analysis

Get Posts in your inbox

Random Posts

Subscribe Us

Facebook

Menu Footer Widget

Contact form

Advertisement

Ad code

Confusion Matrix

What is Confusion Matrix and Advanced Classification Metrics?

What is Confusion Matrix and Advanced Classification Metrics?

What is Confusion Matrix and Advanced Classification Metrics?

Posted by Rakesh Jatav

You may like these posts

Post a Comment

0 Comments

Popular Posts

Free Fire Next Update Frenzy: Unveiling the Mysteries of OB45

PUBG Mobile 3.2 Update: New Mecha Fusion Mode and Exciting Features Revealed!

Ethics in AI Research: Discussing Responsible Conduct in AI Research in Financial Services

Comments

Search This Blog

Categories

Popular Posts

Unlock Your Potential with Coursera&amp;#39;s Artificial Intelligence Courses: Master the Future of Technology

Real Estate Appellate Tribunal, Telangana State Real Estate Regulatory Authority (TS-RERA) Recruitment 2024 - Apply for Member Posts

Backlinks Checker Tool for SEO Success Analysis

Social Plugin

Get Posts in your inbox

Random Posts

Subscribe Us

Facebook

Menu Footer Widget

Contact form

Unlock Your Potential with Coursera's Artificial Intelligence Courses: Master the Future of Technology