Data-Based Economics, ESCP, 2024-2025
2025-03-04
There are many:
Examples:
Independently of how the classification is made, its validity can be assessed with a similar procedure as in the regression.
Separate training set and test set
Compute score: number of correctly identified categories
Predicted | (0) Actual | (1) Actual |
---|---|---|
0 | true negatives (TN) | false negatives (FN) |
1 | false positives (FP) | true positives (TP) |
We can then define different measures:
Which one to favour depends on the use case
Police cameras in London
According to London Police the cameras in London have
But reporters found a 81% are misidentication rate
Interpretation? Is it working?
In-sample confusion matrix
Based on consumer data, an algorithm tries to predict the credit score from various client characeristics. The predictions are reported in the confusion matrix.
Can you calculate: FPR, TPR and overall accuracy?