Description
In this episode, I talked with Curtis Northcutt about his application Cleanlab, with which you can find label errors in your dataset. Cleanlab computes cross-validated probabilities, the confident joint, and the statistics used in uncertainty estimation for dataset labels, and it ranks and sorts the labels by the probabilities of error, so you can easily find them in your dataset.
Show notes: http://podcast.machinelearningcafe.org/finding-the-label-errors-with-cleanlab-with-curtis-northcutt-006