DivExplorer enables to analyze subgroup performance in datasets, efficiently identifying the data subgroups that are anomalous.
Given a dataset, DivExplorer can find subgroups where specified attributes have higher or lower average value compared to the overall dataset. As an example, this can be used to find subgroups in a census dataset that have higher than average income.
In machine learning, DivExplorer enables the idenfitication of data subgroups for which classifiers have higher false-positive or false-negative rates than the average, or the identification of subgroups that are ranked higher or lower than the average.
Here, you can find the papers and videos related to the project, as well a Python package you can use to analyze your datasets.