In this work we deal with the problem of detecting and ex-plaining exceptional behaving values in categorical datasets by perceiv-ing an attribute value as anomalous if its frequency occurrence is ex-ceptionally typical or un-typical within the distribution of frequencies occurrences of any other attribute value. The notion of frequency occur-rence is provided by specialising the Kernel Density Estimation method to the domain of frequency values and an outlierness measure is de fined by leveraging the cdf of such a density. This measure is able to simulta-neously identify two kinds of anomalies called lower outliers and upper outliers, namely exceptionally low or high frequent values. Moreover, data values labeled as outliers come with an interpretable explanations for their abnormality, which is a desirable feature of any knowledge discovery technique.
Scheda prodotto non validato
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo
|Titolo:||Detecting and Explaining Exceptional Values in Categorical Data|
|Data di pubblicazione:||2020|
|Appare nelle tipologie:||4.1 Contributo in Atti di convegno|