Decomposing and pruning primary key violations from large data sets