Integrity
Conflicting labels
Definition
The conflicting labels test checks if there are rows with identical feature values but differing labels. These are rows that are identical to the model (from an input perspective) but are labeled differently.
Taxonomy
- Category: Integrity.
- Task types: Tabular classification, text classification.
- Availability: and .
Why it matters
- Conflicting rows can be a sign of a lack of standardization in the data labeling process.
- Rows with conflicting labels hinder model learning. These are rows that are identical to the model (from an input perspective) but are labeled differently.
Test configuration examples
If you are writing a tests.json
, here are a few valid configurations for the character length test:
Was this page helpful?