Definition

The feature values test allows you to define the expected range of values for a feature. For categorical features, you can define the expected categories.

Taxonomy

  • Category: Integrity.
  • Task types: Tabular classification, tabular regression.
  • Availability: and .

Why it matters

  • Ensuring that the values of a feature are always within a defined range is important to validate the hypotheses around the data. For example, for a feature such as Age, negative values would be invalid and signal an issue with the data collection/ingestion process.
  • Values outside the expected range can also be a sign of data drift.
  • For some categorical features, it is important to ensure that the values are always within the expected categories.

Test configuration examples

If you are writing a tests.json, here are a few valid configurations for the character length test: