Definition

The size ratio test allows you to set a threshold on the the ratio between the number of rows in the validation and training datasets.

Taxonomy

  • Category: Consistency.
  • Task types: LLM, tabular classification, tabular regression, text classification.
  • Availability: .

Why it matters

  • It is important that your model is validated on a sufficient amount of unseen data. If the validation set is too small compared to the training set, it may not adequately represent the variety of data the model will encounter in the real world, leading to overfitting.
  • The size ratio can also helps ensure the statistical significance of the validation results.

Test configuration examples

If you are writing a tests.json, here are a few valid configurations for the character length test: