Definition

The is code test allows you to check if a specified column contains executable code. Currently, Python and JS code are supported.

Taxonomy

  • Category: Integrity.
  • Task types: LLM.
  • Availability: and .

Why it matters

  • If the LLM is used for code generation or understanding, it is important to ensure that the code is valid and executable.
  • For code generation with LLMs, it is particularly important to ensure that the generated code is valid, and not a hallucination.

Test configuration examples

If you are writing a tests.json, here are a few valid configurations for the character length test:

[
  {
    "name": "Outputs have valid Python code",
    "description": "Asserts that the output column contains valid Python code",
    "type": "integrity",
    "subtype": "isCode",
    "thresholds": [
      {
        "insightName": "isCode",
        "insightParameters": [
          { "name": "column_name", "value": "output" }, // Selects the column `output`
          { "name": "language", "value": "python" } // Checks for valid Python code
        ],
        "measurement": "isCodeRowPercentage",
        "operator": ">=",
        "value": 1.0
      }
    ],
    "subpopulationFilters": null,
    "mode": "development",
    "usesValidationDataset": true, // Apply test to the validation set
    "usesTrainingDataset": false,
    "usesMlModel": false,
    "syncId": "b4dee7dc-4f15-48ca-a282-63e2c04e0689" // Some unique id
  }
]