{{ report_title }}

{{ report_subtitle }}

Base Score {{ "%.4f"|format(base_score) }}
Robustness Score {{ "%.4f"|format(robustness_score) }}
Avg Impact {{ "%.4f"|format(avg_impact) }}
Perturbation Levels {{ total_levels }}
Features Tested {{ total_features }}
{% if has_weakspot_analysis %} {% endif %} {% if has_overfitting_analysis %} {% endif %}

Robustness Overview

This chart shows how the model performs under different perturbation levels.

Score Distribution

Distribution of scores across iterations for each perturbation level.

Impact by Perturbation Level

Performance degradation at each perturbation level.

Detailed Results

Level Mean Score Worst Score Std Dev Impact

Feature Importance

Top features ranked by their importance to the model.

Feature Details

Feature Importance Robustness Impact
{% if has_weakspot_analysis %}

WeakSpot Detection

Identifies regions in the feature space where the model's performance significantly degrades.

Total WeakSpots -
Critical WeakSpots -
Max Severity -
Features Analyzed -

Top WeakSpots

Feature Range Samples Mean Residual Severity
{% endif %} {% if has_overfitting_analysis %}

Localized Overfitting Analysis

Detects regions where the model shows significant train-test performance gaps.

Features Analyzed -
Features with Overfitting -
Max Gap -
Worst Feature -

Features with Overfitting

Feature Max Gap Overfit Slices Total Slices
{% endif %}