Dataset Catalog

This page is generated from the dataset registry. It enumerates the classes returned by all_datasets, calls create_test_instances_and_names(), and renders each returned example with plot_marginal_curves.

Datasets expose a single sample. Any train/test or cross-fitting holdout should be created outside the dataset layer with a split object.

Use the dropdown to switch between datasets without leaving the page.

ExampleCategorical

Three-level categorical treatment dataset with observed confounding.

ExampleCategorical-0

Rows: 64. Treatment columns: treatment. Parameters: control_effect=0.0, covariate_effect=0.8, n=64, outcome_noise=0.2, placebo_effect=-0.5, random_state=7, score_x1_weight=-0.5, treated_effect=1.2, treatment_noise=0.4, treatment_threshold=0.6.

Treatment versus response plot for ExampleCategorical-0

Three-level categorical treatment dataset with observed confounding.

The observed covariates satisfy

$$ X_0, X_1 \stackrel{\mathrm{iid}}{\sim} \mathcal{N}(0, 1). $$

A latent treatment score is generated as

$$ S = X_0 + w_1 X_1 + \varepsilon_T, \qquad \varepsilon_T \sim \mathcal{N}(0, \sigma_T^2), $$

where \(w_1 =\) score_x1_weight and \(\sigma_T =\) treatment_noise. The observed treatment is then the three-level threshold rule

$$ A = \begin{cases} \mathrm{treated}, & S > \tau, \\ \mathrm{placebo}, & S < -\tau, \\ \mathrm{control}, & \mathrm{otherwise}, \end{cases} $$

with \(\tau =\) treatment_threshold.

The noiseless response surface is

$$ m(X, A) = \beta X_0 + \alpha(A), $$

where \(\beta =\) covariate_effect and \(\alpha(A)\) is the category-specific shift given by control_effect, placebo_effect, and treated_effect. Observed outcomes satisfy

$$ Y \mid X, A \sim \mathcal{N}(m(X, A), \sigma_Y^2), $$

with \(\sigma_Y =\) outcome_noise.