Managing grouping options for symbolic predictors manually
Combine values or ranges based on the similarity of symbolic predictor behavior.
- In the Data analysis step, click a symbolic predictor.
- In the predictor workspace, click the Grouping tab.
-
In the
Granularity
field, enter the maximum probability of
spurious difference in behavior.
Reducing this probability reduces interval or category numbers.
-
In the
Minimum size (% of samples)
field, enter the minimum
number of cases in each interval or category. Click
Apply.
Use it to ensure that the behavior evidence is sufficient to be used in grouping. Intervals with few cases are combined with their nearest neighbor. Categories with few cases are combined into a residual bin.
- Optional: To read the data in a graph or a table, click the Graphical view tab or the Tabular view tab
- Optional:
Disable the
Ignore ordering
option.
When this option is enabled, categories are combined with any other category most similar in behavior. After you disable it, the order of the symbolic categories is assumed to have some meaning and only the neighboring categories are grouped.
- Optional:
Disable the
Merge bins below minimum size
option.
When this option is enabled, the categories below the minimum size are grouped in a residual bin on the assumption of insufficient cases for the behavior to be the basis for grouping.
- Confirm your updates by clicking Submit.
Previous topic Managing grouping options for numeric predictors manually Next topic Outcome inferencing