Statistics: fitcgam

Function Reference: `fitcgam`

statistics: Mdl = fitcgam (X, Y)
statistics: Mdl = fitcgam (…, name, value)

Fit a Generalized Additive Model (GAM) for binary classification.

Mdl = fitcgam (X, Y) returns a a GAM classification model, Mdl, with X being the predictor data, and Y the binary class labels of observations in X.

X must be a $N×P$ numeric matrix of predictor data where rows correspond to observations and columns correspond to features or variables.
Y is $N×1$ numeric vector containing binary class labels, typically 0 or 1.

Mdl = fitcgam (…, name, value) returns a GAM classification model with additional options specified by Name-Value pair arguments listed below.

Model Parameters

`Name`		`Value`
`"PredictorNames"`		A cell array of character vectors specifying the names of the predictors. The length of this array must match the number of columns in `X`.
`"ResponseName"`		A character vector specifying the name of the response variable.
`"ClassNames"`		Names of the classes in the class labels, `Y`, used for fitting the Discriminant model. `ClassNames` are of the same type as the class labels in `Y`.
`"Cost"`		A $N×R$ numeric matrix containing misclassification cost for the corresponding instances in `X` where $R$ is the number of unique categories in `Y`. If an instance is correctly classified into its category the cost is calculated to be 1, otherwise 0. cost matrix can be altered use `Mdl.cost = somecost`. default value `cost = ones(rows(X),numel(unique(Y)))`.
`"Formula"`		A model specification given as a string in the form `"Y ~ terms"` where `Y` represents the reponse variable and `terms` the predictor variables. The formula can be used to specify a subset of variables for training model. For example: `"Y ~ x1 + x2 + x3 + x4 + x1:x2 + x2:x3"` specifies four linear terms for the first four columns of for predictor data, and `x1:x2` and `x2:x3` specify the two interaction terms for 1st-2nd and 3rd-4th columns respectively. Only these terms will be used for training the model, but `X` must have at least as many columns as referenced in the formula. If Predictor Variable names have been defined, then the terms in the formula must reference to those. When `"formula"` is specified, all terms used for training the model are referenced in the `IntMatrix` field of the `obj` class object as a matrix containing the column indexes for each term including both the predictors and the interactions used.
`"Interactions"`		A logical matrix, a positive integer scalar, or the string `"all"` for defining the interactions between predictor variables. When given a logical matrix, it must have the same number of columns as `X` and each row corresponds to a different interaction term combining the predictors indexed as `true`. Each interaction term is appended as a column vector after the available predictor column in `X`. When `"all"` is defined, then all possible combinations of interactions are appended in `X` before training. At the moment, parsing a positive integer has the same effect as the `"all"` option. When `"interactions"` is specified, only the interaction terms appended to `X` are referenced in the `IntMatrix` field of the `obj` class object.
`"Knots"`		A scalar or a row vector with the same columns as `X`. It defines the knots for fitting a polynomial when training the GAM. As a scalar, it is expanded to a row vector. The default value is 5, hence expanded to `ones (1, columns (X)) * 5`. You can parse a row vector with different number of knots for each predictor variable to be fitted with, although not recommended.
`"Order"`		A scalar or a row vector with the same columns as `X`. It defines the order of the polynomial when training the GAM. As a scalar, it is expanded to a row vector. The default values is 3, hence expanded to `ones (1, columns (X)) * 3`. You can parse a row vector with different number of polynomial order for each predictor variable to be fitted with, although not recommended.
`"DoF"`		A scalar or a row vector with the same columns as `X`. It defines the degrees of freedom for fitting a polynomial when training the GAM. As a scalar, it is expanded to a row vector. The default value is 8, hence expanded to `ones (1, columns (X)) * 8`. You can parse a row vector with different degrees of freedom for each predictor variable to be fitted with, although not recommended.

You can parse either a "Formula" or an "Interactions" optional parameter. Parsing both parameters will result an error. Accordingly, you can only pass up to two parameters among "Knots", "Order", and "DoF" to define the required polynomial for training the GAM model.

See also: ClassificationGAM

Source Code: fitcgam

Example: 1


 ## Train a GAM classifier for binary classification
 ## using specific data and plot the decision boundaries.

 ## Define specific data
 X = [1, 2; 2, 3; 3, 3; 4, 5; 5, 5; ...
     6, 7; 7, 8; 8, 8; 9, 9; 10, 10];
 Y = [0; 0; 0; 0; 0; ...
     1; 1; 1; 1; 1];

 ## Train the GAM model
 obj = fitcgam (X, Y, "Interactions", "all");

 ## Create a grid of values for prediction
 x1 = [min(X(:,1)):0.1:max(X(:,1))];
 x2 = [min(X(:,2)):0.1:max(X(:,2))];
 [x1G, x2G] = meshgrid (x1, x2);
 XGrid = [x1G(:), x2G(:)];
 pred = predict (obj, XGrid);

 ## Plot decision boundaries and data points
 predNumeric = str2double (pred);
 gidx = predNumeric > 0.5;

 figure
 scatter(XGrid(gidx,1), XGrid(gidx,2), "markerfacecolor", "magenta");
 hold on
 scatter(XGrid(!gidx,1), XGrid(!gidx,2), "markerfacecolor", "red");
 plot(X(Y == 0, 1), X(Y == 0, 2), "ko", X(Y == 1, 1), X(Y == 1, 2), "kx");
 xlabel("Feature 1");
 ylabel("Feature 2");
 title("Generalized Additive Model (GAM) Decision Boundary");
 legend({"Class 1 Region", "Class 0 Region", ...
       "Class 1 Samples", "Class 0 Samples"}, ...
       "location", "northwest")
 axis tight
 hold off

Categories &

Functions List

Clustering

Clustering

Classification Classes

Classification Classes

Clustering Classes

Clustering Classes

Regression Classes

Regression Classes

Data Manipulation

Data Manipulation

Descriptive Statistics

Descriptive Statistics

Distribution Classes

Distribution Classes

Distribution Fitting

Distribution Fitting

Distribution Functions

Distribution Functions

Distribution Statistics

Distribution Statistics

Distribution Wrappers

Distribution Wrappers

Experimental Design

Experimental Design

Machine Learning

Machine Learning

Model Fitting

Model Fitting

Hypothesis Testing

Hypothesis Testing

I/O

I/O

Plotting

Plotting

Regression

Regression

Transforms

Transforms

Function Reference: fitcgam

Model Parameters

Example: 1

Function Reference: `fitcgam`