Detecting Relevant Variables and Interactions in Supervised Classification

Emilio Carrizosa, Belén Martín-Barragán, Dolores Romero Morales

Research output: Contribution to journalJournal articleResearchpeer-review

Abstract

The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.
The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.
LanguageEnglish
JournalEuropean Journal of Operational Research
Volume213
Issue number1
Pages260–269
ISSN0377-2217
DOIs
StatePublished - 2011
Externally publishedYes

Keywords

    Cite this

    Carrizosa, Emilio ; Martín-Barragán, Belén ; Morales, Dolores Romero. / Detecting Relevant Variables and Interactions in Supervised Classification. In: European Journal of Operational Research. 2011 ; Vol. 213, No. 1. pp. 260–269
    @article{c7a42c5e036f4dc18e2bbd8bd24aa2d0,
    title = "Detecting Relevant Variables and Interactions in Supervised Classification",
    abstract = "The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.",
    keywords = "Supervised classification, Interactions, Support vector machines, Binarization",
    author = "Emilio Carrizosa and Bel{\'e}n Mart{\'i}n-Barrag{\'a}n and Morales, {Dolores Romero}",
    year = "2011",
    doi = "10.1016/j.ejor.2010.03.020",
    language = "English",
    volume = "213",
    pages = "260–269",
    journal = "European Journal of Operational Research",
    issn = "0377-2217",
    publisher = "Elsevier",
    number = "1",

    }

    Detecting Relevant Variables and Interactions in Supervised Classification. / Carrizosa, Emilio; Martín-Barragán, Belén; Morales, Dolores Romero.

    In: European Journal of Operational Research, Vol. 213, No. 1, 2011, p. 260–269.

    Research output: Contribution to journalJournal articleResearchpeer-review

    TY - JOUR

    T1 - Detecting Relevant Variables and Interactions in Supervised Classification

    AU - Carrizosa,Emilio

    AU - Martín-Barragán,Belén

    AU - Morales,Dolores Romero

    PY - 2011

    Y1 - 2011

    N2 - The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.

    AB - The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.

    KW - Supervised classification

    KW - Interactions

    KW - Support vector machines

    KW - Binarization

    U2 - 10.1016/j.ejor.2010.03.020

    DO - 10.1016/j.ejor.2010.03.020

    M3 - Journal article

    VL - 213

    SP - 260

    EP - 269

    JO - European Journal of Operational Research

    T2 - European Journal of Operational Research

    JF - European Journal of Operational Research

    SN - 0377-2217

    IS - 1

    ER -