Detecting Relevant Variables and Interactions in Supervised Classification

Emilio Carrizosa, Belén Martín-Barragán, Dolores Romero Morales

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Resumé

The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.
OriginalsprogEngelsk
TidsskriftEuropean Journal of Operational Research
Vol/bind213
Udgave nummer1
Sider (fra-til)260–269
ISSN0377-2217
DOI
StatusUdgivet - 2011
Udgivet eksterntJa

Emneord

  • Supervised classification
  • Interactions
  • Support vector machines
  • Binarization

Citer dette

@article{c7a42c5e036f4dc18e2bbd8bd24aa2d0,
title = "Detecting Relevant Variables and Interactions in Supervised Classification",
abstract = "The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.",
keywords = "Supervised classification, Interactions, Support vector machines, Binarization",
author = "Emilio Carrizosa and Bel{\'e}n Mart{\'i}n-Barrag{\'a}n and {Romero Morales}, Dolores",
year = "2011",
doi = "10.1016/j.ejor.2010.03.020",
language = "English",
volume = "213",
pages = "260–269",
journal = "European Journal of Operational Research",
issn = "0377-2217",
publisher = "Elsevier",
number = "1",

}

Detecting Relevant Variables and Interactions in Supervised Classification. / Carrizosa, Emilio; Martín-Barragán, Belén; Romero Morales, Dolores .

I: European Journal of Operational Research, Bind 213, Nr. 1, 2011, s. 260–269.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

TY - JOUR

T1 - Detecting Relevant Variables and Interactions in Supervised Classification

AU - Carrizosa, Emilio

AU - Martín-Barragán, Belén

AU - Romero Morales, Dolores

PY - 2011

Y1 - 2011

N2 - The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.

AB - The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification and Regression Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each predictor variable, the critical values which are most relevant for classification. However, when interactions between variables strongly affect the class membership, CART may yield misleading information. Extending previous work of the authors, in this paper an SVM-based method is introduced. The numerical experiments reported show that our method is competitive against SVM and CART in terms of misclassification rates, and, at the same time, is able to detect critical values and variables interactions which are relevant for classification.

KW - Supervised classification

KW - Interactions

KW - Support vector machines

KW - Binarization

U2 - 10.1016/j.ejor.2010.03.020

DO - 10.1016/j.ejor.2010.03.020

M3 - Journal article

VL - 213

SP - 260

EP - 269

JO - European Journal of Operational Research

JF - European Journal of Operational Research

SN - 0377-2217

IS - 1

ER -