TY - JOUR
T1 - Clustering Categories in Support Vector Machines
AU - Carrizosa, Emilio
AU - Nogales-Gómez, Amaya
AU - Romero Morales, Dolores
PY - 2017/1
Y1 - 2017/1
N2 - The support vector machine (SVM) is a state-of-the-art method in supervised classification. In this paper the Cluster Support Vector Machine (CLSVM) methodology is proposed with the aim to increase the sparsity of the SVM classifier in the presence of categorical features, leading to a gain in interpretability. The CLSVM methodology clusters categories and builds the SVM classifier in the clustered feature space. Four strategies for building the CLSVM classifier are presented based on solving: the SVM formulation in the original feature space, a quadratically constrained quadratic programming formulation, and a mixed integer quadratic programming formulation as well as its continuous relaxation. The computational study illustrates the performance of the CLSVM classifier using two clusters. In the tested datasets our methodology achieves comparable accuracy to that of the SVM in the original feature space, with a dramatic increase in sparsity.
AB - The support vector machine (SVM) is a state-of-the-art method in supervised classification. In this paper the Cluster Support Vector Machine (CLSVM) methodology is proposed with the aim to increase the sparsity of the SVM classifier in the presence of categorical features, leading to a gain in interpretability. The CLSVM methodology clusters categories and builds the SVM classifier in the clustered feature space. Four strategies for building the CLSVM classifier are presented based on solving: the SVM formulation in the original feature space, a quadratically constrained quadratic programming formulation, and a mixed integer quadratic programming formulation as well as its continuous relaxation. The computational study illustrates the performance of the CLSVM classifier using two clusters. In the tested datasets our methodology achieves comparable accuracy to that of the SVM in the original feature space, with a dramatic increase in sparsity.
KW - Support vector machine
KW - Categorical features
KW - Classifier sparsity
KW - Clustering
KW - Quadratically constrained programming
KW - 0-1 programming
KW - Support vector machine
KW - Categorical features
KW - Classifier sparsity
KW - Clustering
KW - Quadratically constrained programming
KW - 0-1 programming
U2 - 10.1016/j.omega.2016.01.008
DO - 10.1016/j.omega.2016.01.008
M3 - Journal article
SN - 0305-0483
VL - 66
SP - 28
EP - 37
JO - Omega
JF - Omega
IS - Part A
ER -