Convex Optimization for Group Feature Selection in Networked Data

Daehan Won
Corresponding Author
Daehan Won
Systems Science and Industrial Engineering Department, Binghamton University, the State University of New York, New York, New York 13902;
Search for more papers by this author
,
Hasan Manzour
Corresponding Author
Hasan Manzour
Department of Industrial and Systems Engineering, University of Washington, Seattle, Washington 98195;
Search for more papers by this author
,
Wanpracha Chaovalitwongse
Wanpracha Chaovalitwongse
http://orcid.org/0000-0002-8051-5981
Institute for Advanced Data Analytics, Department of Industrial Engineering, University of Arkansas, Fayetteville, Arkansas 72701
Search for more papers by this author

Corresponding Author

Daehan Won

Systems Science and Industrial Engineering Department, Binghamton University, the State University of New York, New York, New York 13902;

Search for more papers by this author

Hasan Manzour

Corresponding Author

Hasan Manzour

Department of Industrial and Systems Engineering, University of Washington, Seattle, Washington 98195;

Search for more papers by this author

Wanpracha Chaovalitwongse

http://orcid.org/0000-0002-8051-5981

Institute for Advanced Data Analytics, Department of Industrial Engineering, University of Arkansas, Fayetteville, Arkansas 72701

Search for more papers by this author

Published Online:18 Jul 2019https://doi.org/10.1287/ijoc.2018.0868

Abstract

Feature selection is at the heart of machine learning, and it is effective at facilitating data interpretability and improving prediction performance by defying the curse of dimensionality. Group feature selection is often used to reveal relationships in structured data and provide better predictive power compared with the standard feature selection methods without consideration of the grouped structure. We study a group feature selection problem in networked data in which edge weights are considered as features, while each node in the network is regarded as a group feature. This problem is particularly useful in feature selection for neuroimaging data, where the data are high dimensional and the intrinsic networked structure among the features (i.e., connectivities between regions) in brain data has to be captured properly. We propose a mathematical model based on the support vector machines (SVM), which entails the ℓ₀ norm regularization to restrict the number of nodes (i.e., groups). To cope with the computational challenge of the ℓ₀ norm regularization, we develop a convex relaxation reformulation of the proposed model as a convex semiinfinite programming (SIP). We then introduce a new iterative algorithm that achieves an optimal solution for this convex SIP. Experimental results for synthetic and real brain network data sets show that our approach gives better predictive performance compared with the state-of-the-art group feature selection and the standard feature selection methods. Our technique additionally yields a sparse subnetwork solution that is easier to interpret than those obtained by other methods.

cover image INFORMS Journal on Computing

Volume 32, Issue 1

Winter 2020

Pages 1-198, C2

Article Information

Supplemental Material

Metrics

Information

Received:October 26, 2016
Accepted:August 24, 2018
Published Online:July 18, 2019

Cite as

Daehan Won, Hasan Manzour, Wanpracha Chaovalitwongse (2019) Convex Optimization for Group Feature Selection in Networked Data. INFORMS Journal on Computing 32(1):182-198.

https://doi.org/10.1287/ijoc.2018.0868

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Convex Optimization for Group Feature Selection in Networked Data

Abstract

Volume 32, Issue 1

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News