Training Experimentally Robust and Interpretable Binarized Regression Models Using Mixed-Integer Programming

12/01/2021
by   Sanjana Tule, et al.
0

In this paper, we explore model-based approach to training robust and interpretable binarized regression models for multiclass classification tasks using Mixed-Integer Programming (MIP). Our MIP model balances the optimization of prediction margin and model size by using a weighted objective that: minimizes the total margin of incorrectly classified training instances, maximizes the total margin of correctly classified training instances, and maximizes the overall model regularization. We conduct two sets of experiments to test the classification accuracy of our MIP model over standard and corrupted versions of multiple classification datasets, respectively. In the first set of experiments, we show that our MIP model outperforms an equivalent Pseudo-Boolean Optimization (PBO) model and achieves competitive results to Logistic Regression (LR) and Gradient Descent (GD) in terms of classification accuracy over the standard datasets. In the second set of experiments, we show that our MIP model outperforms the other models (i.e., GD and LR) in terms of classification accuracy over majority of the corrupted datasets. Finally, we visually demonstrate the interpretability of our MIP model in terms of its learned parameters over the MNIST dataset. Overall, we show the effectiveness of training robust and interpretable binarized regression models using MIP.

READ FULL TEXT

page 10

page 13

research
10/19/2022

Margin Optimal Classification Trees

In recent years there has been growing attention to interpretable machin...
research
02/20/2023

Model-based feature selection for neural networks: A mixed-integer programming approach

In this work, we develop a novel input feature selection framework for R...
research
01/10/2017

Efficient Image Set Classification using Linear Regression based Image Reconstruction

We propose a novel image set classification technique using linear regre...
research
11/18/2022

A Mathematical Programming Approach to Optimal Classification Forests

In this paper we propose a novel mathematical optimization based methodo...
research
02/07/2018

Cadre Modeling: Simultaneously Discovering Subpopulations and Predictive Models

We consider the problem in regression analysis of identifying subpopulat...
research
06/27/2013

Supersparse Linear Integer Models for Interpretable Classification

Scoring systems are classification models that only require users to add...
research
12/07/2022

The BeMi Stardust: a Structured Ensemble of Binarized Neural Networks

Binarized Neural Networks (BNNs) are receiving increasing attention due ...

Please sign up or login with your details

Forgot password? Click here to reset