A Study of Machine Learning Models in Predicting the Intention of Adolescents to Smoke Cigarettes

by   Seung Joon Nam, et al.

The use of electronic cigarette (e-cigarette) is increasing among adolescents. This is problematic since consuming nicotine at an early age can cause harmful effects in developing teenager's brain and health. Additionally, the use of e-cigarette has a possibility of leading to the use of cigarettes, which is more severe. There were many researches about e-cigarette and cigarette that mostly focused on finding and analyzing causes of smoking using conventional statistics. However, there is a lack of research on developing prediction models, which is more applicable to anti-smoking campaign, about e-cigarette and cigarette. In this paper, we research the prediction models that can be used to predict an individual e-cigarette user's (including non-e-cigarette users) intention to smoke cigarettes, so that one can be early informed about the risk of going down the path of smoking cigarettes. To construct the prediction models, five machine learning (ML) algorithms are exploited and tested for their accuracy in predicting the intention to smoke cigarettes among never smokers using data from the 2018 National Youth Tobacco Survey (NYTS). In our investigation, the Gradient Boosting Classifier, one of the prediction models, shows the highest accuracy out of all the other models. Also, with the best prediction model, we made a public website that enables users to input information to predict their intentions of smoking cigarettes.


page 1

page 2

page 3

page 4


The Study of Machine Learning Models in Predicting the Intention of Adolescents to Smoke Cigarettes

The use of electronic cigarette (e-cigarette) is increasing among adoles...

The Probabilistic Bounds on the Feasibility of the Defect Prediction Models in Real-World Testing Environments

The research on developing software defect prediction (SDP) models is ta...

Efficient Click-Through Rate Prediction for Developing Countries via Tabular Learning

Despite the rapid growth of online advertisement in developing countries...

An Interpretable Prediction Model for Obesity Prediction using EHR Data

Childhood obesity is a major public health challenge. Obesity in early c...

Modeling Household Online Shopping Demand in the U.S.: A Machine Learning Approach and Comparative Investigation between 2009 and 2017

Despite the rapid growth of online shopping and research interest in the...

Capturing and incorporating expert knowledge into machine learning models for quality prediction in manufacturing

Increasing digitalization enables the use of machine learning methods fo...

Please sign up or login with your details

Forgot password? Click here to reset