Hostname: page-component-7bb8b95d7b-nptnm Total loading time: 0 Render date: 2024-09-25T21:28:08.950Z Has data issue: false hasContentIssue false

4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm

Published online by Cambridge University Press:  29 July 2020

James Keoni Morris
Affiliation:
National Institutes of Health
Josh L. Gowin
Affiliation:
University of Colorado Denver
Melanie L. Schwandt
Affiliation:
National Insititute on Alcohol Abuse and Alcoholism
Nancy Diazgranados
Affiliation:
National Institute on Alcohol Abuse and Alcoholism
Vijay A. Ramchandani
Affiliation:
National Institute on Alcohol Abuse and Alcoholism
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

OBJECTIVES/GOALS: To test if a machine learning algorithm could predict a person’s capacity to binge drink and explore what measures might be important for identifying individuals at risk for high-intensity binge drinking behaviors. METHODS/STUDY POPULATION: The sample included 1177 (474 female) non-treatment-seeking drinkers (age: 18-91 years), that were assigned to a group based on their heaviest drinking day reported in a 90-Day Alcohol Timeline Followback questionnaire. The groups were Non-Bingers (female: 12 drinks, male:>15 drinks). The sample was divided into a training sample (N = 884) and a testing sample (N = 293). A machine learning algorithm called random forest was then used to generate a predictive model based on measures of substance use, personality traits, and trauma. The model was applied to the testing sample to determine accuracy. RESULTS/ANTICIPATED RESULTS: The first model correctly assigned 190 out of 293 subjects, giving it a total error rate of 0.35, with lowest rates for non-binge (0.19) and high-intensity (0.18), while medium-intensity had the highest error rate (0.86). The most important variables for the accuracy of the model included: total score on the Alcohol Use Disorder Identification Test, first five sub-score of the Self-Reported Effects of Alcohol, Compulsive Drinking subscale, and presence of a current psychiatric diagnosis. As a follow-up analysis, we built and tested another random forest model without the use of drinking dependence measures. This model had a total error rate of 0.39, and introduced other important variables such as smoking behaviors, perceived stress, IQ, and number of negative life events. DISCUSSION/SIGNIFICANCE OF IMPACT: Our study showed that it was possible for a machine learning algorithm to predict binge drinking intensity better than chance. Drinking patterns were the most robust predictors, and stress, IQ, and psychiatric diagnoses were also useful in predicting binge drinking intensity.

Type
Translational Science, Policy, & Health Outcomes Science
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Association for Clinical and Translational Science 2020