||5R21CA218231-02 Interpret this number
||Kavuluru, Venkata Naga Ramakanth
||University Of Kentucky
||Toward Fine-Grained E-Cigarette Surveillance on Social Media: Evolving Themes, Popularity Predictors, and Demographic Variations
Electronic cigarettes (or e-cigarettes) are currently a popular emerging tobacco product. Because e-cigarettes do
not generate toxic tobacco combustion products produced when smoking regular cigarettes, they are perceived
and sometimes promoted as a less harmful alternative to smoking and also as a means to quit smoking. Although
they may be less harmful, the ef?cacy of using them for smoking cessation has not been demonstrated conclu-
sively with studies indicating evidence both favoring and opposing such an application for them. Furthermore,
owing to their recent introduction, there are also safety concerns given reported adverse events. The US Federal
Drug Administration (FDA) has introduced regulations that went into effect on 8/8/2016 requiring FDA review for
e-cigarette products, banning sales to minors and free samples, and requiring warning labels on certain prod-
ucts. In this context, surveillance of evolving themes and factors contributing to message popularity for e-cigarette
chatter on social media platforms is an important activity. Twitter has become the favorite network for teenagers
and young adults owing to the short message size and associated ease of use on smart phones. For an emerg-
ing product like e-cigarettes, the asymmetric follower-friend connections and hashtag functionality in Twitter offer
a convenient way to propagate information and facilitate discussion. Among online forums, Reddit allows for
longer messages from users inviting speci?c feedback from other users. Within Reddit, the e-cigarette subreddit
facilitates focused discussions on e-cigarette use and products. In this project, we propose to computationally
analyze the contents and user pro?les available in the dataset of all e-cigarette tweets generated during 7/2016?
6/2017 and all e-cigarette subreddit posts/comments generated since 9/2016. We will continue such analyses
with data collected through free but rate limited API throughout the duration of the project. Our ?rst aim is to sur-
face speci?c themes of interest directly from e-cigarette messages using phrase based online and binned topic
models. We expect these themes to complement familiar broad themes that researchers currently consider when
analyzing online messages. Next, we will identify factors (involving message content and pro?le characteristics)
that contribute to different notions of popularity (#retweets, #replies, #up-votes) of e-cigarette tweets/messages.
We expect these results will help health agencies, the FDA, and researchers gain insights into observed viral
nature of certain messages and designing effective strategies to maximize diffusion of their messages. Finally,
we will conduct these analyses along the dimensions of gender, race, and age to grasp variations in themes and
popularity factors speci?c to different vulnerable demographic segments.
Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task.
, Belousov M.
, Friedrichs J.
, Hakala K.
, Kiritchenko S.
, Mehryary F.
, Han S.
, Tran T.
, Rios A.
, Kavuluru R.
, et al.
Journal Of The American Medical Informatics Association : Jamia, 2018-10-01 00:00:00.0; 25(10), p. 1274-1283.
On the popularity of the USB flash drive-shaped electronic cigarette Juul.
, Han S.
, Hahn E.J.
Tobacco Control, 2018-04-13 00:00:00.0; , .