About
515
Publications
238,947
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
21,136
Citations
Introduction
Jaideep Srivastava currently works at the Department of Computer Science and Engineering, University of Minnesota Twin Cities. Jaideep does research in Databases, Computing in Social science, Arts and Humanities and Data Mining. Their current project is 'Data Search'.
Skills and Expertise
Current institution
Additional affiliations
January 1984 - September 1988
October 1988 - present
Publications
Publications (515)
The scientific community increasingly relies on open data sharing, yet existing metrics inadequately capture the true impact of datasets as research outputs. Traditional measures, such as the h-index, focus on publications and citations but fail to account for dataset accessibility, reuse, and cross-disciplinary influence. We propose the X-index, a...
Misinformation remains one of the most significant issues in the digital age. While automated fact-checking has emerged as a viable solution, most current systems are limited to evaluating factual accuracy. However, the detrimental effect of misinformation transcends simple falsehoods; it takes advantage of how individuals perceive, interpret, and...
Background
Obstructive sleep apnea (OSA) is a prevalent and potentially severe sleep disorder characterized by repeated interruptions in breathing during sleep. Machine learning models have been increasingly applied in various aspects of OSA research, including diagnosis, treatment optimization, and developing biomarkers for endotypes and disease m...
Background: Heart failure with reduced ejection fraction is a complex condition that necessitates adaptive, patient-specific management strategies. This study aimed to evaluate the effectiveness of a time-adaptive machine learning model, the Passive-Aggressive classifier, in predicting heart failure with reduced ejection fraction severity and captu...
Background: Obstructive sleep apnea (OSA) is a prevalent and potentially severe sleep disorder characterized by repeated interruptions in breathing during sleep. Machine learning models have been increasingly applied in various aspects of OSA research, including diagnosis, treatment optimization, and developing biomarkers for endotypes and disease...
We propose a novel approach to assess the public's coping behavior during the COVID-19 outbreak by examining the emotions. Specifically, we explore (1) changes in the public's emotions with the COVID-19 crisis progression and (2) the impacts of the public's emotions on their information-seeking, information-sharing behaviors, and compliance with st...
Phase retrieval (PR) is fundamentally important in scientific imaging and is crucial for nanoscale techniques like coherent diffractive imaging (CDI). Low radiation dose imaging is essential for applications involving radiation-sensitive samples. However, most PR methods struggle in low-dose scenarios due to high shot noise. Recent advancements in...
For corporations engaged in corporate social advocacy (CSA), establishing legitimacy among publics is challenging when they take stands along clear ideological lines on controversial issues. This study examined two questions: (1) how would the congruence between individuals' political ideologies and corporations' CSA stances influence perceived CSA...
Widespread public crises often give rise to the proliferation of sensationalized rumors and conspiracy theories, which can evoke a variety of public emotions. Despite the growing importance of research on the relationship between emotions and coping behaviors in crisis, a dearth of natural observation-based investigation has been limiting theory de...
Prior to the COVID-19 pandemic, the World Health Organization named vaccine hesitancy as one of the top 10 threats to global health. The impact of hesitancy on the uptake of human papillomavirus (HPV) vaccines was of particular concern, given the markedly lower uptake compared to other adolescent vaccines in some countries, notably the United State...
With the ever-increasing spread of misinformation on online social networks, it has become very important to identify the spreaders of misinformation (unintentional), disinformation (intentional), and misinformation refutation. It can help in educating the first, stopping the second, and soliciting the help of the third category, respectively, in t...
Labeling time series data is an expensive task because of domain expertise and dynamic nature of the data. Hence, we often have to deal with limited labeled data settings. Data augmentation techniques have been successfully deployed in domains like computer vision to exploit the use of existing labeled data. We adapt one of the most commonly used t...
Missing data in time series is a challenging issue affecting time series analysis. Missing data occurs due to problems like data drops or sensor malfunctioning. Imputation methods are used to fill in these values, with quality of imputation having a significant impact on downstream tasks like classification. In this work, we propose a semi-supervis...
Prior to the COVID-19 pandemic, the World Health Organization named vaccine hesitancy as one of the top 10 threats to global health. The impact of hesitancy on uptake of human papillomavirus (HPV) vaccines was of particular concern, given the markedly lower uptake compared to other adolescent vaccines in some countries, notably the United States. W...
Appropriately handling the scalability of clustering is a long-standing challenge for the study of clustering techniques and is of fundamental interest to researchers in the community of data mining and knowledge discovery. In comparison to other clustering methods, hierarchical clustering demonstrates better interpretability of clustering results...
An important aspect of preventing fake news spreading in social networks is to proactively detect the users that are likely going to spread such news. Research in the domain of spreader detection is at a nascent stage compared to fake news detection. In this paper, we propose a graph neural network-based framework to identify nodes that are likely...
Introduction
Upper airway stimulation (UAS) therapy is effective for a subset of obstructive sleep apnea (OSA) patients with CPAP intolerance. While overall adherence is high, some patients have suboptimal adherence to UAS, which limits effectiveness. Our goal was to identify UAS therapy usage patterns during the first three months of therapy that...
Computational models for the detection and prevention of false information spreading (popularly called fake news) has gained a lot of attention over the last decade, with most proposed models identifying the veracity of information. In this chapter we propose a framework based on a complementary approach to false information mitigation inspired fro...
Rumors about brands and products are a growing problem on social media, but
systematic research on rumor-refutation strategies for various brands is limited. To advance
research on rumor-refutation effects in the context of commercial rumor outbreaks, and to help brand managers better fight detrimental rumors against their brands, this study examin...
Study objectives:
Upper airway stimulation (UAS) therapy is effective for a subset of obstructive sleep apnea (OSA) patients with continuous positive airway pressure (CPAP) intolerance. While overall adherence is high, some patients have suboptimal adherence, which limits efficacy. Our goal was to identify therapy usage patterns during the first t...
Measuring influence of one person on another has applications in advertising and marketing and across the sciences. Most approaches involve inferring influence based on speech and social media. In contrast, this paper takes existing spending data and attributes influence on to the spenders and those likely to have caused their spending. The resulti...
Modeling infection spread during pandemics is not new, with models using past data to tune simulation parameters for predictions. These help in understanding of the healthcare burden posed by a pandemic and responding accordingly. However, the problem of how college/university campuses should function during a pandemic is new for the following reas...
Recombinant human growth hormone (r-hGH) is an established therapy for growth hormone deficiency (GHD); yet, some patients fail to achieve their full height potential, with poor adherence and persistence with the prescribed regimen often a contributing factor. A data-driven clinical decision support system based on “traffic light” visualizations fo...
One of the main challenges for hierarchical clustering is how to appropriately identify the representative points in the lower level of the cluster tree, which are going to be utilized as the roots in the higher level of the cluster tree for further aggregation. However, conventional hierarchical clustering approaches have adopted some simple trick...
The quality of sleep has a deep impact on people's physical and mental health. People with insufficient sleep are more likely to report physical and mental distress, activity limitation, anxiety, and pain. Moreover, in the past few years, there has been an explosion of applications and devices for activity monitoring and health tracking. Signals co...
This study examined the influence of consumers’ temporary affective states during ad exposure on their engagement with different types of ads that are categorized based on theoretically derived attention-grabbing characteristics. A computational research approach was used, cross-analyzing proxy measures of real-time affective fluctuation of viewers...
Simulation models for infection spread can help understand what factors play a major role in infection spread. Health agencies like the Center for Disease Control (CDC) can accordingly mandate effective guidelines to curb the spread. We built an infection spread model to simulate disease propagation through airborne transmission to study the impact...
Gold farming and real money trade refer to a set of illicit practices in massively multiplayer online games (MMOGs) whereby players accumulate virtual resources to sell for “real world” money. Prior work has examined trade relationships formed by gold farmers but not the trust relationships which exist between members of these organizations. We ado...
Many machine learning models have been built to tackle information overload issues on Massive Open Online Courses (MOOC) platforms. These models rely on learning powerful representations of MOOC entities. However, they suffer from the problem of scarce expert label data. To overcome this problem, we propose to learn pre-trained representations of M...
The problem of consistent therapy adherence is a current challenge for health informatics, and its solution can increase the success rate of treatments. Here we show a methodology to predict, at individual-level, future therapy adherence for patients receiving daily injections of growth hormone (GH) therapy for GH deficiency. Our proposed model is...
False information and true information fact checking it, often co-exist in social networks, each competing to influence people in their spread paths. An efficient strategy here to contain false information is to proactively identify if nodes in the spread path are likely to endorse false information (i.e. further spread it) or refutation informatio...
Recommending relevant items to users is a crucial task on online communities such as Reddit and Twitter. For recommendation system, representation learning presents a powerful technique that learns embeddings to represent user behaviors and capture item properties. However, learning embeddings on online communities is a challenging task because the...
Recommending relevant items to users is a crucial task on online communities such as Reddit and Twitter. For recommendation system, representation learning presents a powerful technique that learns embeddings to represent user behaviors and capture item properties. However, learning embeddings on online communities is a challenging task because the...
False information and true information fact checking it, often co-exist in social networks, each competing to influence people in their spread paths. An efficient strategy here to contain false information is to proactively identify if nodes in the spread path are likely to endorse false information (i.e. further spread it) or refutation informatio...
This study explores the extent to which U.S. newspaper organizations incorporate images captured by private citizens into their news articles as an audience engagement strategy, and examines the relationship between the extent to which citizen-eyewitness images are incorporated in the news and audience news engagement. By conducting a machine-codin...
False information and true information fact checking it, often co-exist in social networks, each competing to influence people in their spread paths. An efficient strategy here to contain false information is to proactively identify if nodes in the spread path are likely to endorse false information (i.e. further spread it) or refutation informatio...
The plague of false information, popularly called fake news has affected lives of news consumers ever since the prevalence of social media. Thus understanding the spread of false information in social networks has gained a lot of attention in the literature. While most proposed models do content analysis of the information, no much work has been do...
Leading up to August 2020, COVID-19 has spread to almost every country in the world, causing millions of infected and hundreds of thousands of deaths. In this paper, we first verify the assumption that clinical variables could have time-varying effects on COVID-19 outcomes. Then, we develop a temporal stratification approach to make daily predictio...
Knowledge tracing (KT) is the problem of modeling each student's mastery of knowledge concepts (KCs) as (s)he engages with a sequence of learning activities. It is an active research area to help provide learners with personalized feedback and materials. Various deep learning techniques have been proposed for solving KT. Recent release of large-sca...
In recent years, Massive Open Online Courses (MOOCs) have witnessed immense growth in popularity. Now, due to the recent Covid19 pandemic situation, it is important to push the limits of online education. Discussion forums are primary means of interaction among learners and instructors. However, with growing class size, students face the challenge...
The 3-volume set LNAI 12712-12714 constitutes the proceedings of the 25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2021, which was held during May 11-14, 2021.
The 157 papers included in the proceedings were carefully reviewed and selected from a total of 628 submissions. They were organized in topical secti...
The 3-volume set LNAI 12712-12714 constitutes the proceedings of the 25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2021, which was held during May 11-14, 2021.
The 157 papers included in the proceedings were carefully reviewed and selected from a total of 628 submissions. They were organized in topical secti...
The 3-volume set LNAI 12712-12714 constitutes the proceedings of the 25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2021, which was held during May 11-14, 2021.
The 157 papers included in the proceedings were carefully reviewed and selected from a total of 628 submissions. They were organized in topical secti...
An important aspect of preventing fake news dissemination is to proactively detect the likelihood of its spreading. Research in the domain of fake news spreader detection has not been explored much from a network analysis perspective. In this paper, we propose a graph neural network based approach to identify nodes that are likely to become spreade...
Research in fake news detection and prevention has gained a lot of attention over the past decade, with most models using features generated from content and propagation paths. Complementary to these approaches, in this position paper we outline a framework inspired from the domain of epidemiology that proposes to identify people who are likely to...
An important aspect of preventing fake news dissemination is to proactively detect the likelihood of its spreading. Research in the domain of fake news spreader detection has not been explored much from a network analysis perspective. In this paper, we propose a graph neural network based approach to identify nodes that are likely to become spreade...
Modeling infection spread during pandemics is not new, with models using past data to tune simulation parameters for predictions. These help understand the healthcare burden posed by a pandemic and respond accordingly. However, the problem of how college/university campuses should function during a pandemic is new for the following reasons:(i) soci...
The world has transitioned into a new phase of online learning in response to the recent Covid19 pandemic. Now more than ever, it has become paramount to push the limits of online learning in every manner to keep flourishing the education system. One crucial component of online learning is Knowledge Tracing (KT). The aim of KT is to model student's...
Over the past 40 years, we have witnessed seismic shifts in advertising planning and buying processes. Due in no small part to the emergence of digital media, consumer choices have mushroomed, while advertisers understand much more about target audiences. Advertising activities have been drastically transformed by the possibilities that technology...
Upper-Airway Stimulation (UAS) therapy is an innovative alternative to Continuous Positive Airway Pressure (CPAP) treatment for patients with obstructive sleep apnea (OSA) and CPAP intolerance. Patients who have implanted a UAS device are responsible for activating and managing the therapy at home before sleep. Consistent nightly use is required fo...
This study examines the role of source trust in viral ad diffusion, specifically the impact of source trust on the reach and speed of ad diffusion. It tests the feasibility of using computer-algorithm-generated social media metrics, indicating the degree to which each person is trusted by others within a social network, for trust-based viral ad see...
The use of social media platforms such as Twitter by affected people during crises is considered a vital source of information for crisis response. However, rapid crisis response requires real-time analysis of online information. When a disaster happens, among other data processing techniques, supervised machine learning can help classify online in...
Negative rumors about brands and products, which can cause serious damages to the involved brands, are a growing problem on social media. However, systematic research on rumor-refutation strategies is extremely limited. To advance research on commercial rumor spread and suppression, and to help brand managers better fight detrimental rumors against...
Intelligent transportation systems are a key component in smart cities, and the estimation and prediction of the spatiotemporal traffic state is critical to capture the dynamics of traffic congestion, i.e., its generation, propagation and mitigation, in order to increase operational efficiency and improve livability within smart cities. And while s...
Monitoring patients in ICU is a challenging and high-cost task. Hence, predicting the condition of patients during their ICU stay can help provide better acute care and plan the hospital's resources. There has been continuous progress in machine learning research for ICU management, and most of this work has focused on using time series signals rec...
Understanding the spread of false information in social networks has gained a lot of recent attention. In this paper, we explore the role community structures play in determining how people get exposed to fake news. Inspired by approaches in epidemiology, we propose a novel Community Health Assessment model, whose goal is to understand the vulnerab...
Sufficient physical activity and restful sleep play a major role in the prevention and cure of many chronic conditions. Being able to proactively screen and monitor such chronic conditions would be a big step forward for overall health. The rapid increase in the popularity of wearable devices pro-vides a significant new source, making it possible t...
A multi-modal transportation system of a city can be modeled as a multiplex network with different layers corresponding to different transportation modes. These layers include, but are not limited to, bus network, metro network, and road network. Formally, a multiplex network is a multilayer graph in which the same set of nodes are connected by dif...
Sufficient physical activity and restful sleep play a major role in the prevention and cure of many chronic conditions. Being able to proactively screen and monitor such chronic conditions would be a big step forward for overall health. The rapid increase in the popularity of wearable devices provides a significant new source, making it possible to...
Ubiquitous use of social media such as microblogging platforms opens unprecedented chances for false information to diffuse online. Facing the challenges in such a so-called “post-fact” era, it is very important for intelligent systems to not only check the veracity of information but also verify the authenticity of the users who spread the informa...
News organizations are increasingly using social media to reach out to their audience aimed at raising their attention and engagement with news. Given the continuous decrease in subscription rates and audience trust in news media, it is imperative for news organizations to understand factors contributing to their relationships with the audience. Us...
Sleep plays a vital role in human health, both mental and physical. Sleep disorders like sleep apnea are increasing in prevalence, with the rapid increase in factors like obesity. Sleep apnea is most commonly treated with Continuous Positive Air Pressure (CPAP) therapy, which maintains the appropriate pressure to ensure continuous airflow. It is wi...
Introduction
CPAP is the optimal treatment for obstructive sleep apnea, but is limited by low adherence. Fairview’s sleep program actively tracks PAP usage and outcomes and employs tele-health coaching to improve adherence. This approach has achieved 6-month adherence rates of 71%. However, this protocol is applied uniformly and is labor-intensive,...
Sleep apnea is a growing problem in the country, with over 200,000 new cases being identified each year. Continuous positive airway pressure (CPAP) is the best treatment for obstructive sleep apnea (OSA), but is limited by low adherence to treatment. Fairview's Sleep program actively tracks CPAP usage and outcomes and employs tele-health coaching t...
Obesity is one of the major health risk factors behind the rise of non-communicable conditions. Understanding the factors influencing obesity is very complex since there are many variables that can affect the health behaviors leading to it. Nowadays, multiple data sources can be used to study health behaviors, such as wearable sensors for physical...
Physical activity and sleep play a major role in the prevention and management of many chronic conditions. It is not a trivial task to understand their impact on chronic conditions. Currently, data from electronic health records (EHRs), sleep lab studies, and activity/sleep logs are used. The rapid increase in the popularity of wearable health devi...
Multilayer networks have been the subject of intense research in the recent years in different applications. However, in urban mobility, the multi-layer nature of transportation systems has been generally ignored, even though most large cities are spanned by more than one transportation system. These different modes of transport have usually been s...
In an effort to curb air pollution, the city of Delhi (India), known to be one of the most populated, polluted, and congested cities in the world has run a trial experiment in two phases of 15 days intervals. During the experiment, most of four-wheeled vehicles were constrained to move on alternate days based on whether their plate numbers ended wi...
Ubiquitous use of social media such as microblog-ging platforms brings about ample opportunities for the false information to diffuse online. It is very important not just to determine the veracity of information but also the authenticity of the users who spread the information, especially in time-critical situations like real-world emergencies, wh...
Assigning relevant keywords to documents is very important for efficient retrieval, clustering and management of the documents. Especially with the web corpus deluged with digital documents, automation of this task is of prime importance. Keyword assignment is a broad topic of research which refers to tagging of document with keywords, key-phrases...
Sleep and physical activity are human behaviors that play a major role in our health. Poor sleep or lack of physical activity have been found to increase health risks and reduce quality of life. The rapid adoption and evolution of pervasive computing systems, both in the health and wellness domain, are creating a new data-intensive context in which...
Obesity is one of the major health risk factors be- hind the rise of non-communicable conditions. Understanding the factors influencing obesity is very complex since there are many variables that can affect the health behaviors leading to it. Nowadays, multiple data sources can be used to study health behaviors, such as wearable sensors for physica...
Scientific datasets play a crucial role in data-driven research. While there are several repositories that curate public datasets, several more datasets and their usage is hidden in the research publications. Hence, discovering a relevant dataset for a research topic requires in-depth investigation of several publications, tracking dataset usage an...
Background
The explosion of consumer electronics and social media are facilitating the rise of the Quantified Self (QS) movement where millions of users are tracking various aspects of their daily life using social media, mobile technology, and wearable devices. Data from mobile phones, wearables and social media can facilitate a better understandi...
Sleep is an important human behavior with a deep impact on quality of life. Inadequate sleep quality negatively affects both mental and physical well-being, and exacerbates many health problems such as diabetes, depression, cancer and obesity. Alarmingly, poor sleep is becoming a growing concern in our society. Increased efforts toward the developm...
The precision matrix is the inverse of the covariance matrix. Estimating large sparse precision matrices is an interesting and a challenging problem in many fields of sciences, engineering, humanities and machine learning problems in general. Recent applications often encounter high dimensionality with a limited number of data points leading to a n...
The study of small collaborations or teams is an important endeavor both in industry and academia. The social phenomena responsible for formation or evolution of such small groups is quite different from those for dyadic relations like friendship or large size guilds (or communities). In small groups when social actors collaborate for various tasks...
This paper reports a detailed empirical study of interpersonal trust in a multi-relational online social network. This study addresses two main aspects of interpersonal trust: formation and reciprocation. Computational models developed, using multi-relational networks, for these processes provide interesting insights about online social interaction...
Trust is an important factor, particularly in viral/social advertising, and computing trust scores for individual users of a social network is crucial for several applications in the advertising research and practice. However, research on trust in the advertising field has been limited, and the application of computational trust to advertising rese...








































































































































![Figure 2: Transition diagram for OSA patients [22] with nonREM states 1...](profile/Jaideep-Srivastava/publication/326586529/figure/fig1/AS:652141052039168@1532493951300/Transition-diagram-for-OSA-patients-22-with-nonREM-states-1-and-2-combined-as-Light-L_Q320.jpg)





























































