Jaideep Srivastava

Jaideep Srivastava
  • PhD, University of California, Berkeley
  • Professor (Full) at University of Minnesota

About

515
Publications
238,947
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
21,136
Citations
Introduction
Jaideep Srivastava currently works at the Department of Computer Science and Engineering, University of Minnesota Twin Cities. Jaideep does research in Databases, Computing in Social science, Arts and Humanities and Data Mining. Their current project is 'Data Search'.
Current institution
University of Minnesota
Current position
  • Professor (Full)
Additional affiliations
January 1984 - September 1988
University of California, Berkeley
Position
  • Research Associate
October 1988 - present
University of Minnesota
Position
  • Professor (Full)

Publications

Publications (515)
Preprint
The scientific community increasingly relies on open data sharing, yet existing metrics inadequately capture the true impact of datasets as research outputs. Traditional measures, such as the h-index, focus on publications and citations but fail to account for dataset accessibility, reuse, and cross-disciplinary influence. We propose the X-index, a...
Preprint
Full-text available
Misinformation remains one of the most significant issues in the digital age. While automated fact-checking has emerged as a viable solution, most current systems are limited to evaluating factual accuracy. However, the detrimental effect of misinformation transcends simple falsehoods; it takes advantage of how individuals perceive, interpret, and...
Article
Background Obstructive sleep apnea (OSA) is a prevalent and potentially severe sleep disorder characterized by repeated interruptions in breathing during sleep. Machine learning models have been increasingly applied in various aspects of OSA research, including diagnosis, treatment optimization, and developing biomarkers for endotypes and disease m...
Article
Full-text available
Background: Heart failure with reduced ejection fraction is a complex condition that necessitates adaptive, patient-specific management strategies. This study aimed to evaluate the effectiveness of a time-adaptive machine learning model, the Passive-Aggressive classifier, in predicting heart failure with reduced ejection fraction severity and captu...
Preprint
Full-text available
Background: Obstructive sleep apnea (OSA) is a prevalent and potentially severe sleep disorder characterized by repeated interruptions in breathing during sleep. Machine learning models have been increasingly applied in various aspects of OSA research, including diagnosis, treatment optimization, and developing biomarkers for endotypes and disease...
Preprint
We propose a novel approach to assess the public's coping behavior during the COVID-19 outbreak by examining the emotions. Specifically, we explore (1) changes in the public's emotions with the COVID-19 crisis progression and (2) the impacts of the public's emotions on their information-seeking, information-sharing behaviors, and compliance with st...
Article
Phase retrieval (PR) is fundamentally important in scientific imaging and is crucial for nanoscale techniques like coherent diffractive imaging (CDI). Low radiation dose imaging is essential for applications involving radiation-sensitive samples. However, most PR methods struggle in low-dose scenarios due to high shot noise. Recent advancements in...
Article
Full-text available
For corporations engaged in corporate social advocacy (CSA), establishing legitimacy among publics is challenging when they take stands along clear ideological lines on controversial issues. This study examined two questions: (1) how would the congruence between individuals' political ideologies and corporations' CSA stances influence perceived CSA...
Article
Full-text available
Widespread public crises often give rise to the proliferation of sensationalized rumors and conspiracy theories, which can evoke a variety of public emotions. Despite the growing importance of research on the relationship between emotions and coping behaviors in crisis, a dearth of natural observation-based investigation has been limiting theory de...
Article
Full-text available
Prior to the COVID-19 pandemic, the World Health Organization named vaccine hesitancy as one of the top 10 threats to global health. The impact of hesitancy on the uptake of human papillomavirus (HPV) vaccines was of particular concern, given the markedly lower uptake compared to other adolescent vaccines in some countries, notably the United State...
Preprint
Full-text available
With the ever-increasing spread of misinformation on online social networks, it has become very important to identify the spreaders of misinformation (unintentional), disinformation (intentional), and misinformation refutation. It can help in educating the first, stopping the second, and soliciting the help of the third category, respectively, in t...
Preprint
Full-text available
Labeling time series data is an expensive task because of domain expertise and dynamic nature of the data. Hence, we often have to deal with limited labeled data settings. Data augmentation techniques have been successfully deployed in domains like computer vision to exploit the use of existing labeled data. We adapt one of the most commonly used t...
Preprint
Full-text available
Missing data in time series is a challenging issue affecting time series analysis. Missing data occurs due to problems like data drops or sensor malfunctioning. Imputation methods are used to fill in these values, with quality of imputation having a significant impact on downstream tasks like classification. In this work, we propose a semi-supervis...
Preprint
Full-text available
Prior to the COVID-19 pandemic, the World Health Organization named vaccine hesitancy as one of the top 10 threats to global health. The impact of hesitancy on uptake of human papillomavirus (HPV) vaccines was of particular concern, given the markedly lower uptake compared to other adolescent vaccines in some countries, notably the United States. W...
Article
Appropriately handling the scalability of clustering is a long-standing challenge for the study of clustering techniques and is of fundamental interest to researchers in the community of data mining and knowledge discovery. In comparison to other clustering methods, hierarchical clustering demonstrates better interpretability of clustering results...
Article
Full-text available
An important aspect of preventing fake news spreading in social networks is to proactively detect the users that are likely going to spread such news. Research in the domain of spreader detection is at a nascent stage compared to fake news detection. In this paper, we propose a graph neural network-based framework to identify nodes that are likely...
Article
Introduction Upper airway stimulation (UAS) therapy is effective for a subset of obstructive sleep apnea (OSA) patients with CPAP intolerance. While overall adherence is high, some patients have suboptimal adherence to UAS, which limits effectiveness. Our goal was to identify UAS therapy usage patterns during the first three months of therapy that...
Chapter
Computational models for the detection and prevention of false information spreading (popularly called fake news) has gained a lot of attention over the last decade, with most proposed models identifying the veracity of information. In this chapter we propose a framework based on a complementary approach to false information mitigation inspired fro...
Article
Rumors about brands and products are a growing problem on social media, but systematic research on rumor-refutation strategies for various brands is limited. To advance research on rumor-refutation effects in the context of commercial rumor outbreaks, and to help brand managers better fight detrimental rumors against their brands, this study examin...
Article
Full-text available
Study objectives: Upper airway stimulation (UAS) therapy is effective for a subset of obstructive sleep apnea (OSA) patients with continuous positive airway pressure (CPAP) intolerance. While overall adherence is high, some patients have suboptimal adherence, which limits efficacy. Our goal was to identify therapy usage patterns during the first t...
Article
Full-text available
Measuring influence of one person on another has applications in advertising and marketing and across the sciences. Most approaches involve inferring influence based on speech and social media. In contrast, this paper takes existing spending data and attributes influence on to the spenders and those likely to have caused their spending. The resulti...
Article
Modeling infection spread during pandemics is not new, with models using past data to tune simulation parameters for predictions. These help in understanding of the healthcare burden posed by a pandemic and responding accordingly. However, the problem of how college/university campuses should function during a pandemic is new for the following reas...
Chapter
Full-text available
Recombinant human growth hormone (r-hGH) is an established therapy for growth hormone deficiency (GHD); yet, some patients fail to achieve their full height potential, with poor adherence and persistence with the prescribed regimen often a contributing factor. A data-driven clinical decision support system based on “traffic light” visualizations fo...
Preprint
One of the main challenges for hierarchical clustering is how to appropriately identify the representative points in the lower level of the cluster tree, which are going to be utilized as the roots in the higher level of the cluster tree for further aggregation. However, conventional hierarchical clustering approaches have adopted some simple trick...
Preprint
Full-text available
The quality of sleep has a deep impact on people's physical and mental health. People with insufficient sleep are more likely to report physical and mental distress, activity limitation, anxiety, and pain. Moreover, in the past few years, there has been an explosion of applications and devices for activity monitoring and health tracking. Signals co...
Article
This study examined the influence of consumers’ temporary affective states during ad exposure on their engagement with different types of ads that are categorized based on theoretically derived attention-grabbing characteristics. A computational research approach was used, cross-analyzing proxy measures of real-time affective fluctuation of viewers...
Conference Paper
Full-text available
Simulation models for infection spread can help understand what factors play a major role in infection spread. Health agencies like the Center for Disease Control (CDC) can accordingly mandate effective guidelines to curb the spread. We built an infection spread model to simulate disease propagation through airborne transmission to study the impact...
Article
Gold farming and real money trade refer to a set of illicit practices in massively multiplayer online games (MMOGs) whereby players accumulate virtual resources to sell for “real world” money. Prior work has examined trade relationships formed by gold farmers but not the trust relationships which exist between members of these organizations. We ado...
Preprint
Many machine learning models have been built to tackle information overload issues on Massive Open Online Courses (MOOC) platforms. These models rely on learning powerful representations of MOOC entities. However, they suffer from the problem of scarce expert label data. To overcome this problem, we propose to learn pre-trained representations of M...
Chapter
Full-text available
The problem of consistent therapy adherence is a current challenge for health informatics, and its solution can increase the success rate of treatments. Here we show a methodology to predict, at individual-level, future therapy adherence for patients receiving daily injections of growth hormone (GH) therapy for GH deficiency. Our proposed model is...
Chapter
False information and true information fact checking it, often co-exist in social networks, each competing to influence people in their spread paths. An efficient strategy here to contain false information is to proactively identify if nodes in the spread path are likely to endorse false information (i.e. further spread it) or refutation informatio...
Chapter
Recommending relevant items to users is a crucial task on online communities such as Reddit and Twitter. For recommendation system, representation learning presents a powerful technique that learns embeddings to represent user behaviors and capture item properties. However, learning embeddings on online communities is a challenging task because the...
Preprint
Full-text available
Recommending relevant items to users is a crucial task on online communities such as Reddit and Twitter. For recommendation system, representation learning presents a powerful technique that learns embeddings to represent user behaviors and capture item properties. However, learning embeddings on online communities is a challenging task because the...
Conference Paper
Full-text available
False information and true information fact checking it, often co-exist in social networks, each competing to influence people in their spread paths. An efficient strategy here to contain false information is to proactively identify if nodes in the spread path are likely to endorse false information (i.e. further spread it) or refutation informatio...
Article
This study explores the extent to which U.S. newspaper organizations incorporate images captured by private citizens into their news articles as an audience engagement strategy, and examines the relationship between the extent to which citizen-eyewitness images are incorporated in the news and audience news engagement. By conducting a machine-codin...
Preprint
Full-text available
False information and true information fact checking it, often co-exist in social networks, each competing to influence people in their spread paths. An efficient strategy here to contain false information is to proactively identify if nodes in the spread path are likely to endorse false information (i.e. further spread it) or refutation informatio...
Preprint
Full-text available
The plague of false information, popularly called fake news has affected lives of news consumers ever since the prevalence of social media. Thus understanding the spread of false information in social networks has gained a lot of attention in the literature. While most proposed models do content analysis of the information, no much work has been do...
Preprint
Full-text available
Leading up to August 2020, COVID-19 has spread to almost every country in the world, causing millions of infected and hundreds of thousands of deaths. In this paper, we first verify the assumption that clinical variables could have time-varying effects on COVID-19 outcomes. Then, we develop a temporal stratification approach to make daily predictio...
Preprint
Full-text available
Knowledge tracing (KT) is the problem of modeling each student's mastery of knowledge concepts (KCs) as (s)he engages with a sequence of learning activities. It is an active research area to help provide learners with personalized feedback and materials. Various deep learning techniques have been proposed for solving KT. Recent release of large-sca...
Preprint
In recent years, Massive Open Online Courses (MOOCs) have witnessed immense growth in popularity. Now, due to the recent Covid19 pandemic situation, it is important to push the limits of online education. Discussion forums are primary means of interaction among learners and instructors. However, with growing class size, students face the challenge...
Book
The 3-volume set LNAI 12712-12714 constitutes the proceedings of the 25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2021, which was held during May 11-14, 2021. The 157 papers included in the proceedings were carefully reviewed and selected from a total of 628 submissions. They were organized in topical secti...
Book
The 3-volume set LNAI 12712-12714 constitutes the proceedings of the 25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2021, which was held during May 11-14, 2021. The 157 papers included in the proceedings were carefully reviewed and selected from a total of 628 submissions. They were organized in topical secti...
Book
The 3-volume set LNAI 12712-12714 constitutes the proceedings of the 25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2021, which was held during May 11-14, 2021. The 157 papers included in the proceedings were carefully reviewed and selected from a total of 628 submissions. They were organized in topical secti...
Conference Paper
Full-text available
An important aspect of preventing fake news dissemination is to proactively detect the likelihood of its spreading. Research in the domain of fake news spreader detection has not been explored much from a network analysis perspective. In this paper, we propose a graph neural network based approach to identify nodes that are likely to become spreade...
Conference Paper
Full-text available
Research in fake news detection and prevention has gained a lot of attention over the past decade, with most models using features generated from content and propagation paths. Complementary to these approaches, in this position paper we outline a framework inspired from the domain of epidemiology that proposes to identify people who are likely to...
Preprint
An important aspect of preventing fake news dissemination is to proactively detect the likelihood of its spreading. Research in the domain of fake news spreader detection has not been explored much from a network analysis perspective. In this paper, we propose a graph neural network based approach to identify nodes that are likely to become spreade...
Preprint
Modeling infection spread during pandemics is not new, with models using past data to tune simulation parameters for predictions. These help understand the healthcare burden posed by a pandemic and respond accordingly. However, the problem of how college/university campuses should function during a pandemic is new for the following reasons:(i) soci...
Preprint
The world has transitioned into a new phase of online learning in response to the recent Covid19 pandemic. Now more than ever, it has become paramount to push the limits of online learning in every manner to keep flourishing the education system. One crucial component of online learning is Knowledge Tracing (KT). The aim of KT is to model student's...
Article
Full-text available
Over the past 40 years, we have witnessed seismic shifts in advertising planning and buying processes. Due in no small part to the emergence of digital media, consumer choices have mushroomed, while advertisers understand much more about target audiences. Advertising activities have been drastically transformed by the possibilities that technology...
Conference Paper
Upper-Airway Stimulation (UAS) therapy is an innovative alternative to Continuous Positive Airway Pressure (CPAP) treatment for patients with obstructive sleep apnea (OSA) and CPAP intolerance. Patients who have implanted a UAS device are responsible for activating and managing the therapy at home before sleep. Consistent nightly use is required fo...
Article
Full-text available
This study examines the role of source trust in viral ad diffusion, specifically the impact of source trust on the reach and speed of ad diffusion. It tests the feasibility of using computer-algorithm-generated social media metrics, indicating the degree to which each person is trusted by others within a social network, for trust-based viral ad see...
Chapter
The use of social media platforms such as Twitter by affected people during crises is considered a vital source of information for crisis response. However, rapid crisis response requires real-time analysis of online information. When a disaster happens, among other data processing techniques, supervised machine learning can help classify online in...
Preprint
Negative rumors about brands and products, which can cause serious damages to the involved brands, are a growing problem on social media. However, systematic research on rumor-refutation strategies is extremely limited. To advance research on commercial rumor spread and suppression, and to help brand managers better fight detrimental rumors against...
Article
Full-text available
Intelligent transportation systems are a key component in smart cities, and the estimation and prediction of the spatiotemporal traffic state is critical to capture the dynamics of traffic congestion, i.e., its generation, propagation and mitigation, in order to increase operational efficiency and improve livability within smart cities. And while s...
Preprint
Monitoring patients in ICU is a challenging and high-cost task. Hence, predicting the condition of patients during their ICU stay can help provide better acute care and plan the hospital's resources. There has been continuous progress in machine learning research for ICU management, and most of this work has focused on using time series signals rec...
Conference Paper
Full-text available
Understanding the spread of false information in social networks has gained a lot of recent attention. In this paper, we explore the role community structures play in determining how people get exposed to fake news. Inspired by approaches in epidemiology, we propose a novel Community Health Assessment model, whose goal is to understand the vulnerab...
Article
Sufficient physical activity and restful sleep play a major role in the prevention and cure of many chronic conditions. Being able to proactively screen and monitor such chronic conditions would be a big step forward for overall health. The rapid increase in the popularity of wearable devices pro-vides a significant new source, making it possible t...
Article
Full-text available
A multi-modal transportation system of a city can be modeled as a multiplex network with different layers corresponding to different transportation modes. These layers include, but are not limited to, bus network, metro network, and road network. Formally, a multiplex network is a multilayer graph in which the same set of nodes are connected by dif...
Preprint
Full-text available
Sufficient physical activity and restful sleep play a major role in the prevention and cure of many chronic conditions. Being able to proactively screen and monitor such chronic conditions would be a big step forward for overall health. The rapid increase in the popularity of wearable devices provides a significant new source, making it possible to...
Article
Full-text available
Ubiquitous use of social media such as microblogging platforms opens unprecedented chances for false information to diffuse online. Facing the challenges in such a so-called “post-fact” era, it is very important for intelligent systems to not only check the veracity of information but also verify the authenticity of the users who spread the informa...
Preprint
Full-text available
News organizations are increasingly using social media to reach out to their audience aimed at raising their attention and engagement with news. Given the continuous decrease in subscription rates and audience trust in news media, it is imperative for news organizations to understand factors contributing to their relationships with the audience. Us...
Preprint
Full-text available
Sleep plays a vital role in human health, both mental and physical. Sleep disorders like sleep apnea are increasing in prevalence, with the rapid increase in factors like obesity. Sleep apnea is most commonly treated with Continuous Positive Air Pressure (CPAP) therapy, which maintains the appropriate pressure to ensure continuous airflow. It is wi...
Article
Full-text available
Introduction CPAP is the optimal treatment for obstructive sleep apnea, but is limited by low adherence. Fairview’s sleep program actively tracks PAP usage and outcomes and employs tele-health coaching to improve adherence. This approach has achieved 6-month adherence rates of 71%. However, this protocol is applied uniformly and is labor-intensive,...
Conference Paper
Sleep apnea is a growing problem in the country, with over 200,000 new cases being identified each year. Continuous positive airway pressure (CPAP) is the best treatment for obstructive sleep apnea (OSA), but is limited by low adherence to treatment. Fairview's Sleep program actively tracks CPAP usage and outcomes and employs tele-health coaching t...
Conference Paper
Obesity is one of the major health risk factors behind the rise of non-communicable conditions. Understanding the factors influencing obesity is very complex since there are many variables that can affect the health behaviors leading to it. Nowadays, multiple data sources can be used to study health behaviors, such as wearable sensors for physical...
Article
Full-text available
Physical activity and sleep play a major role in the prevention and management of many chronic conditions. It is not a trivial task to understand their impact on chronic conditions. Currently, data from electronic health records (EHRs), sleep lab studies, and activity/sleep logs are used. The rapid increase in the popularity of wearable health devi...
Conference Paper
Full-text available
Multilayer networks have been the subject of intense research in the recent years in different applications. However, in urban mobility, the multi-layer nature of transportation systems has been generally ignored, even though most large cities are spanned by more than one transportation system. These different modes of transport have usually been s...
Conference Paper
Full-text available
In an effort to curb air pollution, the city of Delhi (India), known to be one of the most populated, polluted, and congested cities in the world has run a trial experiment in two phases of 15 days intervals. During the experiment, most of four-wheeled vehicles were constrained to move on alternate days based on whether their plate numbers ended wi...
Conference Paper
Full-text available
Ubiquitous use of social media such as microblog-ging platforms brings about ample opportunities for the false information to diffuse online. It is very important not just to determine the veracity of information but also the authenticity of the users who spread the information, especially in time-critical situations like real-world emergencies, wh...
Article
Full-text available
Assigning relevant keywords to documents is very important for efficient retrieval, clustering and management of the documents. Especially with the web corpus deluged with digital documents, automation of this task is of prime importance. Keyword assignment is a broad topic of research which refers to tagging of document with keywords, key-phrases...
Article
Full-text available
Sleep and physical activity are human behaviors that play a major role in our health. Poor sleep or lack of physical activity have been found to increase health risks and reduce quality of life. The rapid adoption and evolution of pervasive computing systems, both in the health and wellness domain, are creating a new data-intensive context in which...
Article
Full-text available
Obesity is one of the major health risk factors be- hind the rise of non-communicable conditions. Understanding the factors influencing obesity is very complex since there are many variables that can affect the health behaviors leading to it. Nowadays, multiple data sources can be used to study health behaviors, such as wearable sensors for physica...
Article
Full-text available
Scientific datasets play a crucial role in data-driven research. While there are several repositories that curate public datasets, several more datasets and their usage is hidden in the research publications. Hence, discovering a relevant dataset for a research topic requires in-depth investigation of several publications, tracking dataset usage an...
Article
Full-text available
Background The explosion of consumer electronics and social media are facilitating the rise of the Quantified Self (QS) movement where millions of users are tracking various aspects of their daily life using social media, mobile technology, and wearable devices. Data from mobile phones, wearables and social media can facilitate a better understandi...
Article
Full-text available
Sleep is an important human behavior with a deep impact on quality of life. Inadequate sleep quality negatively affects both mental and physical well-being, and exacerbates many health problems such as diabetes, depression, cancer and obesity. Alarmingly, poor sleep is becoming a growing concern in our society. Increased efforts toward the developm...
Conference Paper
Full-text available
The precision matrix is the inverse of the covariance matrix. Estimating large sparse precision matrices is an interesting and a challenging problem in many fields of sciences, engineering, humanities and machine learning problems in general. Recent applications often encounter high dimensionality with a limited number of data points leading to a n...
Conference Paper
Full-text available
The study of small collaborations or teams is an important endeavor both in industry and academia. The social phenomena responsible for formation or evolution of such small groups is quite different from those for dyadic relations like friendship or large size guilds (or communities). In small groups when social actors collaborate for various tasks...
Article
Full-text available
This paper reports a detailed empirical study of interpersonal trust in a multi-relational online social network. This study addresses two main aspects of interpersonal trust: formation and reciprocation. Computational models developed, using multi-relational networks, for these processes provide interesting insights about online social interaction...
Article
Full-text available
Trust is an important factor, particularly in viral/social advertising, and computing trust scores for individual users of a social network is crucial for several applications in the advertising research and practice. However, research on trust in the advertising field has been limited, and the application of computational trust to advertising rese...

Network

Cited By