-
Epidemic paradox induced by awareness driven network dynamics
Authors:
Csegő Balázs Kolok,
Gergely Ódor,
Dániel Keliger,
Márton Karsai
Abstract:
We study stationary epidemic processes in scale-free networks with local awareness behavior adopted by only susceptible, only infected, or all nodes. We find that while the epidemic size in the susceptible-aware and the all-aware scenarios scales linearly with the network size, the scaling becomes sublinear in the infected-aware scenario, suggesting that fewer aware nodes may reduce the epidemic s…
▽ More
We study stationary epidemic processes in scale-free networks with local awareness behavior adopted by only susceptible, only infected, or all nodes. We find that while the epidemic size in the susceptible-aware and the all-aware scenarios scales linearly with the network size, the scaling becomes sublinear in the infected-aware scenario, suggesting that fewer aware nodes may reduce the epidemic size more effectively. We explain this paradox via numerical and theoretical analysis, and highlight the role of influential nodes and their disassortativity to raise awareness in epidemic scenarios.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
A Comparative Analysis of Wealth Index Predictions in Africa between three Multi-Source Inference Models
Authors:
Márton Karsai,
János Kertész,
Lisette Espín-Noboa
Abstract:
Poverty map inference is a critical area of research, with growing interest in both traditional and modern techniques, ranging from regression models to convolutional neural networks applied to tabular data, images, and networks. Despite extensive focus on the validation of training phases, the scrutiny of final predictions remains limited. Here, we compare the Relative Wealth Index (RWI) inferred…
▽ More
Poverty map inference is a critical area of research, with growing interest in both traditional and modern techniques, ranging from regression models to convolutional neural networks applied to tabular data, images, and networks. Despite extensive focus on the validation of training phases, the scrutiny of final predictions remains limited. Here, we compare the Relative Wealth Index (RWI) inferred by Chi et al. (2022) with the International Wealth Index (IWI) inferred by Lee and Braithwaite (2022) and Espín-Noboa et al. (2023) across six Sub-Saharan African countries. Our analysis focuses on identifying trends and discrepancies in wealth predictions over time. Our results show that the predictions by Chi et al. and Espín-Noboa et al. align with general GDP trends, with differences expected due to the distinct time-frames of the training sets. However, predictions by Lee and Braithwaite diverge significantly, indicating potential issues with the validity of the model. These discrepancies highlight the need for policymakers and stakeholders in Africa to rigorously audit models that predict wealth, especially those used for decision-making on the ground. These and other techniques require continuous verification and refinement to enhance their reliability and ensure that poverty alleviation strategies are well-founded.
△ Less
Submitted 4 September, 2024; v1 submitted 2 August, 2024;
originally announced August 2024.
-
Distinguishing mechanisms of social contagion from local network view
Authors:
Elsa Andres,
Gergely Ódor,
Iacopo Iacopini,
Márton Karsai
Abstract:
The adoption of individual behavioural patterns is largely determined by stimuli arriving from peers via social interactions or from external sources. Based on these influences, individuals are commonly assumed to follow simple or complex adoption rules, inducing social contagion processes. In reality, multiple adoption rules may coexist even within the same social contagion process, introducing a…
▽ More
The adoption of individual behavioural patterns is largely determined by stimuli arriving from peers via social interactions or from external sources. Based on these influences, individuals are commonly assumed to follow simple or complex adoption rules, inducing social contagion processes. In reality, multiple adoption rules may coexist even within the same social contagion process, introducing additional complexity into the spreading phenomena. Our goal is to understand whether coexisting adoption mechanisms can be distinguished from a microscopic view, at the egocentric network level, without requiring global information about the underlying network, or the unfolding spreading process. We formulate this question as a classification problem, and study it through a Bayesian likelihood approach and with random forest classifiers in various synthetic and data-driven experiments. This study offers a novel perspective on the observations of propagation processes at the egocentric level and a better understanding of landmark contagion mechanisms from a local view.
△ Less
Submitted 27 June, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Epidemic-induced local awareness behavior inferred from surveys and genetic sequence data
Authors:
Gergely Ódor,
Márton Karsai
Abstract:
Behavior-disease models suggest that if individuals are aware and take preventive actions when the prevalence of the disease increases among their close contacts, then the pandemic can be contained in a cost-effective way. To measure the true impact of local awareness behavior on epidemic spreading, we propose an efficient approach to identify superspreading events and assign corresponding Event C…
▽ More
Behavior-disease models suggest that if individuals are aware and take preventive actions when the prevalence of the disease increases among their close contacts, then the pandemic can be contained in a cost-effective way. To measure the true impact of local awareness behavior on epidemic spreading, we propose an efficient approach to identify superspreading events and assign corresponding Event Containment Scores (ECSs) in clinical genetic sequence data. We validate ECS as a measure of local awareness in simulation experiments, and we find that ECS was correlated positively with policy stringency during the COVID-19 pandemic. Finally, we observe a temporary drop in ECS during the Omicron wave in most European countries, matching a survey experiment we carried out at the same time. Our findings bring important insight into the field of awareness modeling through the analysis of large-scale genetic sequence data, one of the most promising data sources in epidemics research.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Human Mobility in the Metaverse
Authors:
Kishore Vasan,
Marton Karsai,
Albert-Laszlo Barabasi
Abstract:
The metaverse promises a shift in the way humans interact with each other, and with their digital and physical environments. The lack of geographical boundaries and travel costs in the metaverse prompts us to ask if the fundamental laws that govern human mobility in the physical world apply. We collected data on avatar movements, along with their network mobility extracted from NFT purchases. We f…
▽ More
The metaverse promises a shift in the way humans interact with each other, and with their digital and physical environments. The lack of geographical boundaries and travel costs in the metaverse prompts us to ask if the fundamental laws that govern human mobility in the physical world apply. We collected data on avatar movements, along with their network mobility extracted from NFT purchases. We find that despite the absence of commuting costs, an individuals inclination to explore new locations diminishes over time, limiting movement to a small fraction of the metaverse. We also find a lack of correlation between land prices and visitation, a deviation from the patterns characterizing the physical world. Finally, we identify the scaling laws that characterize meta mobility and show that we need to add preferential selection to the existing models to explain quantitative patterns of metaverse mobility. Our ability to predict the characteristics of the emerging meta mobility network implies that the laws governing human mobility are rooted in fundamental patterns of human dynamics, rather than the nature of space and cost of movement.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Initialisation and Topology Effects in Decentralised Federated Learning
Authors:
Arash Badie-Modiri,
Chiara Boldrini,
Lorenzo Valerio,
János Kertész,
Márton Karsai
Abstract:
Fully decentralised federated learning enables collaborative training of individual machine learning models on distributed devices on a communication network while keeping the training data localised. This approach enhances data privacy and eliminates both the single point of failure and the necessity for central coordination. Our research highlights that the effectiveness of decentralised federat…
▽ More
Fully decentralised federated learning enables collaborative training of individual machine learning models on distributed devices on a communication network while keeping the training data localised. This approach enhances data privacy and eliminates both the single point of failure and the necessity for central coordination. Our research highlights that the effectiveness of decentralised federated learning is significantly influenced by the network topology of connected devices. We propose a strategy for uncoordinated initialisation of the artificial neural networks, which leverages the distribution of eigenvector centralities of the nodes of the underlying communication network, leading to a radically improved training efficiency. Additionally, our study explores the scaling behaviour and choice of environmental parameters under our proposed initialisation strategy. This work paves the way for more efficient and scalable artificial neural network training in a distributed and uncoordinated environment, offering a deeper understanding of the intertwining roles of network structure and learning dynamics.
△ Less
Submitted 22 May, 2024; v1 submitted 23 March, 2024;
originally announced March 2024.
-
Socioeconomic reorganization of communication and mobility networks in response to external shocks
Authors:
Ludovico Napoli,
Vedran Sekara,
Manuel García-Herranz,
Márton Karsai
Abstract:
Socioeconomic segregation patterns in networks usually evolve gradually, yet they can change abruptly in response to external shocks. The recent COVID-19 pandemic and the subsequent government policies induced several interruptions in societies, potentially disadvantaging the socioeconomically most vulnerable groups. Using large-scale digital behavioral observations as a natural laboratory, here w…
▽ More
Socioeconomic segregation patterns in networks usually evolve gradually, yet they can change abruptly in response to external shocks. The recent COVID-19 pandemic and the subsequent government policies induced several interruptions in societies, potentially disadvantaging the socioeconomically most vulnerable groups. Using large-scale digital behavioral observations as a natural laboratory, here we analyze how lockdown interventions lead to the reorganization of socioeconomic segregation patterns simultaneously in communication and mobility networks in Sierra Leone. We find that while segregation in mobility clearly increased during lockdown, the social communication network reorganized into a less segregated configuration as compared to reference periods. Moreover, due to differences in adaption capacities, the effects of lockdown policies varied across socioeconomic groups, leading to different or even opposite segregation patterns between the lower and higher socioeconomic classes. Such secondary effects of interventions need to be considered for better and more equitable policies.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Coordination-free Decentralised Federated Learning on Complex Networks: Overcoming Heterogeneity
Authors:
Lorenzo Valerio,
Chiara Boldrini,
Andrea Passarella,
János Kertész,
Márton Karsai,
Gerardo Iñiguez
Abstract:
Federated Learning (FL) is a well-known framework for successfully performing a learning task in an edge computing scenario where the devices involved have limited resources and incomplete data representation. The basic assumption of FL is that the devices communicate directly or indirectly with a parameter server that centrally coordinates the whole process, overcoming several challenges associat…
▽ More
Federated Learning (FL) is a well-known framework for successfully performing a learning task in an edge computing scenario where the devices involved have limited resources and incomplete data representation. The basic assumption of FL is that the devices communicate directly or indirectly with a parameter server that centrally coordinates the whole process, overcoming several challenges associated with it. However, in highly pervasive edge scenarios, the presence of a central controller that oversees the process cannot always be guaranteed, and the interactions (i.e., the connectivity graph) between devices might not be predetermined, resulting in a complex network structure. Moreover, the heterogeneity of data and devices further complicates the learning process. This poses new challenges from a learning standpoint that we address by proposing a communication-efficient Decentralised Federated Learning (DFL) algorithm able to cope with them. Our solution allows devices communicating only with their direct neighbours to train an accurate model, overcoming the heterogeneity induced by data and different training histories. Our results show that the resulting local models generalise better than those trained with competing approaches, and do so in a more communication-efficient way.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Mobility Segregation Dynamics and Residual Isolation During Pandemic Interventions
Authors:
Rafiazka Millanida Hilman,
Manuel García-Herranz,
Vedran Sekara,
Márton Karsai
Abstract:
External shocks embody an unexpected and disruptive impact on the regular life of people. This was the case during the COVID-19 outbreak that rapidly led to changes in the typical mobility patterns in urban areas. In response, people reorganised their daily errands throughout space. However, these changes might not have been the same across socioeconomic classes leading to possibile additional det…
▽ More
External shocks embody an unexpected and disruptive impact on the regular life of people. This was the case during the COVID-19 outbreak that rapidly led to changes in the typical mobility patterns in urban areas. In response, people reorganised their daily errands throughout space. However, these changes might not have been the same across socioeconomic classes leading to possibile additional detrimental effects on inequality due to the pandemic. In this paper we study the reorganisation of mobility segregation networks due to external shocks and show that the diversity of visited places in terms of locations and socioeconomic status is affected by the enforcement of mobility restriction during pandemic. We use the case of COVID-19 as a natural experiment in several cities to observe not only the effect of external shocks but also its mid-term consequences and residual effects. We build on anonymised and privacy-preserved mobility data in four cities: Bogota, Jakarta, London, and New York. We couple mobility data with socioeconomic information to capture inequalities in mobility among different socioeconomic groups and see how it changes dynamically before, during, and after different lockdown periods. We find that the first lockdowns induced considerable increases in mobility segregation in each city, while loosening mobility restrictions did not necessarily diminished isolation between different socioeconomic groups, as mobility mixing has not recovered fully to its pre-pandemic level even weeks after the interruption of interventions. Our results suggest that a one fits-all policy does not equally affect the way people adjust their mobility, which calls for socioeconomically informed intervention policies in the future.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
When Dialects Collide: How Socioeconomic Mixing Affects Language Use
Authors:
Thomas Louf,
José J. Ramasco,
David Sánchez,
Márton Karsai
Abstract:
The socioeconomic background of people and how they use standard forms of language are not independent, as demonstrated in various sociolinguistic studies. However, the extent to which these correlations may be influenced by the mixing of people from different socioeconomic classes remains relatively unexplored from a quantitative perspective. In this work we leverage geotagged tweets and transfer…
▽ More
The socioeconomic background of people and how they use standard forms of language are not independent, as demonstrated in various sociolinguistic studies. However, the extent to which these correlations may be influenced by the mixing of people from different socioeconomic classes remains relatively unexplored from a quantitative perspective. In this work we leverage geotagged tweets and transferable computational methods to map deviations from standard English on a large scale, in seven thousand administrative areas of England and Wales. We combine these data with high-resolution income maps to assign a proxy socioeconomic indicator to home-located users. Strikingly, across eight metropolitan areas we find a consistent pattern suggesting that the more different socioeconomic classes mix, the less interdependent the frequency of their departures from standard grammar and their income become. Further, we propose an agent-based model of linguistic variety adoption that sheds light on the mechanisms that produce the observations seen in the data.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Temporal network compression via network hashing
Authors:
Rémi Vaudaine,
Pierre Borgnat,
Paulo Goncalves,
Rémi Gribonval,
Márton Karsai
Abstract:
Pairwise temporal interactions between entities can be represented as temporal networks, which code the propagation of processes such as epidemic spreading or information cascades, evolving on top of them. The largest outcome of these processes is directly linked to the structure of the underlying network. Indeed, a node of a network at given time cannot affect more nodes in the future than it can…
▽ More
Pairwise temporal interactions between entities can be represented as temporal networks, which code the propagation of processes such as epidemic spreading or information cascades, evolving on top of them. The largest outcome of these processes is directly linked to the structure of the underlying network. Indeed, a node of a network at given time cannot affect more nodes in the future than it can reach via time-respecting paths. This set of nodes reachable from a source defines an out-component, which identification is costly. In this paper, we propose an efficient matrix algorithm to tackle this issue and show that it outperforms other state-of-the-art methods. Secondly, we propose a hashing framework to coarsen large temporal networks into smaller proxies on which out-components are easier to estimate, and then recombined to obtain the initial components. Our graph hashing solution has implications in privacy respecting representation of temporal networks.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Social inequalities that matter for contact patterns, vaccination, and the spread of epidemics
Authors:
Adriana Manna,
Júlia Koltai,
Márton Karsai
Abstract:
Individuals socio-demographic and economic characteristics crucially shape the spread of an epidemic by largely determining the exposure level to the virus and the severity of the disease for those who got infected. While the complex interplay between individual characteristics and epidemic dynamics is widely recognized, traditional mathematical models often overlook these factors. In this study,…
▽ More
Individuals socio-demographic and economic characteristics crucially shape the spread of an epidemic by largely determining the exposure level to the virus and the severity of the disease for those who got infected. While the complex interplay between individual characteristics and epidemic dynamics is widely recognized, traditional mathematical models often overlook these factors. In this study, we examine two important aspects of human behavior relevant to epidemics: contact patterns and vaccination uptake. Using data collected during the Covid-19 pandemic in Hungary, we first identify the dimensions along which individuals exhibit the greatest variation in their contact patterns and vaccination attitudes. We find that generally privileged groups of the population have higher number of contact and a higher vaccination uptake with respect to disadvantaged groups. Subsequently, we propose a data-driven epidemiological model that incorporates these behavioral differences. Finally, we apply our model to analyze the fourth wave of Covid-19 in Hungary, providing valuable insights into real-world scenarios. By bridging the gap between individual characteristics and epidemic spread, our research contributes to a more comprehensive understanding of disease dynamics and informs effective public health strategies.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Detecting periodic time scales in temporal networks
Authors:
Elsa Andres,
Alain Barrat,
Márton Karsai
Abstract:
Temporal networks are commonly used to represent dynamical complex systems like social networks, simultaneous firing of neurons, human mobility or public transportation. Their dynamics may evolve on multiple time scales characterising for instance periodic activity patterns or structural changes. The detection of these time scales can be challenging from the direct observation of simple dynamical…
▽ More
Temporal networks are commonly used to represent dynamical complex systems like social networks, simultaneous firing of neurons, human mobility or public transportation. Their dynamics may evolve on multiple time scales characterising for instance periodic activity patterns or structural changes. The detection of these time scales can be challenging from the direct observation of simple dynamical network properties like the activity of nodes or the density of links. Here we propose two new methods, which rely on already established static representations of temporal networks, namely supra-adjacency matrices and temporal event graphs. We define dissimilarity metrics extracted from these representations and compute their Fourier Transform to effectively identify dominant periodic time scales characterising the original temporal network. We demonstrate our methods using synthetic and real-world data sets describing various kinds of temporal networks. We find that while in all cases the two methods outperform the reference measures, the supra-adjacency based method identifies more easily periodic changes in network density, while the temporal event graph based method is better suited to detect periodic changes in the group structure of the network. Our methodology may provide insights into different phenomena occurring at multiple time-scales in systems represented by temporal networks.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Are machine learning technologies ready to be used for humanitarian work and development?
Authors:
Vedran Sekara,
Márton Karsai,
Esteban Moro,
Dohyung Kim,
Enrique Delamonica,
Manuel Cebrian,
Miguel Luengo-Oroz,
Rebeca Moreno Jiménez,
Manuel Garcia-Herranz
Abstract:
Novel digital data sources and tools like machine learning (ML) and artificial intelligence (AI) have the potential to revolutionize data about development and can contribute to monitoring and mitigating humanitarian problems. The potential of applying novel technologies to solving some of humanity's most pressing issues has garnered interest outside the traditional disciplines studying and workin…
▽ More
Novel digital data sources and tools like machine learning (ML) and artificial intelligence (AI) have the potential to revolutionize data about development and can contribute to monitoring and mitigating humanitarian problems. The potential of applying novel technologies to solving some of humanity's most pressing issues has garnered interest outside the traditional disciplines studying and working on international development. Today, scientific communities in fields like Computational Social Science, Network Science, Complex Systems, Human Computer Interaction, Machine Learning, and the broader AI field are increasingly starting to pay attention to these pressing issues. However, are sophisticated data driven tools ready to be used for solving real-world problems with imperfect data and of staggering complexity? We outline the current state-of-the-art and identify barriers, which need to be surmounted in order for data-driven technologies to become useful in humanitarian and development contexts. We argue that, without organized and purposeful efforts, these new technologies risk at best falling short of promised goals, at worst they can increase inequality, amplify discrimination, and infringe upon human rights.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
Generalized contact matrices for epidemic modeling
Authors:
Adriana Manna,
Lorenzo Dall'Amico,
Michele Tizzoni,
Marton Karsai,
Nicola Perra
Abstract:
Contact matrices have become a key ingredient of modern epidemic models. They account for the stratification of contacts for the age of individuals and, in some cases, the context of their interactions. However, age and context are not the only factors shaping contact structures and affecting the spreading of infectious diseases. Socio-economic status (SES) variables such as wealth, ethnicity, and…
▽ More
Contact matrices have become a key ingredient of modern epidemic models. They account for the stratification of contacts for the age of individuals and, in some cases, the context of their interactions. However, age and context are not the only factors shaping contact structures and affecting the spreading of infectious diseases. Socio-economic status (SES) variables such as wealth, ethnicity, and education play a major role as well. Here, we introduce generalized contact matrices capable of stratifying contacts across any number of dimensions including any SES variable. We derive an analytical expression for the basic reproductive number of an infectious disease unfolding on a population characterized by such generalized contact matrices. Our results, on both synthetic and real data, show that disregarding higher levels of stratification might lead to the under-estimation of the reproductive number and to a mis-estimation of the global epidemic dynamics. Furthermore, including generalized contact matrices allows for more expressive epidemic models able to capture heterogeneities in behaviours such as different levels of adoption of non-pharmaceutical interventions across different groups. Overall, our work contributes to the literature attempting to bring socio-economic, as well as other dimensions, to the forefront of epidemic modeling. Tackling this issue is crucial for developing more precise descriptions of epidemics, and thus to design better strategies to contain them.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
The temporal dynamics of group interactions in higher-order social networks
Authors:
Iacopo Iacopini,
Márton Karsai,
Alain Barrat
Abstract:
Representing social systems as networks, starting from the interactions between individuals, sheds light on the mechanisms governing their dynamics. However, networks encode only pairwise interactions, while most social interactions occur among groups of individuals, requiring higher-order network representations. Despite the recent interest in higher-order networks, little is known about the mech…
▽ More
Representing social systems as networks, starting from the interactions between individuals, sheds light on the mechanisms governing their dynamics. However, networks encode only pairwise interactions, while most social interactions occur among groups of individuals, requiring higher-order network representations. Despite the recent interest in higher-order networks, little is known about the mechanisms that govern the formation and evolution of groups, and how people move between groups. Here, we leverage empirical data on social interactions among children and university students to study their temporal dynamics at both individual and group levels, characterising how individuals navigate groups and how groups form and disaggregate. We find robust patterns across contexts and propose a dynamical model that closely reproduces empirical observations. These results represent a further step in understanding social systems, and open up research directions to study the impact of group dynamics on dynamical processes that evolve on top of them.
△ Less
Submitted 9 July, 2024; v1 submitted 16 June, 2023;
originally announced June 2023.
-
Interpreting wealth distribution via poverty map inference using multimodal data
Authors:
Lisette Espín-Noboa,
János Kertész,
Márton Karsai
Abstract:
Poverty maps are essential tools for governments and NGOs to track socioeconomic changes and adequately allocate infrastructure and services in places in need. Sensor and online crowd-sourced data combined with machine learning methods have provided a recent breakthrough in poverty map inference. However, these methods do not capture local wealth fluctuations, and are not optimized to produce acco…
▽ More
Poverty maps are essential tools for governments and NGOs to track socioeconomic changes and adequately allocate infrastructure and services in places in need. Sensor and online crowd-sourced data combined with machine learning methods have provided a recent breakthrough in poverty map inference. However, these methods do not capture local wealth fluctuations, and are not optimized to produce accountable results that guarantee accurate predictions to all sub-populations. Here, we propose a pipeline of machine learning models to infer the mean and standard deviation of wealth across multiple geographically clustered populated places, and illustrate their performance in Sierra Leone and Uganda. These models leverage seven independent and freely available feature sources based on satellite images, and metadata collected via online crowd-sourcing and social media. Our models show that combined metadata features are the best predictors of wealth in rural areas, outperforming image-based models, which are the best for predicting the highest wealth quintiles. Our results recover the local mean and variation of wealth, and correctly capture the positive yet non-monotonous correlation between them. We further demonstrate the capabilities and limitations of model transfer across countries and the effects of data recency and other biases. Our methodology provides open tools to build towards more transparent and interpretable models to help governments and NGOs to make informed decisions based on data availability, urbanization level, and poverty thresholds.
△ Less
Submitted 6 April, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Real-time estimation of the effective reproduction number of COVID-19 from behavioral data
Authors:
Eszter Bokányi,
Zsolt Vizi,
Júlia Koltai,
Gergely Röst,
Márton Karsai
Abstract:
Near-real time estimations of the effective reproduction number are among the most important tools to track the progression of a pandemic and to inform policy makers and the general public. However, these estimations rely on reported case numbers, commonly recorded with significant biases. The epidemic outcome is strongly influenced by the dynamics of social contacts, which are neglected in conven…
▽ More
Near-real time estimations of the effective reproduction number are among the most important tools to track the progression of a pandemic and to inform policy makers and the general public. However, these estimations rely on reported case numbers, commonly recorded with significant biases. The epidemic outcome is strongly influenced by the dynamics of social contacts, which are neglected in conventional surveillance systems as their real-time observation is challenging. Here, we propose a concept using online and offline behavioral data, recording age-stratified contact matrices at a daily rate. Modeling the epidemic using the reconstructed matrices we dynamically estimate the effective reproduction number during the two first waves of the COVID-19 pandemic in Hungary. Our results demonstrate how behavioral data can be used to build alternative monitoring systems complementing the established public health surveillance. They can identify and provide better signals during periods when official estimates appear unreliable due to observational biases.
△ Less
Submitted 21 July, 2022;
originally announced July 2022.
-
Understanding hesitancy with revealed preferences across COVID-19 vaccine types
Authors:
Kristóf Kutasi,
Júlia Koltai,
Ágnes Szabó-Morvai,
Gergely Röst,
Márton Karsai,
Péter Biró,
Balázs Lengyel
Abstract:
Many countries have secured larger quantities of COVID-19 vaccines than their populace is willing to take. This abundance and variety of vaccines created a historical moment to understand vaccine hesitancy better. Never before were more types of vaccines available for an illness and the intensity of vaccine-related public discourse is unprecedented. Yet, the heterogeneity of hesitancy by vaccine t…
▽ More
Many countries have secured larger quantities of COVID-19 vaccines than their populace is willing to take. This abundance and variety of vaccines created a historical moment to understand vaccine hesitancy better. Never before were more types of vaccines available for an illness and the intensity of vaccine-related public discourse is unprecedented. Yet, the heterogeneity of hesitancy by vaccine types has been neglected so far, even though factual or believed vaccine characteristics and patient attributes are known to influence acceptance. We address this problem by analysing acceptance and assessment of five vaccine types using information collected with a nationally representative survey at the end of the third wave of the COVID-19 pandemic in Hungary, where a unique portfolio of vaccines were available to the public in large quantities. Our special case enables us to quantify revealed preferences across vaccine types since one could evaluate a vaccine unacceptable and even could reject an assigned vaccine to wait for another type. We find that the source of information that respondents trust characterizes their attitudes towards vaccine types differently and leads to divergent vaccine hesitancy. Believers of conspiracy theories were significantly more likely to evaluate the mRNA vaccines (Pfizer and Moderna) unacceptable while those who follow the advice of politicians evaluate vector-based (AstraZeneca and Sputnik) or whole-virus vaccines (Sinopharm) acceptable with higher likelihood. We illustrate that the rejection of non-desired and re-selection of preferred vaccines fragments the population by the mRNA versus other type of vaccines while it generally improves the assessment of the received vaccine. These results highlight that greater variance of available vaccine types and individual free choice are desirable conditions that can widen the acceptance of vaccines in societies.
△ Less
Submitted 4 January, 2022; v1 submitted 11 November, 2021;
originally announced November 2021.
-
Directed Percolation in Random Temporal Network Models with Heterogeneities
Authors:
Arash Badie-Modiri,
Abbas K. Rizi,
Márton Karsai,
Mikko Kivelä
Abstract:
The event graph representation of temporal networks suggests that the connectivity of temporal structures can be mapped to a directed percolation problem. However, similar to percolation theory on static networks, this mapping is valid under the approximation that the structure and interaction dynamics of the temporal network are determined by its local properties, and otherwise, it is maximally r…
▽ More
The event graph representation of temporal networks suggests that the connectivity of temporal structures can be mapped to a directed percolation problem. However, similar to percolation theory on static networks, this mapping is valid under the approximation that the structure and interaction dynamics of the temporal network are determined by its local properties, and otherwise, it is maximally random. We challenge these conditions and demonstrate the robustness of this mapping in case of more complicated systems. We systematically analyze random and regular network topologies and heterogeneous link-activation processes driven by bursty renewal or self-exciting processes using numerical simulation and finite-size scaling methods. We find that the critical percolation exponents characterizing the temporal network are not sensitive to many structural and dynamical network heterogeneities, while they recover known scaling exponents characterizing directed percolation on low dimensional lattices. While it is not possible to demonstrate the validity of this mapping for all temporal network models, our results establish the first batch of evidence supporting the robustness of the scaling relationships in the limited-time reachability of temporal networks.
△ Less
Submitted 11 June, 2023; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Socioeconomic biases in urban mixing patterns of US metropolitan areas
Authors:
Rafiazka Millanida Hilman,
Gerardo Iñiguez,
Márton Karsai
Abstract:
Urban areas serve as melting pots of people with diverse socioeconomic backgrounds, who may not only be segregated but have characteristic mobility patterns in the city. While mobility is driven by individual needs and preferences, the specific choice of venues to visit is usually constrained by the socioeconomic status of people. The complex interplay between people and places they visit, given t…
▽ More
Urban areas serve as melting pots of people with diverse socioeconomic backgrounds, who may not only be segregated but have characteristic mobility patterns in the city. While mobility is driven by individual needs and preferences, the specific choice of venues to visit is usually constrained by the socioeconomic status of people. The complex interplay between people and places they visit, given their personal attributes and homophily leaning, is a key mechanism behind the emergence of socioeconomic stratification patterns ultimately leading to urban segregation at large. Here we investigate mixing patterns of mobility in the twenty largest cities of the United States by coupling individual check-in data from the social location platform Foursquare with census information from the American Community Survey. We find strong signs of stratification indicating that people mostly visit places in their own socioeconomic class, occasionally visiting locations from higher classes. The intensity of this `upwards bias' increases with socioeconomic status and correlates with standard measures of racial residential segregation. Our results indicate an even stronger socioeconomic segregation in individual mobility than one would expect from system-level distributions, shedding further light on uneven mobility mixing patterns in cities.
△ Less
Submitted 8 October, 2021;
originally announced October 2021.
-
Directed Percolation in Temporal Networks
Authors:
Arash Badie-Modiri,
Abbas K. Rizi,
Márton Karsai,
Mikko Kivelä
Abstract:
Connectivity and reachability on temporal networks, which can describe the spreading of a disease, decimation of information or the accessibility of a public transport system over time, have been among the main contemporary areas of study in complex systems for the last decade. However, while isotropic percolation theory successfully describes connectivity in static networks, a similar description…
▽ More
Connectivity and reachability on temporal networks, which can describe the spreading of a disease, decimation of information or the accessibility of a public transport system over time, have been among the main contemporary areas of study in complex systems for the last decade. However, while isotropic percolation theory successfully describes connectivity in static networks, a similar description has not been yet developed for temporal networks. Here address this problem and formalize a mapping of the concept of temporal network reachability to percolation theory. We show that the limited-waiting-time reachability, a generic notion of constrained connectivity in temporal networks, displays directed percolation phase transition in connectivity. Consequently, the critical percolation properties of spreading processes on temporal networks can be estimated by a set of known exponents characterising the directed percolation universality class. This result is robust across a diverse set of temporal network models with different temporal and topological heterogeneities, while by using our methodology we uncover similar reachability phase transitions in real temporal networks too. These findings open up an avenue to apply theory, concepts and methodology from the well-developed directed percolation literature to temporal networks.
△ Less
Submitted 11 June, 2023; v1 submitted 3 July, 2021;
originally announced July 2021.
-
Switchover phenomenon induced by epidemic seeding on geometric networks
Authors:
Gergely Ódor,
Domonkos Czifra,
Júlia Komjáthy,
László Lovász,
Márton Karsai
Abstract:
It is a fundamental question in disease modelling how the initial seeding of an epidemic, spreading over a network, determines its final outcome. Research in this topic has primarily concentrated on finding the seed configuration which infects the most individuals. Although these optimal configurations give insight into how the initial state affects the outcome of an epidemic, they are unlikely to…
▽ More
It is a fundamental question in disease modelling how the initial seeding of an epidemic, spreading over a network, determines its final outcome. Research in this topic has primarily concentrated on finding the seed configuration which infects the most individuals. Although these optimal configurations give insight into how the initial state affects the outcome of an epidemic, they are unlikely to occur in real life. In this paper we identify two important seeding scenarios, both motivated by historical data, that reveal a new complex phenomenon. In one scenario, the seeds are concentrated on the central nodes of a network, while in the second, they are spread uniformly in the population. Comparing the final size of the epidemic started from these two initial conditions through data-driven and synthetic simulations on real and modelled geometric metapopulation networks, we find evidence for a switchover phenomenon: When the basic reproduction number $R_0$ is close to its critical value, more individuals become infected in the first seeding scenario, but for larger values of $R_0$, the second scenario is more dangerous. We find that the switchover phenomenon is amplified by the geometric nature of the underlying network, and confirm our results via mathematically rigorous proofs, by mapping the network epidemic processes to bond percolation. Our results expand on the previous finding that in case of a single seed, the first scenario is always more dangerous, and further our understanding why the sizes of consecutive waves can differ even if their epidemic characters are similar.
△ Less
Submitted 30 June, 2021;
originally announced June 2021.
-
Mapping urban socioeconomic inequalities in developing countries through Facebook advertising data
Authors:
Serena Giurgola,
Simone Piaggesi,
Márton Karsai,
Yelena Mejova,
André Panisson,
Michele Tizzoni
Abstract:
Ending poverty in all its forms everywhere is the number one Sustainable Development Goal of the UN 2030 Agenda. To monitor the progress towards such an ambitious target, reliable, up-to-date and fine-grained measurements of socioeconomic indicators are necessary. When it comes to socioeconomic development, novel digital traces can provide a complementary data source to overcome the limits of trad…
▽ More
Ending poverty in all its forms everywhere is the number one Sustainable Development Goal of the UN 2030 Agenda. To monitor the progress towards such an ambitious target, reliable, up-to-date and fine-grained measurements of socioeconomic indicators are necessary. When it comes to socioeconomic development, novel digital traces can provide a complementary data source to overcome the limits of traditional data collection methods, which are often not regularly updated and lack adequate spatial resolution. In this study, we collect publicly available and anonymous advertising audience estimates from Facebook to predict socioeconomic conditions of urban residents, at a fine spatial granularity, in four large urban areas: Atlanta (USA), Bogotá (Colombia), Santiago (Chile), and Casablanca (Morocco). We find that behavioral attributes inferred from the Facebook marketing platform can accurately map the socioeconomic status of residential areas within cities, and that predictive performance is comparable in both high and low-resource settings. We also show that training a model on attributes of adult Facebook users, aged more than 25, leads to a more accurate mapping of socioeconomic conditions in all cities. Our work provides additional evidence of the value of social advertising media data to measure human development.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
Universal role of commuting in the reduction of social assortativity in cities
Authors:
Eszter Bokányi,
Sándor Juhász,
Márton Karsai,
Balázs Lengyel
Abstract:
Millions commute to work every day in cities and interact with colleagues, customers, providers, friends, and strangers. Commuting facilitates the mixing of people from distant and diverse neighborhoods, but whether this has an imprint on social inclusion or instead, connections remain assortative is less explored. In this paper, we aim to better understand income sorting in social networks inside…
▽ More
Millions commute to work every day in cities and interact with colleagues, customers, providers, friends, and strangers. Commuting facilitates the mixing of people from distant and diverse neighborhoods, but whether this has an imprint on social inclusion or instead, connections remain assortative is less explored. In this paper, we aim to better understand income sorting in social networks inside cities and investigate how commuting distance conditions the online social ties of Twitter users in the 50 largest metropolitan areas of the United States. Home and work locations are identified from geolocated tweets that enable us to infer the socio-economic status of individuals. Our results show that an above-median commuting distance in cities is associated with more diverse individual networks in terms of connected peers and their income. The degree that distant commutes link neighborhoods of different socio-economic backgrounds greatly varies by city size and structure. However, we find that above-median commutes are associated with a nearly uniform, moderate reduction of social tie assortativity across the top 50 US cities suggesting a universal role of commuting in integrating disparate social networks in cities. Our results inform policy that facilitating access across distant neighborhoods can advance the social inclusion of low-income groups.
△ Less
Submitted 14 October, 2021; v1 submitted 4 May, 2021;
originally announced May 2021.
-
Monitoring behavioural responses during pandemic via reconstructed contact matrices from online and representative surveys
Authors:
Júlia Koltai,
Orsolya Vásárhelyi,
Gergely Röst,
Márton Karsai
Abstract:
The unprecedented behavioural responses of societies have been evidently shaping the COVID-19 pandemic, yet it is a significant challenge to accurately monitor the continuously changing social mixing patterns in real-time. Contact matrices, usually stratified by age, summarise interaction motifs efficiently, but their collection relies on conventional representative survey techniques, which are ex…
▽ More
The unprecedented behavioural responses of societies have been evidently shaping the COVID-19 pandemic, yet it is a significant challenge to accurately monitor the continuously changing social mixing patterns in real-time. Contact matrices, usually stratified by age, summarise interaction motifs efficiently, but their collection relies on conventional representative survey techniques, which are expensive and slow to obtain. Here we report a data collection effort involving over $2.3\%$ of the Hungarian population to simultaneously record contact matrices through a longitudinal online and sequence of representative phone surveys. To correct non-representative biases characterising the online data, by using census data and the representative samples we develop a reconstruction method to provide a scalable, cheap, and flexible way to dynamically obtain closer-to-representative contact matrices. Our results demonstrate the potential of combined online-offline data collections to understand the changing behavioural responses determining the future evolution of the outbreak, and inform epidemic models with crucial data.
△ Less
Submitted 22 February, 2021; v1 submitted 17 February, 2021;
originally announced February 2021.
-
Temporal properties of higher-order interactions in social networks
Authors:
Giulia Cencetti,
Federico Battiston,
Bruno Lepri,
Márton Karsai
Abstract:
Human social interactions in local settings can be experimentally detected by recording the physical proximity and orientation of people. Such interactions, approximating face-to-face communications, can be effectively represented as time varying social networks with links being unceasingly created and destroyed over time. Traditional analyses of temporal networks have addressed mostly pairwise in…
▽ More
Human social interactions in local settings can be experimentally detected by recording the physical proximity and orientation of people. Such interactions, approximating face-to-face communications, can be effectively represented as time varying social networks with links being unceasingly created and destroyed over time. Traditional analyses of temporal networks have addressed mostly pairwise interactions, where links describe dyadic connections among individuals. However, many network dynamics are hardly ascribable to pairwise settings but often comprise larger groups, which are better described by higher-order interactions. Here we investigate the higher-order organizations of temporal social networks by analyzing three publicly available datasets collected in different social settings. We find that higher-order interactions are ubiquitous and, similarly to their pairwise counterparts, characterized by heterogeneous dynamics, with bursty trains of rapidly recurring higher-order events separated by long periods of inactivity. We investigate the evolution and formation of groups by looking at the transition rates between different higher-order structures. We find that in more spontaneous social settings, group are characterized by slower formation and disaggregation, while in work settings these phenomena are more abrupt, possibly reflecting pre-organized social dynamics. Finally, we observe temporal reinforcement suggesting that the longer a group stays together the higher the probability that the same interaction pattern persist in the future. Our findings suggest the importance of considering the higher-order structure of social interactions when investigating human temporal dynamics.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
Dynamics of cascades on burstiness-controlled temporal networks
Authors:
Samuel Unicomb,
Gerardo Iñiguez,
James P. Gleeson,
Márton Karsai
Abstract:
Burstiness, the tendency of interaction events to be heterogeneously distributed in time, is critical to information diffusion in physical and social systems. However, an analytical framework capturing the effect of burstiness on generic dynamics is lacking. We develop a master equation formalism to study cascades on temporal networks with burstiness modelled by renewal processes. Supported by num…
▽ More
Burstiness, the tendency of interaction events to be heterogeneously distributed in time, is critical to information diffusion in physical and social systems. However, an analytical framework capturing the effect of burstiness on generic dynamics is lacking. We develop a master equation formalism to study cascades on temporal networks with burstiness modelled by renewal processes. Supported by numerical and data-driven simulations, we describe the interplay between heterogeneous temporal interactions and models of threshold-driven and epidemic spreading. We find that increasing interevent time variance can both accelerate and decelerate spreading for threshold models, but can only decelerate epidemic spreading. When accounting for the skewness of different interevent time distributions, spreading times collapse onto a universal curve. Our framework uncovers a deep yet subtle connection between generic diffusion mechanisms and underlying temporal network structures that impacts on a broad class of networked phenomena, from spin interactions to epidemic contagion and language dynamics.
△ Less
Submitted 13 July, 2020;
originally announced July 2020.
-
Socioeconomic correlations of urban patterns inferred from aerial images: interpreting activation maps of Convolutional Neural Networks
Authors:
Jacob Levy Abitbol,
Márton Karsai
Abstract:
Urbanisation is a great challenge for modern societies, promising better access to economic opportunities while widening socioeconomic inequalities. Accurately tracking how this process unfolds has been challenging for traditional data collection methods, while remote sensing information offers an alternative to gather a more complete view on these societal changes. By feeding a neural network wit…
▽ More
Urbanisation is a great challenge for modern societies, promising better access to economic opportunities while widening socioeconomic inequalities. Accurately tracking how this process unfolds has been challenging for traditional data collection methods, while remote sensing information offers an alternative to gather a more complete view on these societal changes. By feeding a neural network with satellite images one may recover the socioeconomic information associated to that area, however these models lack to explain how visual features contained in a sample, trigger a given prediction. Here we close this gap by predicting socioeconomic status across France from aerial images and interpreting class activation mappings in terms of urban topology. We show that the model disregards the spatial correlations existing between urban class and socioeconomic status to derive its predictions. These results pave the way to build interpretable models, which may help to better track and understand urbanisation and its consequences.
△ Less
Submitted 10 April, 2020;
originally announced April 2020.
-
Bridging the gap between graphs and networks
Authors:
Gerardo Iñiguez,
Federico Battiston,
Márton Karsai
Abstract:
Network science has become a powerful tool to describe the structure and dynamics of real-world complex physical, biological, social, and technological systems. Largely built on empirical observations to tackle heterogeneous, temporal, and adaptive patterns of interactions, its intuitive and flexible nature has contributed to the popularity of the field. With pioneering work on the evolution of ra…
▽ More
Network science has become a powerful tool to describe the structure and dynamics of real-world complex physical, biological, social, and technological systems. Largely built on empirical observations to tackle heterogeneous, temporal, and adaptive patterns of interactions, its intuitive and flexible nature has contributed to the popularity of the field. With pioneering work on the evolution of random graphs, graph theory is often cited as the mathematical foundation of network science. Despite this narrative, the two research communities are still largely disconnected. In this Commentary we discuss the need for further cross-pollination between fields -- bridging the gap between graphs and networks -- and how network science can benefit from such influence. A more mathematical network science may clarify the role of randomness in modeling, hint at underlying laws of behavior, and predict yet unobserved complex networked phenomena in nature.
△ Less
Submitted 3 April, 2020;
originally announced April 2020.
-
Weighted temporal event graphs
Authors:
Jari Saramäki,
Mikko Kivelä,
Márton Karsai
Abstract:
The times of temporal-network events and their correlations contain information on the function of the network and they influence dynamical processes taking place on it. To extract information out of correlated event times, techniques such as the analysis of temporal motifs have been developed. We discuss a recently-introduced, more general framework that maps temporal-network structure into stati…
▽ More
The times of temporal-network events and their correlations contain information on the function of the network and they influence dynamical processes taking place on it. To extract information out of correlated event times, techniques such as the analysis of temporal motifs have been developed. We discuss a recently-introduced, more general framework that maps temporal-network structure into static graphs while retaining information on time-respecting paths and the time differences between their consequent events. This framework builds on weighted temporal event graphs: directed, acyclic graphs (DAGs) that contain a superposition of all temporal paths. We introduce the reader to the temporal event-graph mapping and associated computational methods and illustrate its use by applying the framework to temporal-network percolation.
△ Less
Submitted 9 December, 2019;
originally announced December 2019.
-
weg2vec: Event embedding for temporal networks
Authors:
Maddalena Torricelli,
Márton Karsai,
Laetitia Gauvin
Abstract:
Network embedding techniques are powerful to capture structural regularities in networks and to identify similarities between their local fabrics. However, conventional network embedding models are developed for static structures, commonly consider nodes only and they are seriously challenged when the network is varying in time. Temporal networks may provide an advantage in the description of real…
▽ More
Network embedding techniques are powerful to capture structural regularities in networks and to identify similarities between their local fabrics. However, conventional network embedding models are developed for static structures, commonly consider nodes only and they are seriously challenged when the network is varying in time. Temporal networks may provide an advantage in the description of real systems, but they code more complex information, which could be effectively represented only by a handful of methods so far. Here, we propose a new method of event embedding of temporal networks, called weg2vec, which builds on temporal and structural similarities of events to learn a low dimensional representation of a temporal network. This projection successfully captures latent structures and similarities between events involving different nodes at different times and provides ways to predict the final outcome of spreading processes unfolding on the temporal structure.
△ Less
Submitted 6 November, 2019;
originally announced November 2019.
-
Efficient limited-time reachability estimation in temporal networks
Authors:
Arash Badie-Modiri,
Márton Karsai,
Mikko Kivelä
Abstract:
Time-limited states characterise many dynamical processes on networks: disease infected individuals recover after some time, people forget news spreading on social networks, or passengers may not wait forever for a connection. These dynamics can be described as limited waiting-time processes, and they are particularly important for systems modelled as temporal networks. These processes have been s…
▽ More
Time-limited states characterise many dynamical processes on networks: disease infected individuals recover after some time, people forget news spreading on social networks, or passengers may not wait forever for a connection. These dynamics can be described as limited waiting-time processes, and they are particularly important for systems modelled as temporal networks. These processes have been studied via simulations, which is equivalent to repeatedly finding all limited-waiting time temporal paths from a source node and time. We propose a method yielding orders of magnitude more efficient way of tracking the reachability of such temporal paths. Our method gives simultaneous estimates of the in- or out-reachability (with any chosen waiting-time limit) from every possible starting point and time. It works on very large temporal networks with hundreds of millions of events on current commodity computing hardware. This opens up the possibility to analyse reachability and dynamics of spreading processes on large temporal networks in completely new ways. For example, one can now compute centralities based on global reachability for all events or can find with high probability the infected node and time, which would lead to the largest epidemic outbreak.
△ Less
Submitted 11 June, 2023; v1 submitted 30 August, 2019;
originally announced August 2019.
-
Interactional and Informational Attention on Twitter
Authors:
Agathe Baltzer,
Márton Karsai,
Camille Roth
Abstract:
Twitter may be considered as a decentralized social information processing platform whose users constantly receive their followees' information feeds, which they may in turn dispatch to their followers. This decentralization is not devoid of hierarchy and heterogeneity, both in terms of activity and attention. In particular, we appraise the distribution of attention at the collective and individua…
▽ More
Twitter may be considered as a decentralized social information processing platform whose users constantly receive their followees' information feeds, which they may in turn dispatch to their followers. This decentralization is not devoid of hierarchy and heterogeneity, both in terms of activity and attention. In particular, we appraise the distribution of attention at the collective and individual level, which exhibits the existence of attentional constraints and focus effects. We observe that most users usually concentrate their attention on a limited core of peers and topics, and discuss the relationship between interactional and informational attention processes -- all of which, we suggest, may be useful to refine influence models by enabling the consideration of differential attention likelihood depending on users, their activity levels and peers' positions.
△ Less
Submitted 18 July, 2019;
originally announced July 2019.
-
Computational Human Dynamics
Authors:
Márton Karsai
Abstract:
This thesis summarises my scientific contributions in the domain of network science, human dynamics and computational social science. These contributions are associated to computer science, physics, statistics, and applied mathematics. The goal of this thesis is twofold, on one hand to write a concise summary of my most interesting scientific contributions, and on the other hand to provide an up-t…
▽ More
This thesis summarises my scientific contributions in the domain of network science, human dynamics and computational social science. These contributions are associated to computer science, physics, statistics, and applied mathematics. The goal of this thesis is twofold, on one hand to write a concise summary of my most interesting scientific contributions, and on the other hand to provide an up-to-date view and perspective about my field. I start my dissertation with an introduction to position the reader on the landscape of my field and to put in perspective my contributions. In the second chapter I concentrate on my works on bursty human dynamics, addressing heterogeneous temporal characters of human actions and interactions. Next, I discuss my contributions to the field of temporal networks and give a synthesises of my works on various methods of the representation, characterisation, and modelling of time-varying structures. Finally, I discuss my works on the data-driven observations and modelling of collective social phenomena. There, I summarise studies on the static observations of emergent patterns of socioeconomic inequalities and their correlations with social-communication networks, and with linguistic patterns. I also discuss dynamic observations and modelling of social contagion processes.
△ Less
Submitted 18 July, 2019; v1 submitted 17 July, 2019;
originally announced July 2019.
-
Joint embedding of structure and features via graph convolutional networks
Authors:
Sébastien Lerique,
Jacob Levy Abitbol,
Márton Karsai
Abstract:
The creation of social ties is largely determined by the entangled effects of people's similarities in terms of individual characters and friends. However, feature and structural characters of people usually appear to be correlated, making it difficult to determine which has greater responsibility in the formation of the emergent network structure. We propose \emph{AN2VEC}, a node embedding method…
▽ More
The creation of social ties is largely determined by the entangled effects of people's similarities in terms of individual characters and friends. However, feature and structural characters of people usually appear to be correlated, making it difficult to determine which has greater responsibility in the formation of the emergent network structure. We propose \emph{AN2VEC}, a node embedding method which ultimately aims at disentangling the information shared by the structure of a network and the features of its nodes. Building on the recent developments of Graph Convolutional Networks (GCN), we develop a multitask GCN Variational Autoencoder where different dimensions of the generated embeddings can be dedicated to encoding feature information, network structure, and shared feature-network information. We explore the interaction between these disentangled characters by comparing the embedding reconstruction performance to a baseline case where no shared information is extracted. We use synthetic datasets with different levels of interdependency between feature and network characters and show (i) that shallow embeddings relying on shared information perform better than the corresponding reference with unshared information, (ii) that this performance gap increases with the correlation between network and feature structure, and (iii) that our embedding is able to capture joint information of structure and features. Our method can be relevant for the analysis and prediction of any featured network structure ranging from online social systems to network medicine.
△ Less
Submitted 29 October, 2019; v1 submitted 21 May, 2019;
originally announced May 2019.
-
Reentrant phase transitions in threshold driven contagion on multiplex networks
Authors:
Samuel Unicomb,
Gerardo Iñiguez,
János Kertész,
Márton Karsai
Abstract:
Models of threshold driven contagion explain the cascading spread of information, behavior, systemic risk, and epidemics on social, financial and biological networks. At odds with empirical observation, these models predict that single-layer unweighted networks become resistant to global cascades after reaching sufficient connectivity. We investigate threshold driven contagion on weight heterogene…
▽ More
Models of threshold driven contagion explain the cascading spread of information, behavior, systemic risk, and epidemics on social, financial and biological networks. At odds with empirical observation, these models predict that single-layer unweighted networks become resistant to global cascades after reaching sufficient connectivity. We investigate threshold driven contagion on weight heterogeneous multiplex networks and show that they can remain susceptible to global cascades at any level of connectivity, and with increasing edge density pass through alternating phases of stability and instability in the form of reentrant phase transitions of contagion. Our results provide a novel theoretical explanation for the observation of large scale contagion in highly connected but heterogeneous networks.
△ Less
Submitted 28 May, 2019; v1 submitted 24 January, 2019;
originally announced January 2019.
-
Location, Occupation, and Semantics based Socioeconomic Status Inference on Twitter
Authors:
Jacobo Levy Abitbol,
Márton Karsai,
Eric Fleury
Abstract:
The socioeconomic status of people depends on a combination of individual characteristics and environmental variables, thus its inference from online behavioral data is a difficult task. Attributes like user semantics in communication, habitat, occupation, or social network are all known to be determinant predictors of this feature. In this paper we propose three different data collection and comb…
▽ More
The socioeconomic status of people depends on a combination of individual characteristics and environmental variables, thus its inference from online behavioral data is a difficult task. Attributes like user semantics in communication, habitat, occupation, or social network are all known to be determinant predictors of this feature. In this paper we propose three different data collection and combination methods to first estimate and, in turn, infer the socioeconomic status of French Twitter users from their online semantics. Our methods are based on open census data, crawled professional profiles, and remotely sensed, expert annotated information on living environment. Our inference models reach similar performance of earlier results with the advantage of relying on broadly available datasets and of providing a generalizable framework to estimate socioeconomic status of large numbers of Twitter users. These results may contribute to the scientific discussion on social stratification and inequalities, and may fuel several applications.
△ Less
Submitted 16 January, 2019;
originally announced January 2019.
-
Randomized reference models for temporal networks
Authors:
Laetitia Gauvin,
Mathieu Génois,
Márton Karsai,
Mikko Kivelä,
Taro Takaguchi,
Eugenio Valdano,
Christian L. Vestergaard
Abstract:
Many dynamical systems can be successfully analyzed by representing them as networks. Empirically measured networks and dynamic processes that take place in these situations show heterogeneous, non-Markovian, and intrinsically correlated topologies and dynamics. This makes their analysis particularly challenging. Randomized reference models (RRMs) have emerged as a general and versatile toolbox fo…
▽ More
Many dynamical systems can be successfully analyzed by representing them as networks. Empirically measured networks and dynamic processes that take place in these situations show heterogeneous, non-Markovian, and intrinsically correlated topologies and dynamics. This makes their analysis particularly challenging. Randomized reference models (RRMs) have emerged as a general and versatile toolbox for studying such systems. Defined as random networks with given features constrained to match those of an input (empirical) network, they may, for example, be used to identify important features of empirical networks and their effects on dynamical processes unfolding in the network. RRMs are typically implemented as procedures that reshuffle an empirical network, making them very generally applicable. However, the effects of most shuffling procedures on network features remain poorly understood, rendering their use nontrivial and susceptible to misinterpretation. Here we propose a unified framework for classifying and understanding microcanonical RRMs (MRRMs) that sample networks with uniform probability. Focusing on temporal networks, we survey applications of MRRMs found in the literature, and we use this framework to build a taxonomy of MRRMs that proposes a canonical naming convention, classifies them, and deduces their effects on a range of important network features. We furthermore show that certain classes of MRRMs may be applied in sequential composition to generate new MRRMs from the existing ones surveyed in this article. We finally provide a tutorial showing how to apply a series of MRRMs to analyze how different network features affect a dynamic process in an empirical temporal network.
△ Less
Submitted 15 December, 2022; v1 submitted 11 June, 2018;
originally announced June 2018.
-
Socioeconomic Dependencies of Linguistic Patterns in Twitter: A Multivariate Analysis
Authors:
Jacob Levy Abitbol,
Márton Karsai,
Jean-Philippe Magué,
Jean-Pierre Chevrot,
Eric Fleury
Abstract:
Our usage of language is not solely reliant on cognition but is arguably determined by myriad external factors leading to a global variability of linguistic patterns. This issue, which lies at the core of sociolinguistics and is backed by many small-scale studies on face-to-face communication, is addressed here by constructing a dataset combining the largest French Twitter corpus to date with deta…
▽ More
Our usage of language is not solely reliant on cognition but is arguably determined by myriad external factors leading to a global variability of linguistic patterns. This issue, which lies at the core of sociolinguistics and is backed by many small-scale studies on face-to-face communication, is addressed here by constructing a dataset combining the largest French Twitter corpus to date with detailed socioeconomic maps obtained from national census in France. We show how key linguistic variables measured in individual Twitter streams depend on factors like socioeconomic status, location, time, and the social network of individuals. We found that (i) people of higher socioeconomic status, active to a greater degree during the daytime, use a more standard language; (ii) the southern part of the country is more prone to use more standard language than the northern one, while locally the used variety or dialect is determined by the spatial distribution of socioeconomic status; and (iii) individuals connected in the social network are closer linguistically than disconnected ones, even after the effects of status homophily have been removed. Our results inform sociolinguistic theory and may inspire novel learning methods for the inference of socioeconomic status of people from the way they tweet.
△ Less
Submitted 3 April, 2018;
originally announced April 2018.
-
Bursty Human Dynamics
Authors:
Márton Karsai,
Hang-Hyun Jo,
Kimmo Kaski
Abstract:
Bursty dynamics is a common temporal property of various complex systems in Nature but it also characterises the dynamics of human actions and interactions. At the phenomenological level it is a feature of all systems that evolve heterogeneously over time by alternating between periods of low and high event frequencies. In such systems, bursts are identified as periods in which the events occur wi…
▽ More
Bursty dynamics is a common temporal property of various complex systems in Nature but it also characterises the dynamics of human actions and interactions. At the phenomenological level it is a feature of all systems that evolve heterogeneously over time by alternating between periods of low and high event frequencies. In such systems, bursts are identified as periods in which the events occur with a rapid pace within a short time-interval while these periods are separated by long periods of time with low frequency of events. As such dynamical patterns occur in a wide range of natural phenomena, their observation, characterisation, and modelling have been a long standing challenge in several fields of research. However, due to some recent developments in communication and data collection techniques it has become possible to follow digital traces of actions and interactions of humans from the individual up to the societal level. This led to several new observations of bursty phenomena in the new but largely unexplored area of human dynamics, which called for the renaissance to study these systems using research concepts and methodologies, including data analytics and modelling. As a result, a large amount of new insight and knowledge as well as innovations have been accumulated in the field, which provided us a timely opportunity to write this brief monograph to make an up-to-date review and summary of the observations, appropriate measures, modelling, and applications of heterogeneous bursty patterns occurring in the dynamics of human behaviour.
△ Less
Submitted 7 March, 2018;
originally announced March 2018.
-
Link transmission centrality in large-scale social networks
Authors:
Qian Zhang,
Márton Karsai,
Alessandro Vespignani
Abstract:
Understanding the importance of links in transmitting information in a network can provide ways to hinder or postpone ongoing dynamical phenomena like the spreading of epidemic or the diffusion of information. In this work, we propose a new measure based on stochastic diffusion processes, the \textit{transmission centrality}, that captures the importance of links by estimating the average number o…
▽ More
Understanding the importance of links in transmitting information in a network can provide ways to hinder or postpone ongoing dynamical phenomena like the spreading of epidemic or the diffusion of information. In this work, we propose a new measure based on stochastic diffusion processes, the \textit{transmission centrality}, that captures the importance of links by estimating the average number of nodes to whom they transfer information during a global spreading diffusion process. We propose a simple algorithmic solution to compute transmission centrality and to approximate it in very large networks at low computational cost. Finally we apply transmission centrality in the identification of weak ties in three large empirical social networks, showing that this metric outperforms other centrality measures in identifying links that drive spreading processes in a social network.
△ Less
Submitted 14 February, 2018;
originally announced February 2018.
-
Correlations and dynamics of consumption patterns in social-economic networks
Authors:
Yannick Leo,
Márton Karsai,
Carlos Sarraute,
Eric Fleury
Abstract:
We analyse a coupled dataset collecting the mobile phone communications and bank transactions history of a large number of individuals living in a Latin American country. After mapping the social structure and introducing indicators of socioeconomic status, demographic features, and purchasing habits of individuals we show that typical consumption patterns are strongly correlated with identified s…
▽ More
We analyse a coupled dataset collecting the mobile phone communications and bank transactions history of a large number of individuals living in a Latin American country. After mapping the social structure and introducing indicators of socioeconomic status, demographic features, and purchasing habits of individuals we show that typical consumption patterns are strongly correlated with identified socioeconomic classes leading to patterns of stratification in the social structure. In addition we measure correlations between merchant categories and introduce a correlation network, which emerges with a meaningful community structure. We detect multivariate relations between merchant categories and show correlations in purchasing habits of individuals. Finally, by analysing individual consumption histories, we detect dynamical patterns in purchase behaviour and their correlations with the socioeconomic status, demographic characters and the egocentric social network of individuals. Our work provides novel and detailed insight into the relations between social and consuming behaviour with potential applications in resource allocation, marketing, and recommendation system design.
△ Less
Submitted 23 January, 2018;
originally announced January 2018.
-
Mapping temporal-network percolation to weighted, static event graphs
Authors:
Mikko Kivelä,
Jordan Cambe,
Jari Saramäki,
Márton Karsai
Abstract:
Many processes of spreading and diffusion take place on temporal networks, and their outcomes are influenced by correlations in the times of contact. These correlations have a particularly strong influence on processes where the spreading agent has a limited lifetime at nodes: disease spreading (recovery time), diffusion of rumors (lifetime of information), and passenger routing (maximum acceptabl…
▽ More
Many processes of spreading and diffusion take place on temporal networks, and their outcomes are influenced by correlations in the times of contact. These correlations have a particularly strong influence on processes where the spreading agent has a limited lifetime at nodes: disease spreading (recovery time), diffusion of rumors (lifetime of information), and passenger routing (maximum acceptable time between transfers). Here, we introduce weighted event graphs as a powerful and fast framework for studying connectivity determined by time-respecting paths where the allowed waiting times between contacts have an upper limit. We study percolation on the weighted event graphs and in the underlying temporal networks, with simulated and real-world networks. We show that this type of temporal-network percolation is analogous to directed percolation, and that it can be characterized by multiple order parameters.
△ Less
Submitted 17 September, 2017;
originally announced September 2017.
-
Threshold driven contagion on weighted networks
Authors:
Samuel Unicomb,
Gerardo Iñiguez,
Márton Karsai
Abstract:
Weighted networks capture the structure of complex systems where interaction strength is meaningful. This information is essential to a large number of processes, such as threshold dynamics, where link weights reflect the amount of influence that neighbours have in determining a node's behaviour. Despite describing numerous cascading phenomena, such as neural firing or social contagion, threshold…
▽ More
Weighted networks capture the structure of complex systems where interaction strength is meaningful. This information is essential to a large number of processes, such as threshold dynamics, where link weights reflect the amount of influence that neighbours have in determining a node's behaviour. Despite describing numerous cascading phenomena, such as neural firing or social contagion, threshold models have never been explicitly addressed on weighted networks. We fill this gap by studying a dynamical threshold model over synthetic and real weighted networks with numerical and analytical tools. We show that the time of cascade emergence depends non-monotonously on weight heterogeneities, which accelerate or decelerate the dynamics, and lead to non-trivial parameter spaces for various networks and weight distributions. Our methodology applies to arbitrary binary state processes and link properties, and may prove instrumental in understanding the role of edge heterogeneities in various natural and social phenomena.
△ Less
Submitted 7 July, 2017;
originally announced July 2017.
-
Prepaid or Postpaid? That is the question. Novel Methods of Subscription Type Prediction in Mobile Phone Services
Authors:
Yongjun Liao,
Wei Du,
Márton Karsai,
Carlos Sarraute,
Martin Minnoni,
Eric Fleury
Abstract:
In this paper we investigate the behavioural differences between mobile phone customers with prepaid and postpaid subscriptions. Our study reveals that (a) postpaid customers are more active in terms of service usage and (b) there are strong structural correlations in the mobile phone call network as connections between customers of the same subscription type are much more frequent than those betw…
▽ More
In this paper we investigate the behavioural differences between mobile phone customers with prepaid and postpaid subscriptions. Our study reveals that (a) postpaid customers are more active in terms of service usage and (b) there are strong structural correlations in the mobile phone call network as connections between customers of the same subscription type are much more frequent than those between customers of different subscription types. Based on these observations we provide methods to detect the subscription type of customers by using information about their personal call statistics, and also their egocentric networks simultaneously. The key of our first approach is to cast this classification problem as a problem of graph labelling, which can be solved by max-flow min-cut algorithms. Our experiments show that, by using both user attributes and relationships, the proposed graph labelling approach is able to achieve a classification accuracy of $\sim 87\%$, which outperforms by $\sim 7\%$ supervised learning methods using only user attributes. In our second problem we aim to infer the subscription type of customers of external operators. We propose via approximate methods to solve this problem by using node attributes, and a two-ways indirect inference method based on observed homophilic structural correlations. Our results have straightforward applications in behavioural prediction and personal marketing.
△ Less
Submitted 30 June, 2017;
originally announced June 2017.
-
Service adoption spreading in online social networks
Authors:
Gerardo Iñiguez,
Zhongyuan Ruan,
Kimmo Kaski,
János Kertész,
Márton Karsai
Abstract:
The collective behaviour of people adopting an innovation, product or online service is commonly interpreted as a spreading phenomenon throughout the fabric of society. This process is arguably driven by social influence, social learning and by external effects like media. Observations of such processes date back to the seminal studies by Rogers and Bass, and their mathematical modelling has taken…
▽ More
The collective behaviour of people adopting an innovation, product or online service is commonly interpreted as a spreading phenomenon throughout the fabric of society. This process is arguably driven by social influence, social learning and by external effects like media. Observations of such processes date back to the seminal studies by Rogers and Bass, and their mathematical modelling has taken two directions: One paradigm, called simple contagion, identifies adoption spreading with an epidemic process. The other one, named complex contagion, is concerned with behavioural thresholds and successfully explains the emergence of large cascades of adoption resulting in a rapid spreading often seen in empirical data. The observation of real world adoption processes has become easier lately due to the availability of large digital social network and behavioural datasets. This has allowed simultaneous study of network structures and dynamics of online service adoption, shedding light on the mechanisms and external effects that influence the temporal evolution of behavioural or innovation adoption. These advancements have induced the development of more realistic models of social spreading phenomena, which in turn have provided remarkably good predictions of various empirical adoption processes. In this chapter we review recent data-driven studies addressing real-world service adoption processes. Our studies provide the first detailed empirical evidence of a heterogeneous threshold distribution in adoption. We also describe the modelling of such phenomena with formal methods and data-driven simulations. Our objective is to understand the effects of identified social mechanisms on service adoption spreading, and to provide potential new directions and open questions for future research.
△ Less
Submitted 29 June, 2017;
originally announced June 2017.
-
Socioeconomic correlations and stratification in social-communication networks
Authors:
Yannick Leo,
Eric Fleury,
J. Ignacio Alvarez-Hamelin,
Carlos Sarraute,
Márton Karsai
Abstract:
The uneven distribution of wealth and individual economic capacities are among the main forces which shape modern societies and arguably bias the emerging social structures. However, the study of correlations between the social network and economic status of individuals is difficult due to the lack of large-scale multimodal data disclosing both the social ties and economic indicators of the same p…
▽ More
The uneven distribution of wealth and individual economic capacities are among the main forces which shape modern societies and arguably bias the emerging social structures. However, the study of correlations between the social network and economic status of individuals is difficult due to the lack of large-scale multimodal data disclosing both the social ties and economic indicators of the same population. Here, we close this gap through the analysis of coupled datasets recording the mobile phone communications and bank transaction history of one million anonymised individuals living in a Latin American country. We show that wealth and debt are unevenly distributed among people in agreement with the Pareto principle; the observed social structure is strongly stratified, with people being better connected to others of their own socioeconomic class rather than to others of different classes; the social network appears with assortative socioeconomic correlations and tightly connected "rich clubs"; and that egos from the same class live closer to each other but commute further if they are wealthier. These results are based on a representative, society-large population, and empirically demonstrate some long-lasting hypotheses on socioeconomic correlations which potentially lay behind social segregation, and induce differences in human mobility.
△ Less
Submitted 14 December, 2016;
originally announced December 2016.
-
Correlations of consumption patterns in social-economic networks
Authors:
Yannick Leo,
Márton Karsai,
Carlos Sarraute,
Eric Fleury
Abstract:
We analyze a coupled anonymized dataset collecting the mobile phone communication and bank transactions history of a large number of individuals. After mapping the social structure and introducing indicators of socioeconomic status, demographic features, and purchasing habits of individuals we show that typical consumption patterns are strongly correlated with identified socioeconomic classes lead…
▽ More
We analyze a coupled anonymized dataset collecting the mobile phone communication and bank transactions history of a large number of individuals. After mapping the social structure and introducing indicators of socioeconomic status, demographic features, and purchasing habits of individuals we show that typical consumption patterns are strongly correlated with identified socioeconomic classes leading to patterns of stratification in the social structure. In addition we measure correlations between merchant categories and introduce a correlation network, which emerges with a meaningful community structure. We detect multivariate relations between merchant categories and show correlations in purchasing habits of individuals. Our work provides novel and detailed insight into the relations between social and consuming behaviour with potential applications in recommendation system design.
△ Less
Submitted 21 December, 2017; v1 submitted 13 September, 2016;
originally announced September 2016.
-
Burstiness and tie reinforcement in time varying social networks
Authors:
Enrico Ubaldi,
Alessandro Vezzani,
Marton Karsai,
Nicola Perra,
Raffaella Burioni
Abstract:
We introduce a time-varying network model accounting for burstiness and tie reinforcement observed in social networks. The analytical solution indicates a non-trivial phase diagram determined by the competition of the leading terms of the two processes. We test our results against numerical simulations, and compare the analytical predictions with an empirical dataset finding good agreements betwee…
▽ More
We introduce a time-varying network model accounting for burstiness and tie reinforcement observed in social networks. The analytical solution indicates a non-trivial phase diagram determined by the competition of the leading terms of the two processes. We test our results against numerical simulations, and compare the analytical predictions with an empirical dataset finding good agreements between them. The presented framework can be used to classify the dynamical features of real social networks and to gather new insights about the effects of social dynamics on ongoing spreading processes.
△ Less
Submitted 29 July, 2016;
originally announced July 2016.