# Nature of Complex Network of Dengue Epidemic as a Scale-Free Network

## Article information

## Abstract

### Objectives

Dengue epidemic is a dynamic and complex phenomenon that has gained considerable attention due to its injurious effects. The focus of this study is to statically analyze the nature of the dengue epidemic network in terms of whether it follows the features of a scale-free network or a random network.

### Methods

A multifarious network of *Aedes aegypti* is addressed keeping the viewpoint of a complex system and modelled as a network. The dengue network has been transformed into a one-mode network from a two-mode network by utilizing projection methods. Furthermore, three network features have been analyzed, the power-law, clustering coefficient, and network visualization. In addition, five methods have been applied to calculate the global clustering coefficient.

### Results

It has been observed that dengue epidemic follows a power-law, with the value of its exponent γ = −2.1. The value of the clustering coefficient is high for dengue cases, as weight of links. The minimum method showed the highest value among the methods used to calculate the coefficient. Network visualization showed the main areas. Moreover, the dengue situation did not remain the same throughout the observed period.

### Conclusions

The results showed that the network topology exhibits the features of a scale-free network instead of a random network. Focal hubs are highlighted and the critical period is found. Outcomes are important for the researchers, health officials, and policy makers who deal with arbovirus epidemic diseases. *Zika* virus and *Chikungunya* virus can also be modelled and analyzed in this manner.

**Keywords:**Dengue Virus; Arboviruse; Epidemics; Big Data; Network Meta-Analysis

## I. Introduction

Various kinds of real world complex systems including physical, biological, and social systems can be represented in terms of complex networks. Some important examples are the World Wide Web, electric power grids, scientific collaboration networks, airport networks, the Hajj network, and social networks of friendship [1234]. In the fields of medicine and biology, the complex networks of diseases such as HIV/AIDS, smallpox, and dengue virus have also been investigated to analyze the spreading phenomenon [56]. The dengue virus network includes the mosquito ‘*Aedes aegypti*’. Dengue is an arbovirus that spreads infections through mosquitoes to humans and infected humans to mosquitoes (other than *Aedes* species), constituting a complex network. The network of arbovirus epidemic has become a complex phenomenon. In [7], the robustness of the dengue complex network under targeted versus random attack was observed and it was found that targeted attack gives better outcomes in scale-free networks compared to random networks.

While modeling and analyzing the different complex networks, researchers have observed numerous structural characteristics in different real-world organisms [8]. Specifically, in the last few years the research trend has increasingly moved toward analyzing complex systems by creating networks in the form of nodes and links. This facilitates comprehension of both structural and dynamical features of tangible world complex networks. The research and study of these linkages has played a significant role for immunization in epidemics and network tolerance attacks [9]. The authors observed some scale-free features in the dengue epidemic in Singapore [10]. In addition, they analyzed the dengue spreading situation from the perspective of complex networks and modelled the dataset of dengue affected cases in Selangor as a two-mode network, and then projected it to a one-mode network [511]. A dataset of dengue patients has been obtained from the Ministry of Health (MoH) Malaysia, and is provided in Appendix 1. The data are anonymous to protect patients' privacy. The network projection is performed by three methods (weighted Newman, sum, and binary) and the power-law exponent (γ) is calculated, which is an important step to see the deportment of a scale-free network [11].

The existence of a power-law posture is perceived in numerous varieties of complex networks, including metabolic networks, systems of lung inflation, sun motion, and the light from galaxies and the water flow through river [12]. Thus, the power-law has been utilized for discovering complex environments by many researchers in various research areas and contexts.

The power-law is quite different from a bell-shaped normal distribution. A scale-free graph [8] follows a power-law form as given below:

*k*) ~

*k*

^{−γ}

where *k* represents the degree of a node, the probability of node degree distribution is represented by P(*k*), and γ (gamma) is a scaling exponent, which is a statistical parameter that is called a connectivity distribution exponent. In reality, γ does not depend on a specific scale of network that is why; it is called a scale-free parameter. Also, the value of γ has been confirmed in many research studies to approximately range from 2 to 3. R-project has been utilized to find γ and other graphical visualizations in this research.

Scale-free network comprises two main characteristics based on the Barabási-Albert model: growth and preferential attachment. The remainder of this paper is structured as follows: Section I-1 provides the background of the study. In Sections II and III, the methodology is presented and the results are discussed. Finally, Section IV gives conclusions of this research and directions for future work.

### 1. Background

Dengue fever is a major disease in tropical regions of the world. Approximately 25 million people are in danger due to mosquito borne dengue fever [1314]. It is caused by an arbovirus, *Aedes aegypti*, which is the primary vector of dengue virus [14]. DENV-1, DENV-2, DENV-3, and DENV-4 are four serotypes that have been found in this disease [1415161718].

#### 1) Dengue in Malaysia

MoH Malaysia published a report in 2015 noting 107,079 dengue cases with 293 deaths while there were 43,000 dengue cases with 92 deaths in 2013 [192021]. According to the MoH Malaysia, the presence of dengue has been growing rapidly since 2012 [2223]. The rapid spread of dengue virus has become more harmful and tackling this matter should be considered urgent. Dengue epidemic is an important field of research and many researchers are investigating this phenomenon from different viewpoints [182425]. In light of the importance of this issue and with the aim of working toward a remedy, the authors have modelled the dengue problem in Malaysia by utilizing the two-mode network technique [11].

#### 2) Modeling of dengue epidemic network

Researchers have been utilizing the two-mode network technique in many fields, such as collaborative work and movie-actor networks [262728]. The authors of this paper have utilized the two-mode network technique in the dengue mosquito network [11]. In this network primary nodes are represented by L1, L2, L3, and L4 and secondary set of nodes are represented by W1, W2, W3, and W4. A two-mode network is usually converted into a one-mode network by projection for better analysis [28]. The current work is an enhanced form and continuation of our previous work [11]. Analysis of the network is based on the power-law behavior, clustering coefficient, and network visualization, and most importantly we observed the characteristics of the *Aedes aegypti* network from the viewpoint of a scale-free network and random network. In general, there are various types of networks and different ways to destroy those networks. It is very important that before attacking any network, the topology of the network should be understood. Similarly, to break down the dengue network, it must be clarified whether it should be treated as a scale-free network or a random network.

## II. Methods

### 1. Network Analysis

This section presents an analysis of the weighted two-mode dengue network, which has been performed from the perspective of dengue cases that appeared in different locations in Selangor. After utilizing the projection ways (binary, sum, and weighted Newman) and on the basis of their outcomes, the weighted Newman method has been observed to provide the best fit for this dataset. By this method, the dataset lost the minimum weighted information [11]. From the given dataset, the power-law exponent (γ) is calculated and the results showed the trend towards a power-law. It should be noted that a scale-free network follows a power-law [8]. In this research, the results show the characteristics of a scale-free network and particularly the topology of this complex dengue network is in a power-law form.

To see the graphical trends of the power-law, graphs are formed utilizing the dataset (bipartite) of Gombak, Hulu Langat, and Petaling, where numbers of dengue cases are shown in effected localities, respectively. Furthermore, the clustering coefficient is calculated to observe the deviation of the network.

## III. Results

### 1. Power-Law Behavior

In Figure 1, the x-axis shows the weight of links and the y-axis shows the number of links. Dengue cases in different localities of Gombak that occurred during the observed years are represented on a linear scale. The figure displays power-law behavior. It clearly indicates that the human populations in a few localities were highly affected by the dengue virus in Selangor, Malaysia, during the given period.

There are 58 localities in Gombak that recorded 1487 dengue cases in total in the given period. The highest number of dengue cases (=128) was registered in location no. 18 whereas only four cases were observed in location no. 2. In Figure 2, Hulu Langat, 185 localities were affected compared to Petaling, where 243 dengue affected localities were registered. It is interesting that in these three areas only a few localities were attacked repeatedly by *Aedes aegypti*, whereas in other localities this mosquito attacked once or twice in a year. This showed the dengue has effected within this particular area. Similarly, there are a few areas that were highly affected by the dengue virus. In very few places the dengue appearance is high whereas the majority of localities have a small number of dengue cases. There is consequently a need to place greater focus on these few areas to control this disease. On the other side, identification of focal nodes in a complex networks is an important issue for researchers and scientists. Specifically, if the central node is identified, it will potentially make it possible to control the flow of other nodes. Moreover, via that node, other nodes can be captured very quickly. Hence, targeting the areas in which dengue virus is appearing repeatedly and affecting large populations may be a more helpful means to find and control the central node.

In Figures 1, 2, 3, R^{2} is the coefficient of determination. This statistical measure shows how good the regression line estimates the real data points. R^{2} provides information on the goodness of fit of a model. Here, R^{2} = 0.9213, 0.923, and 0.9102 specify that the regression line perfectly fits the data, respectively.

The coefficient of determination is calculated as:

where *SS _{tot}* represents the total sum of squares and

*SS*is the residual sum of squares.

_{res}In Figure 2, the x-axis shows the weight of links and the y-axis shows the number of links. The number of dengue cases in different localities of Hulu Langat that were included in this network during the given period. The weight of links as dengue cases in different localities is shown on a linear scale. Out of 185 dengue affected localities, 7,854 dengue cases were registered. The highest number of cases (= 729) was recorded in the location HL130, whereas the smallest number of cases (= 4) was recorded in location HL177.

In Figure 3, the x-axis shows the weight of links and the y-axis shows the number of links. The numbers of dengue cases in different localities of Petaling are displayed on a linear scale during the period 2013–2014. It can be grasped that there are few localities that registered an extraordinary number of cases. Out of 243 affected localities of Petaling, 15,261 dengue cases were recorded throughout the given period. The highest number of dengue cases (= 3,107) was registered in Petaling (location no. 126). The smallest number, four cases, was observed in Petaling (location no. 240 and 243).

In the literature, some examples of scale-free network are presented, where the power-law exponent has different values. Researchers considered these networks to be scale-free [18]. Investigations of the topology of complex systems from different domains of life have shown interesting results. For example, Barabasi and Bonabeau [8] modelled World Wide Web pages and their hyper-links and brought the idea of scale-free network with power-law exponents γ_{in} = 2.1 and γ_{out} = 2.7, where γ_{in} and γ_{out} are in-degree and out-degree, respectively. In 2001, Liljeros et al. [29] modelled and investigated human sexual connections as a network. Researchers found this societal occurrence to be scale-free and showed that it follows the power-law form (where γ_{f} = 2.54 for females and γ_{m} = 2.31 for males). Newman formed a scientific association network as a two-mode network, where he modelled nodes as scientists and their collaborated papers. Two scientists are linked if they worked on a joint article as primary nodes. He observed the degree distribution of this network in the case of a high energy physics databank, which follows a power-law with the exponent γ = 1.2 [1].

Figure 4 shows the probability distribution of node strength (number of dengue cases in different areas of Selangor) on a log scale, for cases recorded in a given period. R-project has been utilized to calculate the power-law exponent (γ). The broken line is the slope of the declining curve and it represents that this network is geographically structured as a scale-free network. γ is close to the lower bound of its limit. Here, γ = −2.1, which indicates a decreasing slope of the distribution. This probability distribution indicates that the distribution follows a power-law; it has geographically organized itself during the given period. The power-law is an important indicator of scale-free network. Moreover, important links are few in number and should be focused on as they have huge weight compared to other links. A scale-free network is very important in solving the epidemic issue. Epidemic diseases can be better controlled by this type of network topology by focusing on main hubs (nodes), compared to a random distribution.

Link's density is shown in Figure 5, where the x-axis represents the number of links and the y-axis shows the weight of links on a linear scale. It specifies that some links in this two-mode network highly affect the whole network. The majority of nodes have weight below 200, whereas a minority of nodes has high weight in terms of dengue cases that appeared in these nodes.

In Figure 6, a weekly comparison is shown among 6 dengue affected districts in Selangor. This graph indicates the most crucial time period when the attack of *Aedes aegypti* was at its peak. It can be observed from the graph that Petaling is the most affected area followed by Hulu Langat. The human populations of Gombak, Sepang, Hulu Selangor, and Klang have also been victims of *Aedes aegypti*. The dataset showed the peak activity was from December 2013 till the end of February 2014. These twelve weeks were the most critical in these two highly affected districts. These districts had high infection rates in these 9 weeks compared to the other 3 weeks in the year. For the remaining four districts, the time series suggests activity without a clear, sustained epidemic burst between October 20, 2013 and October 18, 2014. Sepang, for instance, appears to have higher activity from October 2013 to the end of December 2013, without any significant activity in 2014. Gombak represented an isolated peak in the 25th week of 2014 and Hulu Selangor showed slightly elevated activity by the end of 2014 (41st week onwards). It is observed that out of 12 months, these 3 months showed the highest rate of dengue infections. It can be concluded that, apart from the importance of focal nodes, time duration is also important, as 3 months showed the highest rate, and also showed power-law resemblance. This feature also indicates a scale-free network.

### 2. Clustering Coefficient

The global clustering coefficient has been generalized to weighted two-mode networks with weighted links [28]. The value can be constructed on the weights of links, and can be defined utilizing the four methods as a one-mode weighted clustering coefficient [27]. For the weighted two-mode network, global clustering coefficient is defined as follows:

where τ^{*}w represents the values of 4-paths and τ^{*}Δw shows the value of these 4-paths that are closed by being part of at least one 6-cycle (i.e., a loop of six links with five nodes).

Here, the global clustering coefficient is computed for the weighted two-mode network using five methods, i.e. binary (Bi), arithmetic mean (AM), geometric mean (GM), maximum (Max) and minimum (Min).

In Figure 7, the global clustering coefficient is shown in two-mode networks by utilizing the above-mentioned five methods. The results showed that the network of localities is clustered, where dengue cases are considered as the weight of the links. It can be observed that a few links have very high weight and, because of this, the minimum method showed the highest value (= 0.85), which also showed resemblance to a scale-free network. The results of the binary method do not include the weight of links to complete 4-paths. On the other hand, the maximum method showed a lower value compared to all other weighted methods (excluding binary) because the majority of links have smaller weight. There are few links having high weight in the network, and for this reason the minimum method represented high values of the clustering coefficient compared to the other methods. In addition, AM displayed an average value compared to other methods. GM meanwhile produced higher values than Max and AM because this method also closes the 4-paths based on GM. Therefore, in terms of the global clustering coefficient, the minimum method is the most appropriate for use in this network. The results obtained by this method indicate that the weighted distribution is very inhomogeneous.

### 3. Network Visualization from Localities Perspective

Figure 8 presents a graphical view of the dengue network that is plotted in igraph package of R-project. The node ID numbers 1, 5, 15, 30, 36, 38, 39, and 40 are very significant and should be focused on for treatment. This is due to their degree and their role as bridges between different clusters of nodes. This graphic visualization showed that localities are not properly associated in Gombak with each other by the co-occurrence of weeks [11], also gave the output that not all localities were affected in the observed time frame. Further, few nodes (localities) were working as main hubs. An actual map of Gombak is shown in Figure 9. This provides a geographical representation of dengue affected nodes (localities) in Gombak, Malaysia [11]. It has been observed that there were different clusters, such as Batu Caves, where people suffered greatly due to dengue disease. Grey color nodes represent the critical focal nodes that should be treated first. These nodes also lead this network to be scale-free.

## IV. Discussion

Complex networks have become a rich field of study. Many real-world phenomena are modelled and analyzed as complex networks. The locale of this study is Selangor, a state of Malaysia. Here, the dengue epidemic issue has been modelled by considering a two-mode network and the given dataset of dengue affected cases was investigated. A power-law exponent (γ) has been calculated and discussed on the basis of output obtained from the projection methods. The results of network metrics, clustering coefficient, and gamma exhibit the topology of the dengue epidemic as a scale-free network. Furthermore, the dengue situation did not remain the same throughout the year, and it was found that a 12-week period was more crucial and showed a power-law form. The global clustering coefficient of localities network revealed that this network is clustered in terms of dengue cases as the weight of the links. The findings of this study showed the overall trend of this network as a scale-free instead of random network. These outcomes can help health official policy makers to deal with the dengue virus by keeping in view its scale-free nature. The outcomes highlight focal hubs that can be inspected in terms of cleanliness, immunization, and how the dengue virus can be avoided or controlled. In the future, the impact of the genetically modified mosquito (GMM) technique can be introduced in focal nodes as an external factor for the treatment of harmful effects of *Aedes Aegypti*. Furthermore, the GMM technique would be less costly and more effective when applied to a scale-free network compared to a random network. The methods and results of this research are also important for researchers and scientists who deal with arbovirus epidemics, such as the *Zika* and *Chikungunya* viruses.

## Acknowledgments

The authors would like to express their gratitude to AMA International University Bahrain, for providing administrative and technical support.

## Notes

**Conflict of Interest:** No potential conflict of interest relevant to this article was reported.