banner

Assessment of first-phase COVID-19 pandemic in Europe using hierarchical clustering based on principal components analysis

Sanjay Kumar, Evrim Oral

Abstract


It is of great interest for researchers to assess the COVID-19 pandemic in Europe. Grouping of COVID-19-affected regions is an effective way to monitor and optimize planning to combat the disease. This paper applied hierarchical clustering based on principal components analysis (HCPCA) to COVID-19 data from affected European countries. Considering several attribute indices, we obtained a new set of indicators using principal components analysis to aggregate and reduce the dimension of attribute indices of affected countries. Further, we obtained groups of affected countries subject to their similarity using hierarchical clustering to the reduced observations of new attributes indices of these countries. This study aims to group European countries with similar epidemic severity using some presumed attribute indices. The study is limited up to 24 May 2020, to assess if the outputs of the study could help governments, administrators, World Health Organization (WHO), healthcare service professionals, and other decision-makers to optimize their policies and plan their regulations in the country level requirements so that transmission of infections, deaths, critical conditions of patients could be minimized. For this purpose, we used hierarchical clustering using principal components analysis to obtain better clusters of countries with similar epidemic severity.


Keywords


principal components analysis; dendrogram; hierarchical clustering; data science; data mining

Full Text:

PDF

References


1. Liu Y, Gu Z, Xia S, et al. What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization. eClinicalMedicine 2020; 22: 100354. doi: 10.1016/j.eclinm.2020.100354

2. Olson DL, Shi Y. Introduction to Business Data Mining. McGraw-Hill/Irwin; 2007.

3. Shi Y, Tian YJ, Kou G, et al. Optimization Based Data Mining: Theory and Applications. Springer; 2011.

4. Kumar S. Monitoring novel corona virus (COVID-19) infections in India by cluster analysis. Annals of Data Science 2020; 7(3): 417–425. doi: 10.1007/s40745-020-00289-7

5. Available online: https://www.worldometers.info/coronavirus (accessed on 24 May 2020).

6. Ratner B. The correlation coefficient: Its values range between +1/−1, or do they? Journal of Targeting, Measurement and Analysis for Marketing 2009; 17(2): 139–142. doi: 10.1057/jt.2009.5

7. Johnson RA, Wichern DW. Applied Multivariate Statistical Analysis, 6th ed. Pearson Education, Inc.; 2007.

8. Husson F, Josse J, Pages J. Principal component methods-hierarchical clustering-partitional clustering: Why would we need to choose for visualizing data? Applied Mathematics Department 2010; 17.

9. Maugeri A, Barchitta M, Basile G, Agodi A. Applying a hierarchical clustering on principal components approach to identify different patterns of the SARS-CoV-2 epidemic across Italian regions. Scientific Reports 2021; 11(1): 7082. doi: 10.1038/s41598-021-86703-3

10. Kaiser HF. An index of factorial simplicity. Psychometrika 1974; 39: 31–36. doi: 10.1007/BF02291575

11. Hair JF Jr, Black WC, Babin BJ, Anderson RE. Multivariate Data Analysis, 7th ed. Prentice Hall; 2010.

12. Bartlett MS. A note on the multiplying factors for various χ2 approximations. Journal of the Royal Statistical Society: Series B (Methodological) 1954; 16(2): 296–298. doi: 10.1111/j.2517-6161.1954.tb00174.x

13. Romesburg HC. Cluster Analysis for Researchers. Lifetime Learning Publications; 1984.

14. Ward JH Jr. Hierarchical grouping to optimize an objective function. Journal of the American Statistical Association 1963; 58(301): 236–244. doi: 10.1080/01621459.1963.10500845

15. Goon AM, Gupta MK, Dasgupta B. Fundamentals of Statistics, 8th ed. The World Press, Kolkata; 2002.




DOI: https://doi.org/10.32629/jai.v7i1.648

Refbacks

  • There are currently no refbacks.


Copyright (c) 2023 Sanjay Kumar, Evrim Oral

License URL: https://creativecommons.org/licenses/by-nc/4.0/