Open Journal Systems

Assessment of first-phase COVID-19 pandemic in Europe using hierarchical clustering based on principal components analysis

Sanjay Kumar, Evrim Oral

Abstract

It is of great interest for researchers to assess the COVID-19 pandemic in Europe. Grouping of COVID-19-affected regions is an effective way to monitor and optimize planning to combat the disease. This paper applied hierarchical clustering based on principal components analysis (HCPCA) to COVID-19 data from affected European countries. Considering several attribute indices, we obtained a new set of indicators using principal components analysis to aggregate and reduce the dimension of attribute indices of affected countries. Further, we obtained groups of affected countries subject to their similarity using hierarchical clustering to the reduced observations of new attributes indices of these countries. This study aims to group European countries with similar epidemic severity using some presumed attribute indices. The study is limited up to 24 May 2020, to assess if the outputs of the study could help governments, administrators, World Health Organization (WHO), healthcare service professionals, and other decision-makers to optimize their policies and plan their regulations in the country level requirements so that transmission of infections, deaths, critical conditions of patients could be minimized. For this purpose, we used hierarchical clustering using principal components analysis to obtain better clusters of countries with similar epidemic severity.

Keywords

principal components analysis; dendrogram; hierarchical clustering; data science; data mining

Full Text:

PDF

References

1. Liu Y, Gu Z, Xia S, et al. What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization. eClinicalMedicine 2020; 22: 100354. doi: 10.1016/j.eclinm.2020.100354

2. Olson DL, Shi Y. Introduction to Business Data Mining. McGraw-Hill/Irwin; 2007.

3. Shi Y, Tian YJ, Kou G, et al. Optimization Based Data Mining: Theory and Applications. Springer; 2011.

4. Kumar S. Monitoring novel corona virus (COVID-19) infections in India by cluster analysis. Annals of Data Science 2020; 7(3): 417–425. doi: 10.1007/s40745-020-00289-7

5. Available online: https://www.worldometers.info/coronavirus (accessed on 24 May 2020).

6. Ratner B. The correlation coefficient: Its values range between +1/−1, or do they? Journal of Targeting, Measurement and Analysis for Marketing 2009; 17(2): 139–142. doi: 10.1057/jt.2009.5

7. Johnson RA, Wichern DW. Applied Multivariate Statistical Analysis, 6th ed. Pearson Education, Inc.; 2007.

8. Husson F, Josse J, Pages J. Principal component methods-hierarchical clustering-partitional clustering: Why would we need to choose for visualizing data? Applied Mathematics Department 2010; 17.

9. Maugeri A, Barchitta M, Basile G, Agodi A. Applying a hierarchical clustering on principal components approach to identify different patterns of the SARS-CoV-2 epidemic across Italian regions. Scientific Reports 2021; 11(1): 7082. doi: 10.1038/s41598-021-86703-3

10. Kaiser HF. An index of factorial simplicity. Psychometrika 1974; 39: 31–36. doi: 10.1007/BF02291575

11. Hair JF Jr, Black WC, Babin BJ, Anderson RE. Multivariate Data Analysis, 7th ed. Prentice Hall; 2010.

12. Bartlett MS. A note on the multiplying factors for various χ2 approximations. Journal of the Royal Statistical Society: Series B (Methodological) 1954; 16(2): 296–298. doi: 10.1111/j.2517-6161.1954.tb00174.x

13. Romesburg HC. Cluster Analysis for Researchers. Lifetime Learning Publications; 1984.

14. Ward JH Jr. Hierarchical grouping to optimize an objective function. Journal of the American Statistical Association 1963; 58(301): 236–244. doi: 10.1080/01621459.1963.10500845

15. Goon AM, Gupta MK, Dasgupta B. Fundamentals of Statistics, 8th ed. The World Press, Kolkata; 2002.

DOI: https://doi.org/10.32629/jai.v7i1.648

Refbacks

There are currently no refbacks.

Arabic	Hebrew	Polish
Bulgarian	Hindi	Portuguese
Catalan	Hmong Daw	Romanian
Chinese Simplified	Hungarian	Russian
Chinese Traditional	Indonesian	Slovak
Czech	Italian	Slovenian
Danish	Japanese	Spanish
Dutch	Klingon	Swedish
English	Korean	Thai
Estonian	Latvian	Turkish
Finnish	Lithuanian	Ukrainian
French	Malay	Urdu
German	Maltese	Vietnamese
Greek	Norwegian	Welsh
Haitian Creole	Persian

Username
Password
Remember me