Biomedical named entity recognition using TCN approaches and bio tagging
Abstract
Biomedical named entity recognition (BNER) is to identify instances in biomedical field such as chemical compounds, drugs, genes, RNA, DNA and proteins used in extracting information. It extracts relation between various drugs and their usage, profiles of similar and related drugs with help of machine learning approach. The efficiency in biomedical field is still in research for further improvement even many supervised methods are applied. The proposed method combines two algorithms and improve performance based on features used. It uses conditional random field (CRF) for entity identification and classification of temporal conventional network (TCN) to detect and recognize subtypes in BNER. Datasets such as GENIA and CHEMDNER corpus are used for evaluation with different entity types. Results shows that proposed methods performed better compared to other machine learning approach. The detailed study of TCN has been discussed. The classification of BNER is mapped with various classification methods to enhance result of high recognition.
Keywords
Full Text:
PDFReferences
1. Yang H, Dong Y. Recognizing hierarchically related biomedical entities using MeSH-based mapping. Tsinghua Science and Technology 2012; 17(6): 609–618. doi: 10.1109/TST.2012.6374362
2. Hsu YY, Kao HY. Curatable named entity recognition using semantic relations. IEEE/ACM Transactions on Computational Biology and Bioinformatics 2015;12(4): 785–792. doi: 10.1109/TCBB.2014.2366770
3. Gajendran S, Manjula D, Sugumaran V. Character level and word level embedding with bidirectional LSTM-Dynamic recurrent neural network for biomedical named entity recognition from literature. Journal of Biomedical Informatics 2020; 112: 1036096. doi: 10.1016/j.jbi.2020.103609
4. Cho M, Ha J, Park C, Park S. Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition. Journal of Biomedical Informatics 2020; 103: 103381. doi: 10.1016/j.jbi.2020.103381
5. Kim D, Lee J, So CH, et al. A neural named entity recognition and multi-type normalization tool for biomedical text mining. IEEE Access 2019; 7: 73729–73740. doi: 10.1109/ACCESS.2019.2920708
6. Cabot S, Darmoni S, Soualmia LF. Cimind: A phonetic-based tool for multilingual named entity recognition in biomedical texts. Journal of Biomedical Informatics 2019; 94: 103176. doi: 10.1016/j.jbi.2019.103176
7. Kumar CUO, Gajendran S, Bhavadharini RM, et al. EHR privacy preservation using federated learning with DQRE-Scnet for healthcare application domains. Knowledge-Based Systems 275; 275: 110638. doi: 10.1016/j.knosys.2023.110638
8. Kumar CUO, Gajendran S, Balaji V, et al. Securing health care data through blockchain enabled collaborative machine learning. Soft Computing 2023; 27(14): 9941–9954. doi: 10.1007/s00500-023-08330-6
9. Chennam KK, Maheshwari VU, Aluvalu R. Maintaining IoT healthcare records using cloud storage. In: Nath Sur S, Balas VE, Bhoi AK, et al. (editors). IoT and IoE Driven Smart Cities. Springer, Cham; 2021. pp. 215–233.
10. Siarry P, Jabbar MA, Aluvalu R, et al. The Fusion of Internet of Things, Artificial Intelligence, and Cloud Computing in Health Care, 1st ed. Springer Cham; 2021. pp. 1–23.
11. Śniegula A, Poniszewska-Maranda A, Chomatek Ł. Study of named entity recognition methods in biomedical field. Procedia Computer Science 2019; 160: 260–265. doi: 10.1016/j.procs.2019.09.466
12. Mehmood T, Gerevirini AE, Lavelli A, Serina I. Combining multi-task learning with transfer learning for biomedical named entity recognition. Procedia Computer Science 2020; 176: 848–857. doi: 10.1016/j.procs.2020.09.080
13. Nozza D, Manchanda P, Fersini E, et al. LearningToAdapt with word embeddings: Domain adaptation of named entity recognition systems. Information Processing and Management 2021; 58(3): 102537. doi: 10.1016/j.ipm.2021.102537
14. Santoso J, Setiawan EI, Purwanto CN, et al. Named entity recognition for extracting concept in ontology building on Indonesian language using end-to-end bidirectional long short-term memory. Expert Systems with Applications 2021; 176: 114856. doi: 10.1016/j.eswa.2021.114856
15. Gridach M. Character-level neural network for biomedical named entity recognition. Journal of Biomedical Informatics 2017; 70: 85–91. doi: 10.1016/j.jbi.2017.05.002
16. Li K, Tang Z, Zhang F, et al. Hadoop recognition of biomedical named entity using conditional random fields. IEEE Transactions on Parallel and Distributed Systems 2015; 26(11): 3040–3051. doi: 10.1109/TPDS.2014.2368568
17. Islamaj R, Wei CH, Cissel D, et al. NLM-Gene, a richly annotated gold standard dataset for gene entities that addresses ambiguity and multi-species gene recognition. Journal of Biomedical Informatics 2021; 118: 103779. doi: 10.1016/j.jbi.2021.103779
18. Wang D, Fan H, Liu J. Learning with joint cross-document information via multi-task learning for named entity recognition. Information Sciences 2021; 579: 454–467. doi: 10.1016/j.ins.2021.08.015
19. ElDin HG, AbdulRazek M, Abdelshafi M, Saglol AT. Med-Flair: Medical named entity recognition for diseases and medications based on flair embedding. Procedia Computer Science 2021; 189: 67–75. doi: 10.1016/j.procs.2021.05.078
20. Liu J, Gao L, Guo S, et al. A hybrid deep-learning approach for complex biochemical named entity recognition. Knowledge-Based Systems 2021; 221: 106958. doi: 10.1016/j.knosys.2021.106958
21. Chen Y, Hu Y, Li Y, et al. A boundary assembling method for nested biomedical named entity recognition. IEEE Access 2020; 8: 214141–214152. doi: 10.1109/ACCESS.2020.3040182
22. Zhang Q, Sun Y, Zhang L, et al. Named entity recognition method in health preserving field based on BERT. Procedia Computer Science 2021; 183: 212–220. doi: 10.1016/j.procs.2021.03.010
23. Bhasuran B, Murugesan G, Abdulkadhar S, Natarajan J. Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases. Journal of Biomedical Informatics 2016; 64: 1–9. doi: 10.1016/j.jbi.2016.09.009
24. Zhang J, Shen D, Zhu G, et al. Enhancing HMM-based biomedical named entity recognition by studying special phenomena. Journal of Biomedical Informatics 2004; 37(6): 411–422. doi: 10.1016/j.jbi.2004.08.005
25. Kim J, Ko Y, Seo J. A bootstrapping approach with CRF and deep learning models for improving the biomedical named entity recognition in multi-domains. IEEE Access 2019; 7: 70308–70318. doi: 10.1109/ACCESS.2019.2914168
26. Chukwuocha C, Mathu T, Raimond K. Design of an interactive biomedical text mining framework to recognize real-time drug entities using machine learning algorithms. Procedia Computer Science 2018; 143: 181–188. doi: 10.1016/j.procs.2018.10.374
27. Zhang S, Elhadad N. Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts. Journal of Biomedical Informatics 2013; 46(6): 1088–1098. doi: 10.1016/j.jbi.2013.08.004
28. Catelli R, Gragiulo F, Casola V, et al. Crosslingual named entity recognition for clinical de-identification applied to a COVID-19 Italian data set. Applied Soft Computing 2020; 97: 106779. doi: 10.1016/j.asoc.2020.106779
29. Li L, Jiang Y. Integrating language model and reading control gate in BLSTM-CRF for Biomedical named entity recognition. IEEE/ACM Transactions on Computational Biology and Bioinformatics 2020; 17(3): 841–846. doi: 10.1109/TCBB.2018.2868346
30. Saha SK, Sarkar S, Mitra P. Feature selection techniques for maximum entropy based biomedical named entity recognition. Journal of Biomedical Informatics 2009; 42(5): 905–911. doi: 10.1016/j.jbi.2008.12.012
31. Wei H, Gao M, Zhou A, et al. Named entity recognition from biomedical texts using a fusion attention-based BiLSTM-CRF. IEEE Access 2019; 7: 736237–73636. doi: 10.1109/ACCESS.2019.2920734
32. Li L, Fan W, Huang D. A two-phase bio-NER system based on integrated classifiers and multiagent strategy. IEEE/ACM Transactions on Computational Biology and Bioinformatics 2013; 10(4): 897–904. doi: 10.1109/TCBB.2013.106
DOI: https://doi.org/10.32629/jai.v6i3.1123
Refbacks
- There are currently no refbacks.
Copyright (c) 2023 Thiyagu Thavittupalayam Meenachisundaram, Sangeetha Ramachandran, Sudhakaran Gajendran, Om Kumar Chandra Umakantham, Sathish Kuppani
License URL: https://creativecommons.org/licenses/by-nc/4.0/