Quick Search:

Ensemble Learning for Medical Image Character Recognition based on Enhanced Lenet-5

Osagie, Efosa ORCID logoORCID: https://orcid.org/0009-0004-3462-7175, Ji, Wei and Helian, Na (2023) Ensemble Learning for Medical Image Character Recognition based on Enhanced Lenet-5. In: IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology (CIBCB), 29-31 August 2023, Eindhoven, Netherlands.

[thumbnail of Ensemble_Learning Manuscript.pdf]
Preview
Text
Ensemble_Learning Manuscript.pdf - Accepted Version

| Preview

Abstract

Generally, Medical Imaging Modalities (MIM) have a distinctive nature of low contrast, complex background, and low resolution, containing burned-in textual data of patients. The conventional OCRs hardly recognise these burned-in textual data under these conditions, as they are designed for mainly bi-level text with a minimum resolution of 300 dpi. With a focus on solving these challenges, an enhanced CNN model for medical image character recognition (MICR) is proposed in this paper. The Lenet-5 architecture inspires this proposed Model. To further enhance this new technique to recognise visually similar characters, this paper proposes an ensemble classifier of CNN base learners. Intensive experiments are done using an open-source medical imaging dataset. The problem of low resolution at 96dpi and background interference is targeted by using small 3 X 3 CNN filters to extract local features and changing the pooling layer to a learning layer by replacing it with 5 X 5 filters with a stride of 2 and training on a low-resolution character dataset. The final prediction is based on a majority voting algorithm. The consensus of the base learners improves the model’s stability in recognising visually similar characters. Finally, our proposed models and the Lenet-5 are compared using the Medpix medical image collection. Further investigation shows that our proposed model shows a 10% increase in accuracy compared with the base model and other past algorithms in recognising burned-in textual data on medical imaging modalities.

Item Type: Conference or Workshop Item (Paper)
Status: Published
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76.9.H85 Human-Computer Interaction; Virtual Reality; Mixed Reality; Augmented Reality ; Extended Reality
School/Department: London Campus
URI: https://ray.yorksj.ac.uk/id/eprint/12833

University Staff: Request a correction | RaY Editors: Update this record