Introducing a novel dataset for testing OCR Engines
We study the performance of the top OCR engines on a Bangla characters dataset called the Bangla CrossHair Dataset introduced by our team. The dataset is available on Kaggle. The paper has been submitted to ICCIT 2024 Conference.
EasyOCR is a modern, deep learning-based OCR library developed by the Jaided AI team. It is built on top of deep neural networks and uses pre-trained models for detecting and recognizing text in various scripts and languages.
Tesseract is one of the most widely-used traditional OCR engines. It was initially developed by HP and is now maintained by Google. Tesseract is a rule-based engine combined with machine learning techniques for recognizing text.
While there is a lot of work on OCR in Bangladesh, due to the lack of a proper understanding of the subject matter, no comparative study on OCR engines has been done at all until this research project which led to the realization of this research project. A novel dataset titled: Bangla-CrossHair was introduced by our team in order to test the OCR engines. This research project was done in collaboration with InteX Research Lab.
Currently, we are serving different roles individually. You can learn more about our team below.
The team was led by A.N.Chowdhury, who during the time of this work was collaborating as a Research Assistant with M.S.R.Kohinoor, the founder of InteX Research Lab. A.A.Sami contributed in every aspect of the process, while contribution to the dataset was done by S.Absar, S.M.P.Mamun, and F.R.Biswas.
We strive for perfection and thrive in competition.
Name: Abdulla Nasir Chowdhury
Current Position: CEO of EARL Research
University: Leading University
Department: Electrical and Electronic Engineering
Affiliation: Research Assitant (InteX Research Lab)
Name: Aftar Ahmad Sami
Current Position: Software Engineer
Affiliation: SJ Innovation
University: Leading University
Department: Computer Science Engineering
Name: Shakib Absar
Current Position: Software Engineering Fellow
Affiliation: HeadStarter
University: Leading University
Department: Computer Science Engineering
Name: Fuad Rahman Biswas
Current Position: Student
University: Taylor's University
Department: Computer Science Engineering
Affiliation: Taylor's University
Name: Shah Masud Parvej Mamun
Current Position: Assistant Teacher
Affiliation: Bidyaniketon
University: Leading University
Department: Electrical and Electronic Engineering
Name: Md. Saidur Rahman Kohinoor
Current Position: Master's Student
University: King Fahad University of Petroleum and Mining
Affiliation: InteX Research Lab
Department: Information and Computer Science
We are currently looking to grow our team. Contact us to collaborate and see if we have an open position for a research project!
More from our laboratory