Latest Issue
Empowering Education with Online Khmer Handwritten Text Recognition for Teaching and Learning Assistance
Published: August 30,2025Undergraduate Student Dropout Prediction with Class Balancing Techniques
Published: August 30,2025Status of Seawater Quality at Koh Rong Island, Sihanoukville, Cambodia
Published: August 30,2025Low-Complexity Detection of Primary Synchronization Signal for 5G New Radio Terrestrial Cellular System
Published: August 30,2025Word Spotting on Khmer Printed Documents
Published: August 30,2025Tuning Hyperparameters Learning Rate and Gamma in Gym Environment Inverted Pendulum
Published: August 30,2025Examining Passenger Loyalty in Phnom Penh Public Bus System: A Structural Equation Modelling Approach
Published: August 30,2025Prediction on Load model for future load profile of Electric Vehicle charging demand in Phnom Penh
Published: August 30,2025Economic Study on Integrating PV-DG with Grid-Tie: Case Study in Cambodia
Published: August 30,2025Word Spotting on Khmer Printed Documents
-
1. Department of Information and Communication Engineering,, Institute of Technology of Cambodia, Russian Federation Blvd., P.O. Box 86, Phnom Penh, Cambodia
Received: August 08,2024 / Revised: September 10,2024 / / Accepted: September 17,2024 / Available online: August 30,2025
Word spotting in Khmer printed documents presents a unique challenge due to the complexities of the Khmer script and the vast array of font styles employed. The scarcity of large, publicly available datasets further complicates this task. This work proposes a two-module approach for achieving accurate and efficient word spotting in Khmer documents. Separate datasets are utilized for text detection and recognition. The first module employs the state-of-the-art YOLOv8 model on a dataset of 10,050 text samples. The model's performance is evaluated using the F1 score, a metric that balances precision and recall in locating text. The second module leverages the fine-tuned Transformer-based TrOCR model for recognition, trained on 22,567 labeled words, with recognition accuracy measured by the Character Error Rate (CER). The first module achieves an impressive F1 score of 0.987 in locating Khmer words within documents. The second module's TrOCR model results in a CER of 8.41%. By overcoming script and font challenges through focused datasets and advanced models, this approach demonstrates potential for improving document processing and information retrieval for the Khmer language.