32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024, Mersin, Türkiye, 15 - 18 Mayıs 2024, (Tam Metin Bildiri)
Text is considered one of the most powerful sources for high-level information extraction. Scene text recognition, a hot field of study in computer vision, is based on first detecting text regions and then analyzing the text in these regions. We target multiple choice questions and question paper analysis, which are the most frequently used methods for measurement and evaluation at all levels in our country, as the target of our research. Three main goals were achieved while conducting experiments on the dataset created with different people and marking styles. These objectives realized on the question paper are respectively; Determining the question with YOLOv8, determining the option with YOLOv8, and determining the question number with the Permutated Autoregressive Sequence Model. The model accuracy obtained from real data shows the feasibility of the study as a result of the measurements.