Using a Modified U-Net: A Research Project at IUTT Enhances the Future of Arabic Document Layout Analysis

iutt-arabic-document-layout-analysis-ai-project

As part of the ongoing graduation project defenses for the AI and Data Science program (2025–2026), IUTT presented a distinguished research project titled: Arabic Document Layout Analysis Across Hierarchical Levels: Paragraphs, Lines, and Words using a Modified U-Net.

The project aims to develop an intelligent system for segmenting Arabic documents across three hierarchical levels (paragraphs, lines, and words) to enhance the efficiency of Optical Character Recognition (OCR) systems. The team built a modified U-Net model, integrating a hybrid loss function (Dice Loss & Binary Cross-Entropy) to address challenges like overlapping text and character connectivity in Arabic script.

The model achieved strong IoU results of 0.896 for lines and 0.900 for words, outperforming various previous works. The project also provided an original scientific contribution by manually annotating a new word dataset containing 7,881 images, paving the way for more robust digital document processing solutions.

Project Members: Hisham Al-Dhabhani, Al-Qassam Al-Saidi, Ali Al-Shahari, Anas Al-Aghbari, Nawar Al-Azazi.
Supervision: Dr. Amin Shayae, Mr. Mohammed Al-Qumasi.

Internal Defense Committee: Dr. Hamzah Jamel, Dr. Ayman Al-Sabri, Prof. Dr. Fadhl Ba-Alawi.
External Defense Committee: Prof. Dr. Ahmed Sultan Al-Hajami, Assoc. Prof. Dr. Malik Al-Jabri.

671273027_18076107410638811_1732070644033949108_n (1)
671192975_18076107470638811_1507906469032899061_n
672985732_18076107419638811_5972365759465670238_n
671191107_18076107431638811_7045770287054267616_n
672398372_18076107479638811_8155933507392677660_n
670798384_18076107533638811_3196081621129638681_n
672357377_18076107491638811_5111149981302734859_n
672349333_18076107512638811_2895501597179672765_n
670883334_18076107566638811_1760238679605844560_n
671166391_18076107500638811_3555746472850467410_n