" " Text Annotations for more than 300K lines in Arabic for OCR

Text Annotations for more than 300K lines in Arabic for OCR

Overview

The Egyptian Ministry of Communications and Information Technology (MCIT) is the government body responsible for information and communications technology (ICT) issues in the Arab Republic of Egypt. Established in 1999, MCIT is responsible for the planning, implementation and operation of government ICT plans and strategies.

The Ministry of Communications and Information Technology (MCIT) endeavors to build “Digital Egypt” and forges an Egyptian digital society that adopts and integrates technologies in almost every aspect of life. Therefore, MCIT seeks to promote the development of the ICT infrastructure and improve digital services in government agencies.

In November 2019, the Egyptian government formed the National Council for Artificial Intelligence as a partnership between governmental institutions, prominent academics and practitioners from leading businesses in the field of AI.

MCIT needs to annotate more than 300,000 text lines for OCR project, it was our pleasure to help our government in “Digital Transformation” and “Artificial Intelligence Projects” for new age of Egypt.

 

Client:

The Egyptian Ministry of Communications and Information Technology

Industry:

Information Technology

Services:

Data Annotation

 

Challenge

How is it possible to annotate more than 300,000 text line in Arabic of pictures for Egyptian old newspapers with quality control.

 

Solution

Through our company’s work team and our platform for data annotations tools, our company was able to mobilize our own forces and work hard to annotate all lines in estimated time, more than 30 people worked on the project.

 

Results

The required number of lines were annotated with high quality control reach to 100%, within the specified period.