Identifying Subscripts and Superscripts in Mathematical Documents

Walaa Aly; Seiichi Uchida; Masakazu Suzuki; Saleh Aly

doi:10.1007/s11786-008-0051-9

Back

Identifying Subscripts and Superscripts in Mathematical Documents

Journal article

Peer reviewed

Identifying Subscripts and Superscripts in Mathematical Documents

Walaa Aly, Seiichi Uchida, Masakazu Suzuki and Saleh Aly

Mathematics in computer science, Vol.2(2), pp.195-209

01/12/2008

DOI: https://doi.org/10.1007/s11786-008-0051-9

Abstract

Mathematics

Mathematics, Applied

Physical Sciences

Science & Technology

In mathematical OCR, it is necessary to analyze two-dimensional structures of the component characters and symbols in mathematical expressions printed in scientific documents. In this paper, we analyze the positional relationships between adjacent characters for the purpose of automatic discrimination between baseline characters, subscripts, and superscripts, which is one of the most important and delicate parts of structure analysis. It has been proven through a large-scale experiment that this discrimination can be carried out almost perfectly (similar to 99.89%) by using the relative size and position of adjacent characters.

Metrics

1 Record Views

Details

Title: Identifying Subscripts and Superscripts in Mathematical Documents
Creators - without role: Walaa Aly - Kyushu University
Seiichi Uchida - Kyushu University
Masakazu Suzuki - Kyushu University
Saleh Aly - Majmaah University
Publication Details: Mathematics in computer science, Vol.2(2), pp.195-209
Publisher: Springer Nature
Number of pages: 15
Identifiers: 9918471708331
Academic Unit: Majmaah University
Language: English
Resource Type: Journal article