Busca avançada
Ano de início
Entree


A Machine Learning approach for Graph-based Page Segmentation

Texto completo
Autor(es):
Maia, Ana L. L. M. ; Julca-Aguilar, Frank D. ; Hirata, Nina S. T. ; IEEE
Número total de Autores: 4
Tipo de documento: Artigo Científico
Fonte: PROCEEDINGS 2018 31ST SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI); v. N/A, p. 8-pg., 2018-01-01.
Resumo

We propose a new approach for segmenting a document image into its page components (e.g. text, graphics and tables). Our approach consists of two main steps. In the first step, a set of scores corresponding to the output of a convolutional neural network, one for each of the possible page component categories, is assigned to each connected component in the document. The labeled connected components define a fuzzy over-segmentation of the page. In the second step, spatially close connected components that are likely to belong to a same page component are grouped together. This is done by building an attributed region adjacency graph of the connected components and modeling the problem as an edge removal problem. Edges are then kept or removed based on a pre-trained classifier. The resulting groups, defined by the connected subgraphs, correspond to the detected page components. We evaluate our method on the ICDAR2009 dataset. Results show that our method effectively segments pages, being able to detect the nine types of page components. Furthermore, as our approach is based on simple machine learning models and graph-based techniques, it should be easily adapted to the segmentation of a variety of document types. (AU)

Processo FAPESP: 15/17741-9 - Combinação de características locais e globais em aprendizagem de operadores de imagens
Beneficiário:Nina Sumiko Tomita Hirata
Modalidade de apoio: Auxílio à Pesquisa - Regular