Advanced search
Start date
Betweenand


Generating Knowledge Networks from Phenotypic Descriptions

Full text
Author(s):
Pantoja, Fagner Leal ; Cavoto, Patricia ; dos Reis, Julio Cesar ; Santanche, Andre ; IEEE
Total Authors: 5
Document type: Journal article
Source: PROCEEDINGS OF THE 2016 IEEE 12TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE); v. N/A, p. 10-pg., 2016-01-01.
Abstract

Several computing systems rely on information about living beings, such as Identification Keys - artifacts created by biologists to identify specimens following a flow of questions about their observable characters (phenotype). These questions are described in a free-text format, e.g., "big and black eye". Free-texts hamper the automatic information interpretation by machines, limiting their ability to perform search and comparison of terms, as well as integration tasks. This paper proposes a method to extract phenotypic information from natural language texts from biology legacy information systems, transforming them in an Entity-Quality formalism - a format to represent each phenotype character (Entity) and its state (Quality). Our approach aligns automatically recognized Entities and Qualities with domain concepts described in ontologies. It adopts existing Natural Language Processing techniques, adding an extra original step, which exploits intrinsic characteristics of phenotypic descriptions and of the organizational structure of Identification Keys. The approach was validated over the FishBase data. We conducted extensive experiments based on a manually annotated Gold Standard set to assess the precision and applicability of the proposed extraction method. The obtained results reveal the feasibility of our technique, its benefits and possibilities of scientific studies using the extracted knowledge network. (AU)

FAPESP's process: 14/14890-0 - IMEanT: methods for dealing with meanings and intentions on interactive collaborative systems
Grantee:Julio Cesar dos Reis
Support Opportunities: Scholarships in Brazil - Post-Doctoral
FAPESP's process: 13/08293-7 - CCES - Center for Computational Engineering and Sciences
Grantee:Munir Salomao Skaf
Support Opportunities: Research Grants - Research, Innovation and Dissemination Centers - RIDC