Journal of Proteomics & Bioinformatics

Journal of Proteomics & Bioinformatics
Open Access

ISSN: 0974-276X

ETC: A toolkit for converting phenotype descriptions into computable data


International Conference on Computational Biology and Bioinformatics

September 05-06, 2018 Tokyo, Japan

Hong Cui, Thomas Rodenhausen, Bertram Ludascher, James Macklin and Nico Franz

University of Arizona, USA
University of Illinois at Urbana-Champaign, USA
Agriculture and Agri-Food, Canada
Arizona State University, USA

Posters & Accepted Abstracts: J Proteomics Bioinform

Abstract :

The explorer of taxon concepts project has produced a web application that consists a set of five tools unlocking phenotype data from text narratives often found as taxonomic descriptions or character descriptions. Aside from the tools described below, the site supports division of labor by allowing users share their tasks. Text capture tool parses textual taxonomic descriptions and marks up anatomical entities and characters. The tool is powered by CharaParser and MicroPIE (for microbial taxonomic descriptions). Ontology building tool enables experts without ontology knowledge to organize a set of phenotypic terms (e.g. those discovered by using is a part of or synonym relationships. Resulting ontology can be used in the other tools to improve data quality. Matrix generation tool takes the output and assembles a raw taxon-character matrix for the user to edit and refine, with or without ontology. Key generation tool employs a novel algorithm that directly takes a taxon-character matrix with polymorphic characters as input and computes the information entropy scores for each character. The result is a multiaccess key that order the characters based on their discrimination power within the pool of the taxa. Taxonomy comparison tool supports the task of taxon concept analysis by supporting both expert asserted RCC-5 (congruent, narrower/broader than, disjoint and overlap) relationships among taxa, as well as providing character data related to the taxa in question to aid expert decisions. These five tools can be used to construct several pipelines that generate a taxon-by-character matrix, create a multiaccess identification tool or facilitate taxon concept comparisons. A related software application, matrix converter is public available for users to convert a raw matrix to a scored matrix for phylogenetic analysis.

Biography :

E-mail: hongcui@email.arizona.edu

 

Top