ISSN: 2169-0138
Barry Robson
Accepted Abstracts: Drug Design
In their previous publications, our researchers have described the data mining of gene and protein sequences, of 667,000 medical records from the State of Virginia, and of 6.7 million chemical records from the automatic reading of all US patents by IBM?s Blue Gene supercomputer, and also described the secure transmission and use of clinical laboratory data, clinical genomic data, and medical images. We are collaborating to develop a Universal Exchange Language to integrate and enhance the security and utility of such information. This facilitates secure storage and transmission, accommodates fine grained consent mechanisms to permit data mining, expresses statistical summary rules in the manner of probabilistic semantic statements, and allows applications to draw inference from large numbers of such rules. This was motivated by the President?s Council of Advisors on Science and Technology in their recent call for such a language in regard to healthcare. The approach is based on an extension of XML to semantic (including RDF and OWL type constructs) in format < logical expression A| operators:=RDF refs | logical expression B >. An important distinction from XML is these objects (and vectors into which they can be decomposed) also have algebraic meaning. They conform to the Dirac algebra and notation of quantum mechanics and particle physics, building on aspects of the Clifford-Dirac calculus. An inference network is a large algebraic expression based on them, an idea that reflects Dirac?s statement that the principles of quantum mechanics are extensible to all aspects of human thought that are objective and quantitative.