Advanced Techniques in Biology & Medicine

Advanced Techniques in Biology & Medicine
Open Access

ISSN: 2379-1764

Abstract

Data Set Analysis for the Calculation of the QSAR Models Predictive Efficiency Based on Activity Cliffs

Fatima Adilova and Alisher Ikramov

The activity cliff concept is of high relevance for medicinal chemistry. Herein, we explore a concept of “data set modelability”, i.e., a priori estimate of the feasibility to obtain externally predictive QSAR models for a data set of bioactive compounds. This concept has emerged from analyzing the effect of so-called “activity cliffs” on the overall performance of QSAR models. Some indexes of “modelability” (SALI, ISAC, and MODI) are known already. We extended the version of MODI to data sets of compounds with real activity values. The predictive efficiency of QSAR models is expressed as the correct classification rate by SVM algorithm, which compared with the results of the other two algorithms: algorithm MODI and Voronin’s algorithm modified by the authors. Comparative analysis of the results performed using Pearson’s correlation coefficient square. Our study showed an extreme lack of evaluation of predictive efficiency of data set only based on “activity cliffs”. In the development of more accurate methods that allow to evaluate the possibility of building of effective models on the data samples, it is necessary to take into account other properties of the sample, and not only the presence (and number) of “activity cliffs”.

Top