Background
Data deluges drug design disciplines, yet information and insights are inadequate.
Computational models for life science range from empirical data mining relationships among key variables to mathematical simulations of known physical processes. Ultimately, the value of computational models can be assessed simply: can they be used for actual prediction of relevant biological activity of new chemical entitities that exist only in a computer? One very simple, and surprisingly useful, formulation of the question is the idea of neighborhood behavior: if two samples are similar enough in terms of computational model values, are they highly likely to also be similar in actual biological activity?
Similarity studies are often essential as part of turning the data deluge into new knowledge. Which measured, or computed, variables are in fact important? Can we leap from one chemical series, or from one phenotypical expression pattern, to another with any confidence?
These abilities are crucial in chemical structure design, both de novo and for searching databases of structures with measured activity. They are also crucial for deconvolution of gene and protein expression patterns.
The projects undertaken at Vistamont Consultancy focus on defining the crucial elements of similarity, including:
computational molecular descriptors such as shape, pharmacophores, hydrophobicity
computational biological descriptors