dc.date.accessioned |
2019-01-22T20:50:35Z |
|
dc.date.available |
2019-01-22T20:50:35Z |
|
dc.date.issued |
2018 |
|
dc.identifier.issn |
1860-7179 |
|
dc.identifier.uri |
https://dspace.kpfu.ru/xmlui/handle/net/149061 |
|
dc.description.abstract |
© 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim This is, to our knowledge, the most comprehensive analysis to date based on generative topographic mapping (GTM) of fragment-like chemical space (40 million molecules with no more than 17 heavy atoms, both from the theoretically enumerated GDB-17 and real-world PubChem/ChEMBL databases). The challenge was to prove that a robust map of fragment-like chemical space can actually be built, in spite of a limited (≪105) maximal number of compounds (“frame set”) usable for fitting the GTM manifold. An evolutionary map building strategy has been updated with a “coverage check” step, which discards manifolds failing to accommodate compounds outside the frame set. The evolved map has a good propensity to separate actives from inactives for more than 20 external structure–activity sets. It was proven to properly accommodate the entire collection of 40 m compounds. Next, it served as a library comparison tool to highlight biases of real-world molecules (PubChem and ChEMBL) versus the universe of all possible species represented by FDB-17, a fragment-like subset of GDB-17 containing 10 million molecules. Specific patterns, proper to some libraries and absent from others (diversity holes), were highlighted. |
|
dc.relation.ispartofseries |
ChemMedChem |
|
dc.subject |
computer chemistry |
|
dc.subject |
generative topographic mapping |
|
dc.subject |
library comparison |
|
dc.subject |
molecular diversity |
|
dc.subject |
structure analysis |
|
dc.title |
Mapping of the Available Chemical Space versus the Chemical Universe of Lead-Like Compounds |
|
dc.type |
Article |
|
dc.relation.ispartofseries-issue |
6 |
|
dc.relation.ispartofseries-volume |
13 |
|
dc.collection |
Публикации сотрудников КФУ |
|
dc.relation.startpage |
540 |
|
dc.source.id |
SCOPUS18607179-2018-13-6-SID85041137519 |
|