{Reference Type}: Journal Article {Title}: Deriving general structure-activity/selectivity relationship patterns for different subfamilies of cyclin-dependent kinase inhibitors using machine learning methods. {Author}: Kaveh S;Mani-Varnosfaderani A;Neiband MS; {Journal}: Sci Rep {Volume}: 14 {Issue}: 1 {Year}: 2024 07 3 {Factor}: 4.996 {DOI}: 10.1038/s41598-024-66173-z {Abstract}: Cyclin-dependent kinases (CDKs) play essential roles in regulating the cell cycle and are among the most critical targets for cancer therapy and drug discovery. The primary objective of this research is to derive general structure-activity relationship (SAR) patterns for modeling the selectivity and activity levels of CDK inhibitors using machine learning methods. To accomplish this, 8592 small molecules with different binding affinities to CDK1, CDK2, CDK4, CDK5, and CDK9 were collected from Binding DB, and a diverse set of descriptors was calculated for each molecule. The supervised Kohonen networks (SKN) and counter propagation artificial neural networks (CPANN) models were trained to predict the activity levels and therapeutic targets of the molecules. The validity of models was confirmed through tenfold cross-validation and external test sets. Using selected sets of molecular descriptors (e.g. hydrophilicity and total polar surface area) we derived activity and selectivity maps to elucidate local regions in chemical space for active and selective CDK inhibitors. The SKN models exhibited prediction accuracies ranging from 0.75 to 0.94 for the external test sets. The developed multivariate classifiers were used for ligand-based virtual screening of 2 million random molecules of the PubChem database, yielding areas under the receiver operating characteristic curves ranging from 0.72 to 1.00 for the SKN model. Considering the persistent challenge of achieving CDK selectivity, this research significantly contributes to addressing the issue and underscores the paramount importance of developing drugs with minimized side effects.