Home   >   CSC-OpenAccess Library   >    Manuscript Information
Comparing Genetic Evolutionary Algorithms on Three Enzymes of HIV-1: Integrase, Protease, and Reverse Transcriptome
Nafiseh Masroor, Jack Wang, Bita Pouyanfar, Yanyan Li, Ahmad Reza Hadaegh
Pages - 1 - 13     |    Revised - 30-04-2021     |    Published - 01-06-2021
Volume - 14   Issue - 1    |    Publication Date - June 2021  Table of Contents
Genetic Evolutionary Algorithms, HIV, Data Predictive Data Mining.
In this work, we utilized Quantitative Structure-Activity Relationship (QSAR) techniques to develop predictive models for inhibitors of the HIV-1 enzymes Integrase, HIV-Protease, and Reverse Transcriptase. Each predictive model was composed of quantitative drug characteristics that were selected by genetic evolutionary algorithms, such as Genetic Algorithm (GE), Differential Evolutionary Algorithm (DE), Binary Particle Swarm Optimization (BPSO), and Differential Evolution with Binary Particle Swarm Optimization (DE-BPSO). After characteristic selection, each model was tested with machine-learning algorithms such as Multiple Linear Regression (MLR), Support Vector Machine (SVM), and Multi-Layer Perceptron neural networks (MLP/ANN). We found that a combination of DE-BPSO combined with Multi-Layer Perceptron produced the most accurate predictive models as measured by R2, the statistical measure of proportion of variance in prediction values, and root-mean-square-error (RMSE) of prediction values compared to observed values. As for the models themselves: the best predictors for Integrase inhibitor included mass-weighted centred Broto-Moreau autocorrelation values, Moran autocorrelations, and eigenvalues of Burden matrices weighted by I-states; the best predictors for HIV-Protease inhibitors included the second Zagreb index value, the normalized spectral positive sum from Laplace matrix, and the connectivity-like index of order 0 from edge adjacency mat; and the best predictors for Reverse Transcriptase inhibitors included the number of hydrogen atoms, the molecular path count of order 7, the centred Broto-Moreau autocorrelation of lag 2 weighted by Sanderson electronegativity, the P_VSA-like on ionization potential, and the frequency of C – N bonds at topological distance 3.
1 Semantic Scholar 
2 refSeek 
3 BibSonomy 
4 J-Gate 
5 Scribd 
6 SlideShare 
Andrea Mauri, Viviana Consonni, Manuela Pavan, Roberto Todeschini, "DRAGON software: An easy approach to molecular descriptor calculations", vol. 56(2), pp:237-248, 2006.
Annemarie M Wensing, Vincent Calvez, Francesca Ceccherini-Silberstein, Charlotte Charpentier, Huldrych F Günthard, Roger Paredes, Robert W Shafer and Douglas D Richman, "2019 update of the drug resistance mutations in HIV-1", vol. 27(3) pp:111-121. 2019.
Biswa Ranjan Meher, Megha Vaishnavi, Venkata Satish Kumar Mattaparthi and Seema Patel, "Mutation Effects on 3D-Structural Reorganization Using HIV-1 Protease as a Case Study", In book: Encyclopedia of Bioinformatics and Computational Biology, 2018.
David Freedman, in book “Statistical Models: Theory and Practice”. Cambridge University Press. p. 26. Cambridge University Press, 2009.
E. Dukhanina, T. Lukyanova, A. Dukhanin, S. Georgieva, “The role of S100A4 protein in anticancer cytotoxicity: its presence is required on the surface of CD 4+ CD 25+ PGRPs + S100A4 + lymphocyte and undesirable on the surface of target cells” . vol. 17(4), pp: 1-20, 2017.
Falguni. Thakor, Ahmad. Hadaegh and Xiaoyu. Zhang. “Comparative study of Differential Evolutionary-Binary Particle Swarm Optimization (DE-BPSO) algorithm as a feature selection technique with different linear regression models for the analysis of HIV-1 Integrase Inhibition features of Aryl ß-Diketo Acids”. Proceedings of 9th International Conference on Bioinformatics and Computational Biology (BICOB 2017) Honolulu, Hawaii, USA, pp: 179-184, 2017.
Gene M. Ko, Srinivas Reddy, Sunil Kumar, Rajni Garg, Barbara A. Bailey and Ahmad R. Hadaegh, "Differential evolution-binary particle swarm optimization algorithm for the analysis of aryl ß-diketo acids for HIV-1 integrase inhibition", 2012 IEEE Congress on Evolutionary Computation, Brisbane, pp. 1-7, 2012.
Ian Kane, and Ahmad. Hadaegh. “Non-linear Quantitative Structure-Activity Relationship (QSAR) Models for the Prediction of HIV Drug Performance”. 24th International Conference on Software Engineering and Data Engineering (SEDE-2015), vol 1, pp: 63-68, 2015.
Jiali. Tang, Jack. Wang, and Ahmad .R. Hadaegh, “A Web Repository System for Data Mining in Drug Discovery”, 2020 International Journal of Data Mining & Knowledge Management Process (IJDKP) vol. 10, No. 1, 2020.
Jinfang Zhu. “T Helper Cell Differentiation Heterogeneity, and Plasticity”, vol. 10(10), 2018
Joint United Nations Program on HIV/AIDS(UNAIDS), UNAIDS Report on the Global AIDS Epidemic 2020, Geneva, 2020.
Julia. Mikulski, “The Ultimate Guide to AdaBoost, random forests and XGBoost. Towards Data Science”, 2020.
M.J. Pucci, Christian Callebaut, Andrea Cathcart and Karen Bush, "5.17 - Recent Epidemiological Changes in Infectious Diseases", in book "Comprehensive Medicinal Chemistry III", pp: 511-552, 2017.
Matineh. Kashani, Richard. Galvan, and Ahmad. Hadaegh, “Improving the Feature Selection for the Development of Linear Model for Discovery of HIV-1 Integrase Inhibitors”. ABDA'15 International Conference on Advances in Big Data Analytics. In Proceeding of the 2015 International Conferences on Advances on Big Data Analyses, pp: 150-154. Las Vegas, Nevada, 2015.
Michael K Gilson, Tiqing Liu, Michael Baitaluk, George Nicola, Linda Hwang and Jenny Chong, "BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology", Nucleic Acids Res, vol. 44(D1), pp: D1045-53, 2016.
Richard Galvan, Matineh Kashani, and Ahmad Hadaegh. “Improving Pharmacological Research of HIV-1 Integrase Inhibition Using Differential Evolution-Binary Particle Swarm Optimization and Non-Linear Adaptive Boosting Random Forest Regression”, 2015 IEEE International Workshop on Data Integration and Mining San Francisco, pp: 485-490. San Francisco, CA. 2015.
Stanford Encyclopedia of Philosophy. Darwin: From Origin of Species to Descent of Man. First published Mon Jun 17, 2019.
Wei-Shau Hu and Stephen H. Hughes, "HIV-1 reverse transcription", Cold Spring Harb Perspect Med, vol. 2(10), 2012.
Youcef Mehellou, and Erik De Clercq, "Twenty-Six Years of Anti-HIV Drug Discovery: Where Do We Stand and Where Do We Go?", vol. 53(2), pp: 521–538, 2010.
Miss Nafiseh Masroor
Computer Science and Information System, California State University San Marcos, San Marcos, 92096 - United States of America
Mr. Jack Wang
Computer Science and Information System, California State University San Marcos, San Marcos, 92096 - United States of America
Miss Bita Pouyanfar
Computer Science and Information System, California State University San Marcos, San Marcos, 92096 - United States of America
Dr. Yanyan Li
Computer Science and Information System, California State University San Marcos, San Marcos, 92096 - United States of America
Dr. Ahmad Reza Hadaegh
Computer Science and Information System, California State University San Marcos, San Marcos, 92096 - United States of America