Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(94.89KB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Development of Predictor for Sequence Derived Features From Amino Acid Sequence Using Associate Rule Mining
Manpreet Singh, Gurvinder Singh
Pages - 14 - 27     |    Revised - 31-03-2011     |    Published - 04-04-2011
Volume - 5   Issue - 1    |    Publication Date - March / April 2011  Table of Contents
MORE INFORMATION
KEYWORDS
Drug Discovery, Sequence Derived Features, Associative Rule Mining, Amino Acid
ABSTRACT
Drug Discovery process include target identification i.e. to identify a target protein whose inhibition can destroy the pathogen. In testing phase, clinical and pre-clinical trials are done on the animals and then on humans. After the discovery process, the drug or medicine is made available for public use. But if the testing of the drug is ineffective or unable to yield the appropriate results, then the whole process need to be repeated. This makes the first stage of drug discovery the most important than the other stages. The present work will assist in the process of drug discovery. The present work involves the development of a model that extracts the sequence derived features from the given amino acid sequence using associative rule mining. Associative rule mining is a data mining technique useful to identify related items and to develop rules. In the present work, various parameters of the amino acid sequence are studied that affect the sequence-derived features and some of the equations and algorithms are implemented. Input is given through text file and collective results are obtained. MATLAB environment is used for the implementation. The results are compared with the previous bioinformatics tools. The model developed assists in protein class prediction process which assists drug discoverers in the drug discovery process.
CITED BY (0)  
1 Google Scholar
2 Academic Journals Database
3 CiteSeerX
4 refSeek
5 iSEEK
6 Bielefeld Academic Search Engine (BASE)
7 Scribd
8 SlideShare
9 PdfSR
1 A. Al-Shahib, R. Breitling, and D. R. Gilbert “Predicting protein function by machine learning on amino acid sequences – a critical evaluation” BMC Genomics, 8:1-10, 2007
2
3 A. Jaiswal, A. Chhabra, U. Malhotra, S. Kohli, V. Rani “Comparative analysis of human matrix metalloproteinases: Emerging therapeutic targets in diseases” Bioinformation 6(1): 23-30, 2011
4 D. Krane and M. Raymer. “Fundamental Concepts of Bioinformatics”, Pearson Education, New Delhi, pp.1-314 (2006)
5 J. Han and M. Kamber. “Data Mining: Concepts and Techniques”, Morgan Kaufmann Publishers, pp. 226-229 (2004)
6 L. Jensen. “Prediction of Protein Function from Sequence Derived Protein Features”, Ph.D. thesis, Technical University of Denmark, 2002
7 L. Jensen, M. Skovgaard and S. Brunak. “Prediction of Novel Archaeal Enzymes from Sequence Derived Features”, Protein Science, 11: 2894-2898, 2002
8 L.J. Jensen, R. Gupta, N. Blom, D. Devos, J. Tamames, C. Kesmir, H. Nielsen, H.H. Starfeldt, K. Rapacki, C. Workman, C.A.F. Andersen, S. Knudsen, A. Krogh, A. Valencia and S. Brunak “Prediction of Human Protein Function from Post-Translational Modifications and Localization Features” Journal of Molecular Biology, 319(5): 1257-1265, 2002
9 M. Kanakubo and M. Hagiwara. “Speed up technique for Associative rule mining based on an Artificial Algorithm”, GRC book on granular computing, 38(12):318-323, 2007
10 M. Ouali, R.D. King “Cascaded multiple classifiers for secondary structure prediction” Prot Sci., 9:1162–1176, 2000
11 M. Singh, P. Singh and P.K, Wadhwa “Human Protein Function Prediction using Decision Tree Induction” International Journal of Computer Science and Network Security, USA, 7(4):92-98, 2007
12 M. Singh, Wadhwa P.K., Surinder Kaur “Predicting Protein Function using Decision Tree” World Academy of Science, Engineering and Technology, 39:350-353, 2008
13 R. Agrawal, T. Imielinski and A. Swami. “Mining Association Rules Between Sets of Items in Large Databases", SIGMOD ACM Conference, 22(2):207-216, 1993
14 R. Gupta, A. Mittal, and K. Singh. “Time series based feature extraction approach for prediction of protein structural class”, EURASIP Journal, 8(1): 1-7, 2008
15 R. Linding, L. J. Jensen, F. Diella, P. Bork, T.J. Gibson, R.B. Russell “Protein disorder prediction: implications for structural proteomics” Structure, 11:1453-1459, 2003
16 Veenu Mangat "Swarm Intelligence Based Technique for Rule Mining in the Medical Domain" International Journal of Computer Applications, 4(1):19-24, July 2010
17 Z.R. Li, H.H. Lin, L.Y. Han, L. Jiang, X. Chen, Y.Z. Chen. “PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence” Nucleic Acids Res, 34:W32-W37, 2008
18 http://www.cbs.dtu.dk/services/NetNGlyc/
19 http://www.cbs.dtu.dk/services/NetOGlyc/
20 http://www.cbs.dtu.dk/services/SignalP/
21 http://www.cbs.dtu.dk/services/TMHMM/
22 http://expasy.org/
23 http://psort.hgc.jp/
Mr. Manpreet Singh
Guru Nanak Dev Engineering College - India
mpreet78@yahoo.com
Dr. Gurvinder Singh
Guru Nanak Dev University, Amritsar - India