Home   >   CSC-OpenAccess Library   >    Manuscript Information
A Vertical Search Engine - Based on Domain Classifier
Rajashree Shettar, Rahul Bhuptani
Pages - 18 - 27     |    Revised - 15-8-2008     |    Published - 15-11-2008
Volume - 2   Issue - 4    |    Publication Date - August 2008  Table of Contents
MORE INFORMATION
KEYWORDS
domain classifier, inverted index, page rank, relevance, vertical search
ABSTRACT
The World Wide Web is growing exponentially and the dynamic, unstructured nature of the web makes it difficult to locate useful resources. Web Search engines such as Google and Alta Vista provide huge amount of information many of which might not be relevant to the users query. In this paper, we build a vertical search engine which takes a seed URL and classifies the URLs crawled as Medical or Finance domains. The filter component of the vertical search engine classifies the web pages downloaded by the crawler into appropriate domains. The web pages crawled is checked for relevance based on the domain chosen and indexed. External users query the database with keywords to search; The Domain classifiers classify the URLs into relevant domain and are presented in descending order according to the rank number. This paper focuses on two issues – page relevance to a particular domain and page contents for the search keywords to improve the quality of URLs to be listed thereby avoiding irrelevant or low-quality ones .
CITED BY (10)  
1 An, B., Qu, T., & Qi, H. (2015). Chinese MOOC Search Engine. In Intelligent Computation in Big Data Era (pp. 453-458). Springer Berlin Heidelberg.
2 Chung, S. H., Robertson, S., Minnaar, A., Cook, M., & Sun, L. (2014, January). Developing a Knowledge Management System Using an Ontological Approach in Global Organization. In ICISO (pp. 340-347).
3 Yuan, Y., Chen, D., Li, Y., Yu, D., Yan, L., & Zhu, Z. (2014). The improved Shark Search Approach for Crawling Large-scale Web Data. International Journal of Multimedia & Ubiquitous Engineering, 9(8).
4 Lalnunsanga, M. (2012). An Introduction to a Meta-meta-search Engine.
5 D. Minnie and S. Srinivasan, “Multi-Domain Meta Search Engine with an Intelligent Interface for Efficient Information Retrieval on the Web”, Trends in Computer Science, Engineering and Information Technology Communications in Computer and Information Science, 204(1), pp. 121-129, 2011.
6 D. Minnie and S. Srinivasan, “Meta Search Engines for Information Retrieval on Multiple Domains”, International Journal of Technology and Engineering System, 2(2), pp. 115-118, 2011.
7 M. Zhao, P. Zhu, T. He, “An Intelligent Topic Web Crawler Based on DTB”, in Proceedings, Web Information Systems and Mining (WISM), International Conference, Sanya, 23-24 Oct. 2010, pp. 84-86.
8 Zhao, M. S., Zhu, P., & He, T. C. (2010, October). An Intelligent Topic Web Crawler Based on DTB. In Web Information Systems and Mining (WISM), 2010 International Conference on (Vol. 1, pp. 84-86). IEEE.
9 P. L. E. Ekmobo , M. Oumsis and M. Meknassi, “Motion Tracking in MRI by Harmonic State Model: Case of Heart Left Ventricle”, International Journal of Computer Science and Security (IJCSS), 3(5), pp. 428 – 447, 2009.
10 EKOMBO, P. L. E., Oumsis, M., & Meknassi, M. (2009). Motion tracking in MRI by Harmonic State Model: Case of heart left ventricle. International Journal of Computer Science and Security (IJCSS), 3(5), 428.
1 Google Scholar 
2 ScientificCommons 
3 Academic Index 
4 CiteSeerX 
5 refSeek 
6 iSEEK 
7 Socol@r  
8 Libsearch 
9 Bielefeld Academic Search Engine (BASE) 
10 Scribd 
11 WorldCat 
12 SlideShare 
13 PDFCAST 
14 PdfSR 
15 Free-Books-Online 
“Web Page Scoring Systems for Horizontal and Vertical Search”, Michelangelo Diligenti , Marco Gori,Marco Maggini , Siena, Italy.
Baujard, O., Baujard, V., Aurel, S., Boyer, C., and Appel, R.D. “Trends in Medical Information Retrieval on the Internet”, Computers in Biology and Medicine, 28,1998.
Castillo, C. (2004). “Effective Web Crawling”, PhD thesis, University of Chile.
George Almpanidis, Constantine Kotropoulos, and Ioannis Pitas. Aristotle University of Thessaloniki, Department of Informatics. “Focused Crawling Using Latent Semantic Indexing” – An Application for Vertical Search Engines.
Google Search Technology. Online at http://www.google.com/technology/index.html.
Manber, U., Smith, M., and Gopal, B. “WebGlimpse: Combining Browsing and searching”, in Proceedings of the USENIX 1997 Annual Technical Conference.
Michael Chau and Hsinchun Chen, “Comparison of Three Vertical Search Spiders”, Journal of Computer ,Vol. 36, No. 5, 2003, ISSN 0018-9162, pp. 56-62, publisher IEEE Computer Society.
Monica Peshave, “How search engine works and a Web Crawler Application”, Dept of Computer science,University of Illinios at Springfield, Spingfield,IL 62703.
Ng Zhug Whai, “A new city university search engine”, Department of information technology.
Pascal Soucy, Guy W. Mineau, Beyond TFIDF weigting for Text Categorization in the Vector Space model, 2005.
R. Steele, “Techniques for Specialized Search Engines”, in Proc. Internet Computing, Las Vegas, 2001.
Ye Wang, Zhihua Geng, Sheng Huang, Xiaoling Wang, Aoying Zhou, “Academic Web Search Engine – generating a Survey Automatically”, Department of ComputerScience, Fudan university, China.
Mr. Rajashree Shettar
- India
rajshri.shettar@gmail.com
Mr. Rahul Bhuptani
- India


CREATE AUTHOR ACCOUNT
 
LAUNCH YOUR SPECIAL ISSUE
View all special issues >>
 
PUBLICATION VIDEOS