Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

This is an Open Access publication published under CSC-OpenAccess Policy.
A Vertical Search Engine - Based on Domain Classifier
Rajashree Shettar, Rahul Bhuptani
Pages - 18 - 27     |    Revised - 15-8-2008     |    Published - 15-11-2008
Volume - 2   Issue - 4    |    Publication Date - August 2008  Table of Contents
domain classifier, inverted index, page rank, relevance, vertical search
The World Wide Web is growing exponentially and the dynamic, unstructured nature of the web makes it difficult to locate useful resources. Web Search engines such as Google and Alta Vista provide huge amount of information many of which might not be relevant to the users query. In this paper, we build a vertical search engine which takes a seed URL and classifies the URLs crawled as Medical or Finance domains. The filter component of the vertical search engine classifies the web pages downloaded by the crawler into appropriate domains. The web pages crawled is checked for relevance based on the domain chosen and indexed. External users query the database with keywords to search; The Domain classifiers classify the URLs into relevant domain and are presented in descending order according to the rank number. This paper focuses on two issues – page relevance to a particular domain and page contents for the search keywords to improve the quality of URLs to be listed thereby avoiding irrelevant or low-quality ones .
CITED BY (10)  
1 An, B., Qu, T., & Qi, H. (2015). Chinese MOOC Search Engine. In Intelligent Computation in Big Data Era (pp. 453-458). Springer Berlin Heidelberg.
2 Chung, S. H., Robertson, S., Minnaar, A., Cook, M., & Sun, L. (2014, January). Developing a Knowledge Management System Using an Ontological Approach in Global Organization. In ICISO (pp. 340-347).
3 Yuan, Y., Chen, D., Li, Y., Yu, D., Yan, L., & Zhu, Z. (2014). The improved Shark Search Approach for Crawling Large-scale Web Data. International Journal of Multimedia & Ubiquitous Engineering, 9(8).
4 Lalnunsanga, M. (2012). An Introduction to a Meta-meta-search Engine.
5 D. Minnie and S. Srinivasan, Multi-Domain Meta Search Engine with an Intelligent Interface for Efficient Information Retrieval on the Web, Trends in Computer Science, Engineering and Information Technology Communications in Computer and Information Science, 204(1), pp. 121-129, 2011.
6 D. Minnie and S. Srinivasan, Meta Search Engines for Information Retrieval on Multiple Domains, International Journal of Technology and Engineering System, 2(2), pp. 115-118, 2011.
7 M. Zhao, P. Zhu, T. He, An Intelligent Topic Web Crawler Based on DTB, in Proceedings, Web Information Systems and Mining (WISM), International Conference, Sanya, 23-24 Oct. 2010, pp. 84-86.
8 Zhao, M. S., Zhu, P., & He, T. C. (2010, October). An Intelligent Topic Web Crawler Based on DTB. In Web Information Systems and Mining (WISM), 2010 International Conference on (Vol. 1, pp. 84-86). IEEE.
9 P. L. E. Ekmobo , M. Oumsis and M. Meknassi, Motion Tracking in MRI by Harmonic State Model: Case of Heart Left Ventricle, International Journal of Computer Science and Security (IJCSS), 3(5), pp. 428 447, 2009.
10 EKOMBO, P. L. E., Oumsis, M., & Meknassi, M. (2009). Motion tracking in MRI by Harmonic State Model: Case of heart left ventricle. International Journal of Computer Science and Security (IJCSS), 3(5), 428.
1 Google Scholar
2 ScientificCommons
3 Academic Index
4 CiteSeerX
5 refSeek
7 Socol@r
8 Libsearch
9 Bielefeld Academic Search Engine (BASE)
10 Scribd
11 WorldCat
12 SlideShare
14 PdfSR
15 Free-Books-Online
1 George Almpanidis, Constantine Kotropoulos, and Ioannis Pitas. Aristotle University of Thessaloniki, Department of Informatics. Focused Crawling Using Latent Semantic Indexing An Application for Vertical Search Engines.
2 Google Search Technology. Online at http://www.google.com/technology/index.html.
3 R. Steele, Techniques for Specialized Search Engines, in Proc. Internet Computing, Las Vegas, 2001.
4 Ng Zhug Whai, A new city university search engine, Department of information technology.
5 Pascal Soucy, Guy W. Mineau, Beyond TFIDF weigting for Text Categorization in the Vector Space model, 2005.
6 Manber, U., Smith, M., and Gopal, B. WebGlimpse: Combining Browsing and searching, in Proceedings of the USENIX 1997 Annual Technical Conference.
7 Castillo, C. (2004). Effective Web Crawling, PhD thesis, University of Chile.
8 Monica Peshave, How search engine works and a Web Crawler Application, Dept of Computer science,University of Illinios at Springfield, Spingfield,IL 62703.
9 Michael Chau and Hsinchun Chen, Comparison of Three Vertical Search Spiders, Journal of Computer ,Vol. 36, No. 5, 2003, ISSN 0018-9162, pp. 56-62, publisher IEEE Computer Society.
10 Baujard, O., Baujard, V., Aurel, S., Boyer, C., and Appel, R.D. Trends in Medical Information Retrieval on the Internet, Computers in Biology and Medicine, 28,1998.
11 Web Page Scoring Systems for Horizontal and Vertical Search, Michelangelo Diligenti , Marco Gori,Marco Maggini , Siena, Italy.
12 Ye Wang, Zhihua Geng, Sheng Huang, Xiaoling Wang, Aoying Zhou, Academic Web Search Engine generating a Survey Automatically, Department of ComputerScience, Fudan university, China.
Mr. Rajashree Shettar
- India
Mr. Rahul Bhuptani
- India