搜索引擎检索机制的研究(Java,Servlet)(包含选题审批表,任务书,开题报告,中期检查报告,毕业论文18000字,答辩ppt,程序代码)
摘 要:本文从搜索引擎的应用出发,探讨了网络蜘蛛在搜索引擎中的作用和地住,提出了网络蜘蛛的功能和设计要求。在对网络蜘蛛系统结构和工作原理所作分析的基 础上,研究了页面爬取、解析等策略和算法,并使用Java实现了一个网络蜘蛛的程序,对其运行结果做了分析。开发平台为JBuilder,用 servlet实现了简单的爬虫搜索功能。通过迷你爬虫搜索引擎,进一步学习和了解有关搜索引擎检索机制方面的问题,通过设计小程序,来达到进一步了解各 种搜索算法的特点的目的。通过这次设计,进一步熟练了包括数据结构,JAVA语言,程序设计,以及软件工程方面的知识。达到学以致用的效果,从理论到实 践,再用实践来检验理论。
关键字:爬虫;搜索引擎;JAVA;Servlet
Network Reptile's Search Engine
Abstract: The paper,discussing from the application of the search engine,searches the importance and function of Web spider in the search engine.and puts forward its demand of function and design.On the base of analyzing Web Spider’s system strtucture and working elements.this paper also researches the method and strategy of multithreading scheduler,Web page crawling and HTML parsing.And then.a program of web page crawling based on Java is applied and analyzed.Develops the platform is JBuilder, has realized the simple reptile search function with servlet. Through the miniature reptile search engine, further studies with the understanding related search engine retrieval machine-made aspect question, through the design script, achieves further understands each kind of searching algorithm the characteristic goal. Through this design, further skilled including construction of data, JAVA language, programming, as well as software engineering aspect knowledge. Achieves effect which studies for the purpose of application, from the theory to the practice, uses the practice to examine the theory again..
Keyword: spider;search engine;JAVA;Servlet
|