标题：Automate discovery of deep web interfaces
作者：Du, Xin ;Zheng, Yongqing ;Yan, Zhongmin
作者机构：[Du, Xin ;Zheng, Yongqing ;Yan, Zhongmin ] School of Computer Science and Technology, Shandong University, Jinan, China
会议名称：2nd International Conference on Information Science and Engineering, ICISE2010
会议日期：4 December 2010 through 6 December 2010
来源：2nd International Conference on Information Science and Engineering, ICISE2010 - Proceedings
关键词：Deep web; Interface extraction; Tag trees
摘要：With the rapid increase of web sources, more and more deep web databases become available. The information in these databases can only be accessed by submitting queries to back-end databases. However, the traditional search engine interfaces resemble extremely deep web interfaces. Therefore, it is difficult to distinguish them and to find deep web interfaces. This paper proposes a novel method of discovering deep web interfaces. We introduce a page division method to divide pages into separate parts. After that we remove the parts which don't contain search interfaces. At last we construct topic-specific queries to obtain results and distinguish deep web interfaces by analyzing the results. Experiment result shows that this method is effective and stable. © 2010 IEEE.