标题:Automate discovery of deep web interfaces
作者:Du, Xin ;Zheng, Yongqing ;Yan, Zhongmin
通讯作者:Zheng, Y
作者机构:[Du, Xin ;Zheng, Yongqing ;Yan, Zhongmin ] School of Computer Science and Technology, Shandong University, Jinan, China
会议名称:2nd International Conference on Information Science and Engineering, ICISE2010
会议日期:4 December 2010 through 6 December 2010
来源:2nd International Conference on Information Science and Engineering, ICISE2010 - Proceedings
出版年:2010
页码:3572-3575
DOI:10.1109/ICISE.2010.5691802
关键词:Deep web; Interface extraction; Tag trees
摘要:With the rapid increase of web sources, more and more deep web databases become available. The information in these databases can only be accessed by submitting queries to back-end databases. However, the traditional search engine interfaces resemble extremely deep web interfaces. Therefore, it is difficult to distinguish them and to find deep web interfaces. This paper proposes a novel method of discovering deep web interfaces. We introduce a page division method to divide pages into separate parts. After that we remove the parts which don't contain search interfaces. At last we construct topic-specific queries to obtain results and distinguish deep web interfaces by analyzing the results. Experiment result shows that this method is effective and stable. © 2010 IEEE.
收录类别:EI;SCOPUS
Scopus被引频次:2
资源类型:会议论文;期刊论文
原文链接:https://www.scopus.com/inward/record.uri?eid=2-s2.0-79951973531&doi=10.1109%2fICISE.2010.5691802&partnerID=40&md5=83aaeaf224562127d9a765b35588bb7d
TOP