标题:Cluster Analysis Based on Contextual Features Extraction for Conversational Corpus
作者:Qi Chen;Yue Chen;Minghu Jiang
作者机构:1College of Computer Science and Technology, Shandong University, Shandong, China;2Department of Chinese Language and Literature
来源:Journal of Computer and Communications
出版年:2015
期:5
页码:33-37
DOI:10.4236/jcc.2015.35004
摘要: Cluster analysis related to computational linguistics seldom concerned with Pragmatics level. Features of corpus on Pragmatics level related to specific situations, including backgrounds, titles and habits. To improve the accuracy of clustering for conversations collected from international students in Tsinghua University, it required contextual features. Here, we collected four-hundred conversations as a corpus and built it to Vector Space Model. With the Oxford-Duden Dictionary and other methods we modified the model and concluded into three groups. We testified our hypothesis through self-organizing map neural network. The result suggested that the modified model had a better outcome.
资源类型:期刊论文
TOP