Volume 7, Issue 4 (12-2015)                   IJICTR 2015, 7(4): 55-64 | Back to browse issues page

XML Print

Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Shahsavar Haghighi H, Hoseini M, Shanbehzadeh J. Keyphrase Ranking Based on Second Order Co-Occurrence Analysis . IJICTR. 2015; 7 (4) :55-64
URL: http://ijict.itrc.ac.ir/article-1-83-en.html
1- Department of Computer Engineering Malek Ashtar University Tehran, Iran
2- Department of Computer Engineering Assistant Professor Malek Ashtar University Tehran, Iran
3- Department of Computer Engineering Associate Professor Kharazmi University Tehran, Iran
Abstract:   (2951 Views)
State-of-the-art researches in unsupervised automatic keyphrase extraction focused on graph analysis. Keyphrase ranking is critical step in graph-based approaches. In this paper, we follow two main purposes including choice of good candidate phrases and computing importance of candidate phrase by considering the mutual information between words. Our documents representation improves the process of candidate phrases selection by constructing a single graph for all documents in the collection. We enjoy from parallel minimum spanning tree to prune irrelevant edge relations. We also consider second order co-occurrence of words by point-wise mutual information as a similarity measure and importance of terms to increase the performance of keyphrase ranking. We formed a single graph of cooccurrence network for all documents in the collection and analyze co-occurrence network with different settings. We compare our method with three baseline approaches of keyphrase extraction. Experimental results show that applying second order co-occurrence analysis improves keyphrases identification accuracy.
Full-Text [PDF 1231 kb]   (1688 Downloads)    
Type of Study: Research | Subject: Information Technology

Add your comments about this article : Your username or Email: