International Journal of Information and Communication Technology Research
مجله بین المللی ارتباطات و فناوری اطلاعات
International Journal of Information and Communication Technology Research
Engineering & Technology
http://ijict.itrc.ac.ir
1
admin
2251-6107
2783-4425
doi
1652
25391
en
jalali
1394
9
1
gregorian
2015
12
1
7
4
online
1
fulltext
fa
Keyphrase Ranking Based on Second Order Co-Occurrence Analysis
فناوری اطلاعات
Information Technology
پژوهشي
Research
State-of-the-art researches in unsupervised automatic keyphrase extraction focused on graph analysis. Keyphrase ranking is critical step in graph-based approaches. In this paper, we follow two main purposes including choice of good candidate phrases and computing importance of candidate phrase by considering the mutual information between words. Our documents representation improves the process of candidate phrases selection by constructing a single graph for all documents in the collection. We enjoy from parallel minimum spanning tree to prune irrelevant edge relations. We also consider second order co-occurrence of words by point-wise mutual information as a similarity measure and importance of terms to increase the performance of keyphrase ranking. We formed a single graph of cooccurrence network for all documents in the collection and analyze co-occurrence network with different settings. We compare our method with three baseline approaches of keyphrase extraction. Experimental results show that applying second order co-occurrence analysis improves keyphrases identification accuracy.
graph analysis, similarity measure, point-wise mutual information, co-occurrence networks, keyphrase ranking
55
64
http://ijict.itrc.ac.ir/browse.php?a_code=A-10-27-61&slc_lang=fa&sid=1
Hosein
Shahsavar Haghighi
1003194753284600196
1003194753284600196
Yes
Mojtaba
Hoseini
1003194753284600197
1003194753284600197
No
Jamshid
Shanbehzadeh
1003194753284600198
1003194753284600198
No