Volume 5, Issue 3 (9-2013)                   2013, 5(3): 19-24 | Back to browse issues page

XML Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Mashayekhi M, Analoui M. Evaluating Fidelity of Persian-English Sentence-Aligned Parallel Corpus . International Journal of Information and Communication Technology Research 2013; 5 (3) :19-24
URL: http://ijict.itrc.ac.ir/article-1-149-en.html
Abstract:   (2489 Views)

Bilingual corpus is one of the most important resources for Natural Language Processing applications and researches. The quality of bilingual corpora can influence the result of researches that used it as a resource. When translation machine is used to verify corpus quality, the quality of translation machine can affect the evaluation of corpus. One way for evaluating software or resources in ISO is verifying its own features. The expectation of finding translation for each word in each sentence by using a bilingual dictionary is verified in this paper as a factor for evaluating fidelity of corpus. Computing this expectation needed a pre-processing step that is designed with considering the differences between English and Persian languages. This method is a combination of a rule-based method with the information of a dictionary.

Full-Text [PDF 2436 kb]   (1732 Downloads)    
Type of Study: Research | Subject: Information Technology

Add your comments about this article : Your username or Email:
CAPTCHA

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.