A WSD Model for Corpus Construction
In: International Journal of Computer Processing of Languages, Jg. 21 (2008-03-01), S. 43-53
Online
unknown
Zugriff:
A common difficulty in the annotation of large corpora is the disambiguation of the meanings of polysemous words, a task which currently depends on unassisted manual checking by human experts. In this paper, we propose RFR-SUM, an approach to the automatic disambiguation of polysemous Chinese words that calculates and then sums the collocational strengths of words in the local context of a given word. The approach is efficient in that it learns from a small annotated corpus. It can also be used to support manual checking by automatically identifying sentences where usages are borderline and require a human decision. Testing has shown that RFR-SUM outperforms two other commonly used WSD approaches.
Titel: |
A WSD Model for Corpus Construction
|
---|---|
Autor/in / Beteiligte Person: | Zhang, Xiaojun ; Qu, Weiguang ; Wu, Yunfang ; Yu, Shiwen ; Sui, Zhifang ; Li, Sujian |
Link: | |
Zeitschrift: | International Journal of Computer Processing of Languages, Jg. 21 (2008-03-01), S. 43-53 |
Veröffentlichung: | World Scientific Pub Co Pte Lt, 2008 |
Medientyp: | unknown |
ISSN: | 2010-0205 (print) ; 1793-8406 (print) |
DOI: | 10.1142/s1793840608001743 |
Schlagwort: |
|
Sonstiges: |
|