Open Access Open Access  Restricted Access Subscription or Fee Access

A Novel Word Sense Disambiguation Algorithm Based on Semi-Supervised Statistical Learning

Zhehuang Huang, Yidong Chen, Xiaodong Shi

Abstract


Statistical learning theory is a framework drawing from the fields of statistics and functional analysis . It provides a strong theoretical foundation for machine learning problems in the system of finite sample case. Word sense disambiguation (WSD) is a fundamental task in natural language processing to identify which sense of a word is used in a sentence, when the word has multiple meanings. At present, the mainstream studies of word sense disambiguation focus on the use of a variety of statistical machine learning techniques. But it difficult to obtain high quality labeled data. To solve the problem, we proposed a novel word sense disambiguation algorithm based on semi-supervised statistical learning in this paper. Firstly, an initial classifier with a certain accuracy rate was constructed based on small-scale labeled data. Then we extend the train data using a variety of threshold. The experiment results show the proposed method has a higher performance for word sense disambiguation.



Keywords


Semi-supervised, Statistical learning, Word sense, Maximum entropy.

Full Text:

PDF


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information.