Open Access Open Access  Restricted Access Subscription or Fee Access

Round-Trip Training Approach for Bilingually Low-Resource Statistical Machine Translation Systems

Benyamin Ahmadnia, Gholamreza Haffari, Javier Serrano

Abstract


Statistical Machine Translation (SMT) is making good progress in recent years. Since SMT systems are based on data-driven approach, they learn from millions or even billions of words from human-translated texts. The quality of SMT systems heavily depends on the data that we use for training step, not only its quality and amount, but also on how relevant it is for the texts that we wish to translate. However, human labeling is very costly and time consuming. In this article we develop a learning mechanism by proposing a round-trip training scenario as a reliable retraining approach through a communication framework for making effective use of monolingual text to tackle the training data scarcity, and improve translation quality. We present detailed experimental results using Spanish-English as a high-resource language pair, and Persian-Spanish as a low-resource language pair. We demonstrate that in all cases translation quality is improved.

Keywords


Natural language processing, statistical machine translation, low-resource language pairs, round-tripping approach.

Full Text:

PDF


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information. Also: DOI is paid service which provided by a third party. We never mentioned that we go for this for our any journal. However, journal have no objection if author go directly for this paid DOI service.