machine translation

warning: Creating default object from empty value in /home/webserver/html/aflat/modules/taxonomy/taxonomy.pages.inc on line 33.

Toward a Rule-Based System for English-Amharic Translation

Toward a Rule-Based System for English-Amharic Translation, Gasser, Michael , Proceedings of the workshop on Language technology for normalisation of less-resourced languages (SALTMIL8/AfLaT2012), Istanbul, Turkey, p.41-46, (2012)

English ↔ Luo Machine Translation




This is a rudimentary machine translation system for the language pair English - Luo (Dholuo). Its backbone is the MOSES statistical machine translation package. The system was trained on the Luo version of the New Testament. While it is therefore relatively capable of translating religious material, it is not very well suited for other registers.

[Please only submit small texts]

From English to Luo
(e.g. Who is it that overcomes the world? Only he who believes that Jesus is the son of God.)
From Luo to English
(e.g. En ng'a moloyo piny? Mana jalo moyie ni Yesu en wuod Nyasaye.)

[Processing the text might take a while]


A detailed description of this system can be found in the paper "A Knowledge-Light Approach to Luo Machine Translation and Part-of-Speech Tagging" by Guy De Pauw, Naomi Maajabu and Peter Waiganjo Wagacha.

Natural Language Processing

Description: 

Current research is focussed on resources for corpus development

Swahili Translation

Description: 

Google translate is now available for Swahili

CBM Tanzania Kiswahili Translator

Description: 

A small but functional automatic translator English to Swahili

Apertium

Description: 

Apertium is an open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English–Catalan). The platform provides (a) a language-independent machine translation engine (b) tools to manage the linguistic data necessary to build a machine translation system for a given language pair and (c) linguistic data for a growing number of language pairs.

Current released language pairs include:

* Spanish–Catalan
* Spanish–Portuguese
* Spanish–Galician
* Occitan–Catalan
* French–Catalan
* English–Catalan
* Romanian-Spanish

With the following under development:

* English-Afrikaans
* French-Spanish

Try it out:

* http://xixona.dlsi.ua.es/apertium-unstable/

SALAMA - Swahili Language Manager

Description: 

SALAMA is a rule-based system for managing a number of applications on Swahili. It includes a tokenizer, morphological analyzer, morphological disambiguator, semantic disambiguator, syntactic analyzer, and a text-based machine translation system from Swahili to English.
In morphology we have earlier used finite state methods with two-level description. Recently we have developed an alternative method using a combination of different rule types. We have used pattern matching rules for initial analysis (sufficient e.g. for a spell checker), and the accurate analysis with due ambiguity is produced with post-processing rules.
In disambiguation and syntactic mapping we have used a Constraint Grammar parser (CG2), licensed from Connexor. This parser is also used in isolating and handling multi-word expressions.
SALAMA includes also a number of rules for constituent re-ordering, and for transferring the grammatical and lexical information from Swahili into correct English language.

African Language Research Project

Description: 

Information on the Machine Translation project by the African Language Research Project.

Syndicate content