<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="6.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">De Pauw, Guy</style></author><author><style face="normal" font="default" size="100%">Peter Waiganjo Wagacha</style></author><author><style face="normal" font="default" size="100%">Gilles-Maurice de Schryver</style></author></authors><secondary-authors><author><style face="normal" font="default" size="100%">De Pauw, Guy</style></author><author><style face="normal" font="default" size="100%">Gilles-Maurice de Schryver</style></author><author><style face="normal" font="default" size="100%">Lori Levin</style></author></secondary-authors></contributors><titles><title><style face="normal" font="default" size="100%">The SAWA Corpus: a Parallel Corpus English - Swahili</style></title><secondary-title><style face="normal" font="default" size="100%">Proceedings of the First Workshop on Language Technologies for African Languages (AfLaT 2009)</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2009</style></year><pub-dates><date><style  face="normal" font="default" size="100%">March</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.aclweb.org/anthology/W09-0702</style></url></web-urls><related-urls><url><style face="normal" font="default" size="100%">http://aflat.org/files/W09-0702.pdf</style></url></related-urls></urls><publisher><style face="normal" font="default" size="100%">Association for Computational Linguistics</style></publisher><pub-location><style face="normal" font="default" size="100%">Athens, Greece</style></pub-location><pages><style face="normal" font="default" size="100%">9–16</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Research in data-driven methods for Machine Translation has greatly benefited from the increasing availability of parallel corpora. Processing the same text in two different languages yields useful information on how words and phrases are translated from a source language into a target language. To investigate this, a parallel corpus is typically aligned by linking linguistic tokens in the source language to the corresponding units in the target language. An aligned parallel corpus therefore facilitates the automatic development of a machine translation system and can also bootstrap annotation through projection. In this paper, we describe data collection and annotation efforts and preliminary experimental results with a parallel corpus English - Swahili.</style></abstract></record></records></xml>
