Tool / Application
Lingala Diacritic Correction
Submitted by Guy on Mon, 2009-04-27 12:45The problem with using web texts for many languages is that many people do not have the ability to enter diacritics or characters from extended
character sets from their keyboard, and so they simply leave them off (and so Lingala "likɔngá" becomes "likonga"), or use incorrect diacritics
(e.g. using acute/grave in place of the macron in Hawaiian, Maori, etc.). Information is usually lost in this conversion, and indeed most languages
that use diacritics have many pairs of words that differ only in diacritical marks; these distinctions are lost when texts are written in plain ASCII.
The aim of charlifter is to undo this lossy conversion, and "lift" the text back to its proper form.
Swahili - English Dictionary
Submitted by djoffe on Thu, 2009-03-19 05:53Software-installable Swahili - English Dictionary (created by TshwaneDJe HLT), with over 16,000 entries and phrases. Integrates into Microsoft Word to display instant lookup results as you work. Other features include bilingual 'linked view', integration with Google web and image search, basic decomposition-based searching, and instant display of related cross-references. The dictionary is corpus-based, and includes hundreds of real-world usage examples.
Ekegusii popular sayings and Wisdom
Submitted by Mashanga on Wed, 2009-02-25 14:49Abagusii, native speakers of Ekegusii language, have in the language stored wisdom and knowhow ranging from ethics to scientifically verifiable medicine. In this forum is the tip of the iceburg - Ekegusii is richer than has been so far believed.
Ekegusii popular sayings and Wisdom
Submitted by Mashanga on Wed, 2009-02-25 14:49Abagusii, native speakers of Ekegusii language, have in the language stored wisdom and knowhow ranging from ethics to scientifically verifiable medicine. In this forum is the tip of the iceburg - Ekegusii is richer than has been so far believed.
Swahili spell checker
Submitted by Guy on Wed, 2009-02-11 15:05Free online Swahili spellchecker
Memory-Based Swahili Morphological Segmentation and Lemmatization - Demo
Submitted by Guy on Wed, 2008-06-25 10:56Coming soon: ETA 2nd week of December 2008
Somali Online Spelling Checker
Submitted by jmgurey on Sat, 2007-12-01 18:34RSOL Spelling Checker is a online tool to check the correctness of your somali text. Its vocabulary is based on RSOL Online Dictionary. Using this tool will help to collect more words from somali language and build comprehensive somali word-list.
Automatic Diacritic Restoration for African Languages
Submitted by Guy on Tue, 2007-10-23 12:16This is a demonstration system for a diacritic restoration method that is able to automatically restore diacritics on the basis of local graphemic context. It is based on the machine learning method of Memory-Based learning. We have applied the method to the African languages of Cilubà, Gĩkũyũ, Kĩkamba, Maa, Sesotho sa Leboa, Tshivenḓa and Yoruba.
You can find more information on this system in this paper
Authors:
Guy De Pauw: CNTS - Language Technology Group, University of Antwerp, Antwerp, Belgium, guy [dot] depauw [at] ua [dot] ac [dot] beGilles-Maurice de Schryver: African Languages and Cultures, Ghent University, Ghent, Belgium, gillesmaurice [dot] deschryver [at] ugent [dot] be
Peter Waiganjo Wagacha: School of Computing and Informatics, University of Nairobi, Nairobi, Kenya, waiganjo [at] uonbi [dot] ac [dot] ke
Northern Sotho Part-of-Speech Tagger (V2) - Demo
Submitted by Guy on Thu, 2007-10-11 13:05This demo showcases a part-of-speech tagger for Northern Sotho. It retrieves the morpho-syntactic categories for words in a sentence. It uses MBT, the memory-based tagger trained on a relatively small annotated corpus.
Version1: Ocotober 10 2007 (20k tokens training set)
Version2: December 8 2007 (35k tokens training set)
Type in the text you want to tag (2,500 character limit)
Example: Motho ge a sa tseba o swanetše go dumela seo gore bao ba tsebago ba mmotše.
Authors:
Guy De Pauw: CNTS - Language Technology Group, University of Antwerp, Antwerp, Belgium, guy [dot] depauw [at] ua [dot] ac [dot] be
Gilles-Maurice de Schryver: African Languages and Cultures, Ghent University, Ghent, Belgium, gillesmaurice [dot] deschryver [at] ugent [dot] be
Paper
Verbal extension sequencing: An examination from a computational perspective
Submitted by wanderson on Fri, 2007-07-13 12:47- Login or register to post comments
- Google Scholar
