Program - AfLaT2012/SALTMIL joint workshop on Language technology for normalisation of less-resourced languages

A full-day workshop at LREC 2012
Tuesday, 22 May 2012.
Lütfi Kirdar Istanbul Exhibition and Congress Centre, Istanbul, Turkey
[register here]



09:30–10:30

Invited Talk - How to build language technology resources for the next 100 years
Sjur Moshagen Nørstebø, Sámi Parliament

10:30–11:00

Coffee Break

11:00–13:00 Resource Creation
11:00–11:30 Issues in Designing a Spoken Corpus of Irish
Elaine Uí Dhonnchadha, Alessio Frenda and Brian Vaughan
11:30–12:00 Learning Morphological Rules for Amharic Verbs Using Inductive Logic Programming
Wondwossen Mulugeta and Michael Gasser
12:00–12:30 The Database of Modern Icelandic Inflection
Kristín Bjarnadóttir
12:30–13:00 Natural Language Processing for Amazigh Language: Challenges and Future Directions
Fadoua Ataa Allah and Siham Boulaknadel

13:00–14:00

Lunch Break

14:00–16:00 Resource Use
14:00–14:30 Compiling Apertium morphological dictionaries with HFST and using them in HFST applications
Tommi A. Pirinen and Francis M. Tyers
14:30–15:00 Automatic structuring and correction suggestion system for Hungarian clinical records
Borbála Siklósi, György Orosz, Attila Novák and Gábor Prószéky
15:00–15:30 Constraint Grammar based Correction of Grammatical Errors for North Sámi
Linda Wiechetek
15:30–16:00 Toward a Rule-Based System for English-Amharic Translation
Michael Gasser

16:00–16:30

Coffee Break

16:30–17:30 Poster Session
 

Technological Tools for Dictionary and Corpora Building for Minority Languages: Example of the French-based CreolesPaola Carrion Gonzalez and Emmanuel Cartier
Describing Morphologically-rich Languages using Metagrammars: a Look at Verbs in IkotaDenys Duchier, Brunelle Magnana Ekoukou, Yannick Parmentier, Simon Petitjean and Emannuel Schang
A Corpus of SantomeTjerk Hagemeijer, Iris Hendrickx, Abigail Tiny and Haldane Amaro
The Tagged Icelandic Corpus (MM)Sigrún Helgadóttir, Ásta Svavarsdóttir, Eiríkur Rögnvaldsson, Kristín Bjarnadóttir and Hrafn Loftsson
Semi-automated extraction of morphological grammars for Nguni with special reference to Southern NdebeleLaurette Pretorius and Sonja Bosch
Tagging and Verifying an Amharic News CorpusBjörn Gambäck
Resource-Light Bantu Part-of-Speech TaggingGuy De Pauw, Gilles-Maurice de Schryver and Janneke van de Loo
POS Annotated 50M Corpus of Tajik LanguageGulshan Dovudov, Vít Suchomel and Pavel Šmerk

17:30–17:45

Closing Session