::1997-1998::
My first project was the LINGUADUCT project. The research goal of this
project was to investigate contextual interpretation of natural language
through abductive reasoning and inductively acquired knowledge. In this
project, we investigated the role of algorithm bias in the context of
part-of-speech tagging. During the course of this project, we also fine-tuned
a memory-based version of Rens Bod's Data-Oriented Parsing model. The latter
work was included in my PhD thesis.
::1998-2002::
During the next four years, I was a research fellow of the Fund for scientific
research (aspirant FWO). In this project, we
investigated an agent-based evolutionary
computing approach to memory-based syntactic parsing of natural language. Starting
off with a fully fledged data-driven parser (cf. the aforementioned memory-based
DOP model), we gradually dismantled the parser's data sources to eventually arrive
at a society model for the emergence of grammatical concepts. The general framework
for this research was dubbed GRAEL (GRAmmar EvoLution). Most research was spent
investigating how an agent-based society can be used to optimize the probability mass
distribution of an annotated treebank. While the evolutionary aspect of GRAEL made sure
that several solutions were considered and optimized over time, the agent-based properties
of GRAEL ensured that the optimization process was grounded in the task at hand: parsing
natural language. You can download my PhD thesis
here.
::2002-2006::
One day after my PhD defense, I started working on the FLaVoR project.
This is a joint project between CNTS at the University of Antwerp and ESAT at the University
of Leuven. The FLaVoR project tries to develop a new modular architecture for speech recognition
which allows for the introduction of relatively intricate language models. My particular job in
this project is the development of a module for prosodic annotation and a morpho-syntactic language
model. The morphological model developed at the University of Antwerp uses memory-based learning.
You should check out the demo. The syntactic part
of the language model makes use of shallow parsing to present structural information in a sentence.
::2004::
I also took part in an investigation on speech rate in the Dutch language area. Using a corpus of
160 conversations with teachers of Dutch all over Flanders and the Netherlands, we were able to
determine average speech rate for the different areas. The results were surprising in that we indeed found
a statistically significant difference in speech rate between Dutch people and Flemish people (the former speaking
significantly faster). We also found that men speak faster than women. Less surprisingly, we also
found that older people speak slower than young people. The prototypical slow speech of people from
Limburg could not be observed.
::2006-...::
African Language Technology
Last modified: Wed Feb 9 10:27:46 CET 2005