User:Kevin Scannell

From Apertium
Revision as of 01:14, 26 April 2011 by Kevin Scannell (talk | contribs) (update for 2011)
Jump to navigation Jump to search

I provided a lot of the data currently in Apertium for Irish to Scottish Gaelic machine translation, which was taken from an ad hoc system for this language pair that I created in 2005. I'd be interested in mentoring a GSOC student (or anyone else) to help finish this work. I'm also working on Irish to Manx Gaelic with help from Phil Kelly.

I'm mentoring Dhiraj Lohiya on a project involving statistical diacritic restoration for GSOC 2011; see my paper Statistical Unicodification of African Languages for more information.

For about ten years I have been working on developing language technology for under-resourced languages around the world. I've developed corpora and spell checkers for many (20+) languages using a web crawler, statistical methods, and contributions from native speakers. I am primarily interested in the Celtic languages, and particularly Irish. I've created a grammar checker, monolingual and parallel corpora, and a semantic network for Irish, and do a lot of the localization of open source software (Firefox, OpenOffice.org, KDE, ...) Currently I am working on a statistical MT engine that I hope might be useful for Apertium down the road, if I ever manage to finish it!

Links: