Difference between revisions of "User:Kevin Scannell"

From Apertium
Jump to navigation Jump to search
(update for 2011)
(update)
 
Line 1: Line 1:
I provided a lot of the data currently in Apertium for [[Irish to Scottish Gaelic]] machine translation, which was taken from an ad hoc system for this language pair that I created in 2005. I'd be interested in mentoring a GSOC student (or anyone else) to help finish this work. I'm also working on Irish to Manx Gaelic with help from Phil Kelly.
+
I provided a lot of the data currently in Apertium for [[Irish to Scottish Gaelic]] machine translation, which was taken from an ad hoc system for this language pair that I created in 2005. Since then, I've developed a more mature [http://github.com/kscanne/caighdean MT system] that translates Scottish and Manx Gaelic into Irish, using statistics on the target side. I'd be happy to mentor a GSOC student (or anyone else) interested in porting that work over to Apertium.
   
 
More about me: for about 15 years I have been working on developing language technology for under-resourced languages around the world. I've developed corpora and spell checkers for many (20+) languages using a [http://crubadan.org/ web crawler], statistical methods, and contributions from native speakers. I am primarily interested in the Celtic languages, and particularly Irish. I've created a [http://borel.slu.edu/gramadoir/ grammar checker], monolingual and parallel corpora, and a [http://borel.slu.edu/lsg/ semantic network] for Irish, and do a lot of the localisation of open source software (Firefox, LibreOffice, KDE, ...)
I'm mentoring Dhiraj Lohiya on a project involving statistical diacritic restoration for GSOC 2011; see my paper [http://borel.slu.edu/pub/lre.pdf Statistical Unicodification of African Languages] for more information.
 
 
For about ten years I have been working on developing language technology for under-resourced languages around the world. I've developed corpora and spell checkers for many (20+) languages using a [http://borel.slu.edu/crubadan/ web crawler], statistical methods, and contributions from native speakers. I am primarily interested in the Celtic languages, and particularly Irish. I've created a [http://borel.slu.edu/gramadoir/ grammar checker], monolingual and parallel corpora, and a [http://borel.slu.edu/lsg/ semantic network] for Irish, and do a lot of the localization of open source software (Firefox, OpenOffice.org, KDE, ...) Currently I am working on a statistical MT engine that I hope might be useful for Apertium down the road, if I ever manage to finish it!
 
   
 
Links:
 
Links:
 
* [http://borel.slu.edu/ Home page]
 
* [http://borel.slu.edu/ Home page]
 
* [http://borel.slu.edu/nlp.html NLP projects]
 
* [http://borel.slu.edu/nlp.html NLP projects]
* [http://accentuate.us/ Accentuate.us]
+
* [http://intergaelic.com/ Intergaelic MT]
 
* [http://indigenoustweets.com/ Indigenous Tweets]
 
* [http://indigenoustweets.com/ Indigenous Tweets]
   

Latest revision as of 22:22, 19 January 2017

I provided a lot of the data currently in Apertium for Irish to Scottish Gaelic machine translation, which was taken from an ad hoc system for this language pair that I created in 2005. Since then, I've developed a more mature MT system that translates Scottish and Manx Gaelic into Irish, using statistics on the target side. I'd be happy to mentor a GSOC student (or anyone else) interested in porting that work over to Apertium.

More about me: for about 15 years I have been working on developing language technology for under-resourced languages around the world. I've developed corpora and spell checkers for many (20+) languages using a web crawler, statistical methods, and contributions from native speakers. I am primarily interested in the Celtic languages, and particularly Irish. I've created a grammar checker, monolingual and parallel corpora, and a semantic network for Irish, and do a lot of the localisation of open source software (Firefox, LibreOffice, KDE, ...)

Links: