Difference between revisions of "Ideas for Google Summer of Code/Closer integration with HFST"

From Apertium
Jump to navigation Jump to search
Line 4: Line 4:
   
 
* Create a new XML-based format for [[lexc]] inspired by [[lttoolbox]] (see [[Development ideas for dictionary format]])
 
* Create a new XML-based format for [[lexc]] inspired by [[lttoolbox]] (see [[Development ideas for dictionary format]])
  +
* Add a compiler for this format, with support for direction restrictions.
 
* Fix [http://sourceforge.net/tracker/?func=detail&aid=3383731&group_id=224521&atid=1061990 this bug] in <code>hfst-proc</code> tokenisation.
 
* Fix [http://sourceforge.net/tracker/?func=detail&aid=3383731&group_id=224521&atid=1061990 this bug] in <code>hfst-proc</code> tokenisation.
 
* Modify the HFST build process to make a "minimal" Apertium-centred install.
 
* Modify the HFST build process to make a "minimal" Apertium-centred install.

Revision as of 08:24, 20 February 2012

Tasks

  • Create a new XML-based format for lexc inspired by lttoolbox (see Development ideas for dictionary format)
  • Add a compiler for this format, with support for direction restrictions.
  • Fix this bug in hfst-proc tokenisation.
  • Modify the HFST build process to make a "minimal" Apertium-centred install.
  • Add lttoolbox as a backend to HFST.
  • Make hfst-expand obey flag diacritics.

Coding challenge

Frequently asked questions

Previous GSOC projects