Difference between revisions of "User:Firespeaker/Wishlist"

From Apertium
Jump to navigation Jump to search
Line 1: Line 1:
The following are projects that I would love to see working as part of apertium:
The following are projects that I would love to see working as part of apertium.

* integration of twol debugger into apertium-quality
== integration of twol debugger into apertium-quality ==
** I'd also reallly like to see someone adopt apertium-quality and maintain it
** I'd also reallly like to see someone adopt apertium-quality and maintain it
* module for templatic syntax
== module for templatic syntax ==
** also, for e.g. compound verbs some sort of interlanguage
** also, for e.g. compound verbs some sort of interlanguage
*** integrated into bidix format?
*** integrated into bidix format?
* an extra module to convert numbers and alphabetisms and acronyms to words (based on something in lexc) so that phonology can happen and then back to its original form
== an extra module to convert numbers and alphabetisms and acronyms to words (based on something in lexc) so that phonology can happen and then back to its original form ==
** you'd need some notation like 10:%{он%} so the form is extracted and popped back in
** you'd need some notation like 10:%{он%} so the form is extracted and popped back in
** this is likely to only work on stems that don't change, at least with a non-complicated implementation (but that's fine for most Turkic languages)
** this is likely to only work on stems that don't change, at least with a non-complicated implementation (but that's fine for most Turkic languages)

Revision as of 22:47, 30 July 2012

The following are projects that I would love to see working as part of apertium.

integration of twol debugger into apertium-quality

    • I'd also reallly like to see someone adopt apertium-quality and maintain it

module for templatic syntax

    • also, for e.g. compound verbs some sort of interlanguage
      • integrated into bidix format?

an extra module to convert numbers and alphabetisms and acronyms to words (based on something in lexc) so that phonology can happen and then back to its original form

    • you'd need some notation like 10:%{он%} so the form is extracted and popped back in
    • this is likely to only work on stems that don't change, at least with a non-complicated implementation (but that's fine for most Turkic languages)

Bugs

HFST :0 context bug

  1. %{A%}:ә <=> [ :FrontVow :Cns* :Cns ]/[ %>: | % | :0 ] _ ;
    except
    [ %{ъ%}: ]/[ %>: ] _ ;
  2. %{A%}:ә <=> [ :FrontVow :Cns* :Cns ]/[ %>: | % | [ :0 - %{ъ%}: ] ] _ ;
  • (2) avoids applying after %{ъ%}:0 %>:0 and (1) does not, but these should really work the same way