Difference between revisions of "Ideas for Google Summer of Code/Complex multiwords"

From Apertium
Jump to navigation Jump to search
m (→‎Coding challenge: remove coding challenge; I'll replace it with one I'm interested in if/when I think of one)
Line 1: Line 1:
 
{{TOCD}}
 
{{TOCD}}
   
Write a bidirectional module for specifying complex multiword units, for example ''dirección general'' and ''zračna luka''. Although in the Romance languages it is not a big problem, as soon as you start to get to languages with cases (e.g. Serbo-Croatian, Slovenian, German, Icelandic, etc.) the problem comes that you can't define a multiword of adj nom because the adjective has a lot of inflection.
+
Write a bidirectional module for specifying complex multiword units, for example ''dirección general'' and ''zračna luka''. Although in the Romance languages it is not a big problem, as soon as you start to get to languages with cases (e.g. Serbo-Croatian, Slovenian, Russian, German, Icelandic, etc.) the problem comes that you can't define a multiword of adj nom because the adjective has a lot of inflection.
   
The module should be bidirectional, that is, it should be able to be used for both analysing and for generating these multiwords.
+
The module should be bidirectional, that is, it should be able to be used for both analysing and for generating these multiwords. So, given an input of:
   
  +
*Još jedna očekivana posledica biće porast potrošnje za zdravstvo da bi se zadovoljila veća tražnje osoba te starosne grupe za ambulantnim uslugama.
==Coding challenge==
 
  +
*Another expected consequence is that health care spending will have to grow to meet the higher demands of people in this group for outpatient services.
   
  +
Default translation of "ambulantni" would be "ambulatory", but in this case we want to translate "ambulantnim uslugama" as "outpatient services".
==Frequently asked questions==
 
   
 
==Coding challenge==
==Previous GSOC projects==
 
   
  +
* Write a stream processor (see [[Apertium stream format]]) for the output of lt-proc that parses character by character, respecting [[superblanks]].
* (2010) [[User:Skh/Application_GSoC_2010]]
 
  +
 
==Frequently asked questions==
   
 
==See also==
 
==See also==
   
 
* [[Multiwords]]
 
* [[Multiwords]]
 
* [[User:Skh/Application_GSoC_2010]]
   
 
[[Category:Ideas for Google Summer of Code|Complex multiwords]]
 
[[Category:Ideas for Google Summer of Code|Complex multiwords]]

Revision as of 13:27, 14 March 2013

Write a bidirectional module for specifying complex multiword units, for example dirección general and zračna luka. Although in the Romance languages it is not a big problem, as soon as you start to get to languages with cases (e.g. Serbo-Croatian, Slovenian, Russian, German, Icelandic, etc.) the problem comes that you can't define a multiword of adj nom because the adjective has a lot of inflection.

The module should be bidirectional, that is, it should be able to be used for both analysing and for generating these multiwords. So, given an input of:

  • Još jedna očekivana posledica biće porast potrošnje za zdravstvo da bi se zadovoljila veća tražnje osoba te starosne grupe za ambulantnim uslugama.
  • Another expected consequence is that health care spending will have to grow to meet the higher demands of people in this group for outpatient services.

Default translation of "ambulantni" would be "ambulatory", but in this case we want to translate "ambulantnim uslugama" as "outpatient services".

Coding challenge

Frequently asked questions

See also