Ideas for Google Summer of Code/Robust recursive transfer
< Ideas for Google Summer of Code
Jump to navigation
Jump to search
Revision as of 14:58, 9 February 2015 by Francis Tyers (talk | contribs) (Francis Tyers moved page Ideas for Google Summer of Code/Prototype recursive transfer implementations to Ideas for Google Summer of Code/Robust recursive transfer)
The purpose of this task is to create a module to replace the apertium-transfer module(s) which will parse and allow transfer operations on an input.
Currently we have a problem with very distantly related languages that have long-distance constituent reordering, because we can only do finite-state chunking. The module should be designed to be able to work cleanly with partial input. e.g. word by word processing, not sentence by sentence.
It should expect morphologically disambiguated input, and its own output should also be unambiguous (it should create a single parse tree).
Tasks
- Do a review of the literature on:
- finite-state dependency parsing
- LALR(1) grammars
- Propose a transfer rule formalism
- Write a number of transfer rules in this formalism for translating between a language pair.
- Reimplement an existing language pair in trunk using your new formalism. This will involve rewriting the existing rules to be compatible with your new formalism.
- Integrate your new rules into the existing pair.
- Evaluate the improvement
Coding challenge
- Install Apertium (see Minimal installation from SVN)
- Compile the prototype code at recursive transfer.
- Write a transfer grammar to perform word-reordering for this story for your chosen language pair.
- Optional
- Adjust prototype code to include support for attributes.
Frequently asked questions
- none yet, ask us something! :)
See also
- (2011) VM for transfer: Relevant to understand how the current transfer implementation works
- Recursive transfer
- User:Mlforcada/Robust LR for Transfer
Further reading
- Elworthy, D. (1999) "A Finite-State Parser with Dependency Structure Output"
- Öflazer, K. (1999) "Dependency Parsing with an Extended Finite State Approach"
- Alshawi, H., Douglas, S., Bangalore, S. (2000) "Learning Dependency Translation Models as Collections of Finite-State Head Transducers". Computational Linguistics 26(1)