Difference between revisions of "Earley-based structural transfer for Apertium"

From Apertium
Jump to navigation Jump to search
(Change of Sharp & Streiter paper's link)
(Change of Takeda's paper link)
Line 16: Line 16:
 
==Further reading==
 
==Further reading==
   
* Koichi Takeda [http://66.102.9.104/search?q=cache:E-giecHfU8QJ:acl.ldc.upenn.edu/P/P96/P96-1020.pdf+earley+algorithm+%22machine+translation%22&hl=en&ct=clnk&cd=6&client=iceweasel-a Pattern-Based Context-Free Grammars for Machine Translation]
+
* Koichi Takeda [http://acl.ldc.upenn.edu/P/P96/P96-1020.pdf Pattern-Based Context-Free Grammars for Machine Translation] (private access)
 
:This paper proposes the use of "pattern-based" context-free grammars as a basis for building machine translation (MT) systems.
 
:This paper proposes the use of "pattern-based" context-free grammars as a basis for building machine translation (MT) systems.
 
*Randall Sharp and Oliver Streiter [http://www.iai.uni-sb.de/docs/meta93.pdf Simplifying the Complexity of Machine Translation]
 
*Randall Sharp and Oliver Streiter [http://www.iai.uni-sb.de/docs/meta93.pdf Simplifying the Complexity of Machine Translation]

Revision as of 08:29, 8 November 2007

Perhaps Earley's algorithm to parse context-free grammars (which has a left-to-right longest-match philosophy as Apertium) could be used to perform more complex syntactical transformations; this could be useful for distant language pairs containing embedded structures.

Open questions

  • Currently, Apertium uses text streams to communicate. I assume this would not be possible here.
  • When would one call the bilingual dictionary? Apertium Level 2 calls it in the first stage.
  • We should check whether this has been done before.
  • In case there is more than one parse of a sentence, there should be a way to select the most likely.

Existing parsers

Current free-software parsers which might be worth looking at:

Further reading

This paper proposes the use of "pattern-based" context-free grammars as a basis for building machine translation (MT) systems.