Difference between revisions of "Apertium system architecture"
Jump to navigation
Jump to search
Firespeaker (talk | contribs) |
Khannatanmai (talk | contribs) |
||
(16 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
== The pipeline == |
== The pipeline == |
||
− | [[File:Apertium_system_architecture.png| |
+ | [[File:Apertium_system_architecture.png|1200px]] |
== The stages == |
== The stages == |
||
Line 14: | Line 14: | ||
!colspan="2"| morphological tagger |
!colspan="2"| morphological tagger |
||
| 2004 |
| 2004 |
||
− | | |
+ | | — |
| <code>xxx-yyy-tagger</code>, <code>xxx-tagger</code> |
| <code>xxx-yyy-tagger</code>, <code>xxx-tagger</code> |
||
+ | | — |
||
⚫ | |||
|- |
|- |
||
!colspan="2"| morphological analysis |
!colspan="2"| morphological analysis |
||
Line 30: | Line 30: | ||
| [[Constraint Grammar]] |
| [[Constraint Grammar]] |
||
|- |
|- |
||
− | !colspan="2"| discontiguous multiword |
+ | !colspan="2"| discontiguous multiword assembly (optional) |
− | | 2017 |
+ | | 2017 |
| <code>apertium-xxx-yyy.xxx-yyy.lsx</code> |
| <code>apertium-xxx-yyy.xxx-yyy.lsx</code> |
||
| <code>xxx-yyy-autoseq</code> |
| <code>xxx-yyy-autoseq</code> |
||
Line 39: | Line 39: | ||
| 2004 |
| 2004 |
||
| <code>apertium-xxx-yyy.xxx-yyy.dix</code> |
| <code>apertium-xxx-yyy.xxx-yyy.dix</code> |
||
+ | | <code>xxx-yyy-biltrans</code> |
||
⚫ | |||
+ | | [[Bilingual dictionary]] |
||
⚫ | |||
|- |
|- |
||
!colspan="2"| lexical selection |
!colspan="2"| lexical selection |
||
| 2012 |
| 2012 |
||
| <code>apertium-xxx-yyy.xxx-yyy.lrx</code> |
| <code>apertium-xxx-yyy.xxx-yyy.lrx</code> |
||
+ | | <code>xxx-yyy-lex</code> |
||
⚫ | |||
+ | | [[Lexical selection]] |
||
− | | |
||
|- |
|- |
||
+ | !colspan="2"| anaphora resolution (optional) |
||
⚫ | |||
+ | | 2019, in progress |
||
+ | | <code>apertium-xxx-yyy.xxx-yyy.arx</code> |
||
+ | | <code>xxx-yyy-anaphora</code> |
||
+ | | [[Anaphora Resolution Module]] |
||
⚫ | |||
+ | !colspan="2"| pre-transfer |
||
⚫ | |||
+ | | — |
||
+ | | <code>xxx-yyy-pretransfer</code> |
||
+ | | — |
||
⚫ | |||
⚫ | |||
! chunker |
! chunker |
||
+ | | 2006 |
||
− | | |
||
| <code>apertium-xxx-yyy.xxx-yyy.t1x</code> |
| <code>apertium-xxx-yyy.xxx-yyy.t1x</code> |
||
+ | | <code>xxx-yyy-chunker</code> |
||
− | | |
||
+ | |rowspan="3" | [[Contributing to an existing pair#Adding structural transfer (grammar) rules]] |
||
− | | |
||
|- |
|- |
||
! interchunk |
! interchunk |
||
+ | | 2006 |
||
− | | |
||
| <code>apertium-xxx-yyy.xxx-yyy.t2x</code> |
| <code>apertium-xxx-yyy.xxx-yyy.t2x</code> |
||
+ | | <code>xxx-yyy-interchunk</code> |
||
− | | |
||
− | | |
||
|- |
|- |
||
! postchunk |
! postchunk |
||
+ | | 2006 |
||
− | | |
||
| <code>apertium-xxx-yyy.xxx-yyy.t3x</code> |
| <code>apertium-xxx-yyy.xxx-yyy.t3x</code> |
||
+ | | <code>xxx-yyy-postchunk</code> |
||
− | | |
||
− | | |
+ | |- |
− | !colspan="2"| |
+ | !colspan="2"| recursive structural transfer |
− | | |
+ | | 2019, in progress |
+ | | <code>apertium-xxx-yyy.xxx-yyy.rtx</code> |
||
+ | | <code>xxx-yyy-rectransfer</code> |
||
+ | | [[Apertium-recursive]] |
||
⚫ | |||
+ | !colspan="2"| discontiguous multiword disassembly (optional) |
||
+ | | 2017 |
||
| <code>apertium-xxx-yyy.yyy-xxx.lsx</code> |
| <code>apertium-xxx-yyy.yyy-xxx.lsx</code> |
||
| <code>xxx-yyy-revautoseq</code> |
| <code>xxx-yyy-revautoseq</code> |
||
Line 75: | Line 92: | ||
| 2004 |
| 2004 |
||
| <code>apertium-yyy.yyy.lexc</code> and<br /><code>apertium-yyy.yyy.twol</code> and<br /><code>apertium-yyy.yyy.twoc</code>,<br />OR <code>apertium-yyy.yyy.dix</code> |
| <code>apertium-yyy.yyy.lexc</code> and<br /><code>apertium-yyy.yyy.twol</code> and<br /><code>apertium-yyy.yyy.twoc</code>,<br />OR <code>apertium-yyy.yyy.dix</code> |
||
+ | | <code>xxx-yyy-dgen</code> or <code>xxx-yyy-gener</code> or <code>xxx-yyy-generador</code> |
||
− | | |
||
| |
| |
||
|- |
|- |
||
Line 81: | Line 98: | ||
| 2004 |
| 2004 |
||
| <code>apertium-yyy.post-yyy.dix</code> |
| <code>apertium-yyy.post-yyy.dix</code> |
||
+ | | <code>xxx-yyy-pgen</code> |
||
− | | |
||
+ | | [[Post-generator]] |
||
− | | |
||
|} |
|} |
||
Revision as of 20:34, 11 August 2019
Contents
The pipeline
The stages
Linguistic data
stage | introduced | filenames | mode | documentation | |
---|---|---|---|---|---|
morphological tagger | 2004 | — | xxx-yyy-tagger , xxx-tagger
|
— | |
morphological analysis | 2004 | apertium-xxx.xxx.lexc andapertium-xxx.xxx.twol andapertium-xxx.xxx.twoc ,OR apertium-xxx.xxx.dix
|
xxx-yyy-morph , xxx-morph
|
||
morphological disambiguation | 2004, 2008 | apertium-xxx.xxx.rlx
|
xxx-yyy-disam , xxx-disam
|
Constraint Grammar | |
discontiguous multiword assembly (optional) | 2017 | apertium-xxx-yyy.xxx-yyy.lsx
|
xxx-yyy-autoseq
|
Apertium separable | |
lexical transfer | 2004 | apertium-xxx-yyy.xxx-yyy.dix
|
xxx-yyy-biltrans
|
Bilingual dictionary | |
lexical selection | 2012 | apertium-xxx-yyy.xxx-yyy.lrx
|
xxx-yyy-lex
|
Lexical selection | |
anaphora resolution (optional) | 2019, in progress | apertium-xxx-yyy.xxx-yyy.arx
|
xxx-yyy-anaphora
|
Anaphora Resolution Module | |
pre-transfer | — | xxx-yyy-pretransfer
|
— | ||
shallow structural transfer | chunker | 2006 | apertium-xxx-yyy.xxx-yyy.t1x
|
xxx-yyy-chunker
|
Contributing to an existing pair#Adding structural transfer (grammar) rules |
interchunk | 2006 | apertium-xxx-yyy.xxx-yyy.t2x
|
xxx-yyy-interchunk
| ||
postchunk | 2006 | apertium-xxx-yyy.xxx-yyy.t3x
|
xxx-yyy-postchunk
| ||
recursive structural transfer | 2019, in progress | apertium-xxx-yyy.xxx-yyy.rtx
|
xxx-yyy-rectransfer
|
Apertium-recursive | |
discontiguous multiword disassembly (optional) | 2017 | apertium-xxx-yyy.yyy-xxx.lsx
|
xxx-yyy-revautoseq
|
Apertium separable | |
morphological generation | 2004 | apertium-yyy.yyy.lexc andapertium-yyy.yyy.twol andapertium-yyy.yyy.twoc ,OR apertium-yyy.yyy.dix
|
xxx-yyy-dgen or xxx-yyy-gener or xxx-yyy-generador
|
||
post-generation | 2004 | apertium-yyy.post-yyy.dix
|
xxx-yyy-pgen
|
Post-generator |
Apertium-internal
deformatter, reformatter