Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on
If you have any questions, please come and talk to us on
#apertium
on irc.freenode.net
or contact the GitHub migration team.Apertium system architecture
From Apertium
(Difference between revisions)
Firespeaker (Talk | contribs) (→Linguistic data) |
Khannatanmai (Talk | contribs) (→Linguistic data) |
||
(14 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
== The pipeline == |
== The pipeline == |
||
− | [[File:Apertium_system_architecture.png|1000px]] |
+ | [[File:Apertium_system_architecture.png|1200px]] |
== The stages == |
== The stages == |
||
Line 14: | Line 14: | ||
!colspan="2"| morphological tagger |
!colspan="2"| morphological tagger |
||
| 2004 |
| 2004 |
||
− | | |
+ | | — |
| <code>xxx-yyy-tagger</code>, <code>xxx-tagger</code> |
| <code>xxx-yyy-tagger</code>, <code>xxx-tagger</code> |
||
− | | |
+ | | — |
|- |
|- |
||
!colspan="2"| morphological analysis |
!colspan="2"| morphological analysis |
||
Line 30: | Line 30: | ||
| [[Constraint Grammar]] |
| [[Constraint Grammar]] |
||
|- |
|- |
||
− | !colspan="2"| discontiguous multiword processing |
+ | !colspan="2"| discontiguous multiword assembly (optional) |
− | | 2017, in progress |
+ | | 2017 |
| <code>apertium-xxx-yyy.xxx-yyy.lsx</code> |
| <code>apertium-xxx-yyy.xxx-yyy.lsx</code> |
||
| <code>xxx-yyy-autoseq</code> |
| <code>xxx-yyy-autoseq</code> |
||
Line 48: | Line 48: | ||
| [[Lexical selection]] |
| [[Lexical selection]] |
||
|- |
|- |
||
− | !rowspan="3"| structural transfer |
+ | !colspan="2"| anaphora resolution (optional) |
+ | | 2019, in progress |
||
+ | | <code>apertium-xxx-yyy.xxx-yyy.arx</code> |
||
+ | | <code>xxx-yyy-anaphora</code> |
||
+ | | [[Anaphora Resolution Module]] |
||
+ | |- |
||
+ | !colspan="2"| pre-transfer |
||
+ | | |
||
+ | | — |
||
+ | | <code>xxx-yyy-pretransfer</code> |
||
+ | | — |
||
+ | |- |
||
+ | !rowspan="3"| shallow structural transfer |
||
! chunker |
! chunker |
||
− | | |
+ | | 2006 |
| <code>apertium-xxx-yyy.xxx-yyy.t1x</code> |
| <code>apertium-xxx-yyy.xxx-yyy.t1x</code> |
||
− | | |
+ | | <code>xxx-yyy-chunker</code> |
− | | |
+ | |rowspan="3" | [[Contributing to an existing pair#Adding structural transfer (grammar) rules]] |
|- |
|- |
||
! interchunk |
! interchunk |
||
− | | |
+ | | 2006 |
| <code>apertium-xxx-yyy.xxx-yyy.t2x</code> |
| <code>apertium-xxx-yyy.xxx-yyy.t2x</code> |
||
− | | |
+ | | <code>xxx-yyy-interchunk</code> |
− | | |
||
|- |
|- |
||
! postchunk |
! postchunk |
||
− | | |
+ | | 2006 |
| <code>apertium-xxx-yyy.xxx-yyy.t3x</code> |
| <code>apertium-xxx-yyy.xxx-yyy.t3x</code> |
||
− | | |
+ | | <code>xxx-yyy-postchunk</code> |
− | | |
+ | |- |
+ | !colspan="2"| recursive structural transfer |
||
+ | | 2019, in progress |
||
+ | | <code>apertium-xxx-yyy.xxx-yyy.rtx</code> |
||
+ | | <code>xxx-yyy-rectransfer</code> |
||
+ | | [[Apertium-recursive]] |
||
|- |
|- |
||
− | !colspan="2"| reverse discontiguous multiword processing |
+ | !colspan="2"| discontiguous multiword disassembly (optional) |
− | | 2017, in progress |
+ | | 2017 |
| <code>apertium-xxx-yyy.yyy-xxx.lsx</code> |
| <code>apertium-xxx-yyy.yyy-xxx.lsx</code> |
||
| <code>xxx-yyy-revautoseq</code> |
| <code>xxx-yyy-revautoseq</code> |
||
Line 75: | Line 75: | ||
| 2004 |
| 2004 |
||
| <code>apertium-yyy.yyy.lexc</code> and<br /><code>apertium-yyy.yyy.twol</code> and<br /><code>apertium-yyy.yyy.twoc</code>,<br />OR <code>apertium-yyy.yyy.dix</code> |
| <code>apertium-yyy.yyy.lexc</code> and<br /><code>apertium-yyy.yyy.twol</code> and<br /><code>apertium-yyy.yyy.twoc</code>,<br />OR <code>apertium-yyy.yyy.dix</code> |
||
− | | |
+ | | <code>xxx-yyy-dgen</code> or <code>xxx-yyy-gener</code> or <code>xxx-yyy-generador</code> |
| |
| |
||
|- |
|- |
||
Line 81: | Line 81: | ||
| 2004 |
| 2004 |
||
| <code>apertium-yyy.post-yyy.dix</code> |
| <code>apertium-yyy.post-yyy.dix</code> |
||
− | | |
+ | | <code>xxx-yyy-pgen</code> |
− | | |
+ | | [[Post-generator]] |
|} |
|} |
||
Latest revision as of 21:34, 11 August 2019
Contents |
[edit] The pipeline
[edit] The stages
[edit] Linguistic data
stage | introduced | filenames | mode | documentation | |
---|---|---|---|---|---|
morphological tagger | 2004 | — | xxx-yyy-tagger , xxx-tagger
|
— | |
morphological analysis | 2004 | apertium-xxx.xxx.lexc andapertium-xxx.xxx.twol andapertium-xxx.xxx.twoc ,OR apertium-xxx.xxx.dix
|
xxx-yyy-morph , xxx-morph
|
||
morphological disambiguation | 2004, 2008 | apertium-xxx.xxx.rlx
|
xxx-yyy-disam , xxx-disam
|
Constraint Grammar | |
discontiguous multiword assembly (optional) | 2017 | apertium-xxx-yyy.xxx-yyy.lsx
|
xxx-yyy-autoseq
|
Apertium separable | |
lexical transfer | 2004 | apertium-xxx-yyy.xxx-yyy.dix
|
xxx-yyy-biltrans
|
Bilingual dictionary | |
lexical selection | 2012 | apertium-xxx-yyy.xxx-yyy.lrx
|
xxx-yyy-lex
|
Lexical selection | |
anaphora resolution (optional) | 2019, in progress | apertium-xxx-yyy.xxx-yyy.arx
|
xxx-yyy-anaphora
|
Anaphora Resolution Module | |
pre-transfer | — | xxx-yyy-pretransfer
|
— | ||
shallow structural transfer | chunker | 2006 | apertium-xxx-yyy.xxx-yyy.t1x
|
xxx-yyy-chunker
|
Contributing to an existing pair#Adding structural transfer (grammar) rules |
interchunk | 2006 | apertium-xxx-yyy.xxx-yyy.t2x
|
xxx-yyy-interchunk
| ||
postchunk | 2006 | apertium-xxx-yyy.xxx-yyy.t3x
|
xxx-yyy-postchunk
| ||
recursive structural transfer | 2019, in progress | apertium-xxx-yyy.xxx-yyy.rtx
|
xxx-yyy-rectransfer
|
Apertium-recursive | |
discontiguous multiword disassembly (optional) | 2017 | apertium-xxx-yyy.yyy-xxx.lsx
|
xxx-yyy-revautoseq
|
Apertium separable | |
morphological generation | 2004 | apertium-yyy.yyy.lexc andapertium-yyy.yyy.twol andapertium-yyy.yyy.twoc ,OR apertium-yyy.yyy.dix
|
xxx-yyy-dgen or xxx-yyy-gener or xxx-yyy-generador
|
||
post-generation | 2004 | apertium-yyy.post-yyy.dix
|
xxx-yyy-pgen
|
Post-generator |
[edit] Apertium-internal
deformatter, reformatter