Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

Apertium system architecture

From Apertium
(Difference between revisions)
Jump to: navigation, search
m (Linguistic data)
(Linguistic data)
 
(15 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
== The pipeline ==
 
== The pipeline ==
[[File:Apertium_system_architecture.png|1000px]]
+
[[File:Apertium_system_architecture.png|1200px]]
   
 
== The stages ==
 
== The stages ==
Line 14: Line 14:
 
!colspan="2"| morphological tagger
 
!colspan="2"| morphological tagger
 
| 2004
 
| 2004
|
+
|
 
| <code>xxx-yyy-tagger</code>, <code>xxx-tagger</code>
 
| <code>xxx-yyy-tagger</code>, <code>xxx-tagger</code>
|
+
|
 
|-
 
|-
 
!colspan="2"| morphological analysis
 
!colspan="2"| morphological analysis
Line 30: Line 30:
 
| [[Constraint Grammar]]
 
| [[Constraint Grammar]]
 
|-
 
|-
!colspan="2"| discontiguous multiword processing
+
!colspan="2"| discontiguous multiword assembly (optional)
| 2017, in progress
+
| 2017
 
| <code>apertium-xxx-yyy.xxx-yyy.lsx</code>
 
| <code>apertium-xxx-yyy.xxx-yyy.lsx</code>
 
| <code>xxx-yyy-autoseq</code>
 
| <code>xxx-yyy-autoseq</code>
Line 39: Line 39:
 
| 2004
 
| 2004
 
| <code>apertium-xxx-yyy.xxx-yyy.dix</code>
 
| <code>apertium-xxx-yyy.xxx-yyy.dix</code>
|
+
| <code>xxx-yyy-biltrans</code>
|
+
| [[Bilingual dictionary]]
 
|-
 
|-
 
!colspan="2"| lexical selection
 
!colspan="2"| lexical selection
 
| 2012
 
| 2012
 
| <code>apertium-xxx-yyy.xxx-yyy.lrx</code>
 
| <code>apertium-xxx-yyy.xxx-yyy.lrx</code>
|
+
| <code>xxx-yyy-lex</code>
|
+
| [[Lexical selection]]
 
|-
 
|-
!rowspan="3"| structural transfer
+
!colspan="2"| anaphora resolution (optional)
  +
| 2019, in progress
  +
| <code>apertium-xxx-yyy.xxx-yyy.arx</code>
  +
| <code>xxx-yyy-anaphora</code>
  +
| [[Anaphora Resolution Module]]
  +
|-
  +
!colspan="2"| pre-transfer
  +
|
  +
| —
  +
| <code>xxx-yyy-pretransfer</code>
  +
| —
  +
|-
  +
!rowspan="3"| shallow structural transfer
 
! chunker
 
! chunker
|
+
| 2006
 
| <code>apertium-xxx-yyy.xxx-yyy.t1x</code>
 
| <code>apertium-xxx-yyy.xxx-yyy.t1x</code>
|
+
| <code>xxx-yyy-chunker</code>
|
+
|rowspan="3" | [[Contributing to an existing pair#Adding structural transfer (grammar) rules]]
 
|-
 
|-
 
! interchunk
 
! interchunk
|
+
| 2006
 
| <code>apertium-xxx-yyy.xxx-yyy.t2x</code>
 
| <code>apertium-xxx-yyy.xxx-yyy.t2x</code>
|
+
| <code>xxx-yyy-interchunk</code>
|
 
 
|-
 
|-
 
! postchunk
 
! postchunk
|
+
| 2006
 
| <code>apertium-xxx-yyy.xxx-yyy.t3x</code>
 
| <code>apertium-xxx-yyy.xxx-yyy.t3x</code>
|
+
| <code>xxx-yyy-postchunk</code>
|
+
|-
  +
!colspan="2"| recursive structural transfer
  +
| 2019, in progress
  +
| <code>apertium-xxx-yyy.xxx-yyy.rtx</code>
  +
| <code>xxx-yyy-rectransfer</code>
  +
| [[Apertium-recursive]]
 
|-
 
|-
!colspan="2"| reverse discontiguous multiword processing
+
!colspan="2"| discontiguous multiword disassembly (optional)
| 2017, in progress
+
| 2017
 
| <code>apertium-xxx-yyy.yyy-xxx.lsx</code>
 
| <code>apertium-xxx-yyy.yyy-xxx.lsx</code>
 
| <code>xxx-yyy-revautoseq</code>
 
| <code>xxx-yyy-revautoseq</code>
Line 75: Line 75:
 
| 2004
 
| 2004
 
| <code>apertium-yyy.yyy.lexc</code> and<br /><code>apertium-yyy.yyy.twol</code> and<br /><code>apertium-yyy.yyy.twoc</code>,<br />OR <code>apertium-yyy.yyy.dix</code>
 
| <code>apertium-yyy.yyy.lexc</code> and<br /><code>apertium-yyy.yyy.twol</code> and<br /><code>apertium-yyy.yyy.twoc</code>,<br />OR <code>apertium-yyy.yyy.dix</code>
|
+
| <code>xxx-yyy-dgen</code> or <code>xxx-yyy-gener</code> or <code>xxx-yyy-generador</code>
 
|
 
|
 
|-
 
|-
Line 81: Line 81:
 
| 2004
 
| 2004
 
| <code>apertium-yyy.post-yyy.dix</code>
 
| <code>apertium-yyy.post-yyy.dix</code>
|
+
| <code>xxx-yyy-pgen</code>
|
+
| [[Post-generator]]
 
|}
 
|}
   

Latest revision as of 21:34, 11 August 2019

Contents

[edit] The pipeline

Apertium system architecture.png

[edit] The stages

[edit] Linguistic data

stage introduced filenames mode documentation
morphological tagger 2004 xxx-yyy-tagger, xxx-tagger
morphological analysis 2004 apertium-xxx.xxx.lexc and
apertium-xxx.xxx.twol and
apertium-xxx.xxx.twoc,
OR apertium-xxx.xxx.dix
xxx-yyy-morph, xxx-morph
morphological disambiguation 2004, 2008 apertium-xxx.xxx.rlx xxx-yyy-disam, xxx-disam Constraint Grammar
discontiguous multiword assembly (optional) 2017 apertium-xxx-yyy.xxx-yyy.lsx xxx-yyy-autoseq Apertium separable
lexical transfer 2004 apertium-xxx-yyy.xxx-yyy.dix xxx-yyy-biltrans Bilingual dictionary
lexical selection 2012 apertium-xxx-yyy.xxx-yyy.lrx xxx-yyy-lex Lexical selection
anaphora resolution (optional) 2019, in progress apertium-xxx-yyy.xxx-yyy.arx xxx-yyy-anaphora Anaphora Resolution Module
pre-transfer xxx-yyy-pretransfer
shallow structural transfer chunker 2006 apertium-xxx-yyy.xxx-yyy.t1x xxx-yyy-chunker Contributing to an existing pair#Adding structural transfer (grammar) rules
interchunk 2006 apertium-xxx-yyy.xxx-yyy.t2x xxx-yyy-interchunk
postchunk 2006 apertium-xxx-yyy.xxx-yyy.t3x xxx-yyy-postchunk
recursive structural transfer 2019, in progress apertium-xxx-yyy.xxx-yyy.rtx xxx-yyy-rectransfer Apertium-recursive
discontiguous multiword disassembly (optional) 2017 apertium-xxx-yyy.yyy-xxx.lsx xxx-yyy-revautoseq Apertium separable
morphological generation 2004 apertium-yyy.yyy.lexc and
apertium-yyy.yyy.twol and
apertium-yyy.yyy.twoc,
OR apertium-yyy.yyy.dix
xxx-yyy-dgen or xxx-yyy-gener or xxx-yyy-generador
post-generation 2004 apertium-yyy.post-yyy.dix xxx-yyy-pgen Post-generator

[edit] Apertium-internal

deformatter, reformatter

[edit] Example translation at each stage

[edit] See also

Personal tools