https://wiki.apertium.org/w/api.php?action=feedcontributions&user=Marcriera&feedformat=atom
Apertium - User contributions [en]
2024-03-29T07:04:08Z
User contributions
MediaWiki 1.34.1
https://wiki.apertium.org/w/index.php?title=Modes&diff=73854
Modes
2022-01-24T22:30:15Z
<p>Marcriera: /* Naming conventions */</p>
<hr />
<div>[[Les fichiers modes|En français]]<br />
<br />
There are a few ways you can use [[pipeline]]s in Apertium. One of them is '''Modes''' files. Modes files (typically called <code>modes.xml</code>) are XML files (see [https://github.com/apertium/apertium/blob/master/apertium/modes.dtd modes.dtd]) which specify which programs should be run and in what order. Normally each linguistic package has one of these files which specifies various ways in which you can use the data to perform translations. <br />
<br />
See the [https://github.com/apertium/apertium-spa-cat/blob/master/modes.xml modes file from Spanish-Catalan pair] for an example. The modes which do not say <code>install="yes"</code> are only usable with the -d switch to apertium, these are typically used during development (eg. ca-es-anmor which only performs morphological analysis on Catalan and nothing else).<br />
<br />
See [[Writing_Makefiles#Modes]] on how to ensure modes that say install="yes" are installed.<br />
<br />
== Naming conventions ==<br />
<br />
The main translation mode is always named "from-to", e.g. "sme-nob". The debug modes each have a suffix, e.g. "sme-nob-morph".<br />
<br />
Common debug mode names:<br />
<br />
* -anmor or -morph run the morphological analysers<br />
** these are used equivalently<br />
* -disam runs up until morphological (CG) disambiguation<br />
* -syntax runs up until syntactical (CG) disambiguation<br />
* -tagger runs up until probabilistic (apertium-tagger) disambiguation (or, if no .prob, up until the last disambiguation step)<br />
* -autoseq runs up until separable multiwords<br />
* -biltrans runs up until the bidix<br />
* -lex runs up until lexical selection<br />
* -anaph runs up until anaphora resolution<br />
* -transfer runs up until (1-stage) transfer<br />
* -chunker runs up until the first stage of 3-or-more-stage transfer<br />
* -interchunk runs up until the second stage of 3-stage transfer<br />
** -interchunk1 and -interchunk2 are used when the pair has 4-stage transfer<br />
* -postchunk runs up until the last stage of transfer<br />
* -dgen run up until generation (using lt-proc -d to include debug symbols)<br />
<br />
== Autogenerating debug modes ==<br />
It's a drag to create all the regular install=no debug modes all the time (e.g. foo-bar-tagger, foo-bar-chunker, etc.). <br />
<br />
If you put '''<code>gendebug="yes"</code>''' on a &lt;mode&gt; element, debug modes will be created automatically for you! If a mode comes out with the wrong suffix, you can override the guess by adding the <code>debug-suff</code> attribute. <br />
<br />
For example, the following entry<br />
<br />
<pre><br />
<mode name="sme-nob" install="yes" gendebug="yes"><br />
<pipeline><br />
<program name="hfst-proc --weight-classes 1 -w -p"><br />
<file name="sme-nob.automorf.hfst"/><br />
</program><br />
<program name="cg-proc" debug-suff="disam"><br />
<file name="sme-nob.mor.rlx.bin"/><br />
</program><br />
<program name="cg-proc -1 -n -w" debug-suff="syntax"><br />
<file name="sme-nob.syn.rlx.bin"/><br />
</program><br />
<program name="apertium-pretransfer"/><br />
<program name="lt-proc -b"><br />
<file name="sme-nob.autobil.bin"/><br />
</program><br />
<program name="cg-proc" debug-suff="lex"><br />
<file name="sme-nob.lex.bin"/><br />
</program><br />
<program name="apertium-transfer -b"><br />
<file name="apertium-sme-nob.sme-nob.t1x"/><br />
<file name="sme-nob.t1x.bin"/><br />
</program><br />
<program name="apertium-interchunk" debug-suff="interchunk1"><br />
<file name="apertium-sme-nob.sme-nob.t2x"/><br />
<file name="sme-nob.t2x.bin"/><br />
</program><br />
<program name="apertium-interchunk" debug-suff="interchunk2"><br />
<file name="apertium-sme-nob.sme-nob.t3x"/><br />
<file name="sme-nob.t3x.bin"/><br />
</program><br />
<program name="lt-proc $1"><br />
<file name="sme-nob.autogen.bin"/><br />
</program><br />
</pipeline><br />
</mode><br />
</pre><br />
would make all these modes automatically:<br />
* sme-nob.mode<br />
* sme-nob-morph.mode<br />
* sme-nob-disam.mode<br />
* sme-nob-syntax.mode<br />
* sme-nob-pretransfer.mode<br />
* sme-nob-biltrans.mode<br />
* sme-nob-lex.mode<br />
* sme-nob-chunker.mode<br />
* sme-nob-interchunk1.mode<br />
* sme-nob-interchunk2.mode<br />
* sme-nob-postchunk.mode<br />
* sme-nob-dgen.mode<br />
(but only install the sme-nob.mode), where -disam, -syntax, -interchunk1, -interchunk2 are manually specified names, and the rest are default names based on program names.<br />
<br />
== Modes hacks ==<br />
=== Statistics mode ===<br />
<br />
In order to get some statistical information about translations made using Apertium, we've hacked the main translation mode, pausing the pipeline just after disambiguation and saving the output into a temp file. After that, pipeline is resumed with temp file as stdin.<br />
<br />
As an example, you can see the /broken/ pipeline for ca-es, installed as <code>ca-es-estadistiques.mode</code><br />
<br />
<pre><br />
/usr/local/bin/lt-proc /usr/local/share/apertium/apertium-es-ca/ca-es.automorf.bin > $LOGSDIR$SEC.tmp;<br />
/usr/local/bin/apertium-tagger -g /usr/local/share/apertium/apertium-es-ca/ca-es.prob < $LOGSDIR$SEC.tmp \<br />
|/usr/local/bin/apertium-pretransfer|/usr/local/bin/apertium-transfer /usr/local/share/apertium/apertium-es-ca/apertium-es-ca.trules-ca-es.xml \<br />
/usr/local/share/apertium/apertium-es-ca/trules-ca-es.bin /usr/local/share/apertium/apertium-es-ca/ca-es.autobil.bin \<br />
|/usr/local/bin/lt-proc $1 /usr/local/share/apertium/apertium-es-ca/ca-es.autogen.bin \<br />
|/usr/local/bin/lt-proc -p /usr/local/share/apertium/apertium-es-ca/ca-es.autogen.bin<br />
</pre><br />
<br />
And an example of calling apertium with this mode would be the following<br />
<br />
<pre><br />
LOGSDIR=~/logs/apertium/; SEC=`date +%s`;<br />
echo "Ara Apertium permet extraure estadístiques" | apertium ca-es-estadistiques<br />
</pre><br />
<br />
In that example, $LOGSDIR is a folder where the logs will be saved, and $SEC is an unique ID for that log.<br />
<br />
When translation is done, we can process the log created in order to get statistics.<br />
<br />
=== Mixed modes ===<br />
<br />
See [[Mixed modes]]<br />
<br />
==See also==<br />
* https://victorio.uit.no/langtech/trunk/tools/CorpusTools/corpustools/modes.py is a Python module that parses modes files into a <code>Pipeline</code> class <br />
<br />
[[Category:Documentation]]<br />
[[Category:Documentation in English]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Main_Page&diff=70736
Main Page
2019-12-05T10:53:00Z
<p>Marcriera: Update pair count</p>
<hr />
<div>{{Main page header}}<br />
<div style="font-size: 95%"><br />
''[[Apertium:About|Apertium]] is a free/open-source platform for developing rule-based machine translation systems!''<br />
{|style="width:100%; background:#fbfbfb; margin-top:1.2em; border:1px solid #ccc;"<br />
|style="width:50%; vertical-align: top; padding-left:20px; padding-right:20px;"|<br />
<br />
To see the whole list of general documentation pages written in English, see [[:Category:Documentation in English|documentation in English]].<br />
<br />
Pour ceux qui sont plus à l'aise avec la langue française, une partie des pages anglaises a été traduite. Voir [[:Category:Documentation en français|documentation en français]].<br />
<br />
==Translators==<br />
<br />
The following 50 pairs have released versions and are considered to be stable:<br />
{|style="border: 0px solid #fbfbfb"<br />
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|<br />
*Spanish ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=spa-cat spa-cat])<br />
*Spanish ← Romanian ([http://www.apertium.org/?id=translatetext&dir=ro-es es-ro])<br />
*French ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=fra-cat fra-cat])<br />
*Occitan ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=oc-ca oc-ca])<br />
*English ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=en-gl en-gl])<br />
*Swedish ⇆ Danish ([http://www.apertium.org/?id=translatetext&dir=swe-dan swe-dan])<br />
*Macedonian → English ([http://www.apertium.org/?id=translatetext&dir=mk-en mk-en])<br />
*Afrikaans ⇆ Dutch ([http://www.apertium.org/?id=translatetext&dir=af-nl af-nl])<br />
*Indonesian ⇆ Malaysian ([http://www.apertium.org/?id=translatetext&dir=id-ms id-ms])<br />
*Icelandic ⇆ Swedish ([http://www.apertium.org/?id=translatetext&dir=hbs-slv is-sv])<br />
*Occitan ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=oc-es oc-es])<br />
*Spanish ⇆ Portuguese ([http://www.apertium.org/?id=translatetext&dir=es-pt es-pt])<br />
*English ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=eng-cat eng-cat])<br />
*English ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=eng-spa eng-spa])<br />
*Sardinian ← Italian ([http://www.apertium.org/?id=translatetext&dir=ita-srd ita-srd])<br />
*Sardinian ← Catalan ([http://www.apertium.org/?id=translatetext&dir=ita-srd cat-srd])<br />
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|<br />
*English ⇆ Esperanto ([http://apertium.saluton.dk/?id=translatetext&dir=en-eo en-eo])<br />
*Spanish → Asturian ([http://www.apertium.org/?id=translatetext&dir=spa-ast spa-ast])<br />
*Catalan ← Italian ([http://www.apertium.org/?id=translatetext&dir=ita-cat cat-ita])<br />
*Maltese → Arabic ([http://www.apertium.org/?id=translatetext&dir=mlt-ara mlt-ara])<br />
*Serbo-Croatian ⇆ Slovenian ([http://www.apertium.org/?id=translatetext&dir=hbs-slv hbs-slv])<br />
*Danish ⇆ Norwegian ([http://www.apertium.org/?id=translatetext&dir=dan-nor dan-nor])<br />
*Spanish ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=es-gl es-gl])<br />
*French ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=fr-es fr-es])<br />
*French → Occitan ([http://www.apertium.org/?id=translatetext&dir=fra-oci fra-oci])<br />
*Esperanto ← Spanish ([http://www.apertium.org/?id=translatetext&lang=eo&dir=es-eo eo-es])<br />
*Welsh → English ([http://www.apertium.org/?id=translatetext&dir=cy-en cy-en])<br />
*Breton → French ([http://www.apertium.org/?id=translatetext&dir=br-fr br-fr])<br />
*Icelandic → English ([http://www.apertium.org/?id=translatetext&dir=isl-eng isl-eng])<br />
*Esperanto ← Catalan ([http://www.apertium.org/?id=translatetext&lang=eo&dir=ca-eo eo-ca])<br />
*North Sámi → Norwegian ([http://www.apertium.org/?id=translatetext&dir=sme-nob sme-nob])<br />
*Crimean Tatar → Turkish ([http://www.apertium.org/?id=translatetext&dir=crh-tur crh-tur])<br />
*Romanian ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=ron-cat ron-cat])<br />
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|<br />
*Serbo-Croatian → Macedonian ([http://www.apertium.org/?id=translatetext&dir=hbs-mkd hbs-mkd])<br />
*Serbo-Croatian → English ([http://www.apertium.org/?id=translatetext&dir=hbs-eng hbs-eng])<br />
*Portuguese ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=pt-ca pt-ca])<br />
*Portuguese ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=pt-gl pt-gl])<br />
*Basque → Spanish ([http://www.apertium.org/?id=translatetext&dir=eu-es eu-es])<br />
*Norwegian Nynorsk ⇆ Bokmål ([http://www.apertium.org/?id=translatetext&dir=nno-nob nno-nob])<br />
*Macedonian ⇆ Bulgarian ([http://www.apertium.org/?id=translatetext&dir=mk-bg mk-bg])<br />
*Esperanto ← French ([http://apertium.saluton.dk/?id=translatetext&lang=eo&dir=fr-eo eo-fr])<br />
*Basque → English ([http://www.apertium.org/?id=translatetext&dir=eu-en eu-en])<br />
*Spanish ⇆ Aragonese ([http://www.apertium.org/?id=translatetext&dir=spa-arg spa-arg])<br />
*Kazakh ⇆ Tatar ([http://www.apertium.org/?id=translatetext&dir=kaz-tat kaz-tat])<br />
*Urdu ⇆ Hindi ([http://www.apertium.org/?id=translatetext&dir=urd-hin urd-hin])<br />
*Aragonese ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=arg-cat arg-cat])<br />
*Swedish ⇆ Norwegian ([http://www.apertium.org/?id=translatetext&dir=swe-nno swe-nor])<br />
*Belarusian ⇆ Russian ([http://www.apertium.org/?id=translatetext&dir=bel-rus bel-rus])<br />
*Russian ⇆ Ukrainian ([http://www.apertium.org/?id=translatetext&dir=rus-ukr rus-ukr])<br />
*Polish → Silesian ([http://www.apertium.org/?id=translatetext&dir=pol-szl pol-szl])<br />
|-<br />
|}<br />
<br />
Other pairs currently in development may be found in our [[Using_Git|GitHub repositories]]. You can [[test drive]] release versions [http://apertium.org here].<!--, and all versions (including versions for unreleased language pairs) [http://xixona.dlsi.ua.es/testing/ here] &mdash; this has the latest versions (up to 6 hours old).--> More information can be found on the page: [[List of language pairs]]<br />
|}<br />
</div><br />
<br />
{|style="width:100%; background:#fbfbfb; margin-top:1.2em; border:1px solid #ccc;"<br />
|style="width:50%; color:#000; vertical-align: top;padding-left:20px; padding-right:20px;"|<br />
<br />
== How to create a new language pair ==<br />
<br />
* '''[[Apertium New Language Pair HOWTO]]''' &mdash; step-by-step description of how to start a new language pair in Apertium.<br />
** [[Créer une nouvelle paire de langues]] <br />
** [[Come scrivere una nuova coppia di lingue Apertium]]<br />
** [[Amestar un par de llingües nuevu]]<br />
** [[Uputstvo za novi jezički par za Apertium]]<br />
** [[Апертиум, как се създава нова езикова двойка]]<br />
** [[Руководство по созданию новой языковой пары]]<br />
** [[Jak zrobić nową parę języków]]<br />
** [[Apertium, Tvorba novej dvojice jazykov]]<br />
** [[Як створити нову мовну пару]]<br />
** [[Nieuw talenpaar maken]]<br />
** [[अपर्टियम मे नई भाषा जोडे]]<br />
** [[अपर्टियम मा नयाँ भाषा जोड्नुहोस]]<br />
** [[অপের্তিউম নতুন ভাষা জোড়া তে হয় কিভাবে]]<br />
** [[Hvordan lage et nytt språkpar i Apertium]]<br />
** [[Oprettelse af et nyt sprogpar i Apertium]]<br />
** [[Kiel aldoni novan lingvoparon]]<br />
** [[Sevel ur c'houblad yezhoù nevez]]<br />
<br />
|style="width:50%; color:#000; vertical-align: top;padding-left:20px; padding-right:20px;"|<br />
<br />
== Documentation (other pages) ==<br />
<br />
{{main|Documentation}}<br />
* '''[[Using Git]]''' &mdash; cheatsheet on how to use Git for users and developers.<br />
* '''[[Contributing to an existing pair]]''' &mdash; some pointers for contributing to an existing pair.<br />
** [[Comment contribuer à une paire de langues existante]]<br />
* '''[[Become a language pair developer for Apertium]]''' &mdash; A quick, step-by-step guide for Ubuntu and Debian users on how to add entries to a language pair.<br />
** [[Devenir un développeur de paire de langues pour Apertium]]<br />
* '''[[Frequently Asked Questions]]''' &mdash; what it says in the link.<br />
** [[Questions fréquentes]]<br />
<br />
|}<br />
<br />
__NOTOC__<br />
__NOEDITSECTION__<br />
<br />
[[Category:Top-level categories]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Main_Page&diff=70735
Main Page
2019-12-05T10:52:09Z
<p>Marcriera: Romanian-Catalan, update language codes</p>
<hr />
<div>{{Main page header}}<br />
<div style="font-size: 95%"><br />
''[[Apertium:About|Apertium]] is a free/open-source platform for developing rule-based machine translation systems!''<br />
{|style="width:100%; background:#fbfbfb; margin-top:1.2em; border:1px solid #ccc;"<br />
|style="width:50%; vertical-align: top; padding-left:20px; padding-right:20px;"|<br />
<br />
To see the whole list of general documentation pages written in English, see [[:Category:Documentation in English|documentation in English]].<br />
<br />
Pour ceux qui sont plus à l'aise avec la langue française, une partie des pages anglaises a été traduite. Voir [[:Category:Documentation en français|documentation en français]].<br />
<br />
==Translators==<br />
<br />
The following 49 pairs have released versions and are considered to be stable:<br />
{|style="border: 0px solid #fbfbfb"<br />
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|<br />
*Spanish ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=spa-cat spa-cat])<br />
*Spanish ← Romanian ([http://www.apertium.org/?id=translatetext&dir=ro-es es-ro])<br />
*French ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=fra-cat fra-cat])<br />
*Occitan ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=oc-ca oc-ca])<br />
*English ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=en-gl en-gl])<br />
*Swedish ⇆ Danish ([http://www.apertium.org/?id=translatetext&dir=swe-dan swe-dan])<br />
*Macedonian → English ([http://www.apertium.org/?id=translatetext&dir=mk-en mk-en])<br />
*Afrikaans ⇆ Dutch ([http://www.apertium.org/?id=translatetext&dir=af-nl af-nl])<br />
*Indonesian ⇆ Malaysian ([http://www.apertium.org/?id=translatetext&dir=id-ms id-ms])<br />
*Icelandic ⇆ Swedish ([http://www.apertium.org/?id=translatetext&dir=hbs-slv is-sv])<br />
*Occitan ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=oc-es oc-es])<br />
*Spanish ⇆ Portuguese ([http://www.apertium.org/?id=translatetext&dir=es-pt es-pt])<br />
*English ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=eng-cat eng-cat])<br />
*English ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=eng-spa eng-spa])<br />
*Sardinian ← Italian ([http://www.apertium.org/?id=translatetext&dir=ita-srd ita-srd])<br />
*Sardinian ← Catalan ([http://www.apertium.org/?id=translatetext&dir=ita-srd cat-srd])<br />
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|<br />
*English ⇆ Esperanto ([http://apertium.saluton.dk/?id=translatetext&dir=en-eo en-eo])<br />
*Spanish → Asturian ([http://www.apertium.org/?id=translatetext&dir=spa-ast spa-ast])<br />
*Catalan ← Italian ([http://www.apertium.org/?id=translatetext&dir=ita-cat cat-ita])<br />
*Maltese → Arabic ([http://www.apertium.org/?id=translatetext&dir=mlt-ara mlt-ara])<br />
*Serbo-Croatian ⇆ Slovenian ([http://www.apertium.org/?id=translatetext&dir=hbs-slv hbs-slv])<br />
*Danish ⇆ Norwegian ([http://www.apertium.org/?id=translatetext&dir=dan-nor dan-nor])<br />
*Spanish ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=es-gl es-gl])<br />
*French ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=fr-es fr-es])<br />
*French → Occitan ([http://www.apertium.org/?id=translatetext&dir=fra-oci fra-oci])<br />
*Esperanto ← Spanish ([http://www.apertium.org/?id=translatetext&lang=eo&dir=es-eo eo-es])<br />
*Welsh → English ([http://www.apertium.org/?id=translatetext&dir=cy-en cy-en])<br />
*Breton → French ([http://www.apertium.org/?id=translatetext&dir=br-fr br-fr])<br />
*Icelandic → English ([http://www.apertium.org/?id=translatetext&dir=isl-eng isl-eng])<br />
*Esperanto ← Catalan ([http://www.apertium.org/?id=translatetext&lang=eo&dir=ca-eo eo-ca])<br />
*North Sámi → Norwegian ([http://www.apertium.org/?id=translatetext&dir=sme-nob sme-nob])<br />
*Crimean Tatar → Turkish ([http://www.apertium.org/?id=translatetext&dir=crh-tur crh-tur])<br />
*Romanian ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=ron-cat ron-cat])<br />
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|<br />
*Serbo-Croatian → Macedonian ([http://www.apertium.org/?id=translatetext&dir=hbs-mkd hbs-mkd])<br />
*Serbo-Croatian → English ([http://www.apertium.org/?id=translatetext&dir=hbs-eng hbs-eng])<br />
*Portuguese ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=pt-ca pt-ca])<br />
*Portuguese ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=pt-gl pt-gl])<br />
*Basque → Spanish ([http://www.apertium.org/?id=translatetext&dir=eu-es eu-es])<br />
*Norwegian Nynorsk ⇆ Bokmål ([http://www.apertium.org/?id=translatetext&dir=nno-nob nno-nob])<br />
*Macedonian ⇆ Bulgarian ([http://www.apertium.org/?id=translatetext&dir=mk-bg mk-bg])<br />
*Esperanto ← French ([http://apertium.saluton.dk/?id=translatetext&lang=eo&dir=fr-eo eo-fr])<br />
*Basque → English ([http://www.apertium.org/?id=translatetext&dir=eu-en eu-en])<br />
*Spanish ⇆ Aragonese ([http://www.apertium.org/?id=translatetext&dir=spa-arg spa-arg])<br />
*Kazakh ⇆ Tatar ([http://www.apertium.org/?id=translatetext&dir=kaz-tat kaz-tat])<br />
*Urdu ⇆ Hindi ([http://www.apertium.org/?id=translatetext&dir=urd-hin urd-hin])<br />
*Aragonese ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=arg-cat arg-cat])<br />
*Swedish ⇆ Norwegian ([http://www.apertium.org/?id=translatetext&dir=swe-nno swe-nor])<br />
*Belarusian ⇆ Russian ([http://www.apertium.org/?id=translatetext&dir=bel-rus bel-rus])<br />
*Russian ⇆ Ukrainian ([http://www.apertium.org/?id=translatetext&dir=rus-ukr rus-ukr])<br />
*Polish → Silesian ([http://www.apertium.org/?id=translatetext&dir=pol-szl pol-szl])<br />
|-<br />
|}<br />
<br />
Other pairs currently in development may be found in our [[Using_Git|GitHub repositories]]. You can [[test drive]] release versions [http://apertium.org here].<!--, and all versions (including versions for unreleased language pairs) [http://xixona.dlsi.ua.es/testing/ here] &mdash; this has the latest versions (up to 6 hours old).--> More information can be found on the page: [[List of language pairs]]<br />
|}<br />
</div><br />
<br />
{|style="width:100%; background:#fbfbfb; margin-top:1.2em; border:1px solid #ccc;"<br />
|style="width:50%; color:#000; vertical-align: top;padding-left:20px; padding-right:20px;"|<br />
<br />
== How to create a new language pair ==<br />
<br />
* '''[[Apertium New Language Pair HOWTO]]''' &mdash; step-by-step description of how to start a new language pair in Apertium.<br />
** [[Créer une nouvelle paire de langues]] <br />
** [[Come scrivere una nuova coppia di lingue Apertium]]<br />
** [[Amestar un par de llingües nuevu]]<br />
** [[Uputstvo za novi jezički par za Apertium]]<br />
** [[Апертиум, как се създава нова езикова двойка]]<br />
** [[Руководство по созданию новой языковой пары]]<br />
** [[Jak zrobić nową parę języków]]<br />
** [[Apertium, Tvorba novej dvojice jazykov]]<br />
** [[Як створити нову мовну пару]]<br />
** [[Nieuw talenpaar maken]]<br />
** [[अपर्टियम मे नई भाषा जोडे]]<br />
** [[अपर्टियम मा नयाँ भाषा जोड्नुहोस]]<br />
** [[অপের্তিউম নতুন ভাষা জোড়া তে হয় কিভাবে]]<br />
** [[Hvordan lage et nytt språkpar i Apertium]]<br />
** [[Oprettelse af et nyt sprogpar i Apertium]]<br />
** [[Kiel aldoni novan lingvoparon]]<br />
** [[Sevel ur c'houblad yezhoù nevez]]<br />
<br />
|style="width:50%; color:#000; vertical-align: top;padding-left:20px; padding-right:20px;"|<br />
<br />
== Documentation (other pages) ==<br />
<br />
{{main|Documentation}}<br />
* '''[[Using Git]]''' &mdash; cheatsheet on how to use Git for users and developers.<br />
* '''[[Contributing to an existing pair]]''' &mdash; some pointers for contributing to an existing pair.<br />
** [[Comment contribuer à une paire de langues existante]]<br />
* '''[[Become a language pair developer for Apertium]]''' &mdash; A quick, step-by-step guide for Ubuntu and Debian users on how to add entries to a language pair.<br />
** [[Devenir un développeur de paire de langues pour Apertium]]<br />
* '''[[Frequently Asked Questions]]''' &mdash; what it says in the link.<br />
** [[Questions fréquentes]]<br />
<br />
|}<br />
<br />
__NOTOC__<br />
__NOEDITSECTION__<br />
<br />
[[Category:Top-level categories]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Apertium-recursive/Formalism&diff=70568
Apertium-recursive/Formalism
2019-10-29T18:36:46Z
<p>Marcriera: /* Attribute Categories */</p>
<hr />
<div>=== File Structure ===<br />
<br />
A <code>.rtx</code> contains attribute categories, tag-order rules, tag-rewrite rules, and reduction rules.<br />
<br />
Comments begin with exclamation points (<code>!</code>) and end at the end of the line. To include spaces in any name, either escape the space with a backslash (<code>\</code>) or enclose the name in double quotes (<code>"</code>).<br />
<br />
=== Attribute Categories ===<br />
<br />
An attribute category can be defined like this:<br />
<br />
gender = m f GD ;<br />
number = sg pl ND ;<br />
<br />
An attribute category can also specify undefined and default values:<br />
<br />
gender = (GD m) m f GD;<br />
<br />
This defines the <code>gender</code> category as before, but with the addition that if any rule tries to read the gender of a node that doesn't have a gender tag, the result will be <code><GD></code> rather than the empty string. It also states that any remaining <code><GD></code> tags will be replaced with <code><m></code> tags in the output step.<br />
<br />
An attribute category can also specify certain values as non-overwritable.<br />
<br />
gender = m f @mf;<br />
<br />
This states that if a lexical unit has a target-language <code><mf></code> tag and a rule tries to replace that tag with something else, the <code><mf></code> tag will be used instead of the replacement.<br />
<br />
An attribute category can include another:<br />
<br />
definite = def ind;<br />
<br />
! The following are equivalent:<br />
det_type = dem [definite] pos;<br />
det_type = dem def ind pos;<br />
<br />
The name of an attribute category cannot be any of the following:<br />
<br />
{| class=wikitable<br />
|-<br />
! Name<br />
! Meaning<br />
|-<br />
| <code>lem</code><br />
| The lemma of an LU or chunk<br />
|-<br />
| <code>lemcase</code><br />
| The case of the lemma of an LU or chunk<br />
|-<br />
| <code>lemh</code> and <code>lemq</code><br />
| The first and second parts of a multiword with inner inflection. See [[Multiwords]], where lemh corresponds to the inflected portion and lemq is the portion in <g><br />
|-<br />
| <code>tags</code><br />
| All of the tags of an LU or chunk<br />
|-<br />
| <code>pos_tag</code><br />
| The first tag or an LU or chunk<br />
|-<br />
| <code>whole</code>, <code>chname</code>, <code>chcontent</code>, and <code>content</code><br />
| Internal names, some of which are included primarily for compatibility with t*x rules. Do not use these directly<br />
|}<br />
<br />
=== Tag Order ===<br />
<br />
The order of tags for each type of node must be defined like this:<br />
<br />
n: _.gender.number;<br />
adj: _.gender;<br />
NP: _.number;<br />
<br />
Where <code>_</code> represents the part of speech tag. If the part of speech tag is different between the source and target languages, the target language one will be used. The lemma head is automatically appended at the beginning of the pattern and the lemma queue is automatically attached to the end.<br />
<br />
To specify a literal tag in a pattern, put it in angle brackets:<br />
<br />
det: _.<def>.number;<br />
<br />
Which tag order to use is determined solely by the first tag in the in the pattern of the reduction rule. See the output section below for how to override this choice. See also the macro section for a more powerful version of these rules.<br />
<br />
The underscore is not mandatory and is merely a shorthand for <code>pos_tag</code>. Thus a literal part of speech tag can be used instead.<br />
<br />
=== Tag Rewrite Rules ===<br />
<br />
This is a way to convert certain sets of tags, either between two languages that have different sets of tenses, or between something like object agreement and number marking.<br />
<br />
object_agr = o1sg o1pl o2sg o2pl o3sg o3pl ;<br />
number = sg pl ;<br />
person = p1 p2 p3 ;<br />
<br />
object_agr > person: o1sg p1, o1pl p1, o2sg p2, o2pl p2, o3sg p3, o3pl p3 ;<br />
object_agr > number: o1sg sg, o1pl pl, o2sg sg, o2pl pl, o3sg sg, o3pl pl ;<br />
<br />
VP -> @v NP {2[number=1.object_agr] _1 1} ;<br />
<br />
In this example, if the verb had <code><o2sg></code>, it would be converted to <code><sg></code> when it was set as the <code>number</code> attribute of the noun.<br />
<br />
tense = farpst nearpst pst prs fut nonpst ;<br />
<br />
tense > tense: farpst pst, nearpst pst, prs nonpst, fut nonpst ;<br />
<br />
In this example, no explicit assignment needs to take place and the 4 tenses of the source language (<code>farpst, nearpst, prs, fut</code>) would be automatically converted to the 2 of the target language (<code>pst, nonpst</code>).<br />
<br />
Converting from 4 to 3 with something like <pre>tense > tense: farpst pst, nearpst pst ;</pre><br />
will also work, the unchanged tags not needing to be explicitly mentioned.<br />
<br />
Tags rewrite rules apply in the output step. When building the parse tree, only unconverted tags are used to create chunks. This makes conversions like the following one potentially dangerous:<br />
<br />
tense > tense: midpst pst, pst pri;<br />
<br />
If tense is being propagated down through multiple chunks, any <code><midpst></code> tags will get converted to <code><pst></code> and then converted again to <code><pri></code>.<br />
<br />
It is also possible to explicitly convert a value, for example when doing comparisons:<br />
<br />
1.object_agr>number<br />
1.object_agr>person<br />
<br />
These will clip <code>object_agr</code> and convert it to <code>number</code> and <code>person</code> immediately, regardless of where they are evaluated.<br />
<br />
Like attribute category definitions, tag rewrite rules can refer to entire output categories by enclosing them in square brackets. Since the replacement must result in a single value, this can only be done on the source side.<br />
<br />
pasts = farpst midpst nearpst;<br />
tense = farpst midpst nearpst past pres fut;<br />
<br />
! These are equivalent:<br />
tense > tense : [pasts] past;<br />
tense > tense : farpst past, midpst past, nearpst past;<br />
<br />
=== Reduction Rules ===<br />
<br />
Reduction rules consist of a node type, an optional name, an optional weight, a pattern, an optional condition, an optional variable setting, and an output, in that order.<br />
<br />
NP -> det n {2 _1 1};<br />
<br />
This matches a determiner followed by a noun, combines them into an NP chunk, and at output time produces "noun determiner".<br />
<br />
NP -> 1: n {1} |<br />
2: n.*.def {the@det.def.sg _ 1};<br />
<br />
Here the first rule will match any noun, while the second will match a noun with a <code><def></code> tag. Since the second rule has a higher weight, the first rule will not be applied if they both match.<br />
<br />
NP -> NP and@cnjcoo NP [$number=pl] {1 _1 2 _2 3};<br />
<br />
Here the rule specifies that the resulting chunk will be marked with a <code><pl></code> tag.<br />
<br />
AP -> adj and@cnjcoo adj ?(1.gender/sl = 3.gender/sl) {1 _1 2 _2 3};<br />
<br />
This rule will not apply if the two adjectives have different genders.<br />
<br />
The arrow can be written as either <code>-></code> or <code>→</code>.<br />
<br />
A name is written in double quotes before the weight.<br />
<br />
DP -> "de > gen" NP de@pr NP { 3 + 's@gen _1 1 } ;<br />
<br />
If the first lemma is quoted and no weight is present, it may be interpreted as a name. This issue can be dealt with by adding an explicit 0 weight.<br />
<br />
! Don't write this:<br />
VP -> "be# taller than"@v NP { 2 _1 1 } ;<br />
<br />
! Write this instead:<br />
VP -> 0: "be# taller than"@v NP { 2 _1 1 } ;<br />
<br />
The process by which rules are selected is described [[User:Popcorndude/Recursive_Transfer/Parser | here]].<br />
<br />
==== Multiple Outputs ====<br />
<br />
A rule can have multiple outputs, but any non-chunk output cannot be conditioned. A rule with multiple outputs is useful for treating certain tokens as if they occurred in a different order. To write such a rule, list multiple nodes before the arrow and wrap multiple outputs in another set of curly braces (<code>{}</code>).<br />
<br />
DP clitic -> det clitic NP { { 1[number=3.number] _1 3 } _2 2 } ;<br />
DP -> det NP { 1[number=2.number] _1 2 } ;<br />
<br />
Here the first rule is essentially equivalent to moving the clitic later in the input stream and then applying the second rule.<br />
<br />
Multi-output rules are closely related to [[#Interpolation|interpolation]], which is described below.<br />
<br />
=== Patterns ===<br />
<br />
An element of a pattern must match a single, literal part of speech tag. In order to match multiple part of speech tags, create a separate rule which matches each of them:<br />
<br />
NOM -> n {1} | np {1};<br />
<br />
To match a lemma or pseudolemma, place it before the part of speech tag, separated by <code>@</code>:<br />
<br />
NP -> the@det n {2 _1 1};<br />
<br />
It is also possible to match a category of lemmas:<br />
<br />
days = sunday monday tuesday wednesday thursday friday saturday;<br />
date -> $days@n the@det num.ord {2 _2 3 _1 1};<br />
<br />
Tags besides part of speech can be matched like this:<br />
<br />
VP -> vbser vblex.pp {1 _1 2};<br />
<br />
To match a set of tags, enclose the category name in square brackets:<br />
<br />
non_finite = pp ger;<br />
VP -> vbser vblex.[non_finite] {1 _1 2};<br />
<br />
Pattern elements can also specify values for the tags of the chunk being output by the rule.<br />
<br />
number = (ND sg) sg pl sp ND;<br />
NP: _.number;<br />
NP -> n.$number adj {1};<br />
<br />
This rule specifies that the number tag of the NP chunk should be copied from the noun. It will use the target language side if that is available. If not, it will proceed to the reference side, and then the source side. If all three of these are empty, it will use the default value <code><ND></code>. To require that a particular variable be taken from a particular side, put the side after a slash:<br />
<br />
NP: number;<br />
NP -> det.$number/ref n {1 _1 2};<br />
<br />
<code>/sl</code> refers to the source language, <code>/tl</code> to the target language, and <code>/ref</code> to anything added by anaphora resolution.<br />
<br />
If a pattern element is contributing several tags to the chunk, the following shortcut is available:<br />
<br />
NP: _.number.gender;<br />
NP -> %n adj {2 _1 1};<br />
<br />
The <code>%</code> indicates the noun is the source of all chunk tags not elsewhere specified.<br />
<br />
To specify a literal value for a chunk tag, put it in square brackets after the pattern like this:<br />
<br />
NP: _.gender.number;<br />
NP -> 0: NP cnjcoo NP [$gender=m, $number=pl] {1 _1 2 _2 3} |<br />
1: NP.f cnjcoo NP.f [$gender=f, $number=pl] {1 _1 2 _2 3} |<br />
2: NP.*.sg or@cnjcoo NP.*.sg [$gender=m, $number=sg] {1 _1 2 _2 3} |<br />
3: NP.f.sg or@cnjcoo NP.f.sg [$gender=f, $number=sg] {1 _1 2 _2 3} ;<br />
<br />
That is, treat the gender of the phrase as masculine unless both elements are feminine and the number as singular unless the conjunction is "or" and both elements are singular.<br />
<br />
These values can also be conditioned, condensing the above rules to:<br />
<br />
NP -> NP cnjcoo NP [$gender=(if (1.gender = f and 3.gender = f) f else m),<br />
$number=(if (2.lem =cl "or" and 1.number = sg and 3.number = sg) sg else pl)]<br />
{1 _1 2 _2 3} ;<br />
<br />
The pattern only looks at the source language, but it is possible to add constraints:<br />
<br />
conj_list = and or;<br />
NP: _.gender.number;<br />
NP -> %NP cnjcoo NP ?((2.lem/tl in conj_list) and ~(3.gender = 1.gender)) {1 _1 2 _2 3};<br />
<br />
This will only match the pattern if it is also the case that the target language lemma of the conjunction is "and" or "or" and the two NPs have different genders. See below for the syntax of conditions.<br />
<br />
{| class="wikitable"<br />
|-<br />
! Piece of pattern<br />
! Meaning<br />
|-<br />
| <pre>.x</pre><br />
| A literal tag <code><x></code><br />
|-<br />
| <pre>.[x]</pre><br />
| Any tag in the category <code>x</code><br />
|-<br />
| <pre>.$x</pre><br />
| When building the output chunk for this rule, the value of the <code>x</code> attribute should come from this element<br />
|-<br />
| <pre>.*</pre><br />
| 0 or more arbitrary tags. Note: this contrasts with other places in the pipeline where <code>*</code> must match at least 1 tag. A final <code>.*</code> is automatically appended to every pattern<br />
|}<br />
<br />
=== Outputs ===<br />
<br />
Output elements are written between curly braces and may be any of the following:<br />
<br />
==== Blanks ====<br />
<br />
An underscore represents a single space. An underscore followed by a number represents the superblank after that position, so <code>1 _ 2</code> is elements 1 and 2 separated by a space while <code>1 _1 2</code> is elements 1 and 2 separated by whatever separated them in the input.<br />
<br />
==== Matched Elements ====<br />
<br />
A number represents the input element in that position with its tags arranged according to the defined output pattern for its part of speech tag. It can be followed by a specification of where those tags should come from.<br />
<br />
1<br />
! the first input element<br />
<br />
1[gender=f]<br />
! the first input element with the gender tag <f><br />
<br />
1[gender=2.gender/ref]<br />
! the first input element with the gender tag of the reference side of the second input element<br />
<br />
1[gender=$gender]<br />
! the first input element with the gender tag set to a placeholder to be filled on output with the gender tag of its parent chunk<br />
<br />
These elements can also be prefixed with <code>%</code> to specify that as many tags as possible should be placeholders for tags of the parent chunk.<br />
<br />
These elements can be conjoined using +:<br />
<br />
1[gender=f] + 2<br />
<br />
This will generate something like <code>^blah<n><f>+bloop<adj>$</code>.<br />
<br />
Conjoining is currently disallowed if one side is in an if statement and the other is not. It is thus also disallowed if the tag-order rule for either element is a macro.<br />
<br />
By default, the order of the output tags is based on the output pattern corresponding to the part of speech tag in the pattern. However, it is possible to override this using parentheses:<br />
<br />
vblex: _.tense.person.number;<br />
vbinf: _.<inf>;<br />
<br />
V -> vblex.inf {1};<br />
! result: ^whatever<vblex><inf><{person}><{number}>$<br />
<br />
V -> vblex.inf {1(vbinf)};<br />
! result: ^whatever<vblex><inf>$<br />
<br />
==== Literal Lexical Units ====<br />
<br />
A new lexical unit can be inserted like this:<br />
<br />
the@det.def.mf.sp<br />
<br />
Placeholders can be included using <code>$</code>:<br />
<br />
the@det.def.$gender.sp<br />
<br />
And clips from other elements can be placed in square brackets:<br />
<br />
the@det.def.[2.gender].[3.number/sl]<br />
<br />
Literal lexical units can also be constructed via the same syntax as matched elements, but with a lemma rather than a number.<br />
<br />
the(det)[gender=2.gender, number=2.number]<br />
<br />
When constructed in this way, the tag-order specification is mandatory.<br />
<br />
==== Output Conditionals ====<br />
<br />
An output conditional evaluates a sequence of conditions and outputs the element corresponding to the first one that evaluates to true. The element to be output can be any of the possibilities listed above, the entire chunk, or another conditional.<br />
<br />
NP -> NP cnjcoo NP<br />
(if (2.lem/sl = and)<br />
{ 1 _1 3 }<br />
else<br />
{ 1 _1 2 _2 3 } );<br />
<br />
Here the rule determines what the final output will be based on the lemma of the conjunction.<br />
<br />
PP -> DP ?(1.case in might_get_pr)<br />
(if (1.prep_flag = none)<br />
{ 1 }<br />
else<br />
{ (if (1.prep_flag = to)<br />
to@pr<br />
else-if (1.prep_flag = at)<br />
at@pr<br />
else-if (1.prep_flag = in)<br />
in@pr<br />
else-if (1.prep_flag = on)<br />
on@pr<br />
else<br />
for@pr<br />
)<br />
_ 1 } );<br />
<br />
Here the rule determines first whether to add a preposition. If it is going to add a preposition, it creates a chunk and within that chunk, has another if statement to determine which preposition to add.<br />
<br />
The first clause is labeled "if", the last can be "else" or "otherwise", and intermediate ones can be "if", "else-if", or "elif". These labels follow the same rules as logical operators - that is, capitalization, "-", and "_" are all ignored.<br />
<br />
For the output of an if statement to have multiple elements, surround those elements with square brackets. Thus the conjunction rule above can be rewritten as follows:<br />
<br />
NP -> NP cnjcoo NP<br />
{ 1 _1<br />
(if (2.lem/sl = and)<br />
[ 2 _2 ]<br />
else [] )<br />
3 };<br />
<br />
=== Conditions ===<br />
<br />
Conditions are written in parentheses. A condition is a value, an operator, and another value. If the operator is "and" or "or" these values are other conditions, otherwise they are clips or strings. A condition can be negated by writing "not" before the operator.<br />
<br />
(1.case = 2.case) ! true if the first and second elements have the same case, otherwise false<br />
(1.case not = 2.case) ! the reverse of the previous line<br />
<br />
The full list of operations is as follows:<br />
<br />
{| class="wikitable" border="1"<br />
|-<br />
! Name<br />
! Description<br />
! Alternate Spellings<br />
|-<br />
| And<br />
| Evaluates to true if both arguments evaluates to true, otherwise false<br />
| &<br />
|-<br />
| Or<br />
| Evaluates to true if either argument evaluates to true, otherwise false<br />
| <nowiki>|</nowiki><br />
|-<br />
| Equal<br />
| Evaluates to true if the arguments are identical strings<br />
| =<br />
|-<br />
| IsPrefix<br />
| Evaluates to true if the right argument occurs at the beginning of the left argument<br />
| StartsWith, BeginsWith<br />
|-<br />
| IsSuffix<br />
| Evaluates to true if the right argument occurs at the end of the left argument<br />
| EndsWith<br />
|-<br />
| IsSubstring<br />
| Evaluates to true if the right argument occurs anywhere in the left argument<br />
| Contains<br />
|-<br />
| HasPrefix<br />
| Evaluates to true if the left argument begins with anything in the list named by the right argument<br />
| StartsWithList, BeginsWithList<br />
|-<br />
| HasSuffix<br />
| Evaluates to true if the left argument ends with anything in the list named by the right argument<br />
| EndsWithList<br />
|-<br />
| In<br />
| Evaluates to true if the left argument is a member of the list named by the right argument<br />
| ∈<br />
|}<br />
<br />
Any of these operators (besides And and Or) can be made to ignore case by adding one of "cl", "caseless", "fold", "foldcase".<br />
<br />
maybe_get_pr = dat obj;<br />
(1.case in maybe_get_pr)<br />
<br />
footwear = boot sock shoe sandal;<br />
((1.number = du) and (1.lem/tl in_caseless footwear))<br />
! note that "in-case-less", "incl", "IN-cl", and "__IN_CASE_LESS__" would all also work here.<br />
<br />
=== Macros ===<br />
<br />
The macro facility is a combination of tag order rules and output conditionals.<br />
<br />
det_type = dem def ind;<br />
<br />
det_dem: _.<dem>.distance;<br />
det_def: _.definite.number;<br />
<br />
det: (if (1.det_type = dem)<br />
1(det_dem)<br />
else<br />
1(def_def)<br />
);<br />
<br />
Here we define a "det" pattern which will apply the "det_dem" pattern or the "det_def" pattern to its argument based on whether that argument has a <code><dem></code> tag. Since this is a tag order pattern it will be applied to all <code><det></code>s by default and can also be manually applied to other things with the <code>3(det)</code> syntax.<br />
<br />
Macros are only allowed to clip from the input node (referred to as <code>1</code>), including any values passed in.<br />
<br />
If a macro specifies a value for an attribute, it will override anything that is passed in. Thus if the above example had <code>1(det_dem)[distance=prx]</code> rather than <code>1(det_dem)</code>, invoking it as <code>2(det)[distance=dist]</code> and as <code>2(det)[distance=med]</code> would make no difference and the output would be have <code><prx></code> regardless.<br />
<br />
A macro is not required to function as a tag order pattern and may output anything or nothing, so long as it only accesses attributes of the input node.<br />
<br />
If you want to call a macro and what node gets passed in doesn't matter, you can use the symbol <code>*</code> to represent an empty node.<br />
<br />
maybe_det: (if (1.definite = def)<br />
[the@det.def.sp _]<br />
elif (1.number = sg)<br />
[a@det.ind.sp _]<br />
else [] );<br />
DP -> n { *(maybe_det)[number=1.number, definite=$definite] 1 };<br />
<br />
This will insert "the" if the DP is definite, "a" if it's indefinite and singular, and will output only the noun otherwise.<br />
<br />
Since a macro needs to be contained in a conditional but not all macros are conditional, they permit the keyword <code>always</code> in addition to <code>if</code>, <code>else</code>, etc.<br />
<br />
vaux: (always 1(vblex));<br />
<br />
This macro is essentially an alias of <code>vblex</code>.<br />
<br />
=== Interpolation ===<br />
<br />
Sometimes it is necessary to insert words into existing nodes, such as when generating certain clitics.<br />
<br />
NP -> adj n { 2 _1 1 } ;<br />
DP -> det NP (if ($lu-count = "2")<br />
{ 1 _1 2 }<br />
else<br />
{ 1 _ >3 _1 2 } ) ;<br />
VP -> DP v.pprs { 1 < be(vaux) _1 2 } ;<br />
<br />
These rules represent a scenario where target language present progressive is marked with a clitic which is placed between a determiner and noun phrase. At the <code>VP</code> level, the clitic is created and inserted into the <code>DP</code> with a less-than sign (<code><</code>). Then at the <code>DP</code> level, the rule checks whether anything has been inserted by checking whether the value of <code>$lu-count</code> is 2, which it would be if nothing had been inserted. If <code>$lu-count</code> is not 2, then the inserted item is output in the appropriate place.<br />
<br />
The inserted value is referred to as <code>>3</code> rather than <code>3</code> to tell the compiler that it is an inserted value so as to prevent error messages about trying to access a node that doesn't exist.<br />
<br />
Some possible input and output from these rules (written monolingually for simplicity):<br />
<br />
^the<det>$ ^green<adj>$ ^frog<n>$ ^speak<v><pprs>$<br />
<br />
the NP[green frog] speak<br />
DP[the NP[green frog]] speak<br />
VP[DP[the NP[green frog]] speak]<br />
<br />
DP[the NP[green frog] be] speak<br />
the be NP[green frog] speak<br />
the be frog green speak<br />
<br />
^the<det>$ ^be<vaux>$ ^frog<n>$ ^green<adj>$ ^speak<v>$<br />
<br />
This is functions as the inverse of [[#Multiple_Outputs|rules with multiple outputs]]. The reverse of the above rules could be something like this:<br />
<br />
NP -> n adj { 2 _1 1 } ;<br />
DP -> det NP { 1 _1 2 } ;<br />
DP vaux -> det vaux NP { { 1 _1 3 } _2 2 } ;<br />
VP -> DP vaux v { 1 _1 3[tense=pprs] } ;<br />
<br />
Multi-output rules take things inside and moves them out, while interpolation takes things outside and moves them in.<br />
<br />
=== Global Variables ===<br />
<br />
For passing nodes up and down a tree, an alternative to multi-output rules and interpolation is global variables. Global variables are referred to with double dollar signs and are set in the attribute literal section of a rule.<br />
<br />
VP -> %vblex DP.$itg [$$wh_word=(if (2.itg = itg) 2)] { 1 (if (2.itg not = itg) [ _1 2 ]) } ;<br />
<br />
The value of this variable can then be included in the output step of any rule.<br />
<br />
S -> DP.nom VP { (if (2.itg = itg) [ $$wh_word _ ] ) 1 _1 2 } ;<br />
<br />
If a rule attempts to output an unset variable, the result will be no output. All variables are reset at the end of the output step.<br />
<br />
=== Clips ===<br />
<br />
When clipping a tag or lemma from a lexical unit in the input stream, <code>/sl</code> refers to source language, <code>/tl</code> refers to target language, and <code>/ref</code> refers to the output of apertium-anaphora. Chunks, meanwhile, have only 1 side, which is <code>/tl</code>. If the side is left unspecified, then <code>/tl</code> will be clipped. However, if an LU is being clipped from and the value for <code>/tl</code> is empty or is the unspecified value for that attribute category, it will try again with <code>/ref</code> and then with <code>/sl</code>.<br />
<br />
Almost anywhere that a clip or a literal value is used as a value, it can be replaced with an if statement using the same syntax as output conditionals and macros.<br />
<br />
VP -> v vaux [$negative=(if (1.negative = neg or 2.negative = neg) neg else pos)]<br />
{ 2 _1 1[tense=(if (2.lem in verb_ing) pprs) else inf)] } ;<br />
<br />
The one exception is embedding an if statement inside a conditional: <code>(x in (if ...))</code>.<br />
<br />
=== Brackets ===<br />
<br />
A summary of which means what where:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Bracket<br />
! General Meaning<br />
! Uses<br />
! Examples<br />
! Comments<br />
|-<br />
|rowspan="6"| <code>()</code><br />
|rowspan="6"| Condition<br />
| If statement<br />
| <code>(if ...)</code><br />
|<br />
|-<br />
| Condition<br />
| <code>(a in b)</code><br />
| in an if statement<br />
|-<br />
| Pattern condition<br />
| <code>NP NP ?(1.case = 2.case)</code><br />
|<br />
|-<br />
| Pattern override<br />
| <code>1(vb_impers)</code><br />
| use the vb_impers output rule rather than the output rule chosen based on the pattern<br />
|-<br />
| Macro invocation<br />
| <code>*(maybe_det)</code><br />
|<br />
|-<br />
| Attribute defaults<br />
| <code>gender = (GD m) m f mf GD;</code><br />
|<br />
|-<br />
|rowspan="6"| <code>[]</code><br />
|rowspan="6"| List<br />
| Rule variable setting<br />
| <code>[$tense=past, $number=sg]</code><br />
|<br />
|-<br />
| Node variable setting<br />
| <code>1[tense=2.tense, number=2.number]</code><br />
|<br />
|-<br />
| Set inclusion<br />
| <code>tense = [finite] inf ger;</code><br />
| tense is composed of <inf>, <ger> and everything in finite<br />
|-<br />
| Group tag rewriting<br />
| <code>poss > number : [poss_sg] sg, [poss_pl] pl;</code><br />
|<br />
|-<br />
| Grouping in if statements<br />
| <code>(if (whatever) [1 _ 2])</code><br />
|<br />
|-<br />
| Pattern tag sets<br />
| <code>NP.*.[case_not_nomacc]</code><br />
|<br />
|-<br />
|rowspan="2"| <code>{}</code><br />
|rowspan="2"| Chunk<br />
| Output<br />
| <code>NP -> n { 1 } ;</code><br />
|<br />
|-<br />
| Chunk<br />
| <code>NP clitic -> det clitic NP { { 1 _ 3 } _ 2 } ;</code><br />
|<br />
|}<br />
<br />
== Correspondence with t*x ==<br />
<br />
number = sg pl;<br />
gender = m f;<br />
pre_adj = gran buen;<br />
<br />
n: _.gender.number;<br />
adj: _.gender;<br />
NP: _.number;<br />
<br />
NP -> adj n.$number ?(1.number = 2.number)<br />
(if (1.lem/tl incl pre_adj)<br />
{1[gender=2.gender] _1 2}<br />
else<br />
{2 _1 1[gender=2.gender]}<br />
) ;<br />
<br />
<transfer><br />
<section-def-cats><br />
<def-cat "n"><br />
<cat-item tags="n"/><br />
<cat-item tags="n.*"/><br />
</def-cat><br />
<def-cat "adj"><br />
<cat-item tags="adj"/><br />
<cat-item tags="adj.*"/><br />
</def-cat><br />
</section-def-cats><br />
<section-def-attrs><br />
<def-attr n="number"><br />
<attr-item tags="sg"/><br />
<attr-item tags="pl"/><br />
</def-attr><br />
<def-attr n="gender"><br />
<attr-item tags="m"/><br />
<attr-item tags="f"/><br />
</def-attr><br />
</section-def-attrs><br />
<section-def-lists><br />
<def-list n="pre_adj"><br />
<list-item v="gran"/><br />
<list-item v="buen"/><br />
</def-list><br />
</section-def-lists><br />
<section-rules><br />
<rule comment="adj n"><br />
<pattern><br />
<pattern-item n="adj"/><br />
<pattern-item n="n"/><br />
</pattern><br />
<action><br />
<choose><br />
<when><br />
<test><br />
<not><br />
<equal><br />
<clip pos="1" side="tl" part="number"/><br />
<clip pos="2" side="tl" part="number"/><br />
</equal><br />
</not><br />
</test><br />
<reject-current-rule/><br />
</when><br />
</choose><br />
<choose><br />
<when><br />
<test><br />
<in caseless="yes"><br />
<clip pos="1" side="tl" part="lem"/><br />
<list n="pre_adj"/><br />
</in><br />
</test><br />
<out><br />
<chunk name="default"><br />
<tags><br />
<tag><lit-tag v="NP"/></tag><br />
</tags><br />
<lu><br />
<clip pos="1" side="tl" part="lemh"/><br />
<lit-tag v="adj"/><br />
<clip pos="2" side="tl" part="gender"/><br />
</lu><br />
<b pos="1"/><br />
<lu><br />
<clip pos="2" side="tl" part="lemh"/><br />
<lit-tag v="n"/><br />
<clip pos="2" side="tl" part="gender"/><br />
<clip pos="2" side="tl" part="number"/><br />
</lu><br />
</chunk><br />
</out><br />
</when><br />
<otherwise><br />
<out><br />
<chunk name="default"><br />
<tags><br />
<tag><lit-tag v="NP"/></tag><br />
</tags><br />
<lu><br />
<clip pos="1" side="tl" part="lemh"/><br />
<lit-tag v="adj"/><br />
<clip pos="2" side="tl" part="gender"/><br />
<clip pos="1" side="tl" part="lemq"/><br />
</lu><br />
<b pos="1"/><br />
<lu><br />
<clip pos="2" side="tl" part="lemh"/><br />
<lit-tag v="n"/><br />
<clip pos="2" side="tl" part="gender"/><br />
<clip pos="2" side="tl" part="number"/><br />
<clip pos="2" side="tl" part="lemq"/><br />
</lu><br />
</chunk><br />
</out><br />
</otherwise><br />
</choose><br />
</action><br />
</rule><br />
</section-rules><br />
</transfer><br />
<br />
{| class="wikitable" border="1"<br />
|-<br />
| <pre><br />
number = sg pl;<br />
</pre><br />
| <pre><br />
<def-attr n="number"><br />
<attr-item tags="sg"/><br />
<attr-item tags="pl"/><br />
</def-attr><br />
<def-list n="number"><br />
<list-item v="sg"/><br />
<list-item v="pl"/><br />
</def-list><br />
</pre><br />
| It isn't shown in the above example, but each list simultaneously defines an attribute category and a list.<br />
|-<br />
| <pre>n: _.gender.number;</pre><br />
| (no direct equivalent)<br />
|<br />
|-<br />
| <pre>NP -> </pre><br />
| <pre><br />
<tags><br />
<tag><lit-tag v="NP"/></tag><br />
...<br />
</tags><br />
</pre><br />
| The further contents of <code><tags></code> is determined by <code>NP: _.number;</code>, which indicates that those contents will be a <code>number</code> tag, probably clipped from one of the inputs.<br />
|-<br />
| <pre>n</pre><br />
| <pre><def-cat n="some_unique_name"><br />
<cat-item tags="n"/><br />
<cat-item tags="n.*"/><br />
</def-cat><br />
...<br />
<pattern-item n="some_unique_name"/><br />
</pre><br />
|<br />
|-<br />
| <pre>.$number</pre><br />
| <pre><clip pos="2" side="tl" part="number"/></pre><br />
| This determines the contents of <code><tags></code> in the output chunk.<br />
|-<br />
| <pre>?</pre><br />
| <pre><br />
<choose><br />
<when><br />
<test><br />
<not><br />
...<br />
</not><br />
</test><br />
<reject-current-rule/><br />
</when><br />
</choose><br />
</pre><br />
| There is no functionality equivalent to <code><reject-current-rule shifting="yes"/></code>.<br />
|-<br />
| <pre>1.number</pre><br />
| <pre><clip pos="1" part="number"/></pre><br />
| In the example rule, the clips are written as being <code>side="tl"</code>, but an unspecified clip will actually check all three sides (target, then reference, then source) until it finds a value.<br />
|-<br />
| <pre><br />
(if (...)<br />
...<br />
else<br />
...<br />
)<br />
</pre><br />
| <pre><br />
<choose><br />
<when><br />
<test><br />
...<br />
</test><br />
<out><br />
...<br />
</out><br />
</when><br />
<otherwise><br />
<out><br />
...<br />
</out><br />
</otherwise><br />
</choose><br />
</pre><br />
|<br />
|-<br />
| <pre>(... incl ...)</pre><br />
| <pre><br />
<in caseless="yes"><br />
...<br />
<list n="..."/><br />
</in><br />
</pre><br />
|<br />
|-<br />
| <pre>_1</pre><br />
| <pre><b pos="1"/></pre><br />
|<br />
|-<br />
| <pre>1[gender=2.gender]</pre><br />
| <pre><br />
<lu><br />
<clip pos="1" side="tl" part="lemh"/><br />
<lit-tag v="adj"/><br />
<clip pos="2" side="tl" part="gender"/><br />
<clip pos="1" side="tl" part="lemq"/><br />
</lu><br />
</pre><br />
| <code><lit-tag v="adj"/></code> should actually be <code><clip pos="1" side="tl" part="pos_tag"/></code> where <code>pos_tag</code> is a special attribute that returns whatever the first tag is.<br />
|-<br />
| <pre>{ ... }</pre><br />
| <pre><br />
<chunk name="default"><br />
...<br />
</chunk><br />
</pre><br />
| It is possible to make the name be something other than default, for example with <code>n.$lem/sl</code> in the pattern.<br />
|}<br />
<br />
Technically this would compile to a rule which outputs an <code>NP</code> chunk containing the input unchanged and also a separate postchunk rule that would do the actual rearranging so that the conditionals can depend on changed values of the chunk tags.</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Task_ideas_for_Google_Code-in&diff=70540
Task ideas for Google Code-in
2019-10-26T14:47:42Z
<p>Marcriera: /* Task ideas */</p>
<hr />
<div><br />
{{TOCD}}<br />
This is the task ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.<br />
<br />
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:<br />
<br />
<!--# '''this does not include time taken to [[Minimal installation from SVN|install]] / set up apertium (and relevant tools)'''.--><br />
# this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve. <br />
<br />
<!--Если ты не понимаешь английский язык или предпочитаешь работать над русским языком или другими языками России, смотри: [[Task ideas for Google Code-in/Russian]]--><br />
'''Categories:'''<br />
<br />
* {{sc|code}}: Tasks related to writing or refactoring code<br />
* {{sc|documentation}}: Tasks related to creating/editing documents and helping others learn more<br />
* {{sc|research}}: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions<br />
* {{sc|quality}}: Tasks related to testing and ensuring code is of high quality.<br />
* {{sc|design}}: Tasks related to user experience research or user interface design and interaction<br />
<br />
'''Clarification of "multiple task" types'''<br />
* multi = number of students who can do a given task<br />
* dup = number of times a student can do the same task<br />
<br />
You can find descriptions of some of the mentors [[List_of_Apertium_mentors | here]].<br />
<br />
== Task ideas ==<br />
<br />
'''The current task ideas here are for 2019.''' See [[Talk:Task ideas for Google Code-in]] for task ideas from previous years.<br />
<br />
<table class="sortable wikitable"><br />
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th><th>bgnr?</th><th>multi?</th><th>duplicates</th></tr><br />
<br />
{{Taskidea<br />
|type=research, quality, documentation<br />
|title=Adopt a Wiki page<br />
|description=Request an Apertium wiki account and adopt a wiki page by updating and fixing any issues with it. Examples of things to update might be documentation that still refers to our SVN repo (we're on GitHub now), documentation of new features, clarification of unclear things, indicating that a page no longer reflects how things are done, or updating documentation to reflect the current options and defaults of various tools.<br />
|tags=wiki<br />
|mentors=*<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research, code<br />
|title=expand coverage of Kyrgyz to English structural transfer<br />
|description=Find a sentence in Kyrgyz that once the lexical items are added to the bilingual dictionary is not fully (or correctly) parsed by the <tt>kir-eng-transfer</tt> Apertium mode. Determine what rule(s) need(s) to be added (or fixed) to cover this structure, and update <tt>apertium-eng-kir.kir-eng.rtx</tt> accordingly. You will first want to clone and compile [https://github.com/apertium/apertium-eng-kir apertium-eng-kir].<br />
|tags=Kyrgyz, English, recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add recursive transfer support to a language pair that doesn't support it<br />
|description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfer". Add vanilla <tt>.rtx</tt> files for both directions, and modify <tt>Makefile.am</tt> and <tt>modes.xml</tt> so that the branch compiles and runs. See [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair this page] for instructions on how to do this.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add 2 recursive transfer rules to a language pair<br />
|description=Add two recursive transfer rules to an Apertium language pair. These rules consist of, at minimum, a syntactic pattern to match, a phrase to combine them into, and an output pattern ([http://wiki.apertium.org/wiki/Apertium-recursive#Further_Documentation more documentation here]). If the language pair does not support recursive transfer, make sure [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair to set it up] first.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=20<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, wei2912<br />
|title=Use apertium-init to bootstrap a new language pair<br />
|description=Use the [[Apertium-init]] script to bootstrap a new translation pair between two languages which have monolingual modules already in Apertium. To see if a translation pair has already been made, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add 100 common stems to the dictionary. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation.<br />
|tags=languages, bootstrap, dictionaries, translators<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, wei2912<br />
|title=Use apertium-init to bootstrap a new language module<br />
|description=Use the [[Apertium-init]] script to bootstrap a new language module that doesn't currently exist in Apertium. To see if a language is available, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add enough stems and morphology to the module so that it analyses and generates at least 100 correct forms. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation. [[Task ideas for Google Code-in/Add words from frequency list|Read more about adding stems...]]<br />
|tags=languages, bootstrap, dictionaries<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, sevilay, Unhammer, marcriera<br />
|title=Write 10 lexical selection for an existing translation pair<br />
|description=Add 10 lexical selection rules to an existing translation pair. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add lexical-select rules|Read more...]]<br />
|tags=languages, bootstrap, lexical selection, translators<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, Unhammer<br />
|title=Write 10 constraint grammar rules for an existing language module<br />
|description=Add 10 constraint grammar rules to an existing language module for a language that you know. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add constraint-grammar rules|Read more...]]<br />
|tags=languages, bootstrap, constraint grammar<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=Research<br />
|mentors=anakuz, fotonzade, khannatanmai<br />
|title=Syntactic annotation of text<br />
|description=Pick a text of about 200 words and make a syntactic annotation for it according to the Universal Dependencies treebank. UD Annotatrix can be used for visualisation. Consult with your mentor about the language.<br />
|tags=UD, trees, annotation<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW,ftyers, fotonzade, khannatanmai, anakuz<br />
|title=Create a UD-Apertium morphology mapping<br />
|description=Choose a language that has a Universal Dependencies treebank and tabulate a potential set of Apertium morph labels based on the (universal) UD morph labels. See Apertium's [[list of symbols]] and [http://universaldependencies.org/ UD]'s POS and feature tags for the labels.<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW, ftyers, fotonzade, khannatanmai, anakuz<br />
|title=Create an Apertium-UD morphology mapping<br />
|description=Choose a language that has an Apertium morphological analyser and adapt it to convert the morphology to UD morphology<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|title=Install Apertium and verify that it works<br />
|description=See [[Installation]] for instructions and if you encounter any issues along the way, document them and/or improve the wiki instructions!<br />
|tags=bash<br />
|mentors=ftyers, JNW, Unhammer, anakuz, Josh, fotonzade, sevilay, eirien, wei2912<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|title=Write a contrastive grammar<br />
|description=Document 6 differences between two (preferably related) languages and where they would need to be addressed in the [[Apertium pipeline]] (morph analysis, transfer, etc). Use a grammar book/resource for inspiration. Each difference should have no fewer than 3 examples. Put your work on the Apertium wiki under [[Language1_and_Language2/Contrastive_grammar]]. See [[Farsi_and_English/Pending_tests]] for an example of a contrastive grammar that a previous GCI student made.<br />
|mentors=Mikel, JNW, Josh, xavivars, fotonzade, sevilay, khannatanmai, dolphingarlic<br />
|tags=wiki, languages<br />
|beginner=yes<br />
|multi=40<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, Unhammer, eirien, dolphingarlic, wei2912, marcriera<br />
|tags=xml, dictionaries, svn<br />
|title=Add 200 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 200 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|multi=40<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, ftyers, sevilay, eirien, dolphingarlic, wei2912, marcriera<br />
|tags=xml, dictionaries, svn<br />
|title=Add 500 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 500 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|dup=10<br />
|beginner=no<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=fotonzade, JNW, ftyers, anakuz, xavivars, Mikel, shardulc, sevilay, Unhammer, khannatanmai, dolphingarlic, wei2912, marcriera<br />
|tags=xml, dictionaries, svn<br />
|title=Post-edit 500 sentences of any public domain text from %AAA% to %BBB%<br />
|description=Many of our systems benefit from statistical methods used with (ideally public domain) bilingual data. For this task, you need to translate a public domain text from %AAA% to %BBB% using any available machine translation system and clean up the translations yourself manually. Commit the post-edited texts (in plain text format) to an existing (via pull request) or if needed new github repository for the pair in dev/ or texts/ folder. The texts are subject to mentor approval.<br />
|multi=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, khannatanmai, dolphingarlic, wei2912, marcriera<br />
|tags=disambiguation, svn<br />
|title=Disambiguate 500 tokens of text in %AAA%<br />
|description=Run some text through a morphological analyser and disambiguate the output. Discuss with the mentor beforehand to approve the choice of language and text. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Manually_disambiguate_text Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien, khannatanmai, anakuz, marcriera<br />
|tags=dictionaries<br />
|title=Categorise 100 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien, anakuz, sevilay. marcriera<br />
|tags=dictionaries<br />
|title=Categorise 500 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=khannatanmai, sevilay<br />
|tags=evaluation<br />
|title=Evaluate an existing apertium pair %AAA% to %BBB% on a text<br />
|description= Pick an existing apertium pair and get a parallel text for that language pair. Translate %AAA% to %BBB% and evaluate the translation using an automatic evaluation metric like BLEU or/and evaluate it manually (?) [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Evaluation_of_translation_of_an_existing_pair Read more]...<br />
}}<br />
<br />
{{Taskidea<br />
|type=code, research<br />
|title=Add apertium-anaphora support to a new language pair<br />
|description=Make a branch of an Apertium language pair that doesn't use apertium-anaphora yet, and call it "anaphora". Manually add the correct antecedent to side ref of the anaphors in the output of biltrans, and modify t1x to change the anaphor based on its antecedent. Verify that it runs and gives the correct anaphor. See the [http://wiki.apertium.org/wiki/Anaphora_Resolution_Module documentation] of the apertium-anaphora module for help.<br />
|tags=apertium-anaphora, transfer<br />
|mentors=khannatanmai<br />
}}<br />
<br />
{{Taskidea<br />
|type=code, research<br />
|title=Add one markable to the arx file for a language pair<br />
|description=The arx file is where we tell the anaphora resolution algorithm which patterns to detect and score, when we want to find the antecedent of an anaphor. Add one rule in this file which can help the algorithm find out the antecedent of an anaphor - specify the pattern to detect and the positive or negative score you want to give to the noun in this pattern. See the [http://wiki.apertium.org/wiki/Anaphora_Resolution_Module documentation] of the apertium-anaphora module for help.<br />
|tags=apertium-anaphora, anaphora resolution<br />
|mentors=khannatanmai<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Write a TextMate grammar for CG-3 syntax highlighting<br />
|description=We want CG-3 files to be syntax highlighted in Github. They use TextMate-compatible grammars. See [https://github.com/TinoDidriksen/cg3/issues/48 issue 48] in the CG-3 repo.<br />
|tags=cg, editors, tools<br />
|mentors=Unhammer<br />
}}<br />
<br />
</table><br />
<br />
==Mentors==<br />
<br />
These are languages that can be substituted for AAA and/or BBB for tasks each mentor is listed to mentor above.<br />
<br />
{|class=wikitable<br />
! Mentor !! Languages<br />
|-<br />
| ftyers || eng, spa, cat, fra, nor, rus, por, swe, tur, gag, aze<br />
|-<br />
| JNW || eng, spa, fra, rus, tur, gag, aze, kaz, kir, kaa, tat, bak, kum, nog, kaa, uzb, uig, crh, khk, yid<br />
|-<br />
| anakuz || grn, spa, por, rus<br />
|-<br />
| fotonzade || eng, tur, aze, uig, tat, crh, kmr, ckb, fas<br />
|-<br />
| xavivars || cat, spa, eng, fra <br />
|-<br />
| unhammer || nno, nob, swe, dan, fao, sme, ovd<br />
|-<br />
| shardulc || eng, fra, mar, hin, urd, kan<br />
|-<br />
| m-alpha || eng, fra, byv<br />
|-<br />
| popcorndude || eng, spa<br />
|-<br />
| sevilay || tur, kaz, aze, tat, gag, eng<br />
|-<br />
| eirien || sah, rus, eng<br />
|-<br />
| khannatanmai || eng, hin<br />
|-<br />
| flammie || fin, krl, olo, hun, nio, kpv, mdf, tlh, fra, swe, eng, est, ekk, vro<br />
|-<br />
| dolphingarlic || afr, deu, eng<br />
|-<br />
| wei2912 || eng, zho<br />
|-<br />
| marcriera|| cat, spa, eng, ron<br />
|}<br />
<br />
== Counts ==<br />
<br />
Last updated by [[User:Eirien|Eirien]] ([[User talk:Eirien|talk]]) 22:29, 24 October 2019 (UTC+3).<br />
<br />
{| class="sortable wikitable"<br />
|-<br />
! scope="col" | Category<br />
! scope="col" | Count<br />
|-<br />
|{{sc|code}}<br />
|align="right"|35<br />
|-<br />
|{{sc|documentation}}<br />
|align="right"|5<br />
|-<br />
|{{sc|research}}<br />
|align="right"|11<br />
|-<br />
|{{sc|quality}}<br />
|align="right"|10<br />
|-<br />
|{{sc|design}}<br />
|align="right"|5<br />
|- class="sortbottom"<br />
!Total<br />
|align="right"|66<br />
|}<br />
<br />
[[Category:Google Code-in]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Task_ideas_for_Google_Code-in&diff=70539
Task ideas for Google Code-in
2019-10-26T14:44:36Z
<p>Marcriera: /* Mentors */</p>
<hr />
<div><br />
{{TOCD}}<br />
This is the task ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.<br />
<br />
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:<br />
<br />
<!--# '''this does not include time taken to [[Minimal installation from SVN|install]] / set up apertium (and relevant tools)'''.--><br />
# this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve. <br />
<br />
<!--Если ты не понимаешь английский язык или предпочитаешь работать над русским языком или другими языками России, смотри: [[Task ideas for Google Code-in/Russian]]--><br />
'''Categories:'''<br />
<br />
* {{sc|code}}: Tasks related to writing or refactoring code<br />
* {{sc|documentation}}: Tasks related to creating/editing documents and helping others learn more<br />
* {{sc|research}}: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions<br />
* {{sc|quality}}: Tasks related to testing and ensuring code is of high quality.<br />
* {{sc|design}}: Tasks related to user experience research or user interface design and interaction<br />
<br />
'''Clarification of "multiple task" types'''<br />
* multi = number of students who can do a given task<br />
* dup = number of times a student can do the same task<br />
<br />
You can find descriptions of some of the mentors [[List_of_Apertium_mentors | here]].<br />
<br />
== Task ideas ==<br />
<br />
'''The current task ideas here are for 2019.''' See [[Talk:Task ideas for Google Code-in]] for task ideas from previous years.<br />
<br />
<table class="sortable wikitable"><br />
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th><th>bgnr?</th><th>multi?</th><th>duplicates</th></tr><br />
<br />
{{Taskidea<br />
|type=research, quality, documentation<br />
|title=Adopt a Wiki page<br />
|description=Request an Apertium wiki account and adopt a wiki page by updating and fixing any issues with it. Examples of things to update might be documentation that still refers to our SVN repo (we're on GitHub now), documentation of new features, clarification of unclear things, indicating that a page no longer reflects how things are done, or updating documentation to reflect the current options and defaults of various tools.<br />
|tags=wiki<br />
|mentors=*<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research, code<br />
|title=expand coverage of Kyrgyz to English structural transfer<br />
|description=Find a sentence in Kyrgyz that once the lexical items are added to the bilingual dictionary is not fully (or correctly) parsed by the <tt>kir-eng-transfer</tt> Apertium mode. Determine what rule(s) need(s) to be added (or fixed) to cover this structure, and update <tt>apertium-eng-kir.kir-eng.rtx</tt> accordingly. You will first want to clone and compile [https://github.com/apertium/apertium-eng-kir apertium-eng-kir].<br />
|tags=Kyrgyz, English, recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add recursive transfer support to a language pair that doesn't support it<br />
|description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfer". Add vanilla <tt>.rtx</tt> files for both directions, and modify <tt>Makefile.am</tt> and <tt>modes.xml</tt> so that the branch compiles and runs. See [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair this page] for instructions on how to do this.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add 2 recursive transfer rules to a language pair<br />
|description=Add two recursive transfer rules to an Apertium language pair. These rules consist of, at minimum, a syntactic pattern to match, a phrase to combine them into, and an output pattern ([http://wiki.apertium.org/wiki/Apertium-recursive#Further_Documentation more documentation here]). If the language pair does not support recursive transfer, make sure [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair to set it up] first.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=20<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, wei2912<br />
|title=Use apertium-init to bootstrap a new language pair<br />
|description=Use the [[Apertium-init]] script to bootstrap a new translation pair between two languages which have monolingual modules already in Apertium. To see if a translation pair has already been made, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add 100 common stems to the dictionary. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation.<br />
|tags=languages, bootstrap, dictionaries, translators<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, wei2912<br />
|title=Use apertium-init to bootstrap a new language module<br />
|description=Use the [[Apertium-init]] script to bootstrap a new language module that doesn't currently exist in Apertium. To see if a language is available, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add enough stems and morphology to the module so that it analyses and generates at least 100 correct forms. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation. [[Task ideas for Google Code-in/Add words from frequency list|Read more about adding stems...]]<br />
|tags=languages, bootstrap, dictionaries<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, sevilay, Unhammer<br />
|title=Write 10 lexical selection for an existing translation pair<br />
|description=Add 10 lexical selection rules to an existing translation pair. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add lexical-select rules|Read more...]]<br />
|tags=languages, bootstrap, lexical selection, translators<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, Unhammer<br />
|title=Write 10 constraint grammar rules for an existing language module<br />
|description=Add 10 constraint grammar rules to an existing language module for a language that you know. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add constraint-grammar rules|Read more...]]<br />
|tags=languages, bootstrap, constraint grammar<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=Research<br />
|mentors=anakuz, fotonzade, khannatanmai<br />
|title=Syntactic annotation of text<br />
|description=Pick a text of about 200 words and make a syntactic annotation for it according to the Universal Dependencies treebank. UD Annotatrix can be used for visualisation. Consult with your mentor about the language.<br />
|tags=UD, trees, annotation<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW,ftyers, fotonzade, khannatanmai, anakuz<br />
|title=Create a UD-Apertium morphology mapping<br />
|description=Choose a language that has a Universal Dependencies treebank and tabulate a potential set of Apertium morph labels based on the (universal) UD morph labels. See Apertium's [[list of symbols]] and [http://universaldependencies.org/ UD]'s POS and feature tags for the labels.<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW, ftyers, fotonzade, khannatanmai, anakuz<br />
|title=Create an Apertium-UD morphology mapping<br />
|description=Choose a language that has an Apertium morphological analyser and adapt it to convert the morphology to UD morphology<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|title=Install Apertium and verify that it works<br />
|description=See [[Installation]] for instructions and if you encounter any issues along the way, document them and/or improve the wiki instructions!<br />
|tags=bash<br />
|mentors=ftyers, JNW, Unhammer, anakuz, Josh, fotonzade, sevilay, eirien, wei2912<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|title=Write a contrastive grammar<br />
|description=Document 6 differences between two (preferably related) languages and where they would need to be addressed in the [[Apertium pipeline]] (morph analysis, transfer, etc). Use a grammar book/resource for inspiration. Each difference should have no fewer than 3 examples. Put your work on the Apertium wiki under [[Language1_and_Language2/Contrastive_grammar]]. See [[Farsi_and_English/Pending_tests]] for an example of a contrastive grammar that a previous GCI student made.<br />
|mentors=Mikel, JNW, Josh, xavivars, fotonzade, sevilay, khannatanmai, dolphingarlic<br />
|tags=wiki, languages<br />
|beginner=yes<br />
|multi=40<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, Unhammer, eirien, dolphingarlic, wei2912<br />
|tags=xml, dictionaries, svn<br />
|title=Add 200 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 200 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|multi=40<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, ftyers, sevilay, eirien, dolphingarlic, wei2912<br />
|tags=xml, dictionaries, svn<br />
|title=Add 500 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 500 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|dup=10<br />
|beginner=no<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=fotonzade, JNW, ftyers, anakuz, xavivars, Mikel, shardulc, sevilay, Unhammer, khannatanmai, dolphingarlic, wei2912<br />
|tags=xml, dictionaries, svn<br />
|title=Post-edit 500 sentences of any public domain text from %AAA% to %BBB%<br />
|description=Many of our systems benefit from statistical methods used with (ideally public domain) bilingual data. For this task, you need to translate a public domain text from %AAA% to %BBB% using any available machine translation system and clean up the translations yourself manually. Commit the post-edited texts (in plain text format) to an existing (via pull request) or if needed new github repository for the pair in dev/ or texts/ folder. The texts are subject to mentor approval.<br />
|multi=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, khannatanmai, dolphingarlic, wei2912<br />
|tags=disambiguation, svn<br />
|title=Disambiguate 500 tokens of text in %AAA%<br />
|description=Run some text through a morphological analyser and disambiguate the output. Discuss with the mentor beforehand to approve the choice of language and text. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Manually_disambiguate_text Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien, khannatanmai, anakuz<br />
|tags=dictionaries<br />
|title=Categorise 100 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien, anakuz, sevilay<br />
|tags=dictionaries<br />
|title=Categorise 500 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=khannatanmai, sevilay<br />
|tags=evaluation<br />
|title=Evaluate an existing apertium pair %AAA% to %BBB% on a text<br />
|description= Pick an existing apertium pair and get a parallel text for that language pair. Translate %AAA% to %BBB% and evaluate the translation using an automatic evaluation metric like BLEU or/and evaluate it manually (?) [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Evaluation_of_translation_of_an_existing_pair Read more]...<br />
}}<br />
<br />
{{Taskidea<br />
|type=code, research<br />
|title=Add apertium-anaphora support to a new language pair<br />
|description=Make a branch of an Apertium language pair that doesn't use apertium-anaphora yet, and call it "anaphora". Manually add the correct antecedent to side ref of the anaphors in the output of biltrans, and modify t1x to change the anaphor based on its antecedent. Verify that it runs and gives the correct anaphor. See the [http://wiki.apertium.org/wiki/Anaphora_Resolution_Module documentation] of the apertium-anaphora module for help.<br />
|tags=apertium-anaphora, transfer<br />
|mentors=khannatanmai<br />
}}<br />
<br />
{{Taskidea<br />
|type=code, research<br />
|title=Add one markable to the arx file for a language pair<br />
|description=The arx file is where we tell the anaphora resolution algorithm which patterns to detect and score, when we want to find the antecedent of an anaphor. Add one rule in this file which can help the algorithm find out the antecedent of an anaphor - specify the pattern to detect and the positive or negative score you want to give to the noun in this pattern. See the [http://wiki.apertium.org/wiki/Anaphora_Resolution_Module documentation] of the apertium-anaphora module for help.<br />
|tags=apertium-anaphora, anaphora resolution<br />
|mentors=khannatanmai<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Write a TextMate grammar for CG-3 syntax highlighting<br />
|description=We want CG-3 files to be syntax highlighted in Github. They use TextMate-compatible grammars. See [https://github.com/TinoDidriksen/cg3/issues/48 issue 48] in the CG-3 repo.<br />
|tags=cg, editors, tools<br />
|mentors=Unhammer<br />
}}<br />
<br />
</table><br />
<br />
==Mentors==<br />
<br />
These are languages that can be substituted for AAA and/or BBB for tasks each mentor is listed to mentor above.<br />
<br />
{|class=wikitable<br />
! Mentor !! Languages<br />
|-<br />
| ftyers || eng, spa, cat, fra, nor, rus, por, swe, tur, gag, aze<br />
|-<br />
| JNW || eng, spa, fra, rus, tur, gag, aze, kaz, kir, kaa, tat, bak, kum, nog, kaa, uzb, uig, crh, khk, yid<br />
|-<br />
| anakuz || grn, spa, por, rus<br />
|-<br />
| fotonzade || eng, tur, aze, uig, tat, crh, kmr, ckb, fas<br />
|-<br />
| xavivars || cat, spa, eng, fra <br />
|-<br />
| unhammer || nno, nob, swe, dan, fao, sme, ovd<br />
|-<br />
| shardulc || eng, fra, mar, hin, urd, kan<br />
|-<br />
| m-alpha || eng, fra, byv<br />
|-<br />
| popcorndude || eng, spa<br />
|-<br />
| sevilay || tur, kaz, aze, tat, gag, eng<br />
|-<br />
| eirien || sah, rus, eng<br />
|-<br />
| khannatanmai || eng, hin<br />
|-<br />
| flammie || fin, krl, olo, hun, nio, kpv, mdf, tlh, fra, swe, eng, est, ekk, vro<br />
|-<br />
| dolphingarlic || afr, deu, eng<br />
|-<br />
| wei2912 || eng, zho<br />
|-<br />
| marcriera|| cat, spa, eng, ron<br />
|}<br />
<br />
== Counts ==<br />
<br />
Last updated by [[User:Eirien|Eirien]] ([[User talk:Eirien|talk]]) 22:29, 24 October 2019 (UTC+3).<br />
<br />
{| class="sortable wikitable"<br />
|-<br />
! scope="col" | Category<br />
! scope="col" | Count<br />
|-<br />
|{{sc|code}}<br />
|align="right"|35<br />
|-<br />
|{{sc|documentation}}<br />
|align="right"|5<br />
|-<br />
|{{sc|research}}<br />
|align="right"|11<br />
|-<br />
|{{sc|quality}}<br />
|align="right"|10<br />
|-<br />
|{{sc|design}}<br />
|align="right"|5<br />
|- class="sortbottom"<br />
!Total<br />
|align="right"|66<br />
|}<br />
<br />
[[Category:Google Code-in]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Calculating_coverage&diff=69883
Calculating coverage
2019-06-10T18:19:23Z
<p>Marcriera: /* Faster coverage testing with frequency lists */</p>
<hr />
<div><br />
[[Calculer la couverture|En français]]<br />
<br />
==Simple bidix-trimmed coverage testing==<br />
<br />
First install apertium-cleanstream:<br />
<br />
svn checkout https://svn.code.sf.net/p/apertium/svn/trunk/apertium-tools/apertium-cleanstream<br />
cd apertium-cleanstream<br />
make<br />
sudo cp apertium-cleanstream /usr/local/bin<br />
<br />
'''Note - After Apertium's migration to GitHub, this tool is read-only on the SourceForge repository and does not exist on GitHub. If you are interested in migrating this tool to GitHub, see [[Migrating tools to GitHub]].'''<br />
<br />
Then save this as coverage.sh:<br />
<br />
#!/bin/bash<br />
mode=$1<br />
outfile=/tmp/$mode.clean<br />
apertium -d . $mode | apertium-cleanstream -n > $outfile<br />
total=$(grep -c '^\^' $outfile)<br />
unknown=$(grep -c '/\*' $outfile)<br />
bidix_unknown=$(grep -c '/@' $outfile)<br />
known_percent=$(calc -p "round( 100*($total-$unknown-$bidix_unknown)/$total, 3)")<br />
echo "$known_percent % known tokens ($unknown unknown, $bidix_unknown bidix-unknown of total $total tokens)"<br />
echo "Top unknown words:"<br />
grep '/[*@]' $outfile | sort | uniq -c | sort -nr | head<br />
<br />
And run it like<br />
<br />
cat asm.corpus | bash coverage.sh asm-eng-biltrans<br />
<br />
(The bidix-unknown count should always be 0 if your pair uses [[lt-trim|automatic analyser trimming]].)<br />
<br />
==TODO: paradigm-coverage (less naïve)==<br />
On an analysed corpus, we can sum frequencies into bins for each lemma+mainpos, so if the analysed corpus contains<br />
<br />
<pre><br />
musa/mus<n><f><sg><def>/muse<vblex><past><br />
mus/mus<n><f><sg><ind>/mus<n><f><pl><ind>/muse<vblex><imp><br />
musene/mus<n><f><pl><def><br />
</pre><br />
then output has<br />
<pre><br />
3 mus<n><f><br />
2 muse<vblex><br />
</pre><br />
and we can find paradigms that are likely to mess up disambiguation, or where we need to ensure that the bidix contains the highest-frequency paradigm (since the bidix is typically smaller than the monodix).<br />
<br />
We could also weight these numbers by number of unique forms in the pardef; if the verb pardef has 6 unique forms and then noun only 3, then the above output should be even more skewed:<br />
<pre><br />
0.33 mus<n><f><br />
0.75 muse<vblex><br />
</pre><br />
<br />
==Faster coverage testing with frequency lists==<br />
<br />
If words appear several times in your corpus, why bother analysing them several times? We can make a frequency list first and add together the frequencies. This script does some very stupid tokenisation and creates a frequency list:<br />
<br />
make-freqlist.sh:<br />
<pre><br />
#!/bin/bash<br />
<br />
if [[ -t 0 ]]; then<br />
echo "Expecting a corpus on stdin"<br />
exit 2<br />
fi<br />
<br />
tr '[:space:][:punct:]' '\n' | grep . | sort | uniq -c | sort -nr<br />
</pre><br />
And this script runs your analyser, summing up the frequencies:<br />
<br />
freqlist-coverage.sh:<br />
<pre><br />
#!/bin/bash<br />
<br />
set -e -u<br />
<br />
if [[ $# -eq 0 || -t 0 ]]; then<br />
echo "Expecting apertium arguments and a 'sort|uniq -c|sort -nr' style frequency list on stdin"<br />
echo "For example:"<br />
echo "\$ < spa.freqlist $0 -d . spa-morph"<br />
exit 2<br />
fi<br />
<br />
sed 's%^ *%<apertium-notrans>%;s% %</apertium-notrans>%;s%$% .%' |<br />
apertium -f html-noent "$@" |<br />
awk -F'</?apertium-notrans>| *\\^\\./\\.<sent><clb>\\$' '<br />
/[/][*@]/ {<br />
unknown+=$2<br />
if(!printed) print "Top unknown tokens:"<br />
if(++printed<10) print $2,$3<br />
next<br />
}<br />
{<br />
known+=$2<br />
}<br />
END {<br />
total=known+unknown<br />
known_pct=100*known/total<br />
unk_pct=100*unknown/total<br />
print known_pct" % known of total "total" tokens"<br />
}'<br />
</pre> <br />
<br />
Usage:<br />
<br />
$ chmod +x make-freqlist.sh freqlist-coverage.sh<br />
$ bzcat ~/corpora/nno.txt.bz2 |./make-freqlist.sh > nno.freqlist<br />
$ < nno.freqlist ./freqlist-coverage.sh -d ~/apertium-svn/languages/apertium-nno/ nno-morph<br />
<br />
==coverage.py==<br />
<br />
https://svn.code.sf.net/p/apertium/svn/trunk/apertium-tools/coverage.py is a coverage script that wraps curl and bzcat. <br />
<br />
<br />
'''Note - After Apertium's migration to GitHub, this tool is read-only on the SourceForge repository and does not exist on GitHub. If you are interested in migrating this tool to GitHub, see [[Migrating tools to GitHub]].'''<br />
<br />
=See also==<br />
<br />
* [[Wikipedia dumps]]<br />
* [[Cleanstream]]<br />
<br />
[[Category:Documentation]]<br />
[[Category:Documentation in English]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan&diff=68039
Romanian and Catalan
2018-11-28T10:08:17Z
<p>Marcriera: </p>
<hr />
<div>This is a language pair translating between [[Romanian]] and [[Catalan]]. The pair is currently located in [https://github.com/apertium/apertium-ron-cat nursery].<br />
<br />
==Current status==<br />
<br />
''Last update: 28 Nov 2018''<br />
<br />
'''Bidix entries:''' 22,989<br />
<br />
'''Bidix paradigms:''' 21<br />
<br />
<br />
'''Transfer rules (Romanian to Catalan):''' 118 (T1X), 7 (T2X), 1 (T3X)<br />
<br />
'''Transfer rules (Catalan to Romanian):''' 33 (T1X), 5 (T2X), 1 (T3X), 1 (T4X)<br />
<br />
<br />
'''Coverage (Romanian to Catalan):''' 86.8% (Wikipedia)<br />
<br />
'''Coverage (Catalan to Romanian):''' 88.7% (Wikipedia)</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Transfer_Rules&diff=68038
Romanian and Catalan/Transfer Rules
2018-11-28T10:06:59Z
<p>Marcriera: /* Catalan → Romanian */</p>
<hr />
<div>This page provides general documentation on the transfer rules used in the Romanian-Catalan pair. The tables are generated from the data and comments found within the rule files using an automated script ('''transfer_documentation.sh''', available [https://github.com/MarcRiera/apertium-scripts here]).<br />
<br />
== Romanian → Catalan ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!3<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!4<br />
|conj_verb3<br />
| style="text-align:center" |3<br />
|Conjuga un verb compost triple<br />
|-<br />
!5<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables del chunk<br />
|-<br />
!6<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables del chunk<br />
|-<br />
!7<br />
|f_concord3<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements i desa els valors a les variables del chunk<br />
|-<br />
!8<br />
|f_concord4<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements i desa els valors a les variables del chunk<br />
|-<br />
!9<br />
|f_concord5<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements i desa els valors a les variables del chunk<br />
|-<br />
!10<br />
|f_concord1_i<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element intern i desa els valors a les variables internes<br />
|-<br />
!11<br />
|f_concord2_i<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements interns i desa els valors a les variables internes<br />
|-<br />
!12<br />
|f_concord3_i<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements interns i desa els valors a les variables internes<br />
|-<br />
!13<br />
|f_concord4_i<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements interns i desa els valors a les variables internes<br />
|-<br />
!14<br />
|f_concord5_i<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements interns i desa els valors a les variables internes<br />
|-<br />
!15<br />
|n_gen_nbr<br />
| style="text-align:center" |0<br />
|Neteja les variables de gènere i nombre<br />
|-<br />
!16<br />
|f_bcond<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!17<br />
|f_outverb2<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!18<br />
|f_outverb3<br />
| style="text-align:center" |3<br />
|Undocumented macro!<br />
|-<br />
!19<br />
|prn_verb<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!20<br />
|prn_verbp<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!21<br />
|prn_verbenc<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!22<br />
|f_modgen<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!23<br />
|f_lexicAdj<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casă → casa<br />
|-<br />
!2<br />
|num nom <br />
|NUM + NOM: 3 case → 3 cases<br />
|-<br />
!3<br />
|num prepde nom <br />
|NUM + de + NOM: 20 de case → 20 cases<br />
|-<br />
!4<br />
|nom adjnorm <br />
|NOM + ADJECTIU<br />
|-<br />
!5<br />
|nom adjpos <br />
|NOM + ADJECTIU POSSESSIU<br />
|-<br />
!6<br />
|num nom adjnorm <br />
|NUM + NOM + ADJECTIU<br />
|-<br />
!7<br />
|num prepde nom adjnorm <br />
|NUM + de + NOM + ADJECTIU<br />
|-<br />
!8<br />
|nom adjnorm adjnorm <br />
|NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!9<br />
|adj nom <br />
|ADJECTIU + NOM<br />
|-<br />
!10<br />
|det nom <br />
|DETERMINANT + NOM<br />
|-<br />
!11<br />
|predet nom <br />
|tot + NOM<br />
|-<br />
!12<br />
|nom nom_dgdef adjpos <br />
|NOM + NOM GENITIU + ADJECTIU POSSESSIU<br />
|-<br />
!13<br />
|nom np_dg <br />
|NOM + NOM PROPI GENITIU<br />
|-<br />
!14<br />
|nom adj_lui np <br />
|NOM + lui + NOM PROPI<br />
|-<br />
!15<br />
|nom adjnorm det_al np_dg <br />
|NOM + ADJECTIU + al + NOM PROPI GENITIU<br />
|-<br />
!16<br />
|nom adjnorm det_al adj_lui np <br />
|NOM + ADJECTIU + al + lui + NOM PROPI<br />
|-<br />
!17<br />
|det_cel num nom <br />
|cel + NUMERAL + NOM<br />
|-<br />
!18<br />
|det_cel adv_mai adjnorm nom <br />
|cel + mai + ADJECTIU + NOM<br />
|-<br />
!19<br />
|det_cel adv_mai adjnorm nom adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU<br />
|-<br />
!20<br />
|det_cel adv_mai adjnorm nom adjnorm adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!21<br />
|np <br />
|NOM PROPI<br />
|-<br />
!22<br />
|num <br />
|NUMERAL<br />
|-<br />
!23<br />
|prn_tnitg <br />
|PRONOM<br />
|-<br />
!24<br />
|prnpro <br />
|CLÍTIC<br />
|-<br />
!25<br />
|det_al nom_dg <br />
|al + NOM GENITIU<br />
|-<br />
!26<br />
|det_al nom_dg adjnorm <br />
|al + NOM GENITIU + ADJECTIU<br />
|-<br />
!27<br />
|det_al nom_dg adjnorm adjnorm <br />
|al + NOM GENITIU + ADJECTIU + ADJECTIU<br />
|-<br />
!28<br />
|adj <br />
|ADJ<br />
|-<br />
!29<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!30<br />
|pp prnenc <br />
|REGLA: PP + ENC (per si no ho reconeix cap altra regla)<br />
|-<br />
!31<br />
|prepnum num_xifra <br />
|prep + NUMERAL (any)<br />
|-<br />
!32<br />
|prepnum nom_any num_xifra <br />
|prep + NOM (any) + NUMERAL (any)<br />
|-<br />
!33<br />
|prep num_xifra mesos num_xifra <br />
|prep + NUMERAL (dia) + NOM (mes) + NUMERAL (any)<br />
|-<br />
!34<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!35<br />
|advnu vbcnj <br />
|REGLA: NU + VERB CONJUGAT<br />
|-<br />
!36<br />
|prnproref vbcnj <br />
|REGLA: CLÍTIC + VERB CONJUGAT<br />
|-<br />
!37<br />
|advnu prnproref vbcnj <br />
|REGLA: NU + CLÍTIC + VERB CONJUGAT<br />
|-<br />
!38<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: VERB CONJUGAT DOBLE<br />
|-<br />
!39<br />
|advnu vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: NU + VERB CONJUGAT DOBLE<br />
|-<br />
!40<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!41<br />
|advnu vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: NU + VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!42<br />
|vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: VERB CONJUGAT TRIPLE<br />
|-<br />
!43<br />
|advnu vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: NU + VERB CONJUGAT TRIPLE<br />
|-<br />
!44<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!45<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!46<br />
|cnjcoo <br />
|REGLA: CNJCOO<br />
|-<br />
!47<br />
|sent <br />
|Reinicialitza les variables<br />
|-<br />
!48<br />
|advnu vbcnj advdecât nom <br />
|Undocumented rule!<br />
|-<br />
!49<br />
|advnu vbcnj advdecât detind nom <br />
|Undocumented rule!<br />
|-<br />
!50<br />
|num mesos <br />
|Undocumented rule!<br />
|-<br />
!51<br />
|nom verbfi adj <br />
|Undocumented rule!<br />
|-<br />
!52<br />
|nom verbfi pp <br />
|Undocumented rule!<br />
|-<br />
!53<br />
|nom det_al relcărui nom <br />
|Undocumented rule!<br />
|-<br />
!54<br />
|prep_în nptop <br />
|Undocumented rule!<br />
|-<br />
!55<br />
|prep nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!56<br />
|vbcnj nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!57<br />
|verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!58<br />
|num mesos num <br />
|Undocumented rule!<br />
|-<br />
!59<br />
|prep nom adj <br />
|Undocumented rule!<br />
|-<br />
!60<br />
|prep nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!61<br />
|prep nom nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!62<br />
|vbcnj nom adj <br />
|Undocumented rule!<br />
|-<br />
!63<br />
|vbcnj nom det nom <br />
|Undocumented rule!<br />
|-<br />
!64<br />
|preppe npant <br />
|Undocumented rule!<br />
|-<br />
!65<br />
|preppe prn <br />
|Undocumented rule!<br />
|-<br />
!66<br />
|preppe det <br />
|Undocumented rule!<br />
|-<br />
!67<br />
|vbcnj prep nom nom <br />
|Undocumented rule!<br />
|-<br />
!68<br />
|prep nom nom_dg <br />
|Undocumented rule!<br />
|-<br />
!69<br />
|prep nom <br />
|Undocumented rule!<br />
|-<br />
!70<br />
|prep nom adj adj adj <br />
|Undocumented rule!<br />
|-<br />
!71<br />
|det_predet <br />
|Undocumented rule!<br />
|-<br />
!72<br />
|adjtot nom <br />
|Undocumented rule!<br />
|-<br />
!73<br />
|vbcnj dettuturor <br />
|Undocumented rule!<br />
|-<br />
!74<br />
|nom dettuturor <br />
|Undocumented rule!<br />
|-<br />
!75<br />
|relat <br />
|Undocumented rule!<br />
|-<br />
!76<br />
|vbcnj nom nom <br />
|Undocumented rule!<br />
|-<br />
!77<br />
|adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!78<br />
|nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!79<br />
|prep nom adj det_al np <br />
|Undocumented rule!<br />
|-<br />
!80<br />
|prep nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!81<br />
|nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!82<br />
|nom det nom <br />
|Undocumented rule!<br />
|-<br />
!83<br />
|nom det_al detind nom <br />
|Undocumented rule!<br />
|-<br />
!84<br />
|det nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!85<br />
|det adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!86<br />
|nom adv adj <br />
|Undocumented rule!<br />
|-<br />
!87<br />
|det_cel adv_mai adj <br />
|Undocumented rule!<br />
|-<br />
!88<br />
|advnespus prepde adj <br />
|Undocumented rule!<br />
|-<br />
!89<br />
|prep prepa prnpro inf <br />
|Undocumented rule!<br />
|-<br />
!90<br />
|prep prepa inf <br />
|Undocumented rule!<br />
|-<br />
!91<br />
|nom det_cel adj <br />
|Undocumented rule!<br />
|-<br />
!92<br />
|det nom adj <br />
|Undocumented rule!<br />
|-<br />
!93<br />
|det adj nom <br />
|Undocumented rule!<br />
|-<br />
!94<br />
|nom nom <br />
|Undocumented rule!<br />
|-<br />
!95<br />
|advnu adv_mai vbcnj <br />
|Undocumented rule!<br />
|-<br />
!96<br />
|vrea verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!97<br />
|conjnici advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!98<br />
|advnici- advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!99<br />
|advnu vbcnj advdecât <br />
|Undocumented rule!<br />
|-<br />
!100<br />
|vbaveacni verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!101<br />
|vbaveapri verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!102<br />
|vbcnj nom <br />
|Undocumented rule!<br />
|-<br />
!103<br />
|vbcnj conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!104<br />
|adv adj nom <br />
|Undocumented rule!<br />
|-<br />
!105<br />
|num adj nom <br />
|Undocumented rule!<br />
|-<br />
!106<br />
|vbcnj det_cel num nom <br />
|Undocumented rule!<br />
|-<br />
!107<br />
|vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!108<br />
|vrea vbcnj prep prn <br />
|Undocumented rule!<br />
|-<br />
!109<br />
|vrea vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!110<br />
|vbaveacni vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!111<br />
|prepde pp <br />
|Undocumented rule!<br />
|-<br />
!112<br />
|catverb prnenc <br />
|Undocumented rule!<br />
|-<br />
!113<br />
|catverb prnenc prnenc <br />
|Undocumented rule!<br />
|-<br />
!114<br />
|advca nom <br />
|Undocumented rule!<br />
|-<br />
!115<br />
|advca det nom <br />
|Undocumented rule!<br />
|-<br />
!116<br />
|adv adj advca <br />
|Undocumented rule!<br />
|-<br />
!117<br />
|prep nom nptop <br />
|Undocumented rule!<br />
|-<br />
!118<br />
|prepdintr- detind <br />
|Undocumented rule!<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|elimina_GDND<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND'<br />
|-<br />
!2<br />
|f_concord_SN_SV<br />
| style="text-align:center" |2<br />
|Realitza la concordança d'un SN i un SV<br />
|-<br />
!3<br />
|f_concord_SN_SN_SV<br />
| style="text-align:center" |3<br />
|Realitza la concordança de dos SN i un SV<br />
|-<br />
!4<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança entre dos elements<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|SN<br />
|-<br />
!2<br />
|SAdj <br />
|SAdj<br />
|-<br />
!3<br />
|SN SAdj <br />
|SN + SAdj<br />
|-<br />
!4<br />
|SN SV <br />
|SN + SV<br />
|-<br />
!5<br />
|SN cnjcoo SN SV <br />
|SN + cnjcoo + SN + SV<br />
|-<br />
!6<br />
|SN SV_cop SAdj <br />
|SN + SV (copulatiu) + SAdj<br />
|-<br />
!7<br />
|SP <br />
|SP<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
== Catalan → Romanian ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|mod_maj2<br />
| style="text-align:center" |2<br />
|Canvia la minúscula inicial d'un lema a majúscula si un altre lema té majúscula inicial<br />
|-<br />
!3<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!4<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!5<br />
|gen_clitic1<br />
| style="text-align:center" |2<br />
|Genera un clític per a un verb<br />
|-<br />
!6<br />
|gen_clitic2<br />
| style="text-align:center" |3<br />
|Genera dos clítics per a un verb<br />
|-<br />
!7<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables<br />
|-<br />
!8<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables<br />
|-<br />
!9<br />
|arregla_np<br />
| style="text-align:center" |1<br />
|Treu l'etiqueta de genitiu d'un nom propi i en determina l'article<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casa → casă<br />
|-<br />
!2<br />
|artdef nom <br />
|DET + NOM: la casa → casa<br />
|-<br />
!3<br />
|detpos nom <br />
|DETPOS + NOM: la meva casa → casa mea<br />
|-<br />
!4<br />
|nom adjpos <br />
|NOM + ADJPOS: casa meva → casa mea<br />
|-<br />
!5<br />
|artdef nom pr_de np <br />
|DET + NOM + de + NOM PROPI: la casa de Maria → casa Mariei / la casa de Mihai → casa lui Mihai<br />
|-<br />
!6<br />
|adjord nom <br />
|ADJORD + NOM: tercera posició → poziția a treia<br />
|-<br />
!7<br />
|artdef adjord nom <br />
|DET + ADJORD + NOM: la tercera posició → poziția a treia<br />
|-<br />
!8<br />
|adj_det nom <br />
|ADJECTIU + NOM: moltes cases → multe case<br />
|-<br />
!9<br />
|num nom <br />
|NUMERAL + NOM: cinc cases → cinci case / trenta cases → treizeci de case<br />
|-<br />
!10<br />
|np <br />
|NOM PROPI<br />
|-<br />
!11<br />
|prn <br />
|PRONOM<br />
|-<br />
!12<br />
|det_predet <br />
|DET/PREDET<br />
|-<br />
!13<br />
|detpos <br />
|DETPOS<br />
|-<br />
!14<br />
|num <br />
|NUMERAL<br />
|-<br />
!15<br />
|adj <br />
|ADJECTIU: bonic → frumos<br />
|-<br />
!16<br />
|pp <br />
|PARTICIPI<br />
|-<br />
!17<br />
|vbcnj <br />
|VERB CONJUGAT SIMPLE<br />
|-<br />
!18<br />
|inf <br />
|INFINITIU<br />
|-<br />
!19<br />
|ger <br />
|GERUNDI<br />
|-<br />
!20<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|VERB CONJUGAT DOBLE<br />
|-<br />
!21<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|VERB CONJUGAT DOBLE + ENC<br />
|-<br />
!22<br />
|vbcnj_doble1 vbcnj_doble2 prnenc prnenc <br />
|VERB CONJUGAT DOBLE + ENC + ENC<br />
|-<br />
!23<br />
|vbimp prnenc <br />
|REGLA: IMPERATIU ENC<br />
|-<br />
!24<br />
|vbimp prnenc prnenc <br />
|REGLA: IMPERATIU ENC ENC<br />
|-<br />
!25<br />
|ger prnenc <br />
|REGLA: GER ENC<br />
|-<br />
!26<br />
|ger prnenc prnenc <br />
|REGLA: GER ENC ENC<br />
|-<br />
!27<br />
|inf_sol prnenc <br />
|REGLA: INF ENC<br />
|-<br />
!28<br />
|inf_sol prnenc prnenc <br />
|REGLA: INF ENC ENC<br />
|-<br />
!29<br />
|vbhaver <br />
|REGLA: VBHAVER (per si apareix sol)<br />
|-<br />
!30<br />
|prnproenc <br />
|PRONOM CLÍTIC (per si apareix sol)<br />
|-<br />
!31<br />
|artdef_m_sg num_xifra <br />
|DETERMINANT + NUMERAL: el 1992 → în 1992<br />
|-<br />
!32<br />
|rel <br />
|RELATIU<br />
|-<br />
!33<br />
|sent <br />
|Reinicialitza les variables<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|elimina_GDND<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND'.<br />
|-<br />
!2<br />
|elimina_GDND_adj<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND' d'un adjectiu<br />
|-<br />
!3<br />
|f_concord_SN_SV<br />
| style="text-align:center" |2<br />
|Realitza la concordança d'un SN i un SV<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|-<br />
!2<br />
|SA <br />
|REGLA: SA<br />
|-<br />
!3<br />
|SN SV <br />
|SN + SV<br />
|-<br />
!4<br />
|DET <br />
|REGLA: DET<br />
|-<br />
!5<br />
|NUM <br />
|REGLA: NUM<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
=== Extra transfer (T4X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|ajust_genere <br />
|REGLA: ADJ/DET/PREDET/PRNPRO (ajusta el gènere perquè es generin correctament i elimina el cas per als determinants)<br />
|}<br />
<br />
<br />
[[Category:Romanian and Catalan|*]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Transfer_Rules&diff=68037
Romanian and Catalan/Transfer Rules
2018-11-28T10:05:37Z
<p>Marcriera: /* Interchunk (T2X) */</p>
<hr />
<div>This page provides general documentation on the transfer rules used in the Romanian-Catalan pair. The tables are generated from the data and comments found within the rule files using an automated script ('''transfer_documentation.sh''', available [https://github.com/MarcRiera/apertium-scripts here]).<br />
<br />
== Romanian → Catalan ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!3<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!4<br />
|conj_verb3<br />
| style="text-align:center" |3<br />
|Conjuga un verb compost triple<br />
|-<br />
!5<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables del chunk<br />
|-<br />
!6<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables del chunk<br />
|-<br />
!7<br />
|f_concord3<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements i desa els valors a les variables del chunk<br />
|-<br />
!8<br />
|f_concord4<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements i desa els valors a les variables del chunk<br />
|-<br />
!9<br />
|f_concord5<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements i desa els valors a les variables del chunk<br />
|-<br />
!10<br />
|f_concord1_i<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element intern i desa els valors a les variables internes<br />
|-<br />
!11<br />
|f_concord2_i<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements interns i desa els valors a les variables internes<br />
|-<br />
!12<br />
|f_concord3_i<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements interns i desa els valors a les variables internes<br />
|-<br />
!13<br />
|f_concord4_i<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements interns i desa els valors a les variables internes<br />
|-<br />
!14<br />
|f_concord5_i<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements interns i desa els valors a les variables internes<br />
|-<br />
!15<br />
|n_gen_nbr<br />
| style="text-align:center" |0<br />
|Neteja les variables de gènere i nombre<br />
|-<br />
!16<br />
|f_bcond<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!17<br />
|f_outverb2<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!18<br />
|f_outverb3<br />
| style="text-align:center" |3<br />
|Undocumented macro!<br />
|-<br />
!19<br />
|prn_verb<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!20<br />
|prn_verbp<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!21<br />
|prn_verbenc<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!22<br />
|f_modgen<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!23<br />
|f_lexicAdj<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casă → casa<br />
|-<br />
!2<br />
|num nom <br />
|NUM + NOM: 3 case → 3 cases<br />
|-<br />
!3<br />
|num prepde nom <br />
|NUM + de + NOM: 20 de case → 20 cases<br />
|-<br />
!4<br />
|nom adjnorm <br />
|NOM + ADJECTIU<br />
|-<br />
!5<br />
|nom adjpos <br />
|NOM + ADJECTIU POSSESSIU<br />
|-<br />
!6<br />
|num nom adjnorm <br />
|NUM + NOM + ADJECTIU<br />
|-<br />
!7<br />
|num prepde nom adjnorm <br />
|NUM + de + NOM + ADJECTIU<br />
|-<br />
!8<br />
|nom adjnorm adjnorm <br />
|NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!9<br />
|adj nom <br />
|ADJECTIU + NOM<br />
|-<br />
!10<br />
|det nom <br />
|DETERMINANT + NOM<br />
|-<br />
!11<br />
|predet nom <br />
|tot + NOM<br />
|-<br />
!12<br />
|nom nom_dgdef adjpos <br />
|NOM + NOM GENITIU + ADJECTIU POSSESSIU<br />
|-<br />
!13<br />
|nom np_dg <br />
|NOM + NOM PROPI GENITIU<br />
|-<br />
!14<br />
|nom adj_lui np <br />
|NOM + lui + NOM PROPI<br />
|-<br />
!15<br />
|nom adjnorm det_al np_dg <br />
|NOM + ADJECTIU + al + NOM PROPI GENITIU<br />
|-<br />
!16<br />
|nom adjnorm det_al adj_lui np <br />
|NOM + ADJECTIU + al + lui + NOM PROPI<br />
|-<br />
!17<br />
|det_cel num nom <br />
|cel + NUMERAL + NOM<br />
|-<br />
!18<br />
|det_cel adv_mai adjnorm nom <br />
|cel + mai + ADJECTIU + NOM<br />
|-<br />
!19<br />
|det_cel adv_mai adjnorm nom adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU<br />
|-<br />
!20<br />
|det_cel adv_mai adjnorm nom adjnorm adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!21<br />
|np <br />
|NOM PROPI<br />
|-<br />
!22<br />
|num <br />
|NUMERAL<br />
|-<br />
!23<br />
|prn_tnitg <br />
|PRONOM<br />
|-<br />
!24<br />
|prnpro <br />
|CLÍTIC<br />
|-<br />
!25<br />
|det_al nom_dg <br />
|al + NOM GENITIU<br />
|-<br />
!26<br />
|det_al nom_dg adjnorm <br />
|al + NOM GENITIU + ADJECTIU<br />
|-<br />
!27<br />
|det_al nom_dg adjnorm adjnorm <br />
|al + NOM GENITIU + ADJECTIU + ADJECTIU<br />
|-<br />
!28<br />
|adj <br />
|ADJ<br />
|-<br />
!29<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!30<br />
|pp prnenc <br />
|REGLA: PP + ENC (per si no ho reconeix cap altra regla)<br />
|-<br />
!31<br />
|prepnum num_xifra <br />
|prep + NUMERAL (any)<br />
|-<br />
!32<br />
|prepnum nom_any num_xifra <br />
|prep + NOM (any) + NUMERAL (any)<br />
|-<br />
!33<br />
|prep num_xifra mesos num_xifra <br />
|prep + NUMERAL (dia) + NOM (mes) + NUMERAL (any)<br />
|-<br />
!34<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!35<br />
|advnu vbcnj <br />
|REGLA: NU + VERB CONJUGAT<br />
|-<br />
!36<br />
|prnproref vbcnj <br />
|REGLA: CLÍTIC + VERB CONJUGAT<br />
|-<br />
!37<br />
|advnu prnproref vbcnj <br />
|REGLA: NU + CLÍTIC + VERB CONJUGAT<br />
|-<br />
!38<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: VERB CONJUGAT DOBLE<br />
|-<br />
!39<br />
|advnu vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: NU + VERB CONJUGAT DOBLE<br />
|-<br />
!40<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!41<br />
|advnu vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: NU + VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!42<br />
|vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: VERB CONJUGAT TRIPLE<br />
|-<br />
!43<br />
|advnu vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: NU + VERB CONJUGAT TRIPLE<br />
|-<br />
!44<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!45<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!46<br />
|cnjcoo <br />
|REGLA: CNJCOO<br />
|-<br />
!47<br />
|sent <br />
|Reinicialitza les variables<br />
|-<br />
!48<br />
|advnu vbcnj advdecât nom <br />
|Undocumented rule!<br />
|-<br />
!49<br />
|advnu vbcnj advdecât detind nom <br />
|Undocumented rule!<br />
|-<br />
!50<br />
|num mesos <br />
|Undocumented rule!<br />
|-<br />
!51<br />
|nom verbfi adj <br />
|Undocumented rule!<br />
|-<br />
!52<br />
|nom verbfi pp <br />
|Undocumented rule!<br />
|-<br />
!53<br />
|nom det_al relcărui nom <br />
|Undocumented rule!<br />
|-<br />
!54<br />
|prep_în nptop <br />
|Undocumented rule!<br />
|-<br />
!55<br />
|prep nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!56<br />
|vbcnj nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!57<br />
|verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!58<br />
|num mesos num <br />
|Undocumented rule!<br />
|-<br />
!59<br />
|prep nom adj <br />
|Undocumented rule!<br />
|-<br />
!60<br />
|prep nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!61<br />
|prep nom nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!62<br />
|vbcnj nom adj <br />
|Undocumented rule!<br />
|-<br />
!63<br />
|vbcnj nom det nom <br />
|Undocumented rule!<br />
|-<br />
!64<br />
|preppe npant <br />
|Undocumented rule!<br />
|-<br />
!65<br />
|preppe prn <br />
|Undocumented rule!<br />
|-<br />
!66<br />
|preppe det <br />
|Undocumented rule!<br />
|-<br />
!67<br />
|vbcnj prep nom nom <br />
|Undocumented rule!<br />
|-<br />
!68<br />
|prep nom nom_dg <br />
|Undocumented rule!<br />
|-<br />
!69<br />
|prep nom <br />
|Undocumented rule!<br />
|-<br />
!70<br />
|prep nom adj adj adj <br />
|Undocumented rule!<br />
|-<br />
!71<br />
|det_predet <br />
|Undocumented rule!<br />
|-<br />
!72<br />
|adjtot nom <br />
|Undocumented rule!<br />
|-<br />
!73<br />
|vbcnj dettuturor <br />
|Undocumented rule!<br />
|-<br />
!74<br />
|nom dettuturor <br />
|Undocumented rule!<br />
|-<br />
!75<br />
|relat <br />
|Undocumented rule!<br />
|-<br />
!76<br />
|vbcnj nom nom <br />
|Undocumented rule!<br />
|-<br />
!77<br />
|adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!78<br />
|nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!79<br />
|prep nom adj det_al np <br />
|Undocumented rule!<br />
|-<br />
!80<br />
|prep nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!81<br />
|nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!82<br />
|nom det nom <br />
|Undocumented rule!<br />
|-<br />
!83<br />
|nom det_al detind nom <br />
|Undocumented rule!<br />
|-<br />
!84<br />
|det nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!85<br />
|det adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!86<br />
|nom adv adj <br />
|Undocumented rule!<br />
|-<br />
!87<br />
|det_cel adv_mai adj <br />
|Undocumented rule!<br />
|-<br />
!88<br />
|advnespus prepde adj <br />
|Undocumented rule!<br />
|-<br />
!89<br />
|prep prepa prnpro inf <br />
|Undocumented rule!<br />
|-<br />
!90<br />
|prep prepa inf <br />
|Undocumented rule!<br />
|-<br />
!91<br />
|nom det_cel adj <br />
|Undocumented rule!<br />
|-<br />
!92<br />
|det nom adj <br />
|Undocumented rule!<br />
|-<br />
!93<br />
|det adj nom <br />
|Undocumented rule!<br />
|-<br />
!94<br />
|nom nom <br />
|Undocumented rule!<br />
|-<br />
!95<br />
|advnu adv_mai vbcnj <br />
|Undocumented rule!<br />
|-<br />
!96<br />
|vrea verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!97<br />
|conjnici advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!98<br />
|advnici- advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!99<br />
|advnu vbcnj advdecât <br />
|Undocumented rule!<br />
|-<br />
!100<br />
|vbaveacni verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!101<br />
|vbaveapri verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!102<br />
|vbcnj nom <br />
|Undocumented rule!<br />
|-<br />
!103<br />
|vbcnj conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!104<br />
|adv adj nom <br />
|Undocumented rule!<br />
|-<br />
!105<br />
|num adj nom <br />
|Undocumented rule!<br />
|-<br />
!106<br />
|vbcnj det_cel num nom <br />
|Undocumented rule!<br />
|-<br />
!107<br />
|vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!108<br />
|vrea vbcnj prep prn <br />
|Undocumented rule!<br />
|-<br />
!109<br />
|vrea vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!110<br />
|vbaveacni vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!111<br />
|prepde pp <br />
|Undocumented rule!<br />
|-<br />
!112<br />
|catverb prnenc <br />
|Undocumented rule!<br />
|-<br />
!113<br />
|catverb prnenc prnenc <br />
|Undocumented rule!<br />
|-<br />
!114<br />
|advca nom <br />
|Undocumented rule!<br />
|-<br />
!115<br />
|advca det nom <br />
|Undocumented rule!<br />
|-<br />
!116<br />
|adv adj advca <br />
|Undocumented rule!<br />
|-<br />
!117<br />
|prep nom nptop <br />
|Undocumented rule!<br />
|-<br />
!118<br />
|prepdintr- detind <br />
|Undocumented rule!<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|elimina_GDND<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND'<br />
|-<br />
!2<br />
|f_concord_SN_SV<br />
| style="text-align:center" |2<br />
|Realitza la concordança d'un SN i un SV<br />
|-<br />
!3<br />
|f_concord_SN_SN_SV<br />
| style="text-align:center" |3<br />
|Realitza la concordança de dos SN i un SV<br />
|-<br />
!4<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança entre dos elements<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|SN<br />
|-<br />
!2<br />
|SAdj <br />
|SAdj<br />
|-<br />
!3<br />
|SN SAdj <br />
|SN + SAdj<br />
|-<br />
!4<br />
|SN SV <br />
|SN + SV<br />
|-<br />
!5<br />
|SN cnjcoo SN SV <br />
|SN + cnjcoo + SN + SV<br />
|-<br />
!6<br />
|SN SV_cop SAdj <br />
|SN + SV (copulatiu) + SAdj<br />
|-<br />
!7<br />
|SP <br />
|SP<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
== Catalan → Romanian ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|mod_maj2<br />
| style="text-align:center" |2<br />
|Canvia la minúscula inicial d'un lema a majúscula si un altre lema té majúscula inicial<br />
|-<br />
!3<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!4<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!5<br />
|gen_clitic1<br />
| style="text-align:center" |2<br />
|Genera un clític per a un verb<br />
|-<br />
!6<br />
|gen_clitic2<br />
| style="text-align:center" |3<br />
|Genera dos clítics per a un verb<br />
|-<br />
!7<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables<br />
|-<br />
!8<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables<br />
|-<br />
!9<br />
|arregla_np<br />
| style="text-align:center" |1<br />
|Treu l'etiqueta de genitiu d'un nom propi i en determina l'article<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casa → casă<br />
|-<br />
!2<br />
|artdef nom <br />
|DET + NOM: la casa → casa<br />
|-<br />
!3<br />
|detpos nom <br />
|DETPOS + NOM: la meva casa → casa mea<br />
|-<br />
!4<br />
|nom adjpos <br />
|NOM + ADJPOS: casa meva → casa mea<br />
|-<br />
!5<br />
|artdef nom pr_de np <br />
|DET + NOM + de + NOM PROPI: la casa de Maria → casa Mariei / la casa de Mihai → casa lui Mihai<br />
|-<br />
!6<br />
|adjord nom <br />
|ADJORD + NOM: tercera posició → poziția a treia<br />
|-<br />
!7<br />
|artdef adjord nom <br />
|DET + ADJORD + NOM: la tercera posició → poziția a treia<br />
|-<br />
!8<br />
|adj_det nom <br />
|ADJECTIU + NOM: moltes cases → multe case<br />
|-<br />
!9<br />
|num nom <br />
|NUMERAL + NOM: cinc cases → cinci case / trenta cases → treizeci de case<br />
|-<br />
!10<br />
|np <br />
|NOM PROPI<br />
|-<br />
!11<br />
|prn <br />
|PRONOM<br />
|-<br />
!12<br />
|det_predet <br />
|DET/PREDET<br />
|-<br />
!13<br />
|detpos <br />
|DETPOS<br />
|-<br />
!14<br />
|num <br />
|NUMERAL<br />
|-<br />
!15<br />
|adj <br />
|ADJECTIU: bonic → frumos<br />
|-<br />
!16<br />
|pp <br />
|PARTICIPI<br />
|-<br />
!17<br />
|vbcnj <br />
|VERB CONJUGAT SIMPLE<br />
|-<br />
!18<br />
|inf <br />
|INFINITIU<br />
|-<br />
!19<br />
|ger <br />
|GERUNDI<br />
|-<br />
!20<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|VERB CONJUGAT DOBLE<br />
|-<br />
!21<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|VERB CONJUGAT DOBLE + ENC<br />
|-<br />
!22<br />
|vbcnj_doble1 vbcnj_doble2 prnenc prnenc <br />
|VERB CONJUGAT DOBLE + ENC + ENC<br />
|-<br />
!23<br />
|vbimp prnenc <br />
|REGLA: IMPERATIU ENC<br />
|-<br />
!24<br />
|vbimp prnenc prnenc <br />
|REGLA: IMPERATIU ENC ENC<br />
|-<br />
!25<br />
|ger prnenc <br />
|REGLA: GER ENC<br />
|-<br />
!26<br />
|ger prnenc prnenc <br />
|REGLA: GER ENC ENC<br />
|-<br />
!27<br />
|inf_sol prnenc <br />
|REGLA: INF ENC<br />
|-<br />
!28<br />
|inf_sol prnenc prnenc <br />
|REGLA: INF ENC ENC<br />
|-<br />
!29<br />
|vbhaver <br />
|REGLA: VBHAVER (per si apareix sol)<br />
|-<br />
!30<br />
|prnproenc <br />
|PRONOM CLÍTIC (per si apareix sol)<br />
|-<br />
!31<br />
|artdef_m_sg num_xifra <br />
|DETERMINANT + NUMERAL: el 1992 → în 1992<br />
|-<br />
!32<br />
|rel <br />
|RELATIU<br />
|-<br />
!33<br />
|sent <br />
|Reinicialitza les variables<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|elimina_GDND<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND'.<br />
|-<br />
!2<br />
|elimina_GDND_adj<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND' d'un adjectiu<br />
|-<br />
!3<br />
|f_concord_SN_SV<br />
| style="text-align:center" |2<br />
|Realitza la concordança d'un SN i un SV<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|-<br />
!2<br />
|SA <br />
|REGLA: SA<br />
|-<br />
!3<br />
|SN SV <br />
|SN + SV<br />
|-<br />
!4<br />
|DET <br />
|REGLA: DET<br />
|-<br />
!5<br />
|NUM <br />
|REGLA: NUM<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
<br />
[[Category:Romanian and Catalan|*]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Transfer_Rules&diff=68036
Romanian and Catalan/Transfer Rules
2018-11-28T10:04:56Z
<p>Marcriera: /* Chunk (T1X) */</p>
<hr />
<div>This page provides general documentation on the transfer rules used in the Romanian-Catalan pair. The tables are generated from the data and comments found within the rule files using an automated script ('''transfer_documentation.sh''', available [https://github.com/MarcRiera/apertium-scripts here]).<br />
<br />
== Romanian → Catalan ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!3<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!4<br />
|conj_verb3<br />
| style="text-align:center" |3<br />
|Conjuga un verb compost triple<br />
|-<br />
!5<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables del chunk<br />
|-<br />
!6<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables del chunk<br />
|-<br />
!7<br />
|f_concord3<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements i desa els valors a les variables del chunk<br />
|-<br />
!8<br />
|f_concord4<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements i desa els valors a les variables del chunk<br />
|-<br />
!9<br />
|f_concord5<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements i desa els valors a les variables del chunk<br />
|-<br />
!10<br />
|f_concord1_i<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element intern i desa els valors a les variables internes<br />
|-<br />
!11<br />
|f_concord2_i<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements interns i desa els valors a les variables internes<br />
|-<br />
!12<br />
|f_concord3_i<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements interns i desa els valors a les variables internes<br />
|-<br />
!13<br />
|f_concord4_i<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements interns i desa els valors a les variables internes<br />
|-<br />
!14<br />
|f_concord5_i<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements interns i desa els valors a les variables internes<br />
|-<br />
!15<br />
|n_gen_nbr<br />
| style="text-align:center" |0<br />
|Neteja les variables de gènere i nombre<br />
|-<br />
!16<br />
|f_bcond<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!17<br />
|f_outverb2<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!18<br />
|f_outverb3<br />
| style="text-align:center" |3<br />
|Undocumented macro!<br />
|-<br />
!19<br />
|prn_verb<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!20<br />
|prn_verbp<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!21<br />
|prn_verbenc<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!22<br />
|f_modgen<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!23<br />
|f_lexicAdj<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casă → casa<br />
|-<br />
!2<br />
|num nom <br />
|NUM + NOM: 3 case → 3 cases<br />
|-<br />
!3<br />
|num prepde nom <br />
|NUM + de + NOM: 20 de case → 20 cases<br />
|-<br />
!4<br />
|nom adjnorm <br />
|NOM + ADJECTIU<br />
|-<br />
!5<br />
|nom adjpos <br />
|NOM + ADJECTIU POSSESSIU<br />
|-<br />
!6<br />
|num nom adjnorm <br />
|NUM + NOM + ADJECTIU<br />
|-<br />
!7<br />
|num prepde nom adjnorm <br />
|NUM + de + NOM + ADJECTIU<br />
|-<br />
!8<br />
|nom adjnorm adjnorm <br />
|NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!9<br />
|adj nom <br />
|ADJECTIU + NOM<br />
|-<br />
!10<br />
|det nom <br />
|DETERMINANT + NOM<br />
|-<br />
!11<br />
|predet nom <br />
|tot + NOM<br />
|-<br />
!12<br />
|nom nom_dgdef adjpos <br />
|NOM + NOM GENITIU + ADJECTIU POSSESSIU<br />
|-<br />
!13<br />
|nom np_dg <br />
|NOM + NOM PROPI GENITIU<br />
|-<br />
!14<br />
|nom adj_lui np <br />
|NOM + lui + NOM PROPI<br />
|-<br />
!15<br />
|nom adjnorm det_al np_dg <br />
|NOM + ADJECTIU + al + NOM PROPI GENITIU<br />
|-<br />
!16<br />
|nom adjnorm det_al adj_lui np <br />
|NOM + ADJECTIU + al + lui + NOM PROPI<br />
|-<br />
!17<br />
|det_cel num nom <br />
|cel + NUMERAL + NOM<br />
|-<br />
!18<br />
|det_cel adv_mai adjnorm nom <br />
|cel + mai + ADJECTIU + NOM<br />
|-<br />
!19<br />
|det_cel adv_mai adjnorm nom adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU<br />
|-<br />
!20<br />
|det_cel adv_mai adjnorm nom adjnorm adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!21<br />
|np <br />
|NOM PROPI<br />
|-<br />
!22<br />
|num <br />
|NUMERAL<br />
|-<br />
!23<br />
|prn_tnitg <br />
|PRONOM<br />
|-<br />
!24<br />
|prnpro <br />
|CLÍTIC<br />
|-<br />
!25<br />
|det_al nom_dg <br />
|al + NOM GENITIU<br />
|-<br />
!26<br />
|det_al nom_dg adjnorm <br />
|al + NOM GENITIU + ADJECTIU<br />
|-<br />
!27<br />
|det_al nom_dg adjnorm adjnorm <br />
|al + NOM GENITIU + ADJECTIU + ADJECTIU<br />
|-<br />
!28<br />
|adj <br />
|ADJ<br />
|-<br />
!29<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!30<br />
|pp prnenc <br />
|REGLA: PP + ENC (per si no ho reconeix cap altra regla)<br />
|-<br />
!31<br />
|prepnum num_xifra <br />
|prep + NUMERAL (any)<br />
|-<br />
!32<br />
|prepnum nom_any num_xifra <br />
|prep + NOM (any) + NUMERAL (any)<br />
|-<br />
!33<br />
|prep num_xifra mesos num_xifra <br />
|prep + NUMERAL (dia) + NOM (mes) + NUMERAL (any)<br />
|-<br />
!34<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!35<br />
|advnu vbcnj <br />
|REGLA: NU + VERB CONJUGAT<br />
|-<br />
!36<br />
|prnproref vbcnj <br />
|REGLA: CLÍTIC + VERB CONJUGAT<br />
|-<br />
!37<br />
|advnu prnproref vbcnj <br />
|REGLA: NU + CLÍTIC + VERB CONJUGAT<br />
|-<br />
!38<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: VERB CONJUGAT DOBLE<br />
|-<br />
!39<br />
|advnu vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: NU + VERB CONJUGAT DOBLE<br />
|-<br />
!40<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!41<br />
|advnu vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: NU + VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!42<br />
|vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: VERB CONJUGAT TRIPLE<br />
|-<br />
!43<br />
|advnu vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: NU + VERB CONJUGAT TRIPLE<br />
|-<br />
!44<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!45<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!46<br />
|cnjcoo <br />
|REGLA: CNJCOO<br />
|-<br />
!47<br />
|sent <br />
|Reinicialitza les variables<br />
|-<br />
!48<br />
|advnu vbcnj advdecât nom <br />
|Undocumented rule!<br />
|-<br />
!49<br />
|advnu vbcnj advdecât detind nom <br />
|Undocumented rule!<br />
|-<br />
!50<br />
|num mesos <br />
|Undocumented rule!<br />
|-<br />
!51<br />
|nom verbfi adj <br />
|Undocumented rule!<br />
|-<br />
!52<br />
|nom verbfi pp <br />
|Undocumented rule!<br />
|-<br />
!53<br />
|nom det_al relcărui nom <br />
|Undocumented rule!<br />
|-<br />
!54<br />
|prep_în nptop <br />
|Undocumented rule!<br />
|-<br />
!55<br />
|prep nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!56<br />
|vbcnj nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!57<br />
|verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!58<br />
|num mesos num <br />
|Undocumented rule!<br />
|-<br />
!59<br />
|prep nom adj <br />
|Undocumented rule!<br />
|-<br />
!60<br />
|prep nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!61<br />
|prep nom nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!62<br />
|vbcnj nom adj <br />
|Undocumented rule!<br />
|-<br />
!63<br />
|vbcnj nom det nom <br />
|Undocumented rule!<br />
|-<br />
!64<br />
|preppe npant <br />
|Undocumented rule!<br />
|-<br />
!65<br />
|preppe prn <br />
|Undocumented rule!<br />
|-<br />
!66<br />
|preppe det <br />
|Undocumented rule!<br />
|-<br />
!67<br />
|vbcnj prep nom nom <br />
|Undocumented rule!<br />
|-<br />
!68<br />
|prep nom nom_dg <br />
|Undocumented rule!<br />
|-<br />
!69<br />
|prep nom <br />
|Undocumented rule!<br />
|-<br />
!70<br />
|prep nom adj adj adj <br />
|Undocumented rule!<br />
|-<br />
!71<br />
|det_predet <br />
|Undocumented rule!<br />
|-<br />
!72<br />
|adjtot nom <br />
|Undocumented rule!<br />
|-<br />
!73<br />
|vbcnj dettuturor <br />
|Undocumented rule!<br />
|-<br />
!74<br />
|nom dettuturor <br />
|Undocumented rule!<br />
|-<br />
!75<br />
|relat <br />
|Undocumented rule!<br />
|-<br />
!76<br />
|vbcnj nom nom <br />
|Undocumented rule!<br />
|-<br />
!77<br />
|adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!78<br />
|nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!79<br />
|prep nom adj det_al np <br />
|Undocumented rule!<br />
|-<br />
!80<br />
|prep nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!81<br />
|nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!82<br />
|nom det nom <br />
|Undocumented rule!<br />
|-<br />
!83<br />
|nom det_al detind nom <br />
|Undocumented rule!<br />
|-<br />
!84<br />
|det nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!85<br />
|det adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!86<br />
|nom adv adj <br />
|Undocumented rule!<br />
|-<br />
!87<br />
|det_cel adv_mai adj <br />
|Undocumented rule!<br />
|-<br />
!88<br />
|advnespus prepde adj <br />
|Undocumented rule!<br />
|-<br />
!89<br />
|prep prepa prnpro inf <br />
|Undocumented rule!<br />
|-<br />
!90<br />
|prep prepa inf <br />
|Undocumented rule!<br />
|-<br />
!91<br />
|nom det_cel adj <br />
|Undocumented rule!<br />
|-<br />
!92<br />
|det nom adj <br />
|Undocumented rule!<br />
|-<br />
!93<br />
|det adj nom <br />
|Undocumented rule!<br />
|-<br />
!94<br />
|nom nom <br />
|Undocumented rule!<br />
|-<br />
!95<br />
|advnu adv_mai vbcnj <br />
|Undocumented rule!<br />
|-<br />
!96<br />
|vrea verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!97<br />
|conjnici advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!98<br />
|advnici- advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!99<br />
|advnu vbcnj advdecât <br />
|Undocumented rule!<br />
|-<br />
!100<br />
|vbaveacni verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!101<br />
|vbaveapri verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!102<br />
|vbcnj nom <br />
|Undocumented rule!<br />
|-<br />
!103<br />
|vbcnj conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!104<br />
|adv adj nom <br />
|Undocumented rule!<br />
|-<br />
!105<br />
|num adj nom <br />
|Undocumented rule!<br />
|-<br />
!106<br />
|vbcnj det_cel num nom <br />
|Undocumented rule!<br />
|-<br />
!107<br />
|vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!108<br />
|vrea vbcnj prep prn <br />
|Undocumented rule!<br />
|-<br />
!109<br />
|vrea vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!110<br />
|vbaveacni vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!111<br />
|prepde pp <br />
|Undocumented rule!<br />
|-<br />
!112<br />
|catverb prnenc <br />
|Undocumented rule!<br />
|-<br />
!113<br />
|catverb prnenc prnenc <br />
|Undocumented rule!<br />
|-<br />
!114<br />
|advca nom <br />
|Undocumented rule!<br />
|-<br />
!115<br />
|advca det nom <br />
|Undocumented rule!<br />
|-<br />
!116<br />
|adv adj advca <br />
|Undocumented rule!<br />
|-<br />
!117<br />
|prep nom nptop <br />
|Undocumented rule!<br />
|-<br />
!118<br />
|prepdintr- detind <br />
|Undocumented rule!<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|elimina_GDND<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND'<br />
|-<br />
!2<br />
|f_concord_SN_SV<br />
| style="text-align:center" |2<br />
|Realitza la concordança d'un SN i un SV<br />
|-<br />
!3<br />
|f_concord_SN_SN_SV<br />
| style="text-align:center" |3<br />
|Realitza la concordança de dos SN i un SV<br />
|-<br />
!4<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança entre dos elements<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|SN<br />
|-<br />
!2<br />
|SAdj <br />
|SAdj<br />
|-<br />
!3<br />
|SN SAdj <br />
|SN + SAdj<br />
|-<br />
!4<br />
|SN SV <br />
|SN + SV<br />
|-<br />
!5<br />
|SN cnjcoo SN SV <br />
|SN + cnjcoo + SN + SV<br />
|-<br />
!6<br />
|SN SV_cop SAdj <br />
|SN + SV (copulatiu) + SAdj<br />
|-<br />
!7<br />
|SP <br />
|SP<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
== Catalan → Romanian ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|mod_maj2<br />
| style="text-align:center" |2<br />
|Canvia la minúscula inicial d'un lema a majúscula si un altre lema té majúscula inicial<br />
|-<br />
!3<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!4<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!5<br />
|gen_clitic1<br />
| style="text-align:center" |2<br />
|Genera un clític per a un verb<br />
|-<br />
!6<br />
|gen_clitic2<br />
| style="text-align:center" |3<br />
|Genera dos clítics per a un verb<br />
|-<br />
!7<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables<br />
|-<br />
!8<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables<br />
|-<br />
!9<br />
|arregla_np<br />
| style="text-align:center" |1<br />
|Treu l'etiqueta de genitiu d'un nom propi i en determina l'article<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casa → casă<br />
|-<br />
!2<br />
|artdef nom <br />
|DET + NOM: la casa → casa<br />
|-<br />
!3<br />
|detpos nom <br />
|DETPOS + NOM: la meva casa → casa mea<br />
|-<br />
!4<br />
|nom adjpos <br />
|NOM + ADJPOS: casa meva → casa mea<br />
|-<br />
!5<br />
|artdef nom pr_de np <br />
|DET + NOM + de + NOM PROPI: la casa de Maria → casa Mariei / la casa de Mihai → casa lui Mihai<br />
|-<br />
!6<br />
|adjord nom <br />
|ADJORD + NOM: tercera posició → poziția a treia<br />
|-<br />
!7<br />
|artdef adjord nom <br />
|DET + ADJORD + NOM: la tercera posició → poziția a treia<br />
|-<br />
!8<br />
|adj_det nom <br />
|ADJECTIU + NOM: moltes cases → multe case<br />
|-<br />
!9<br />
|num nom <br />
|NUMERAL + NOM: cinc cases → cinci case / trenta cases → treizeci de case<br />
|-<br />
!10<br />
|np <br />
|NOM PROPI<br />
|-<br />
!11<br />
|prn <br />
|PRONOM<br />
|-<br />
!12<br />
|det_predet <br />
|DET/PREDET<br />
|-<br />
!13<br />
|detpos <br />
|DETPOS<br />
|-<br />
!14<br />
|num <br />
|NUMERAL<br />
|-<br />
!15<br />
|adj <br />
|ADJECTIU: bonic → frumos<br />
|-<br />
!16<br />
|pp <br />
|PARTICIPI<br />
|-<br />
!17<br />
|vbcnj <br />
|VERB CONJUGAT SIMPLE<br />
|-<br />
!18<br />
|inf <br />
|INFINITIU<br />
|-<br />
!19<br />
|ger <br />
|GERUNDI<br />
|-<br />
!20<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|VERB CONJUGAT DOBLE<br />
|-<br />
!21<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|VERB CONJUGAT DOBLE + ENC<br />
|-<br />
!22<br />
|vbcnj_doble1 vbcnj_doble2 prnenc prnenc <br />
|VERB CONJUGAT DOBLE + ENC + ENC<br />
|-<br />
!23<br />
|vbimp prnenc <br />
|REGLA: IMPERATIU ENC<br />
|-<br />
!24<br />
|vbimp prnenc prnenc <br />
|REGLA: IMPERATIU ENC ENC<br />
|-<br />
!25<br />
|ger prnenc <br />
|REGLA: GER ENC<br />
|-<br />
!26<br />
|ger prnenc prnenc <br />
|REGLA: GER ENC ENC<br />
|-<br />
!27<br />
|inf_sol prnenc <br />
|REGLA: INF ENC<br />
|-<br />
!28<br />
|inf_sol prnenc prnenc <br />
|REGLA: INF ENC ENC<br />
|-<br />
!29<br />
|vbhaver <br />
|REGLA: VBHAVER (per si apareix sol)<br />
|-<br />
!30<br />
|prnproenc <br />
|PRONOM CLÍTIC (per si apareix sol)<br />
|-<br />
!31<br />
|artdef_m_sg num_xifra <br />
|DETERMINANT + NUMERAL: el 1992 → în 1992<br />
|-<br />
!32<br />
|rel <br />
|RELATIU<br />
|-<br />
!33<br />
|sent <br />
|Reinicialitza les variables<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|-<br />
!2<br />
|SA <br />
|REGLA: SA<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
<br />
[[Category:Romanian and Catalan|*]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Transfer_Rules&diff=68035
Romanian and Catalan/Transfer Rules
2018-11-28T10:02:56Z
<p>Marcriera: /* Interchunk (T2X) */</p>
<hr />
<div>This page provides general documentation on the transfer rules used in the Romanian-Catalan pair. The tables are generated from the data and comments found within the rule files using an automated script ('''transfer_documentation.sh''', available [https://github.com/MarcRiera/apertium-scripts here]).<br />
<br />
== Romanian → Catalan ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!3<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!4<br />
|conj_verb3<br />
| style="text-align:center" |3<br />
|Conjuga un verb compost triple<br />
|-<br />
!5<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables del chunk<br />
|-<br />
!6<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables del chunk<br />
|-<br />
!7<br />
|f_concord3<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements i desa els valors a les variables del chunk<br />
|-<br />
!8<br />
|f_concord4<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements i desa els valors a les variables del chunk<br />
|-<br />
!9<br />
|f_concord5<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements i desa els valors a les variables del chunk<br />
|-<br />
!10<br />
|f_concord1_i<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element intern i desa els valors a les variables internes<br />
|-<br />
!11<br />
|f_concord2_i<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements interns i desa els valors a les variables internes<br />
|-<br />
!12<br />
|f_concord3_i<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements interns i desa els valors a les variables internes<br />
|-<br />
!13<br />
|f_concord4_i<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements interns i desa els valors a les variables internes<br />
|-<br />
!14<br />
|f_concord5_i<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements interns i desa els valors a les variables internes<br />
|-<br />
!15<br />
|n_gen_nbr<br />
| style="text-align:center" |0<br />
|Neteja les variables de gènere i nombre<br />
|-<br />
!16<br />
|f_bcond<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!17<br />
|f_outverb2<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!18<br />
|f_outverb3<br />
| style="text-align:center" |3<br />
|Undocumented macro!<br />
|-<br />
!19<br />
|prn_verb<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!20<br />
|prn_verbp<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!21<br />
|prn_verbenc<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!22<br />
|f_modgen<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!23<br />
|f_lexicAdj<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casă → casa<br />
|-<br />
!2<br />
|num nom <br />
|NUM + NOM: 3 case → 3 cases<br />
|-<br />
!3<br />
|num prepde nom <br />
|NUM + de + NOM: 20 de case → 20 cases<br />
|-<br />
!4<br />
|nom adjnorm <br />
|NOM + ADJECTIU<br />
|-<br />
!5<br />
|nom adjpos <br />
|NOM + ADJECTIU POSSESSIU<br />
|-<br />
!6<br />
|num nom adjnorm <br />
|NUM + NOM + ADJECTIU<br />
|-<br />
!7<br />
|num prepde nom adjnorm <br />
|NUM + de + NOM + ADJECTIU<br />
|-<br />
!8<br />
|nom adjnorm adjnorm <br />
|NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!9<br />
|adj nom <br />
|ADJECTIU + NOM<br />
|-<br />
!10<br />
|det nom <br />
|DETERMINANT + NOM<br />
|-<br />
!11<br />
|predet nom <br />
|tot + NOM<br />
|-<br />
!12<br />
|nom nom_dgdef adjpos <br />
|NOM + NOM GENITIU + ADJECTIU POSSESSIU<br />
|-<br />
!13<br />
|nom np_dg <br />
|NOM + NOM PROPI GENITIU<br />
|-<br />
!14<br />
|nom adj_lui np <br />
|NOM + lui + NOM PROPI<br />
|-<br />
!15<br />
|nom adjnorm det_al np_dg <br />
|NOM + ADJECTIU + al + NOM PROPI GENITIU<br />
|-<br />
!16<br />
|nom adjnorm det_al adj_lui np <br />
|NOM + ADJECTIU + al + lui + NOM PROPI<br />
|-<br />
!17<br />
|det_cel num nom <br />
|cel + NUMERAL + NOM<br />
|-<br />
!18<br />
|det_cel adv_mai adjnorm nom <br />
|cel + mai + ADJECTIU + NOM<br />
|-<br />
!19<br />
|det_cel adv_mai adjnorm nom adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU<br />
|-<br />
!20<br />
|det_cel adv_mai adjnorm nom adjnorm adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!21<br />
|np <br />
|NOM PROPI<br />
|-<br />
!22<br />
|num <br />
|NUMERAL<br />
|-<br />
!23<br />
|prn_tnitg <br />
|PRONOM<br />
|-<br />
!24<br />
|prnpro <br />
|CLÍTIC<br />
|-<br />
!25<br />
|det_al nom_dg <br />
|al + NOM GENITIU<br />
|-<br />
!26<br />
|det_al nom_dg adjnorm <br />
|al + NOM GENITIU + ADJECTIU<br />
|-<br />
!27<br />
|det_al nom_dg adjnorm adjnorm <br />
|al + NOM GENITIU + ADJECTIU + ADJECTIU<br />
|-<br />
!28<br />
|adj <br />
|ADJ<br />
|-<br />
!29<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!30<br />
|pp prnenc <br />
|REGLA: PP + ENC (per si no ho reconeix cap altra regla)<br />
|-<br />
!31<br />
|prepnum num_xifra <br />
|prep + NUMERAL (any)<br />
|-<br />
!32<br />
|prepnum nom_any num_xifra <br />
|prep + NOM (any) + NUMERAL (any)<br />
|-<br />
!33<br />
|prep num_xifra mesos num_xifra <br />
|prep + NUMERAL (dia) + NOM (mes) + NUMERAL (any)<br />
|-<br />
!34<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!35<br />
|advnu vbcnj <br />
|REGLA: NU + VERB CONJUGAT<br />
|-<br />
!36<br />
|prnproref vbcnj <br />
|REGLA: CLÍTIC + VERB CONJUGAT<br />
|-<br />
!37<br />
|advnu prnproref vbcnj <br />
|REGLA: NU + CLÍTIC + VERB CONJUGAT<br />
|-<br />
!38<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: VERB CONJUGAT DOBLE<br />
|-<br />
!39<br />
|advnu vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: NU + VERB CONJUGAT DOBLE<br />
|-<br />
!40<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!41<br />
|advnu vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: NU + VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!42<br />
|vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: VERB CONJUGAT TRIPLE<br />
|-<br />
!43<br />
|advnu vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: NU + VERB CONJUGAT TRIPLE<br />
|-<br />
!44<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!45<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!46<br />
|cnjcoo <br />
|REGLA: CNJCOO<br />
|-<br />
!47<br />
|sent <br />
|Reinicialitza les variables<br />
|-<br />
!48<br />
|advnu vbcnj advdecât nom <br />
|Undocumented rule!<br />
|-<br />
!49<br />
|advnu vbcnj advdecât detind nom <br />
|Undocumented rule!<br />
|-<br />
!50<br />
|num mesos <br />
|Undocumented rule!<br />
|-<br />
!51<br />
|nom verbfi adj <br />
|Undocumented rule!<br />
|-<br />
!52<br />
|nom verbfi pp <br />
|Undocumented rule!<br />
|-<br />
!53<br />
|nom det_al relcărui nom <br />
|Undocumented rule!<br />
|-<br />
!54<br />
|prep_în nptop <br />
|Undocumented rule!<br />
|-<br />
!55<br />
|prep nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!56<br />
|vbcnj nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!57<br />
|verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!58<br />
|num mesos num <br />
|Undocumented rule!<br />
|-<br />
!59<br />
|prep nom adj <br />
|Undocumented rule!<br />
|-<br />
!60<br />
|prep nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!61<br />
|prep nom nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!62<br />
|vbcnj nom adj <br />
|Undocumented rule!<br />
|-<br />
!63<br />
|vbcnj nom det nom <br />
|Undocumented rule!<br />
|-<br />
!64<br />
|preppe npant <br />
|Undocumented rule!<br />
|-<br />
!65<br />
|preppe prn <br />
|Undocumented rule!<br />
|-<br />
!66<br />
|preppe det <br />
|Undocumented rule!<br />
|-<br />
!67<br />
|vbcnj prep nom nom <br />
|Undocumented rule!<br />
|-<br />
!68<br />
|prep nom nom_dg <br />
|Undocumented rule!<br />
|-<br />
!69<br />
|prep nom <br />
|Undocumented rule!<br />
|-<br />
!70<br />
|prep nom adj adj adj <br />
|Undocumented rule!<br />
|-<br />
!71<br />
|det_predet <br />
|Undocumented rule!<br />
|-<br />
!72<br />
|adjtot nom <br />
|Undocumented rule!<br />
|-<br />
!73<br />
|vbcnj dettuturor <br />
|Undocumented rule!<br />
|-<br />
!74<br />
|nom dettuturor <br />
|Undocumented rule!<br />
|-<br />
!75<br />
|relat <br />
|Undocumented rule!<br />
|-<br />
!76<br />
|vbcnj nom nom <br />
|Undocumented rule!<br />
|-<br />
!77<br />
|adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!78<br />
|nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!79<br />
|prep nom adj det_al np <br />
|Undocumented rule!<br />
|-<br />
!80<br />
|prep nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!81<br />
|nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!82<br />
|nom det nom <br />
|Undocumented rule!<br />
|-<br />
!83<br />
|nom det_al detind nom <br />
|Undocumented rule!<br />
|-<br />
!84<br />
|det nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!85<br />
|det adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!86<br />
|nom adv adj <br />
|Undocumented rule!<br />
|-<br />
!87<br />
|det_cel adv_mai adj <br />
|Undocumented rule!<br />
|-<br />
!88<br />
|advnespus prepde adj <br />
|Undocumented rule!<br />
|-<br />
!89<br />
|prep prepa prnpro inf <br />
|Undocumented rule!<br />
|-<br />
!90<br />
|prep prepa inf <br />
|Undocumented rule!<br />
|-<br />
!91<br />
|nom det_cel adj <br />
|Undocumented rule!<br />
|-<br />
!92<br />
|det nom adj <br />
|Undocumented rule!<br />
|-<br />
!93<br />
|det adj nom <br />
|Undocumented rule!<br />
|-<br />
!94<br />
|nom nom <br />
|Undocumented rule!<br />
|-<br />
!95<br />
|advnu adv_mai vbcnj <br />
|Undocumented rule!<br />
|-<br />
!96<br />
|vrea verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!97<br />
|conjnici advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!98<br />
|advnici- advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!99<br />
|advnu vbcnj advdecât <br />
|Undocumented rule!<br />
|-<br />
!100<br />
|vbaveacni verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!101<br />
|vbaveapri verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!102<br />
|vbcnj nom <br />
|Undocumented rule!<br />
|-<br />
!103<br />
|vbcnj conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!104<br />
|adv adj nom <br />
|Undocumented rule!<br />
|-<br />
!105<br />
|num adj nom <br />
|Undocumented rule!<br />
|-<br />
!106<br />
|vbcnj det_cel num nom <br />
|Undocumented rule!<br />
|-<br />
!107<br />
|vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!108<br />
|vrea vbcnj prep prn <br />
|Undocumented rule!<br />
|-<br />
!109<br />
|vrea vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!110<br />
|vbaveacni vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!111<br />
|prepde pp <br />
|Undocumented rule!<br />
|-<br />
!112<br />
|catverb prnenc <br />
|Undocumented rule!<br />
|-<br />
!113<br />
|catverb prnenc prnenc <br />
|Undocumented rule!<br />
|-<br />
!114<br />
|advca nom <br />
|Undocumented rule!<br />
|-<br />
!115<br />
|advca det nom <br />
|Undocumented rule!<br />
|-<br />
!116<br />
|adv adj advca <br />
|Undocumented rule!<br />
|-<br />
!117<br />
|prep nom nptop <br />
|Undocumented rule!<br />
|-<br />
!118<br />
|prepdintr- detind <br />
|Undocumented rule!<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|elimina_GDND<br />
| style="text-align:center" |1<br />
|Elimina les etiquetes 'GD' i 'ND'<br />
|-<br />
!2<br />
|f_concord_SN_SV<br />
| style="text-align:center" |2<br />
|Realitza la concordança d'un SN i un SV<br />
|-<br />
!3<br />
|f_concord_SN_SN_SV<br />
| style="text-align:center" |3<br />
|Realitza la concordança de dos SN i un SV<br />
|-<br />
!4<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança entre dos elements<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|SN<br />
|-<br />
!2<br />
|SAdj <br />
|SAdj<br />
|-<br />
!3<br />
|SN SAdj <br />
|SN + SAdj<br />
|-<br />
!4<br />
|SN SV <br />
|SN + SV<br />
|-<br />
!5<br />
|SN cnjcoo SN SV <br />
|SN + cnjcoo + SN + SV<br />
|-<br />
!6<br />
|SN SV_cop SAdj <br />
|SN + SV (copulatiu) + SAdj<br />
|-<br />
!7<br />
|SP <br />
|SP<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
== Catalan → Romanian ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|casa > casă<br />
|-<br />
!2<br />
|artdef nom <br />
|la casa > casa<br />
|-<br />
!3<br />
|np <br />
|REGLA: NOM PROPI<br />
|-<br />
!4<br />
|adj <br />
|REGLA: ADJ<br />
|-<br />
!5<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!6<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!7<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!8<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!9<br />
|vbhaver <br />
|REGLA: VBHAVER (per si apareix sol)<br />
|-<br />
!10<br />
|sent <br />
|Resets variables.<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|-<br />
!2<br />
|SA <br />
|REGLA: SA<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
<br />
[[Category:Romanian and Catalan|*]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Transfer_Rules&diff=68034
Romanian and Catalan/Transfer Rules
2018-11-28T10:01:49Z
<p>Marcriera: /* Chunk (T1X) */</p>
<hr />
<div>This page provides general documentation on the transfer rules used in the Romanian-Catalan pair. The tables are generated from the data and comments found within the rule files using an automated script ('''transfer_documentation.sh''', available [https://github.com/MarcRiera/apertium-scripts here]).<br />
<br />
== Romanian → Catalan ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!3<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!4<br />
|conj_verb3<br />
| style="text-align:center" |3<br />
|Conjuga un verb compost triple<br />
|-<br />
!5<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables del chunk<br />
|-<br />
!6<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables del chunk<br />
|-<br />
!7<br />
|f_concord3<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements i desa els valors a les variables del chunk<br />
|-<br />
!8<br />
|f_concord4<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements i desa els valors a les variables del chunk<br />
|-<br />
!9<br />
|f_concord5<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements i desa els valors a les variables del chunk<br />
|-<br />
!10<br />
|f_concord1_i<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element intern i desa els valors a les variables internes<br />
|-<br />
!11<br />
|f_concord2_i<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements interns i desa els valors a les variables internes<br />
|-<br />
!12<br />
|f_concord3_i<br />
| style="text-align:center" |3<br />
|Realitza la concordança de tres elements interns i desa els valors a les variables internes<br />
|-<br />
!13<br />
|f_concord4_i<br />
| style="text-align:center" |4<br />
|Realitza la concordança de quatre elements interns i desa els valors a les variables internes<br />
|-<br />
!14<br />
|f_concord5_i<br />
| style="text-align:center" |5<br />
|Realitza la concordança de cinc elements interns i desa els valors a les variables internes<br />
|-<br />
!15<br />
|n_gen_nbr<br />
| style="text-align:center" |0<br />
|Neteja les variables de gènere i nombre<br />
|-<br />
!16<br />
|f_bcond<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!17<br />
|f_outverb2<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!18<br />
|f_outverb3<br />
| style="text-align:center" |3<br />
|Undocumented macro!<br />
|-<br />
!19<br />
|prn_verb<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!20<br />
|prn_verbp<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!21<br />
|prn_verbenc<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!22<br />
|f_modgen<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!23<br />
|f_lexicAdj<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM: casă → casa<br />
|-<br />
!2<br />
|num nom <br />
|NUM + NOM: 3 case → 3 cases<br />
|-<br />
!3<br />
|num prepde nom <br />
|NUM + de + NOM: 20 de case → 20 cases<br />
|-<br />
!4<br />
|nom adjnorm <br />
|NOM + ADJECTIU<br />
|-<br />
!5<br />
|nom adjpos <br />
|NOM + ADJECTIU POSSESSIU<br />
|-<br />
!6<br />
|num nom adjnorm <br />
|NUM + NOM + ADJECTIU<br />
|-<br />
!7<br />
|num prepde nom adjnorm <br />
|NUM + de + NOM + ADJECTIU<br />
|-<br />
!8<br />
|nom adjnorm adjnorm <br />
|NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!9<br />
|adj nom <br />
|ADJECTIU + NOM<br />
|-<br />
!10<br />
|det nom <br />
|DETERMINANT + NOM<br />
|-<br />
!11<br />
|predet nom <br />
|tot + NOM<br />
|-<br />
!12<br />
|nom nom_dgdef adjpos <br />
|NOM + NOM GENITIU + ADJECTIU POSSESSIU<br />
|-<br />
!13<br />
|nom np_dg <br />
|NOM + NOM PROPI GENITIU<br />
|-<br />
!14<br />
|nom adj_lui np <br />
|NOM + lui + NOM PROPI<br />
|-<br />
!15<br />
|nom adjnorm det_al np_dg <br />
|NOM + ADJECTIU + al + NOM PROPI GENITIU<br />
|-<br />
!16<br />
|nom adjnorm det_al adj_lui np <br />
|NOM + ADJECTIU + al + lui + NOM PROPI<br />
|-<br />
!17<br />
|det_cel num nom <br />
|cel + NUMERAL + NOM<br />
|-<br />
!18<br />
|det_cel adv_mai adjnorm nom <br />
|cel + mai + ADJECTIU + NOM<br />
|-<br />
!19<br />
|det_cel adv_mai adjnorm nom adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU<br />
|-<br />
!20<br />
|det_cel adv_mai adjnorm nom adjnorm adjnorm <br />
|cel + mai + ADJECTIU + NOM + ADJECTIU + ADJECTIU<br />
|-<br />
!21<br />
|np <br />
|NOM PROPI<br />
|-<br />
!22<br />
|num <br />
|NUMERAL<br />
|-<br />
!23<br />
|prn_tnitg <br />
|PRONOM<br />
|-<br />
!24<br />
|prnpro <br />
|CLÍTIC<br />
|-<br />
!25<br />
|det_al nom_dg <br />
|al + NOM GENITIU<br />
|-<br />
!26<br />
|det_al nom_dg adjnorm <br />
|al + NOM GENITIU + ADJECTIU<br />
|-<br />
!27<br />
|det_al nom_dg adjnorm adjnorm <br />
|al + NOM GENITIU + ADJECTIU + ADJECTIU<br />
|-<br />
!28<br />
|adj <br />
|ADJ<br />
|-<br />
!29<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!30<br />
|pp prnenc <br />
|REGLA: PP + ENC (per si no ho reconeix cap altra regla)<br />
|-<br />
!31<br />
|prepnum num_xifra <br />
|prep + NUMERAL (any)<br />
|-<br />
!32<br />
|prepnum nom_any num_xifra <br />
|prep + NOM (any) + NUMERAL (any)<br />
|-<br />
!33<br />
|prep num_xifra mesos num_xifra <br />
|prep + NUMERAL (dia) + NOM (mes) + NUMERAL (any)<br />
|-<br />
!34<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!35<br />
|advnu vbcnj <br />
|REGLA: NU + VERB CONJUGAT<br />
|-<br />
!36<br />
|prnproref vbcnj <br />
|REGLA: CLÍTIC + VERB CONJUGAT<br />
|-<br />
!37<br />
|advnu prnproref vbcnj <br />
|REGLA: NU + CLÍTIC + VERB CONJUGAT<br />
|-<br />
!38<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: VERB CONJUGAT DOBLE<br />
|-<br />
!39<br />
|advnu vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: NU + VERB CONJUGAT DOBLE<br />
|-<br />
!40<br />
|vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!41<br />
|advnu vbcnj_doble1 vbcnj_doble2 prnenc <br />
|REGLA: NU + VERB CONJUGAT DOBLE + CLÍTIC (-o)<br />
|-<br />
!42<br />
|vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: VERB CONJUGAT TRIPLE<br />
|-<br />
!43<br />
|advnu vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: NU + VERB CONJUGAT TRIPLE<br />
|-<br />
!44<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!45<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!46<br />
|cnjcoo <br />
|REGLA: CNJCOO<br />
|-<br />
!47<br />
|sent <br />
|Reinicialitza les variables<br />
|-<br />
!48<br />
|advnu vbcnj advdecât nom <br />
|Undocumented rule!<br />
|-<br />
!49<br />
|advnu vbcnj advdecât detind nom <br />
|Undocumented rule!<br />
|-<br />
!50<br />
|num mesos <br />
|Undocumented rule!<br />
|-<br />
!51<br />
|nom verbfi adj <br />
|Undocumented rule!<br />
|-<br />
!52<br />
|nom verbfi pp <br />
|Undocumented rule!<br />
|-<br />
!53<br />
|nom det_al relcărui nom <br />
|Undocumented rule!<br />
|-<br />
!54<br />
|prep_în nptop <br />
|Undocumented rule!<br />
|-<br />
!55<br />
|prep nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!56<br />
|vbcnj nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!57<br />
|verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!58<br />
|num mesos num <br />
|Undocumented rule!<br />
|-<br />
!59<br />
|prep nom adj <br />
|Undocumented rule!<br />
|-<br />
!60<br />
|prep nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!61<br />
|prep nom nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!62<br />
|vbcnj nom adj <br />
|Undocumented rule!<br />
|-<br />
!63<br />
|vbcnj nom det nom <br />
|Undocumented rule!<br />
|-<br />
!64<br />
|preppe npant <br />
|Undocumented rule!<br />
|-<br />
!65<br />
|preppe prn <br />
|Undocumented rule!<br />
|-<br />
!66<br />
|preppe det <br />
|Undocumented rule!<br />
|-<br />
!67<br />
|vbcnj prep nom nom <br />
|Undocumented rule!<br />
|-<br />
!68<br />
|prep nom nom_dg <br />
|Undocumented rule!<br />
|-<br />
!69<br />
|prep nom <br />
|Undocumented rule!<br />
|-<br />
!70<br />
|prep nom adj adj adj <br />
|Undocumented rule!<br />
|-<br />
!71<br />
|det_predet <br />
|Undocumented rule!<br />
|-<br />
!72<br />
|adjtot nom <br />
|Undocumented rule!<br />
|-<br />
!73<br />
|vbcnj dettuturor <br />
|Undocumented rule!<br />
|-<br />
!74<br />
|nom dettuturor <br />
|Undocumented rule!<br />
|-<br />
!75<br />
|relat <br />
|Undocumented rule!<br />
|-<br />
!76<br />
|vbcnj nom nom <br />
|Undocumented rule!<br />
|-<br />
!77<br />
|adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!78<br />
|nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!79<br />
|prep nom adj det_al np <br />
|Undocumented rule!<br />
|-<br />
!80<br />
|prep nom adj det_al nom <br />
|Undocumented rule!<br />
|-<br />
!81<br />
|nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!82<br />
|nom det nom <br />
|Undocumented rule!<br />
|-<br />
!83<br />
|nom det_al detind nom <br />
|Undocumented rule!<br />
|-<br />
!84<br />
|det nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!85<br />
|det adj nom det_al nom <br />
|Undocumented rule!<br />
|-<br />
!86<br />
|nom adv adj <br />
|Undocumented rule!<br />
|-<br />
!87<br />
|det_cel adv_mai adj <br />
|Undocumented rule!<br />
|-<br />
!88<br />
|advnespus prepde adj <br />
|Undocumented rule!<br />
|-<br />
!89<br />
|prep prepa prnpro inf <br />
|Undocumented rule!<br />
|-<br />
!90<br />
|prep prepa inf <br />
|Undocumented rule!<br />
|-<br />
!91<br />
|nom det_cel adj <br />
|Undocumented rule!<br />
|-<br />
!92<br />
|det nom adj <br />
|Undocumented rule!<br />
|-<br />
!93<br />
|det adj nom <br />
|Undocumented rule!<br />
|-<br />
!94<br />
|nom nom <br />
|Undocumented rule!<br />
|-<br />
!95<br />
|advnu adv_mai vbcnj <br />
|Undocumented rule!<br />
|-<br />
!96<br />
|vrea verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!97<br />
|conjnici advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!98<br />
|advnici- advnu vbcnj <br />
|Undocumented rule!<br />
|-<br />
!99<br />
|advnu vbcnj advdecât <br />
|Undocumented rule!<br />
|-<br />
!100<br />
|vbaveacni verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!101<br />
|vbaveapri verbtrebui conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!102<br />
|vbcnj nom <br />
|Undocumented rule!<br />
|-<br />
!103<br />
|vbcnj conjsub vbcnj <br />
|Undocumented rule!<br />
|-<br />
!104<br />
|adv adj nom <br />
|Undocumented rule!<br />
|-<br />
!105<br />
|num adj nom <br />
|Undocumented rule!<br />
|-<br />
!106<br />
|vbcnj det_cel num nom <br />
|Undocumented rule!<br />
|-<br />
!107<br />
|vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!108<br />
|vrea vbcnj prep prn <br />
|Undocumented rule!<br />
|-<br />
!109<br />
|vrea vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!110<br />
|vbaveacni vbcnj prep nom <br />
|Undocumented rule!<br />
|-<br />
!111<br />
|prepde pp <br />
|Undocumented rule!<br />
|-<br />
!112<br />
|catverb prnenc <br />
|Undocumented rule!<br />
|-<br />
!113<br />
|catverb prnenc prnenc <br />
|Undocumented rule!<br />
|-<br />
!114<br />
|advca nom <br />
|Undocumented rule!<br />
|-<br />
!115<br />
|advca det nom <br />
|Undocumented rule!<br />
|-<br />
!116<br />
|adv adj advca <br />
|Undocumented rule!<br />
|-<br />
!117<br />
|prep nom nptop <br />
|Undocumented rule!<br />
|-<br />
!118<br />
|prepdintr- detind <br />
|Undocumented rule!<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
== Catalan → Romanian ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|casa > casă<br />
|-<br />
!2<br />
|artdef nom <br />
|la casa > casa<br />
|-<br />
!3<br />
|np <br />
|REGLA: NOM PROPI<br />
|-<br />
!4<br />
|adj <br />
|REGLA: ADJ<br />
|-<br />
!5<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!6<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!7<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!8<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!9<br />
|vbhaver <br />
|REGLA: VBHAVER (per si apareix sol)<br />
|-<br />
!10<br />
|sent <br />
|Resets variables.<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|-<br />
!2<br />
|SA <br />
|REGLA: SA<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
<br />
[[Category:Romanian and Catalan|*]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian&diff=68033
Romanian
2018-11-28T09:53:25Z
<p>Marcriera: </p>
<hr />
<div>'''Romanian''' ([[Wikipedia:Romanian language]]) is a [[Romance language]]. It is spoken primarily in Romania and Moldova. It is available in Apertium as a standalone analyser/generator ([[Romanian#Apertium-ron|apertium-ron]]) and as a component of several pairs which translate to/from Romanian.<br />
<br />
* '''[[apertium-es-ro|Spanish-Romanian]]''' (trunk) <br />
* [[apertium-ron-cat|Romanian-Catalan]] (nursery) <br />
* [[apertium-ro-it|Romanian-Italian]] (nursery)<br />
* ''[[apertium-fra-ron|French-Romanian]]'' (incubator)<br />
* ''[[apertium-ron-ina|Romanian-Interlingua]]'' (incubator)<br />
* ''[[apertium-ron-rup|Romanian-Aromanian]]'' (incubator)<br />
<br />
== Apertium-ron ==<br />
<br />
===Current status===<br />
<br />
''Last update: 28 Nov 2018''<br />
<br />
'''Dix entries:''' 22,230<br />
<br />
'''Dix paradigms:''' 909<br />
<br />
== Materials ==<br />
<br />
<br />
===General===<br />
<br />
* Sala, Marius. "Limba română, limbă romanică."&nbsp;''Romance Philology''&nbsp;(2011).<br />
* Niculescu, Alexandru.&nbsp;''Outline history of the Romanian language''. Editura Științifică și Enciclopedică, 1981.<br />
* Todi, Aida.&nbsp;''Istoria limbii române''. Editura Universitară, 2008.<br />
<br />
===Comparative Studies===<br />
<br />
* [http://dl.acm.org/citation.cfm?id=1596842 Girju, Roxana. "The syntax and semantics of prepositions in the task of automatic interpretation of nominal phrases and compounds: A cross-linguistic study." ''Computational Linguistics'' 35, no. 2 (2009): 185-228.]<br />
* [https://books.google.ro/books?hl=en&lr=&id=UKmQESh-VCcC&oi=fnd&pg=PR7&dq=Romanian+&ots=QTCkN55NFz&sig=2FrQE5Y_zfAjsGThuepTrcfkFII&redir_esc=y#v=onepage&q=Romanian&f=false Dobrovie-Sorin, Carmen. ''The syntax of Romanian: Comparative studies in Romance''. Vol. 40. Walter de Gruyter, 1994.]<br />
* Niculescu, Alexandru.&nbsp;''Individualitatea limbii române între limbile romanice''. Vol. 3. Editura Științifică și Enciclopedică, 1999.<br />
<br />
===Machine Translation Tools and Techniques===<br />
<br />
* [http://mt-archive.info/IIS-2009-Irimia.pdf Irimia, Elena. "EBMT experiments for the English-Romanian language pair." ''Recent Advances in Intelligent Information Systems'', ISBN (2009): 978-83.]<br />
* [http://www.racai.ro/media/Tufis-Mason-LREC1998.pdf Tufis, Dan, and Oliver Mason. "Tagging romanian texts: a case study for qtag, a language independent probabilistic tagger." In ''Proceedings of the First International Conference on Language Resources and Evaluation (LREC)'', vol. 1, pp. 589-596. 1998.]<br />
* [http://link.springer.com/chapter/10.1007/3-540-36456-0_26 Şerban, Gabriela, and Doina Tătar. "Word sense disambiguation for untagged corpus: application to Romanian language." In ''Computational Linguistics and Intelligent Text Processing'', pp. 268-272. Springer Berlin Heidelberg, 2003.]<br />
* [http://www.aclweb.org/anthology/W/W08/W08-19.pdf#page=65 Curteanu, Neculai, Alex Moruz, and Diana Trandabăţ. "Extracting Sense Trees from the Romanian Thesaurus by Sense Segmentation & Dependency Parsing." In ''Proceedings of the workshop on Cognitive Aspects of the Lexicon'', pp. 55-63. Association for Computational Linguistics, 2008.]<br />
* [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.149.3908&rep=rep1&type=pdf Cristea, Dan, Marius Răschip, Corina Forăscu, Gabriela Haja, Cristina Florescu, Bogdan Aldea, and Elena Dănilă. "The Digital Form of the Thesaurus Dictionary of the Romanian Language." In ''Proceedings of the 4th International IEEE Conference SpeD'', pp. 195-206. 2007.]<br />
* [http://digital.library.unt.edu/ark:/67531/metadc31007/m2/1/high_res_d/Mihalcea-2008-Multilingual_Subjectivity_Analysis_Using.pdf Banea, Carmen, Rada Mihalcea, Janyce Wiebe, and Samer Hassan. "Multilingual subjectivity analysis using machine translation." In ''Proceedings of the Conference on Empirical Methods in Natural Language Processing'', pp. 127-135. Association for Computational Linguistics, 2008.]<br />
* [http://link.springer.com/chapter/10.1007/978-3-642-40495-5_27 Colhon, Mihaela. "Automatic Lexical Alignment between Syntactically Weak Related Languages. Application for English and Romanian." In ''Computational Collective Intelligence. Technologies and Applications'', pp. 266-275. Springer Berlin Heidelberg, 2013.]<br />
* Tufiş, D., A. M. Barbu, V. Pătraşcu, G. Rotariu, and C. Popescu. "Corpora and Corpus-Based Morpho-Lexical Processing."&nbsp;''Recent Advances in Romanian Language Technology, Editura Academiei''&nbsp;(1997): 35-56.<br />
* Frunza, Oana, Diana Inkpen, and David Nadeau. "A text processing tool for the romanian language." In Proc. of the EuroLAN 2005 Workshop on Cross-Language Knowledge Induction. 2005.<br />
<br />
===Grammars===<br />
<br />
* Irimia, Dumitru.&nbsp;''Gramatica limbii române''. Polirom, 1997.<br />
* Iordan, Iorgu, and Vladimir Robu.&nbsp;''Limba română contemporană''. Editura Didactică și Pedagogică, 1978.<br />
* Frâncu, Constantin. "Gramatica limbii române vechi (1521-1780)." (2009).<br />
* Romalo, Valeria Guţu, ed.&nbsp;''Gramatica limbii române''. Vol. 2. Editura Academiei Române, 2005.<br />
* Dimitriu, Corneliu.&nbsp;''Tratat de gramatică a limbii române''. Institutul European, 2002.<br />
* [http://limba-romana.ucoz.ro/index/gramatica_limbii_romane/0-17 http://limba-romana.ucoz.ro/index/gramatica_limbii_romane/0-17]<br />
* [http://gramaticalimbiiromane.ro/ http://gramaticalimbiiromane.ro/]<br />
<br />
===Morphology===<br />
<br />
* Iordan, Iorgu, Valeria Guţu Romalo, and Alexandru Niculescu.&nbsp;''Structura morfologică a limbii române contemporane''. Editura Sțiințifică, 1967.<br />
* Ciompec, G. "Limba română contemporană. Fonetică, fonologie, morfologie." (1985).<br />
* Pitiriciu, Silvia. "Prepoziția–aspecte de morfologie funcțională." Analele Universității de Vest din Timișoara. Seria Științe Filologice.<br />
<br />
===Miscellaneous===<br />
<br />
* Bateman, Nicoleta, and Maria Polinsky. "Romanian as a two-gender language." Hypothesis A/Hypothesis B (2010): 41-78.<br />
* [https://www.researchgate.net/profile/Dan_Tufis/publication/228608020_Automatic_diacritics_insertion_in_Romanian_texts/links/00b49524aa91ad629d000000.pdf Tufiş, Dan, and Adrian Chiţu. "Automatic diacritics insertion in Romanian texts." In ''Proceedings of the International Conference on Computational Lexicography'', Pecs, Hungary, pp. 185-194. 1999.]<br />
<br />
===Dictionaries===<br />
<br />
* [http://www.toatecuvintele.ro/ http://www.toatecuvintele.ro/]<br />
* [https://dexonline.ro/ https://dexonline.ro/], GPL, contains all the important monolingual dictionaries<br />
* [https://ro.wiktionary.org/wiki/Pagina_principal%C4%83 Romanian Wiktionary]<br />
<br />
===Corpora===<br />
<br />
* [http://ws.racai.ro:9191/repository/browse/romanian-balanced-corpus/8997021cfb6811e2a8ad00237df3e35848d354472f7b4f76985dcfbbaa26d04d/ Romanian Balanced Corpus (ROMBAC)]<br />
* RoWaC Corpus<br />
* [https://ro.wikipedia.org/wiki/Pagina_principal%C4%83 Romanian Wikipedia]<br />
* [http://corpora.heliohost.org/ http://corpora.heliohost.org/]<br />
* [http://mailman.uib.no/public/corpora/2007-July/004732.html http://mailman.uib.no/public/corpora/2007-July/004732.html]<br />
* [http://www.statmt.org/europarl/ http://www.statmt.org/europarl/]<br />
* [http://www.bibliaortodoxa.ro/ The Bible in Romanian]<br />
* [http://adevarul.ro/ The Daily Newspaper ][http://adevarul.ro/ Adevărul]<br />
<br />
<br />
<br />
<br />
[[Category:Romanian]]<br />
[[Category:Languages]]<br />
[[Category:Romance languages]]<br />
[[Category:Documentation in English]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67529
Romanian and Catalan/Workplan
2018-08-14T10:56:20Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Upgrade Indonesian-Malaysian pair to monolingual package system (testvoc clean)<br />
* Upgrade Welsh-English pair to monolingual package system<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English) <br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Upgrade Catalan-Italian pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan)<br />
'''Second evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Upgrade Afrikaans-Dutch pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
* Add Romanian CG rules<br />
| style="text-align:center" | 22,995 (~27,000)<br />
| style="text-align:center" | ron > cat ~29% (~24%)<br>cat > ron ~46% (~50%)<br />
| style="text-align:center" | ron 86.8% (84.7%)<br>cat 88.7% (83.6%)<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
* Add Romanian CG rules<br />
| style="text-align:center" | 23,009 (~29,000)<br />
| style="text-align:center" | ron > cat ~29% (~23%)<br>cat > ron ~46% (~47%)<br />
| style="text-align:center" | ron 86.8% (85.3%)<br>cat 88.7% (84.2%)<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
'''Final evaluation'''<br />
| style="text-align:center" | 23,015 (~31,000)<br />
| style="text-align:center" | ron > cat ~29% (~22%)<br>cat > ron ~46% (~45%)<br />
| style="text-align:center" | ron 86.8% (85.8%)<br>cat 88.7% (84.7%)<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/GSOC_2018&diff=67528
Romanian and Catalan/GSOC 2018
2018-08-14T10:48:04Z
<p>Marcriera: </p>
<hr />
<div>This page serves as a summary of all the work done in the Romanian-Catalan pair during Google Summer of Code 2018. It also includes information on the upgrade of four language pairs which was carried out during the same period. For a more detailed workplan of the project, please check [[Romanian_and_Catalan/Workplan|this page]].<br />
<br />
==Main work==<br />
<br />
The main goal of this project was to upgrade the Romanian-Catalan language pair to the monolingual package system and develop it to bring it to release quality. In addition, four other language pairs have been upgraded to the monolingual package system to ease future development. As such, the commits are distributed into many repositories, which are listed below:<br />
<br />
====Romanian-Catalan pair====<br />
<br />
* Romanian: [https://github.com/apertium/apertium-ron/commits?author=MarcRiera apertium-ron]<br />
* Catalan: [https://github.com/apertium/apertium-cat/commits?author=MarcRiera apertium-cat] (only commits after March 2018 are to be considered as part of the project)<br />
* Romanian-Catalan: [https://github.com/apertium/apertium-ron-cat/commits?author=MarcRiera apertium-ron-cat]<br />
<br />
====Other pairs====<br />
<br />
* Indonesian-Malay: [https://github.com/apertium/apertium-ind-zlm/pull/2 apertium-ind-zlm] [https://github.com/apertium/apertium-ind/pull/1 apertium-ind] [https://github.com/apertium/apertium-zlm/pull/1 apertium-zlm]<br />
* Welsh-English: [https://github.com/apertium/apertium-cym-eng/pull/4 apertium-cym-eng] [https://github.com/apertium/apertium-cym/pull/1 apertium-cym]<br />
* Catalan-Italian: [https://github.com/apertium/apertium-cat-ita/pull/3 apertium-cat-ita]<br />
* Afrikaans-Dutch: [https://github.com/apertium/apertium-afr-nld/pull/6 apertium-afr-nld] [https://github.com/apertium/apertium-afr/pull/1 apertium-afr] [https://github.com/apertium/apertium-nld/pull/1 apertium-nld]<br />
<br />
===Dictionaries===<br />
<br />
The bilingual dictionary, initially comprising 12,819 entries, has been expanded to 23,015 entries, obtained from crossdics with Romanian-Spanish and frequency lists. While this is substantially lower than the initial goal of 31,000 entries, a great part of the existing entries were not working due to a lack of gender tags, incorrect direction tags in monolingual dictionaries, or a lack of corresponding entries in the monolingual dictionaries. As such, fixing these existing entries has been given priority, and in the end it has proven to be fruitful, with Wikipedia coverage surpassing expectations (86.8% for Romanian and 88.7% for Catalan).<br />
<br />
The Romanian monolingual dictionary, given its limited development in Apertium, contained many inaccuracies and errors which have been fixed as part of the project. The Catalan side, on the other hand, has not needed major changes due to extensive development and consensus between developers.<br />
<br />
Despite the existence a few dictionary-related testvoc errors in both directions which prevent the immediate release of the pair at the end of GSoC, they are easy to fix and the pair should be clean with a few more days of work.<br />
<br />
===Transfer rules===<br />
<br />
Before this project, the pair contained many transfer rules for the Romanian → Catalan direction, with most of them being ported from the Romanian-Spanish pair. These were single-step transfer rules which have been kept in the pair as chunking rules after being partially rewritten. In addition, basic interchunk rules have been added, allowing gender and number match between noun/pronoun and verb chunks. This has reduced the PER in this direction from 36% to 29%.<br />
<br />
The other direction (Catalan → Romanian) did not have any previous transfer rules and has been written from scratch. Most of the work has been done in the chunking stage, but there are basic interchunk rules too. Also, due to the way gender is handled in Romanian, a fourth step has been added to transfer to generate adjectives and determiners correctly. In this direction, the PER has been reduced from 61% to 46%.<br />
<br />
Given the effort needed to develop transfer rules and the tight schedule, it has not been possible to reduce PER as much as desired. However, the results are good enough to allow for a release of the pair in both directions, and the current transfer rules will ease future development.<br />
<br />
===Other language pair upgrades===<br />
<br />
In addition to the main work on the Romanian-Catalan pair, four other pairs (Indonesian-Malay, Welsh-English, Catalan-Italian and Afrikaans-Dutch) have been upgraded to the monolingual package system. All four pairs had been released in the past, but they relied on embedded monolingual data that made future development tedious and difficult. The upgrade has included the separation of the monolingual data (the upgrade itself) and the fix of all testvoc errors which had emerged after the change. In the case of Welsh-English, which had been published with errors, it has been impossible to fix all of them due to a lack of knowledge of the Welsh language, but the amount of errors has been reduced nonetheless.<br />
<br />
==Supporting work==<br />
<br />
===Automatic evaluation===<br />
<br />
The development of transfer rules has required the adoption of an effective way to constantly evaluate translations and see the differences in results between versions. This has been possible thanks to the work of Xavi Ivars (mentor) and Jaume Ortolà (Apertium contributor), who have included the pair in an automatic diff system powered by Softcatalà.<br />
<br />
Inspired by this idea, a local system of equal functionality was written for more intensive, personal use. The evaluations have greatly helped to detect errors and flaws in the pair and tweak the transfer rules for better results.<br />
<br />
===Documentation===<br />
<br />
As the pair had very little documentation (limited to a few inline comments in transfer rule files and dictionaries), documentation has also been taken into consideration during development. The following wiki pages have been created or modified to reflect information on the pair, and will be updated in the future with more useful data:<br />
<br />
* [[Romanian and Catalan]]<br />
* [[Romanian and Catalan/Transfer Rules]]<br />
<br />
==Future work==<br />
<br />
===Work needed for release===<br />
<br />
* Clean up testvoc errors<br />
<br />
===Other plans===<br />
<br />
* Finish the rewrite of old T1X rules for Romanian → Catalan<br />
* Expand transfer rules to handle more complex structures, specially verb-pronoun structures<br />
<br />
==Final words==<br />
<br />
Working with Apertium during Google Summer of Code 2018 for a second time has been a great experience that has allowed me to get to know Apertium even better and feel more confident to keep contributing in the future. The experience gained from the first time has allowed me to embark on more complex challenges and find the best solutions to solve problems. Hopefully, the good results of the Romanian-Catalan pair will make Apertium a good and reliable alternative to commercial machine translation systems between these two languages. I also hope that the upgrade of the other four pairs will encourage other developers to work on these pairs in the future.<br />
<br />
Last but not least, I would like to thank not only my mentors (Xavi Ivars and Hèctor Alòs) but also other Apertium contributors and friends such as Fran Tyers and Jaume Ortolà, who have all provided me with invaluable feedback and help.</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian&diff=67527
Romanian
2018-08-14T10:16:20Z
<p>Marcriera: </p>
<hr />
<div>'''Romanian''' ([[Wikipedia:Romanian language]]) is a [[Romance language]]. It is spoken primarily in Romania and Moldova.<br />
<br />
In [[Apertium]], there is a stand-alone Romanian language module (in '''[http://svn.code.sf.net/p/apertium/svn/languages/apertium-ron/ languages/apertium-ron]'''), and the following are the language pairs involving Romanian (see [[list of language pairs]]):<br />
<br />
* '''[[apertium-es-ro|Spanish-Romanian]]''' (trunk) <br />
* [[apertium-ron-cat|Romanian-Catalan]] (nursery) <br />
* [[apertium-ro-it|Romanian-Italian]] (nursery)<br />
* ''[[apertium-fra-ron|French-Romanian]]'' (incubator)<br />
* ''[[apertium-ron-ina|Romanian-Interlingua]]'' (incubator)<br />
* ''[[apertium-ron-rup|Romanian-Aromanian]]'' (incubator)<br />
<br />
<br />
== Materials ==<br />
<br />
<br />
===General===<br />
<br />
* Sala, Marius. "Limba română, limbă romanică."&nbsp;''Romance Philology''&nbsp;(2011).<br />
* Niculescu, Alexandru.&nbsp;''Outline history of the Romanian language''. Editura Științifică și Enciclopedică, 1981.<br />
* Todi, Aida.&nbsp;''Istoria limbii române''. Editura Universitară, 2008.<br />
<br />
===Comparative Studies===<br />
<br />
* [http://dl.acm.org/citation.cfm?id=1596842 Girju, Roxana. "The syntax and semantics of prepositions in the task of automatic interpretation of nominal phrases and compounds: A cross-linguistic study." ''Computational Linguistics'' 35, no. 2 (2009): 185-228.]<br />
* [https://books.google.ro/books?hl=en&lr=&id=UKmQESh-VCcC&oi=fnd&pg=PR7&dq=Romanian+&ots=QTCkN55NFz&sig=2FrQE5Y_zfAjsGThuepTrcfkFII&redir_esc=y#v=onepage&q=Romanian&f=false Dobrovie-Sorin, Carmen. ''The syntax of Romanian: Comparative studies in Romance''. Vol. 40. Walter de Gruyter, 1994.]<br />
* Niculescu, Alexandru.&nbsp;''Individualitatea limbii române între limbile romanice''. Vol. 3. Editura Științifică și Enciclopedică, 1999.<br />
<br />
===Machine Translation Tools and Techniques===<br />
<br />
* [http://mt-archive.info/IIS-2009-Irimia.pdf Irimia, Elena. "EBMT experiments for the English-Romanian language pair." ''Recent Advances in Intelligent Information Systems'', ISBN (2009): 978-83.]<br />
* [http://www.racai.ro/media/Tufis-Mason-LREC1998.pdf Tufis, Dan, and Oliver Mason. "Tagging romanian texts: a case study for qtag, a language independent probabilistic tagger." In ''Proceedings of the First International Conference on Language Resources and Evaluation (LREC)'', vol. 1, pp. 589-596. 1998.]<br />
* [http://link.springer.com/chapter/10.1007/3-540-36456-0_26 Şerban, Gabriela, and Doina Tătar. "Word sense disambiguation for untagged corpus: application to Romanian language." In ''Computational Linguistics and Intelligent Text Processing'', pp. 268-272. Springer Berlin Heidelberg, 2003.]<br />
* [http://www.aclweb.org/anthology/W/W08/W08-19.pdf#page=65 Curteanu, Neculai, Alex Moruz, and Diana Trandabăţ. "Extracting Sense Trees from the Romanian Thesaurus by Sense Segmentation & Dependency Parsing." In ''Proceedings of the workshop on Cognitive Aspects of the Lexicon'', pp. 55-63. Association for Computational Linguistics, 2008.]<br />
* [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.149.3908&rep=rep1&type=pdf Cristea, Dan, Marius Răschip, Corina Forăscu, Gabriela Haja, Cristina Florescu, Bogdan Aldea, and Elena Dănilă. "The Digital Form of the Thesaurus Dictionary of the Romanian Language." In ''Proceedings of the 4th International IEEE Conference SpeD'', pp. 195-206. 2007.]<br />
* [http://digital.library.unt.edu/ark:/67531/metadc31007/m2/1/high_res_d/Mihalcea-2008-Multilingual_Subjectivity_Analysis_Using.pdf Banea, Carmen, Rada Mihalcea, Janyce Wiebe, and Samer Hassan. "Multilingual subjectivity analysis using machine translation." In ''Proceedings of the Conference on Empirical Methods in Natural Language Processing'', pp. 127-135. Association for Computational Linguistics, 2008.]<br />
* [http://link.springer.com/chapter/10.1007/978-3-642-40495-5_27 Colhon, Mihaela. "Automatic Lexical Alignment between Syntactically Weak Related Languages. Application for English and Romanian." In ''Computational Collective Intelligence. Technologies and Applications'', pp. 266-275. Springer Berlin Heidelberg, 2013.]<br />
* Tufiş, D., A. M. Barbu, V. Pătraşcu, G. Rotariu, and C. Popescu. "Corpora and Corpus-Based Morpho-Lexical Processing."&nbsp;''Recent Advances in Romanian Language Technology, Editura Academiei''&nbsp;(1997): 35-56.<br />
* Frunza, Oana, Diana Inkpen, and David Nadeau. "A text processing tool for the romanian language." In Proc. of the EuroLAN 2005 Workshop on Cross-Language Knowledge Induction. 2005.<br />
<br />
===Grammars===<br />
<br />
* Irimia, Dumitru.&nbsp;''Gramatica limbii române''. Polirom, 1997.<br />
* Iordan, Iorgu, and Vladimir Robu.&nbsp;''Limba română contemporană''. Editura Didactică și Pedagogică, 1978.<br />
* Frâncu, Constantin. "Gramatica limbii române vechi (1521-1780)." (2009).<br />
* Romalo, Valeria Guţu, ed.&nbsp;''Gramatica limbii române''. Vol. 2. Editura Academiei Române, 2005.<br />
* Dimitriu, Corneliu.&nbsp;''Tratat de gramatică a limbii române''. Institutul European, 2002.<br />
* [http://limba-romana.ucoz.ro/index/gramatica_limbii_romane/0-17 http://limba-romana.ucoz.ro/index/gramatica_limbii_romane/0-17]<br />
* [http://gramaticalimbiiromane.ro/ http://gramaticalimbiiromane.ro/]<br />
<br />
===Morphology===<br />
<br />
* Iordan, Iorgu, Valeria Guţu Romalo, and Alexandru Niculescu.&nbsp;''Structura morfologică a limbii române contemporane''. Editura Sțiințifică, 1967.<br />
* Ciompec, G. "Limba română contemporană. Fonetică, fonologie, morfologie." (1985).<br />
* Pitiriciu, Silvia. "Prepoziția–aspecte de morfologie funcțională." Analele Universității de Vest din Timișoara. Seria Științe Filologice.<br />
<br />
===Miscellaneous===<br />
<br />
* Bateman, Nicoleta, and Maria Polinsky. "Romanian as a two-gender language." Hypothesis A/Hypothesis B (2010): 41-78.<br />
* [https://www.researchgate.net/profile/Dan_Tufis/publication/228608020_Automatic_diacritics_insertion_in_Romanian_texts/links/00b49524aa91ad629d000000.pdf Tufiş, Dan, and Adrian Chiţu. "Automatic diacritics insertion in Romanian texts." In ''Proceedings of the International Conference on Computational Lexicography'', Pecs, Hungary, pp. 185-194. 1999.]<br />
<br />
===Dictionaries===<br />
<br />
* [http://www.toatecuvintele.ro/ http://www.toatecuvintele.ro/]<br />
* [https://dexonline.ro/ https://dexonline.ro/], GPL, contains all the important monolingual dictionaries<br />
* [https://ro.wiktionary.org/wiki/Pagina_principal%C4%83 Romanian Wiktionary]<br />
<br />
===Corpora===<br />
<br />
* [http://ws.racai.ro:9191/repository/browse/romanian-balanced-corpus/8997021cfb6811e2a8ad00237df3e35848d354472f7b4f76985dcfbbaa26d04d/ Romanian Balanced Corpus (ROMBAC)]<br />
* RoWaC Corpus<br />
* [https://ro.wikipedia.org/wiki/Pagina_principal%C4%83 Romanian Wikipedia]<br />
* [http://corpora.heliohost.org/ http://corpora.heliohost.org/]<br />
* [http://mailman.uib.no/public/corpora/2007-July/004732.html http://mailman.uib.no/public/corpora/2007-July/004732.html]<br />
* [http://www.statmt.org/europarl/ http://www.statmt.org/europarl/]<br />
* [http://www.bibliaortodoxa.ro/ The Bible in Romanian]<br />
* [http://adevarul.ro/ The Daily Newspaper ][http://adevarul.ro/ Adevărul]<br />
<br />
<br />
<br />
<br />
[[Category:Romanian]]<br />
[[Category:Languages]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/GSOC_2018&diff=67526
Romanian and Catalan/GSOC 2018
2018-08-14T09:55:23Z
<p>Marcriera: </p>
<hr />
<div>This page serves as a summary of all the work done in the Romanian-Catalan pair during Google Summer of Code 2018. It also includes information on the upgrade of four language pairs which was carried out during the same period. For a more detailed workplan of the project, please check [[Romanian_and_Catalan/Workplan|this page]].<br />
<br />
==Main work==<br />
<br />
The main goal of this project was to upgrade the Romanian-Catalan language pair to the monolingual package system and develop it to bring it to release quality. In addition, four other language pairs have been upgraded to the monolingual package system to ease future development. As such, the commits are distributed into many repositories, which are listed below:<br />
<br />
====Romanian-Catalan pair====<br />
<br />
* Romanian: [https://github.com/apertium/apertium-ron/commits?author=MarcRiera apertium-ron]<br />
* Catalan: [https://github.com/apertium/apertium-cat/commits?author=MarcRiera apertium-cat] (only commits after March 2018 are to be considered as part of the project)<br />
* Romanian-Catalan: [https://github.com/apertium/apertium-ron-cat/commits?author=MarcRiera apertium-ron-cat]<br />
<br />
====Other pairs====<br />
<br />
* Indonesian-Malay: [https://github.com/apertium/apertium-ind-zlm/pull/2 apertium-ind-zlm] [https://github.com/apertium/apertium-ind/pull/1 apertium-ind] [https://github.com/apertium/apertium-zlm/pull/1 apertium-zlm]<br />
* Welsh-English: [https://github.com/apertium/apertium-cym-eng/pull/4 apertium-cym-eng] [https://github.com/apertium/apertium-cym/pull/1 apertium-cym]<br />
* Catalan-Italian: [https://github.com/apertium/apertium-cat-ita/pull/3 apertium-cat-ita]<br />
* Afrikaans-Dutch: [https://github.com/apertium/apertium-afr-nld/pull/6 apertium-afr-nld] [https://github.com/apertium/apertium-afr/pull/1 apertium-afr] [https://github.com/apertium/apertium-nld/pull/1 apertium-nld]<br />
<br />
===Dictionaries===<br />
<br />
The bilingual dictionary, initially comprising 12,819 entries, has been expanded to 23,015 entries, obtained from crossdics with Romanian-Spanish and frequency lists. While this is substantially lower than the initial goal of 31,000 entries, a great part of the existing entries were not working due to a lack of gender tags, incorrect direction tags in monolingual dictionaries, or a lack of corresponding entries in the monolingual dictionaries. As such, fixing these existing entries has been given priority, and in the end it has proven to be fruitful, with Wikipedia coverage surpassing expectations (86.8% for Romanian and 88.7% for Catalan).<br />
<br />
The Romanian monolingual dictionary, given its limited development in Apertium, contained many inaccuracies and errors which have been fixed as part of the project. The Catalan side, on the other hand, has not needed major changes due to extensive development and consensus between developers.<br />
<br />
Despite the existence a few dictionary-related testvoc errors in both directions which prevent the immediate release of the pair at the end of GSoC, they are easy to fix and the pair should be clean with a few more days of work.<br />
<br />
===Transfer rules===<br />
<br />
Before this project, the pair contained many transfer rules for the Romanian → Catalan direction, with most of them being ported from the Romanian-Spanish pair. These were single-step transfer rules which have been kept in the pair as chunking rules after being partially rewritten. In addition, basic interchunk rules have been added, allowing gender and number match between noun/pronoun and verb chunks. This has reduced the PER in this direction from 36% to 29%.<br />
<br />
The other direction (Catalan → Romanian) did not have any previous transfer rules and has been written from scratch. Most of the work has been done in the chunking stage, but there are basic interchunk rules too. Also, due to the way gender is handled in Romanian, a fourth step has been added to transfer to generate adjectives and determiners correctly. In this direction, the PER has been reduced from 61% to 46%.<br />
<br />
Given the effort needed to develop transfer rules and the tight schedule, it has not been possible to reduce PER as much as desired. However, the results are good enough to allow for a release of the pair in both directions, and the current transfer rules will ease future development.<br />
<br />
===Other language pair upgrades===<br />
<br />
<br />
<br />
==Supporting work==<br />
<br />
<br />
<br />
===Documentation===<br />
<br />
<br />
<br />
==Future work==<br />
<br />
===Work needed for release===<br />
<br />
* Clean up testvoc errors<br />
<br />
===Other plans===<br />
<br />
* Finish the rewrite of old T1X rules for Romanian → Catalan<br />
* Expand transfer rules to handle more complex structures, specially verb-pronoun structures<br />
<br />
==Final words==</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/GSOC_2018&diff=67503
Romanian and Catalan/GSOC 2018
2018-08-13T12:38:53Z
<p>Marcriera: Created page with "This page serves as a summary of all the work done in the Romanian-Catalan pair during Google Summer of Code 2018. It also includes information on the upgrade of four language..."</p>
<hr />
<div>This page serves as a summary of all the work done in the Romanian-Catalan pair during Google Summer of Code 2018. It also includes information on the upgrade of four language pairs which was carried out during the same period. For a more detailed workplan of the project, please check [[Romanian_and_Catalan/Workplan|this page]].<br />
<br />
==Main work==<br />
<br />
<br />
<br />
===Dictionaries===<br />
<br />
<br />
<br />
===Transfer rules===<br />
<br />
<br />
<br />
==Supporting work==<br />
<br />
<br />
<br />
===Documentation===<br />
<br />
<br />
<br />
==Future work==<br />
<br />
===Work needed for release===<br />
<br />
<br />
<br />
===Other plans===<br />
<br />
<br />
<br />
==Final words==</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Language_and_pair_maintainer&diff=67500
Language and pair maintainer
2018-08-13T11:43:54Z
<p>Marcriera: /* Unreleased pairs with a maintainer */</p>
<hr />
<div>Feel free to add your name, if you are not listed.<br />
<br />
== Released pairs with or without a maintainer ==<br />
<br />
Here is a list of current language pair (linguistic package) maintainers.<br />
<br />
{|class=wikitable<br />
! Pair name !! First release !! Languages !! Maintainer(s) !! Release<br />
|-<br />
|<code>[[apertium-es-ca]] </code>||align=center| 2005 || Spanish ⇆ Catalan || || {{release-tc|1.0.5}}<br />
|-<br />
|<code>[[apertium-es-gl]] </code>||align=center| 2005 || Spanish ⇆ Galician || [[User:Antonmoura|Antón Moura]], [[User:Oscarsenra|Óscar Senra]] || {{release-tc|1.0.6}}<br />
|-<br />
|<code>[[apertium-es-pt]] </code>||align=center| 2005 || Spanish ⇆ Portuguese || || {{release-tc|1.0.3}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-en-ca]] </code>||align=center| 2006 || English ⇆ Catalan || || {{release-tc|0.8.4}}<br />
|-<br />
|<code>[[apertium-oc-ca]] </code>||align=center| 2006 || Occitan ⇆ Catalan || [[User:Gramirez|Gramirez]] || {{release-tc|1.0.5}}<br />
|-<br />
|<code>[[apertium-fr-ca]] </code>||align=center| 2006 || French ⇆ Catalan || [[User:Gramirez|Gramirez]] || {{release-tc|1.0.0}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-es-ro]] </code>||align=center| 2007 || Spanish ← Romanian || || {{release-tc|0.7.1}}<br />
|-<br />
|<code>[[apertium-fr-es]] </code>||align=center| 2007 || French ⇆ Spanish || || {{release-tc|0.8.0}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-cy-en]] </code>||align=center| 2008 || Welsh ⇆ English || [[User:Donnek|Donnek]], [[User:Francis Tyers|Francis Tyers]] || {{release-tc|0.1.0}}<br />
|-<br />
|<code>[[apertium-en-es]] </code>||align=center| 2008 || English ⇆ Spanish || || {{release-tc|0.6.0}}<br />
|-<br />
|<code>[[apertium-eo-ca]] </code>||align=center| 2008 || Esperanto ← Catalan || [[User:Hectoralos|Hèctor Alòs]] || {{release-tc|0.9.0}}<br />
|-<br />
|<code>[[apertium-eo-es]] </code>||align=center| 2008 || Esperanto ← Spanish || [[User:Hectoralos|Hèctor Alòs]] || {{release-tc|0.9.0}}<br />
|-<br />
|<code>[[apertium-eo-en]] </code>||align=center| 2008 || Esperanto ⇆ English || [[User:Jacob_Nordfalk|Jacob Nordfalk]] || {{release-tc|1.0.2}}<br />
|-<br />
|<code>[[apertium-eu-es]] </code>||align=center| 2008 || Basque ⇆ Spanish || || {{release-tc|0.3.1}}<br />
|-<br />
|<code>[[apertium-oc-es]] </code>||align=center| 2008 || Occitan ⇆ Spanish || [[User:Gramirez|Gramirez]] || {{release-tc|1.0.5}}<br />
|-<br />
|<code>[[apertium-pt-ca]] </code>||align=center| 2008 || Portuguese ⇆ Catalan || || {{release-tc|0.8.1}}<br />
|-<br />
|<code>[[apertium-en-gl]] </code>||align=center| 2008 || English ⇆ Galician || [[User:Antonmoura|Antón Moura]], [[User:Oscarsenra|Óscar Senra]] || {{release-tc|0.5.1}}<br />
|-<br />
|<code>[[apertium-pt-gl]] </code>||align=center| 2008 || Portuguese ⇆ Galician || [[User:Antonmoura|Antón Moura]], [[User:Oscarsenra|Óscar Senra]] || {{release-tc|0.9.1}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-nn-nb]] </code>||align=center| 2009 || Norwegian Nynorsk ⇆ Norwegian Bokmål || [[User:Unhammer|Kevin Brubeck Unhammer]], [[User:Trondtr|Trond Trosterud]] ||{{release-tc|0.6.5}}<br />
|-<br />
|<code>[[apertium-sv-da]] </code>||align=center| 2009 || Swedish → Danish || [[User:mkrist|Michael Kristensen]], [[User:Jacob_Nordfalk|Jacob Nordfalk]] , [[User:tunedal|Per Tunedal]] || {{release-tc|0.6.0}}<br />
|-<br />
|<code>[[apertium-br-fr]] </code>||align=center| 2009 || Breton → French || [[User:Fulup|Fulup]], [[User:Francis Tyers|Francis Tyers]] || {{release-tc|0.4.0}}<br />
|-<br />
|<code>[[apertium-es-ast]] </code>||align=center| 2009 || Spanish → Asturian || || {{release-tc|1.0.0}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-is-en]] </code>||align=center| 2010 || Icelandic → English || [[User:Hrafn|Hrafn]], [[User:Martha|Martha]] ||{{release-tc|0.1.0}}<br />
|-<br />
|<code>[[apertium-mk-bg]] </code>||align=center| 2010 || Macedonian ⇆ Bulgarian || [[User:Tihomir|Tihomir Rangelov]] ||{{release-tc|0.2.0}}<br />
|-<br />
|<code>[[apertium-mk-en]] </code>||align=center| 2010 || Macedonian ⇆ English || [[User:Francis Tyers|Francis Tyers]] ||{{release-tc|0.1.0}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-ca-it]] </code>||align=center| 2011 || Catalan ← Italian || ||{{release-tc|0.1.0}}<br />
|-<br />
|<code>[[apertium-eo-fr]] </code>||align=center| 2011 || Esperanto ← French<br>Esperanto → French || [[User:Hectoralos|Hèctor Alòs]]<br>[[User:Bech|Bernard Chardonneau]] ||{{release-tc|0.9.0}}<br />
|-<br />
|<code>[[apertium-spa-arg]] </code>||align=center| 2011 || Spanish ⇆ Aragonese || [[User:Juanpabl|Juan Pablo Martínez]] ||{{release-tc|0.3.0}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-sme-nob]] </code>||align=center| 2012 || Northern Sámi → Norwegian Bokmål || [[User:Unhammer|Kevin Brubeck Unhammer]], [[User:Trondtr|Trond Trosterud]], Lene Antonsen, … || {{release-tc|0.5.0}}<br />
|-<br />
|colspan=5|<br />
|-<br />
|<code>[[apertium-kaz-tat]] </code>||align=center| 2013 || Kazakh ⇆ Tatar || [[User:ilnar.salimzyan|Ilnar Salimzyan]]||{{release-tc|0.2.1}}<br />
|-<br />
|<code>[[apertium-arg-cat]] </code>||align=center| 2015 || Aragonese ⇆ Catalan || [[User:Juanpabl|Juan Pablo Martínez]] ||{{release-tc|0.1.0}}<br />
|-<br />
|<code>[[apertium-srd-ita]] </code>||align=center| 2016 || Sardinian ← Italian || [[User:Hectoralos|Hèctor Alòs]]||{{release-tc|0.9.5}}<br />
|-<br />
|<code>[[apertium-crh-tur]] </code>||align=center| 2017 || Crimean Tatar → Turkish || ||<br />
|-<br />
|<code>[[apertium-cat-srd]] </code>||align=center| 2017 || Sardinian ← Catalan || [[User:Hectoralos|Hèctor Alòs]]||{{release-tc|1.0.0}} <br />
|}<br />
<br />
== Unreleased pairs with a maintainer ==<br />
<br />
{|class=wikitable<br />
! Pair name !! Languages !! Maintainer<br />
|-<br />
|<code>[[apertium-bn-en]] </code>|| Bengali ⇆ English || [[User:darthxaher|Abu Zaher Md. Faridee]] <br />
|-<br />
|<code>[[apertium-sme-fin]] </code>|| Northern Sámi ← Finnish || [[User:pyry|Ryan Johnson]], [[User:Trondtr|Trond Trosterud]]<br />
|-<br />
|<code>[[apertium-tg-fa]]</code>|| Tajik ⇆ Persian || [[User:Francis Tyers|Francis Tyers]]<br />
|-<br />
|<code>[[apertium-en-af]] </code>|| English ⇆ Afrikaans || [[User:Francis Tyers|Francis Tyers]]<br />
|-<br />
|<code>[[apertium-en-pl]] </code>|| English ⇆ Polish || [[User:Jimregan|Jimregan]]<br />
|-<br />
|<code>[[apertium-tat-rus]] </code>|| Tatar → Russian || [[User:ilnar.salimzyan|Ilnar Salimzyan]]<br />
|-<br />
|<code>[[apertium-en-fr]] </code>|| English ⇆ French || [[User:Dom|Dom]]<br />
|-<br />
|<code>[[apertium-eng-cat]] </code>|| English ⇆ Catalan || [[User:Marcriera|Marc Riera]]<br />
|-<br />
|<code>[[apertium-ron-cat]] </code>|| Romanian ⇆ Catalan || [[User:Marcriera|Marc Riera]]<br />
|}<br />
<br />
==Matrix==<br />
<br />
{|class=wikitable<br />
|-<br />
! From/To !! ES !! CA !! GL !! AST !! PT !! OC !! FR !! RO !! IT !! AN !! !! EN !! AF !! NN !! NB !! DA !! SV !! IS !! !! CY !! BR !! !! MK !! BG !! SH !! PL !! !! FA !! TG !! !! BN !! !! FIN !! SME !! !! EU !! !! EO<br />
|-<br />
| ES || {{LangPair|}} ||X||X||X||X||X||X||X||D||X|| ||X|| || || || || || || || || || || || || || || || || || || || || || || ||X|| ||X<br />
|-<br />
| CA ||X|| {{LangPair|}} ||X|| || ||X||X||D|| ||X|| ||X|| || || || || || || || || || || || || || || || || || || || || || || || || ||X<br />
|-<br />
| GL ||X|| || {{LangPair|}} || ||X|| || || || || || ||X|| || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| AST|| || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| PT ||X||X||X|| || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| OC ||X||X|| || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| FR ||X||X|| || || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||X<br />
|-<br />
| RO ||X||D|| || || || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| IT ||D||X|| || || || || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| AN ||X||X|| || || || || || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||<br />
|-<br />
| EN ||X||X||X|| || || || || || || || || {{LangPair|}} ||D|| || || || || || ||X|| || ||X|| || ||D|| || || || ||D|| || || || || || ||X<br />
|-<br />
| AF || || || || || || || || || || || ||D|| {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| NN || || || || || || || || || || || || || || {{LangPair|}} ||X|| || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| NB || || || || || || || || || || || || || ||X|| {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| DA || || || || || || || || || || || || || || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| SV || || || || || || || || || || || || || || || ||X|| {{LangPair|}} || || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
| IS || || || || || || || || || || || ||X|| || || || || || {{LangPair|}} || || || || || || || || || || || || || || || || || || || || <br />
|-<br />
! From/To !! ES !! CA !! GL !! AST !! PT !! OC !! FR !! RO !! IT !! AN !! !! EN !! AF !! NN !! NB !! DA !! SV !! IS !! !! CY !! BR !! !! MK !! BG !! SH !! PL !! !! FA !! TG !! !! BN !! !! FIN !! SME !! !! EU !! !! EO<br />
|-<br />
| CY || || || || || || || || || || || ||X|| || || || || || || || {{LangPair|}} || || || || || || || || || || || || || || || || || ||<br />
|-<br />
| BR || || || || || || ||X|| || || || || || || || || || || || || || {{LangPair|}} || || || || || || || || || || || || || || || || ||<br />
|-<br />
| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||<br />
|-<br />
|-<br />
| MK || || || || || || || || || || || ||X|| || || || || || || || || || || {{LangPair|}} ||X||D|| || || || || || || || || || || || ||<br />
|-<br />
| BG || || || || || || || || || || || || || || || || || || || || || || ||X|| {{LangPair|}} || || || || || || || || || || || || || ||<br />
|-<br />
| SH || || || || || || || || || || || || || || || || || || || || || || ||D|| || {{LangPair|}} || || || || || || || || || || || || || <br />
|-<br />
| PL || || || || || || || || || || || ||D|| || || || || || || || || || || || || || {{LangPair|}} || || || || || || || || || || || || <br />
|-<br />
| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||<br />
|-<br />
| FA || || || || || || || || || || || || || || || || || || || || || || || || || || || || {{LangPair|}} ||D|| || || || || || || || || <br />
|-<br />
| TG || || || || || || || || || || || || || || || || || || || || || || || || || || || ||D|| {{LangPair|}} || || || || || || || || || <br />
|-<br />
| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||<br />
|-<br />
| BN || || || || || || || || || || || ||D|| || || || || || || || || || || || || || || || || || || {{LangPair|}} || || || || || || || <br />
|-<br />
| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||<br />
|-<br />
| FIN|| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || {{LangPair|}} ||D|| || || ||<br />
|-<br />
| SME|| || || || || || || || || || || || || || ||D|| || || || || || || || || || || || || || || || || || || {{LangPair|}} || || || || <br />
|-<br />
| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||<br />
|-<br />
| EU ||X|| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || {{LangPair|}} || ||<br />
|-<br />
| || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || || ||<br />
|-<br />
| EO || || || || || || || || || || || ||X|| || || || || || || || || || || || || || || || || || || || || || || || || || {{LangPair|}}<br />
|-<br />
|}<br />
----<br />
[[Category:Languages]]<br />
[[Category:Language_pairs]]<br />
[[Category:Statistics]]<br />
[[Category:Documentation in English]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67376
Romanian and Catalan/Workplan
2018-08-06T11:12:43Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Upgrade Indonesian-Malaysian pair to monolingual package system (testvoc clean)<br />
* Upgrade Welsh-English pair to monolingual package system<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English) <br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Upgrade Catalan-Italian pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan)<br />
'''Second evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Upgrade Afrikaans-Dutch pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
* Add Romanian CG rules<br />
| style="text-align:center" | 22,995 (~27,000)<br />
| style="text-align:center" | ron > cat ~29% (~24%)<br>cat > ron ~46% (~50%)<br />
| style="text-align:center" | ron 86.8% (84.7%)<br>cat 88.7% (83.6%)<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
* Add Romanian CG rules<br />
| style="text-align:center" | 23,009 (~29,000)<br />
| style="text-align:center" | ron > cat ~29% (~23%)<br>cat > ron ~46% (~47%)<br />
| style="text-align:center" | ron 86.8% (85.3%)<br>cat 88.7% (84.2%)<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67348
Romanian and Catalan/Workplan
2018-07-29T22:49:49Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Upgrade Indonesian-Malaysian pair to monolingual package system (testvoc clean)<br />
* Upgrade Welsh-English pair to monolingual package system<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English) <br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Upgrade Catalan-Italian pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan)<br />
'''Second evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Upgrade Afrikaans-Dutch pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
* Add Romanian CG rules<br />
| style="text-align:center" | 22,995 (~27,000)<br />
| style="text-align:center" | ron > cat ~29% (~24%)<br>cat > ron ~46% (~50%)<br />
| style="text-align:center" | ron 86.8% (84.7%)<br>cat 88.7% (83.6%)<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67340
Romanian and Catalan/Workplan
2018-07-22T15:45:23Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Upgrade Indonesian-Malaysian pair to monolingual package system (testvoc clean)<br />
* Upgrade Welsh-English pair to monolingual package system<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English) <br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Upgrade Catalan-Italian pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan)<br />
'''Second evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Upgrade Afrikaans-Dutch pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67323
Romanian and Catalan/Workplan
2018-07-16T10:39:46Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Upgrade Indonesian-Malaysian pair to monolingual package system (testvoc clean)<br />
* Upgrade Welsh-English pair to monolingual package system<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English) <br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Upgrade Catalan-Italian pair to monolingual package system (testvoc clean)<br />
* Testvoc fixes (Romanian-Catalan)<br />
'''Second evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67309
Romanian and Catalan/Workplan
2018-07-09T14:45:20Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Upgrade Indonesian-Malaysian pair to monolingual package system (testvoc clean)<br />
* Upgrade Welsh-English pair to monolingual package system<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English) <br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67259
Romanian and Catalan/Workplan
2018-07-02T09:07:06Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Upgrade Indonesian-Malaysian pair to monolingual package system (testvoc clean)<br />
* Upgrade Welsh-English pair to monolingual package system<br />
* Testvoc fixes (Romanian-Catalan and Welsh-English)<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67220
Romanian and Catalan/Workplan
2018-06-25T10:17:48Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 6%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 9%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67219
Romanian and Catalan/Workplan
2018-06-25T10:16:43Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Write new transfer rules<br />
| style="text-align:center" | 22,640 (~25,000)<br />
| style="text-align:center" | ron > cat ~30% (~25%)<br>cat > ron ~51% (~53%)<br />
| style="text-align:center" | ron 86.4% (84.1%)<br>cat 88.2% (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67178
Romanian and Catalan/Workplan
2018-06-17T12:10:47Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add more Romanian CG rules<br />
* Improve evaluation of test texts with diff files<br />
'''First evaluation'''<br />
| style="text-align:center" | 22,381 (~23,000)<br />
| style="text-align:center" | ron > cat ~30% (~26%)<br>cat > ron ~53% (~56%)<br />
| style="text-align:center" | ron 86.1% (83.4%)<br>cat 87.4% (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67140
Romanian and Catalan/Workplan
2018-06-11T08:49:39Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Upgrade transfer rules and write rules for new patterns<br />
* Add CG to Romanian to improve disambiguation<br />
| style="text-align:center" | 20,324 (~21,000)<br />
| style="text-align:center" | ron > cat ~31% (~28%)<br>cat > ron ~53% (~57%)<br />
| style="text-align:center" | ron 85.4% (82.7%)<br>cat 86.8% (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Transfer_Rules&diff=67135
Romanian and Catalan/Transfer Rules
2018-06-10T21:34:59Z
<p>Marcriera: /* Chunk (T1X) */</p>
<hr />
<div>This page provides general documentation on the transfer rules used in the Romanian-Catalan pair. The tables are generated from the data and comments found within the rule files using an automated script ('''transfer_documentation.sh''', available [https://github.com/MarcRiera/apertium-scripts here]).<br />
<br />
== Romanian → Catalan ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|mod_maj<br />
| style="text-align:center" |1<br />
|Canvia la majúscula inicial d'un lema a minúscula<br />
|-<br />
!2<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|-<br />
!3<br />
|conj_verb2<br />
| style="text-align:center" |2<br />
|Conjuga un verb compost doble<br />
|-<br />
!4<br />
|conj_verb3<br />
| style="text-align:center" |3<br />
|Conjuga un verb compost triple<br />
|-<br />
!5<br />
|f_concord1<br />
| style="text-align:center" |1<br />
|Realitza la concordança d'un element i desa els valors a les variables<br />
|-<br />
!6<br />
|f_concord2<br />
| style="text-align:center" |2<br />
|Realitza la concordança de dos elements i desa els valors a les variables<br />
|-<br />
!7<br />
|f_bcond<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!8<br />
|f_outverb2<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!9<br />
|f_outverb3<br />
| style="text-align:center" |3<br />
|Undocumented macro!<br />
|-<br />
!10<br />
|prn_verb<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!11<br />
|prn_verbp<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!12<br />
|prn_verbenc<br />
| style="text-align:center" |2<br />
|Undocumented macro!<br />
|-<br />
!13<br />
|f_modgen<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|-<br />
!14<br />
|f_concord3<br />
| style="text-align:center" |3<br />
|Undocumented macro!<br />
|-<br />
!15<br />
|f_concord4<br />
| style="text-align:center" |4<br />
|Undocumented macro!<br />
|-<br />
!16<br />
|f_lexicAdj<br />
| style="text-align:center" |1<br />
|Undocumented macro!<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|NOM<br />
|-<br />
!2<br />
|num nom <br />
|NUM + NOM<br />
|-<br />
!3<br />
|num prepde nom <br />
|NUM + de + NOM<br />
|-<br />
!4<br />
|nom adjnorm <br />
|NOM + ADJECTIU<br />
|-<br />
!5<br />
|nom adjpos <br />
|NOM + ADJECTIU POSSESSIU<br />
|-<br />
!6<br />
|num <br />
|NUMERAL<br />
|-<br />
!7<br />
|adj <br />
|ADJ<br />
|-<br />
!8<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!9<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!10<br />
|advnu vbcnj <br />
|REGLA: NU + VERB CONJUGAT<br />
|-<br />
!11<br />
|vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: VERB CONJUGAT DOBLE<br />
|-<br />
!12<br />
|advnu vbcnj_doble1 vbcnj_doble2 <br />
|REGLA: NU + VERB CONJUGAT DOBLE<br />
|-<br />
!13<br />
|vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: VERB CONJUGAT TRIPLE<br />
|-<br />
!14<br />
|advnu vbcnj_triple1 vbcnj_triple2 vbcnj_triple3 <br />
|REGLA: NU + VERB CONJUGAT DOBLE<br />
|-<br />
!15<br />
|sent <br />
|Reinicialitza les variables<br />
|-<br />
!16<br />
|vbaveapri pp prnenc <br />
|Undocumented rule!<br />
|-<br />
!17<br />
|det nom nom <br />
|Undocumented rule!<br />
|-<br />
!18<br />
|advnu catverb advdecât nom <br />
|Undocumented rule!<br />
|-<br />
!19<br />
|advnu catverb advdecât detind nom <br />
|Undocumented rule!<br />
|-<br />
!20<br />
|nom nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!21<br />
|prepîn nom xifra <br />
|Undocumented rule!<br />
|-<br />
!22<br />
|num mesos <br />
|Undocumented rule!<br />
|-<br />
!23<br />
|nom verbfi adj <br />
|Undocumented rule!<br />
|-<br />
!24<br />
|nom verbfi pp <br />
|Undocumented rule!<br />
|-<br />
!25<br />
|deta nom <br />
|Undocumented rule!<br />
|-<br />
!26<br />
|deta nom adj <br />
|Undocumented rule!<br />
|-<br />
!27<br />
|deta nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!28<br />
|nom deta np <br />
|Undocumented rule!<br />
|-<br />
!29<br />
|nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!30<br />
|nom adj deta np <br />
|Undocumented rule!<br />
|-<br />
!31<br />
|nom deta relcărui nom <br />
|Undocumented rule!<br />
|-<br />
!32<br />
|prepîn nptop <br />
|Undocumented rule!<br />
|-<br />
!33<br />
|prep nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!34<br />
|catverb nom adjpos <br />
|Undocumented rule!<br />
|-<br />
!35<br />
|verbtrebui conjsub catverb <br />
|Undocumented rule!<br />
|-<br />
!36<br />
|num mesos num <br />
|Undocumented rule!<br />
|-<br />
!37<br />
|prep nom adj <br />
|Undocumented rule!<br />
|-<br />
!38<br />
|prep nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!39<br />
|prep nom nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!40<br />
|catverb nom adj <br />
|Undocumented rule!<br />
|-<br />
!41<br />
|catverb nom det nom <br />
|Undocumented rule!<br />
|-<br />
!42<br />
|np <br />
|Undocumented rule!<br />
|-<br />
!43<br />
|preppe npant <br />
|Undocumented rule!<br />
|-<br />
!44<br />
|preppe prn <br />
|Undocumented rule!<br />
|-<br />
!45<br />
|preppe det <br />
|Undocumented rule!<br />
|-<br />
!46<br />
|catverb prep nom nom <br />
|Undocumented rule!<br />
|-<br />
!47<br />
|prep nom nom <br />
|Undocumented rule!<br />
|-<br />
!48<br />
|prep nom <br />
|Undocumented rule!<br />
|-<br />
!49<br />
|prep nom adj <br />
|Undocumented rule!<br />
|-<br />
!50<br />
|prep nom adj adj <br />
|Undocumented rule!<br />
|-<br />
!51<br />
|prep nom adj adj adj <br />
|Undocumented rule!<br />
|-<br />
!52<br />
|det <br />
|Undocumented rule!<br />
|-<br />
!53<br />
|prn <br />
|Undocumented rule!<br />
|-<br />
!54<br />
|adjtot nom <br />
|Undocumented rule!<br />
|-<br />
!55<br />
|catverb dettuturor <br />
|Undocumented rule!<br />
|-<br />
!56<br />
|nom dettuturor <br />
|Undocumented rule!<br />
|-<br />
!57<br />
|nom adjlui np <br />
|Undocumented rule!<br />
|-<br />
!58<br />
|relat <br />
|Undocumented rule!<br />
|-<br />
!59<br />
|adjpos <br />
|Undocumented rule!<br />
|-<br />
!60<br />
|catverb nom nom <br />
|Undocumented rule!<br />
|-<br />
!61<br />
|adj nom <br />
|Undocumented rule!<br />
|-<br />
!62<br />
|adj nom deta nom <br />
|Undocumented rule!<br />
|-<br />
!63<br />
|nom adj deta nom <br />
|Undocumented rule!<br />
|-<br />
!64<br />
|prep nom adj deta np <br />
|Undocumented rule!<br />
|-<br />
!65<br />
|prep nom adj deta nom <br />
|Undocumented rule!<br />
|-<br />
!66<br />
|catverb nom adj deta nom <br />
|Undocumented rule!<br />
|-<br />
!67<br />
|nom deta nom <br />
|Undocumented rule!<br />
|-<br />
!68<br />
|nom det nom <br />
|Undocumented rule!<br />
|-<br />
!69<br />
|nom deta detind nom <br />
|Undocumented rule!<br />
|-<br />
!70<br />
|det nom deta nom <br />
|Undocumented rule!<br />
|-<br />
!71<br />
|det adj nom deta nom <br />
|Undocumented rule!<br />
|-<br />
!72<br />
|nom adv adj <br />
|Undocumented rule!<br />
|-<br />
!73<br />
|detcel advmai adj <br />
|Undocumented rule!<br />
|-<br />
!74<br />
|advnespus prepde adj <br />
|Undocumented rule!<br />
|-<br />
!75<br />
|prep prepa prnpro catverb <br />
|Undocumented rule!<br />
|-<br />
!76<br />
|prep prepa catverb <br />
|Undocumented rule!<br />
|-<br />
!77<br />
|nom detcel adj <br />
|Undocumented rule!<br />
|-<br />
!78<br />
|det nom adj <br />
|Undocumented rule!<br />
|-<br />
!79<br />
|det adj nom <br />
|Undocumented rule!<br />
|-<br />
!80<br />
|det nom <br />
|Undocumented rule!<br />
|-<br />
!81<br />
|nom nom <br />
|Undocumented rule!<br />
|-<br />
!82<br />
|advnu advmai catverb <br />
|Undocumented rule!<br />
|-<br />
!83<br />
|vrea verbtrebui conjsub catverb <br />
|Undocumented rule!<br />
|-<br />
!84<br />
|conjnici advnu catverb <br />
|Undocumented rule!<br />
|-<br />
!85<br />
|advnici- advnu catverb <br />
|Undocumented rule!<br />
|-<br />
!86<br />
|advnu catverb advdecât <br />
|Undocumented rule!<br />
|-<br />
!87<br />
|vbaveacni verbtrebui conjsub catverb <br />
|Undocumented rule!<br />
|-<br />
!88<br />
|vbaveapri verbtrebui conjsub catverb <br />
|Undocumented rule!<br />
|-<br />
!89<br />
|catverb nom <br />
|Undocumented rule!<br />
|-<br />
!90<br />
|catverb nom adj <br />
|Undocumented rule!<br />
|-<br />
!91<br />
|catverb conjsub catverb <br />
|Undocumented rule!<br />
|-<br />
!92<br />
|adv adj nom <br />
|Undocumented rule!<br />
|-<br />
!93<br />
|num adj nom <br />
|Undocumented rule!<br />
|-<br />
!94<br />
|num nom adj <br />
|Undocumented rule!<br />
|-<br />
!95<br />
|catverb detcel num nom <br />
|Undocumented rule!<br />
|-<br />
!96<br />
|detcel num nom <br />
|Undocumented rule!<br />
|-<br />
!97<br />
|det nom <br />
|Undocumented rule!<br />
|-<br />
!98<br />
|vbaveacni catverb pp <br />
|Undocumented rule!<br />
|-<br />
!99<br />
|catverb prep nom <br />
|Undocumented rule!<br />
|-<br />
!100<br />
|vrea catverb prep prn <br />
|Undocumented rule!<br />
|-<br />
!101<br />
|vrea catverb prep nom <br />
|Undocumented rule!<br />
|-<br />
!102<br />
|vbaveacni catverb prep nom <br />
|Undocumented rule!<br />
|-<br />
!103<br />
|prepde pp <br />
|Undocumented rule!<br />
|-<br />
!104<br />
|catverb prnenc <br />
|Undocumented rule!<br />
|-<br />
!105<br />
|catverb prnenc prnenc <br />
|Undocumented rule!<br />
|-<br />
!106<br />
|prnproref verbospron <br />
|Undocumented rule!<br />
|-<br />
!107<br />
|advca nom <br />
|Undocumented rule!<br />
|-<br />
!108<br />
|advca det nom <br />
|Undocumented rule!<br />
|-<br />
!109<br />
|adv adj advca <br />
|Undocumented rule!<br />
|-<br />
!110<br />
|nom nptop <br />
|Undocumented rule!<br />
|-<br />
!111<br />
|prep nom nptop <br />
|Undocumented rule!<br />
|-<br />
!112<br />
|prepdintr- detind <br />
|Undocumented rule!<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
== Catalan → Romanian ==<br />
<br />
=== Chunk (T1X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Macro name<br />
! style="width: 10%" | Arguments<br />
! style="width: 60%" | Description<br />
|-<br />
!1<br />
|conj_verb1<br />
| style="text-align:center" |1<br />
|Conjuga un verb simple<br />
|}<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|nom <br />
|casa > casă<br />
|-<br />
!2<br />
|artdef nom <br />
|la casa > casa<br />
|-<br />
!3<br />
|np <br />
|REGLA: NOM PROPI<br />
|-<br />
!4<br />
|adj <br />
|REGLA: ADJ<br />
|-<br />
!5<br />
|pp <br />
|REGLA: PP<br />
|-<br />
!6<br />
|vbcnj <br />
|REGLA: VERB CONJUGAT<br />
|-<br />
!7<br />
|inf <br />
|REGLA: INF<br />
|-<br />
!8<br />
|ger <br />
|REGLA: GER<br />
|-<br />
!9<br />
|vbhaver <br />
|REGLA: VBHAVER (per si apareix sol)<br />
|-<br />
!10<br />
|sent <br />
|Resets variables.<br />
|}<br />
<br />
=== Interchunk (T2X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|SN <br />
|REGLA: SN<br />
|-<br />
!2<br />
|SA <br />
|REGLA: SA<br />
|}<br />
<br />
=== Postchunk (T3X) ===<br />
<br />
{|class="wikitable"<br />
! style="width: 5%" | #<br />
! style="width: 25%" | Rule pattern<br />
! style="width: 70%" | Description<br />
|-<br />
!1<br />
|sent <br />
|CHUNK:<br />
|}<br />
<br />
<br />
[[Category:Romanian and Catalan|*]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67111
Romanian and Catalan/Workplan
2018-06-04T08:29:16Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | 18,487 (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67110
Romanian and Catalan/Workplan
2018-06-04T08:28:14Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Plan transfer rule changes<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat ~31% (~30%)<br>cat > ron ~53% (~58%)<br />
| style="text-align:center" | ron 84.9% (81.9%)<br>cat 86.4% (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67077
Romanian and Catalan/Workplan
2018-05-27T21:47:47Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat ~31% (~32%)<br>cat > ron ~53% (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67071
Romanian and Catalan/Workplan
2018-05-26T21:50:25Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries (adj clean)<br />
* Rewrite ron-cat transfer rules to use chunking<br />
* Improve freqlist generation script (explains the leap in coverage)<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron 83.6% (81.1%)<br>cat 86.2% (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67068
Romanian and Catalan/Workplan
2018-05-25T14:51:12Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
| style="text-align:center" | 17,026 (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron (81.1%)<br>cat (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Catalan&diff=67063
Catalan
2018-05-24T15:08:57Z
<p>Marcriera: /* Proper nouns */</p>
<hr />
<div>'''Catalan''' ([[Wikipedia:Catalan language]]) is a [[Romance language]]. It is available in Apertium as a standalone analyser/generator ([[Catalan#Apertium-cat|apertium-cat]]) and as a component of several pairs which translate to/from Catalan.<br />
<br />
== Language pairs ==<br />
''See also: [[List of language pairs]]''<br />
<br />
In [[trunk]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-arg-cat]]</code> || align=center | Aragonese <-> Catalan || align=center | 17 Aug 2016<br />
|-<br />
| align=center | <code>[[apertium-ca-it]]</code> || align=center | Catalan <-> Italian || align=center | 12 Oct 2014<br />
|-<br />
| align=center | <code>[[apertium-en-ca]]</code> || align=center | English <-> Catalan || align=center | 28 Mar 2016<br />
|-<br />
| align=center | <code>[[apertium-eo-ca]]</code> || align=center | Esperanto <-- Catalan || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-fra-cat]]</code> || align=center | French <-> Catalan || align=center | 18 Apr 2017<br />
|-<br />
| align=center | <code>[[apertium-oc-ca]]</code> || align=center | Occitan <-> Catalan || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-pt-ca]]</code> || align=center | Portuguese <-> Catalan || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-spa-cat]]</code> || align=center | Spanish <-> Catalan || align=center | 01 Apr 2017<br />
|-<br />
|}<br />
<br />
In [[staging]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-cat-glg]]</code> || align=center | Catalan <-> Galician || align=center | 18 Nov 2016<br />
|-<br />
| align=center | <code>[[apertium-cat-srd]]</code> || align=center | Catalan <-> Sardinian || align=center | 03 Apr 2017<br />
|-<br />
|}<br />
<br />
In [[nursery]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-ca-ro]]</code> || align=center | Catalan <-> Romanian || align=center | 30 Sep 2015<br />
|-<br />
|}<br />
<br />
In [[incubator]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-cat-cos]]</code> || align=center | Catalan <-> Corsican || align=center | 05 Nov 2013<br />
|-<br />
| align=center | <code>[[apertium-cat-ina]]</code> || align=center | Catalan <-> Interlingua || align=center | 07 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-eng-cat]]</code> || align=center | English <-> Catalan || align=center | 24 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-por-cat]]</code> || align=center | Portuguese <-> Catalan || align=center | 22 Jan 2016<br />
|-<br />
|}<br />
<br />
== Apertium-cat ==<br />
<br />
===Current status===<br />
<br />
''Last update: 28 Aug 2017''<br />
<br />
'''Dix entries:''' 56,588<br />
<br />
'''Dix paradigms:''' 607<br />
<br />
'''Coverage:''' 94.04% (Wikipedia)<br />
<br />
===Dictionary guidelines===<br />
<br />
The current Catalan dictionary is quite big (more than 55,000 entries), so tidiness is essential to ensure future development:<br />
<br />
* Keep entries sorted alphabetically.<br />
* Keep entries grouped by type and tags (do not mix different types of proper nouns together).<br />
* Check the file with apertium-dixtools (to update the number of entries and remove duplicates).<br />
<br />
====Proper nouns====<br />
<br />
Catalan proper nouns (names, toponyms, acronyms, etc.) should all have gender and number. They were once removed, but they should be specified using the following paradigms:<br />
<br />
* Toponyms <np><top><m><sg>: Iran__np<br />
* Toponyms <np><top><f><sg>: Àfrica__np<br />
* Toponyms <np><top><m><pl>: Estats_Units__np<br />
* Toponyms <np><top><f><pl>: Balears__np<br />
<br />
* Antroponyms <np><ant><m><sg>: Marc__np<br />
* Antroponyms <np><ant><f><sg>: Maria__np<br />
* Family names <np><cog><mf><sp>: Saussure__np<br />
<br />
* Others <np><al><m><sg>: Linux__np<br />
* Others <np><al><f><sg>: Wikipedia__np<br />
* Others <np><al><m><pl>: Jocs_Olímpics__np<br />
* Others <np><al><f><pl>: Falles__np<br />
* Others <np><al><mf><sp>: Honda__np<br />
<br />
===Future work===<br />
<br />
* Add support for proper noun articles (en/na).<br />
* Restore gender and number to proper nouns that still do not have them.<br />
* Tweak entries related to proper nouns with translations (kings, queens, etc.).<br />
<br />
<br />
''For further documentation about Catalan in Apertium, check: [[:Category:Catalan]]''<br />
<br />
[[Category:Catalan]]<br />
[[Category:Languages]]<br />
[[Category:Romance languages]]<br />
[[Category:Documentation in English]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67050
Romanian and Catalan/Workplan
2018-05-21T11:18:36Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat ~33% (~34%)<br>cat > ron ~56% (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
| style="text-align:center" | (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron (81.1%)<br>cat (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67048
Romanian and Catalan/Workplan
2018-05-21T08:03:26Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new entries to dictionaries<br />
* Fix broken bidix entries<br />
* Fix transfer rules that didn't work as expected<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat (~34%)<br>cat > ron (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
| style="text-align:center" | (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron (81.1%)<br>cat (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=67038
Romanian and Catalan/Workplan
2018-05-19T23:10:49Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
| style="text-align:center" | 14,890 (~15,000)<br />
| style="text-align:center" | ron > cat (~34%)<br>cat > ron (~60%)<br />
| style="text-align:center" | ron 82.2% (80.1%)<br>cat 82.7% (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
| style="text-align:center" | (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron (81.1%)<br>cat (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan&diff=67019
Romanian and Catalan
2018-05-16T08:16:00Z
<p>Marcriera: </p>
<hr />
<div>This is a language pair translating between [[Romanian]] and [[Catalan]]. The pair is currently located in [https://github.com/apertium/apertium-ron-cat nursery].<br />
<br />
==Current status==<br />
<br />
''Last update: 15 May 2018''<br />
<br />
'''Bidix entries:''' 13,242<br />
<br />
'''Bidix paradigms:''' 7</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan&diff=67011
Romanian and Catalan
2018-05-14T22:28:31Z
<p>Marcriera: </p>
<hr />
<div>This is a language pair translating between [[Romanian]] and [[Catalan]]. The pair is currently located in [https://github.com/apertium/apertium-ron-cat incubator].<br />
<br />
==Current status==<br />
<br />
''Last update: 15 May 2018''<br />
<br />
'''Bidix entries:''' 13,242<br />
<br />
'''Bidix paradigms:''' 7</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=66957
Romanian and Catalan/Workplan
2018-05-09T22:03:50Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,819<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
| style="text-align:center" | (~15,000)<br />
| style="text-align:center" | ron > cat (~34%)<br>cat > ron (~60%)<br />
| style="text-align:center" | ron (80.1%)<br>cat (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
| style="text-align:center" | (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron (81.1%)<br>cat (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=66956
Romanian and Catalan/Workplan
2018-05-09T22:02:04Z
<p>Marcriera: </p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Work done<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,748<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
| style="text-align:center" | (~15,000)<br />
| style="text-align:center" | ron > cat (~34%)<br>cat > ron (~60%)<br />
| style="text-align:center" | ron (80.1%)<br>cat (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
| style="text-align:center" | (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron (81.1%)<br>cat (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Romanian_and_Catalan/Workplan&diff=66955
Romanian and Catalan/Workplan
2018-05-09T22:01:25Z
<p>Marcriera: Created page with "You can find the detailed goals for each week here. {|class="wikitable" ! style="width: 10%" | Week ! style="width: 15%" | Dates ! st..."</p>
<hr />
<div>You can find the detailed goals for each week [[User:Marcriera/Proposal2018#Workplan|here]].<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Goals<br />
! style="width: 5%" | Bidix<br />
! style="width: 15%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! Post-application period<br />
| style="text-align:center" | 28 March - 13 May<br />
|<br />
* Build frequency lists for Romanian and Catalan<br />
* Fix broken bidix entries<br />
* Improve testvoc scripts<br />
| style="text-align:center" | 12,748<br />
| style="text-align:center" | ron > cat (~36%)<br>cat > ron (~61%)<br />
| style="text-align:center" | ron (79%)<br>cat (78%)<br />
|-<br />
! 1<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
| style="text-align:center" | (~15,000)<br />
| style="text-align:center" | ron > cat (~34%)<br>cat > ron (~60%)<br />
| style="text-align:center" | ron (80.1%)<br>cat (79.1%)<br />
|-<br />
! 2<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
| style="text-align:center" | (~17,000)<br />
| style="text-align:center" | ron > cat (~32%)<br>cat > ron (~59%)<br />
| style="text-align:center" | ron (81.1%)<br>cat (80%)<br />
|-<br />
! 3<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
| style="text-align:center" | (~19,000)<br />
| style="text-align:center" | ron > cat (~30%)<br>cat > ron (~58%)<br />
| style="text-align:center" | ron (81.9%)<br>cat (80.9%)<br />
|-<br />
! 4<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
| style="text-align:center" | (~21,000)<br />
| style="text-align:center" | ron > cat (~28%)<br>cat > ron (~57%)<br />
| style="text-align:center" | ron (82.7%)<br>cat (81.7%)<br />
|-<br />
! 5<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
'''First evaluation'''<br />
| style="text-align:center" | (~23,000)<br />
| style="text-align:center" | ron > cat (~26%)<br>cat > ron (~56%)<br />
| style="text-align:center" | ron (83.4%)<br>cat (82.4%)<br />
|-<br />
! 6<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
| style="text-align:center" | (~25,000)<br />
| style="text-align:center" | ron > cat (~25%)<br>cat > ron (~53%)<br />
| style="text-align:center" | ron (84.1%)<br>cat (83%)<br />
|-<br />
! 7<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
| style="text-align:center" | (~27,000)<br />
| style="text-align:center" | ron > cat (~24%)<br>cat > ron (~50%)<br />
| style="text-align:center" | ron (84.7%)<br>cat (83.6%)<br />
|-<br />
! 8<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
| style="text-align:center" | (~29,000)<br />
| style="text-align:center" | ron > cat (~23%)<br>cat > ron (~47%)<br />
| style="text-align:center" | ron (85.3%)<br>cat (84.2%)<br />
|-<br />
! 9<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
'''Second evaluation'''<br />
| style="text-align:center" | (~31,000)<br />
| style="text-align:center" | ron > cat (~22%)<br>cat > ron (~45%)<br />
| style="text-align:center" | ron (85.8%)<br>cat (84.7%)<br />
|-<br />
! 10<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 11<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 12<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|-<br />
! 13<br />
| style="text-align:center" | 6 August - 14 August<br />
|<br />
'''Final evaluation'''<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
! style="text-align:center" |<br />
|}</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=English_and_Catalan&diff=66954
English and Catalan
2018-05-09T21:38:50Z
<p>Marcriera: /* Current status */</p>
<hr />
<div>This is a language pair translating between [[English]] and [[Catalan]]. The pair is currently located in [https://github.com/apertium/apertium-eng-cat trunk].<br />
<br />
==Current status==<br />
<br />
''Last update: 9 May 2018''<br />
<br />
'''Bidix entries:''' 54,830<br />
<br />
'''Bidix paradigms:''' 16<br />
<br />
<br />
'''Transfer rules (English to Catalan):''' 221 (T1X), 60 (T2X), 38 (T3X)<br />
<br />
'''Transfer rules (Catalan to English):''' 97 (T1X), 28 (T2X), 9 (T3X)<br />
<br />
<br />
'''Coverage (English to Catalan):''' 92.50% (Wikipedia)<br />
<br />
'''Coverage (Catalan to English):''' 87.52% (Wikipedia)<br />
<br />
==Dictionary guidelines==<br />
<br />
The current English-Catalan bilingual dictionary is quite big (more than 65,000 entries), so tidiness is essential to ensure future development:<br />
<br />
* Keep entries sorted alphabetically.<br />
* Keep entries grouped by type and tags (do not mix different types of proper nouns together).<br />
* Check the file with apertium-dixtools (to update the number of entries and remove duplicates).<br />
<br />
===Adjectives===<br />
<br />
* Adjectives do not need <code><GD></code> or <code><ND></code> tags for Catalan in the bidix, they are automatically adjusted during transfer.<br />
* English adjectives with comparative and superlative forms (''high'', ''big'', etc.) need the <code><sint></code> tag to be generated properly.<br />
<br />
===Proper nouns===<br />
<br />
* In English, family names (<code><np><cog></code>) have two forms (singular and plural), but in Catalan they only have one (singular-plural), so two entries must be used. Example:<br />
: <code><l>'''Smith'''<np><cog><sg></l><r>'''Smith'''<np><cog><mf><sg></r></code><br />
: <code><l>'''Smith'''<np><cog><pl></l><r>'''Smith'''<np><cog><mf><sg></r></code> (LR-only)<br />
<br />
===Verbs===<br />
<br />
* Catalan verbs tied to a pronoun (''dutxar-se'', ''casar-se'', etc.) need the <code><pron></code> tag to be generated properly.<br />
<br />
==Transfer Rules==<br />
<br />
A detailed list of all chunk, interchunk and postchunk rules and macros is available in the following page: [[English_and_Catalan/Transfer_Rules|Transfer Rules]]<br />
<br />
==Future work==<br />
<br />
===Bidix===<br />
<br />
* Keep expanding the bidix (specially Catalan to English).<br />
* Fix proper noun entries (they do not have gender or number in Catalan).<br />
* Make use of the new apertium-separable module.<br />
* Make use of all the closed categories in the English dictionary.<br />
<br />
===Lexical selection===<br />
<br />
* Add new rules to improve lexical selection.<br />
* Fix odd translations in both directions.<br />
<br />
===Transfer===<br />
<br />
* Rewrite rules related to verb conjugation.<br />
* Add support for word order inversions in English questions.<br />
* Improve support for reflexive verbs and pronouns.<br />
* Move uppercase changes from T1X to T3X.<br />
<br />
====Ambiguous rules====<br />
<br />
{|class=wikitable<br />
! English pattern !! Catalan pattern !! Example input !! Example output<br />
|-<br />
|rowspan=2| det adj nom || det nom adj || The next step || El pas pròxim<br />
|-<br />
|| det adj nom || The next step || El pròxim pas<br />
|-<br />
|rowspan=2| be pp || ser pp || It is done like this. || És fet així.<br />
|-<br />
|| pro-ref vfin || It is done like this. || Es fa així.<br />
|-<br />
|}<br />
<br />
==Archived information==<br />
<br />
<span style="color:red;">'''The information below is out of date and kept only for archival purposes. Read the sections above for documentation on the current English-Catalan pair.'''</span><br />
<br />
===Introducció===<br />
Aquesta pàgina recull els suggeriments de millora al paquet apertium-eng-cat quant a vocabulari (especialment per a unitats multimot), canvis estructurals i lèxics, errades de format, etc. Les files marcades a la primera columna amb blau fosc corresponen a fenòmens que ja han estat tractats.<br />
<br />
===Unitats monoparaula i multiparaula===<br />
<br />
En aquesta taula podeu suggerir unitats monoparaula i multiparaula que haurien de ser als diccionaris d'apertium-eng-cat. Una unitat multiparaula es útil quan la traducció en conjunt de seqüències de més d'un mot és preferible a la traducció del mots per separat. Això pot ajudar a desambiguar una paraula amb més d'un sentit.<br />
<br />
Tenim bàsicament dos tipus d'unitats multiparaula: les que tenen flexió interna (el verb 'trobar a faltar': trobe a faltar, trobes a faltar, etc.) i les que no en tenen (com ara l'adverbi 'de bon grat'). Per a paraules senzilles i unitats multiparaula sense flexió, ens ajudareu més si feu servir la nostra eina d'inserció de vocabulari. <br />
<br />
Per a la resta, podeu emplenar aquesta taula. La columna de 'Forma valenciana' la farem servir quan hi haja una distinció entre variants. Els sentits volen dir que la multiparaula s'ha d'introduir en els dos sentits de la traducció (:), només d'anglés a català (>) o només de català a anglés (<). <br />
<br />
{|class=wikitable<br />
! Castellà (<code>en</code>) || Català (<code>ca</code>) || Forma valenciana || Sentit (:,<,>)<br />
|-<br />
|style="background-color: #bbbbff" |hurt || fer mal || ||:<br />
|-<br />
| prisoner || presoner || ||:<br />
|-<br />
| prisoner || pres || ||<<br />
|-<br />
| prisoner || reclús || ||<<br />
|}<br />
<br />
===Regles de transferència estructural===<br />
<br />
En aquesta secció us demanem que feu un esforç de definir un canvi estructural sistemàtic (tant com siga possible) entre anglés i català. Volem saber quin patró ens trobarem (seqüència d'etiquetes morfològiques acompanyades o no d'un lema específic en llengua origen) i quina acció li hem d'aplicar (seqüència d'etiquetes morfològiques acompanyades o no d'un lema específic en llengua meta). Distingirem entre canvis estructurals que afecten a sintagmes nominals i verbals. Per saber que significa, quins hi ha o com s'escriu un símbol mireu en la página ''[[List of symbols]]''.<br />
<br />
====Anglés → Català====<br />
<br />
'''Sintagmes nominals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>en</code>) || Acció (<code>ca</code>) || Entrada (<code>en</code>) || Eixida (<code>ca</code>)<br />
|-<br />
|style="background-color: #bbbbff"| adj + n || n + adj || green cow || vaca verda<br />
|-<br />
| || || ||<br />
|}<br />
<br />
'''Sintagmes verbals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>en</code>) || Acció (<code>ca</code>) || Entrada (<code>en</code>) || Eixida (<code>ca</code>)<br />
|-<br />
|style="background-color: #bbbbff"| prn.tn.p1.sg + will + vblex.inf || vblex.fti.p1.sg || I will go || aniré<br />
|-<br />
| || || ||<br />
|}<br />
<br />
====Català → Anglés====<br />
<br />
<br />
'''Sintagmes nominals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>ca</code>) || Acció (<code>es</code>) || Entrada (<code>ca</code>) || Eixida (<code>es</code>)<br />
|-<br />
|style="background-color: #bbbbff"| det.pos + n1's + n2 || det.def + n2 + de.pr + det.pos + n1 || my sister's teacher || el mestre de la meua germana<br />
|-<br />
| || || ||<br />
|}<br />
<br />
'''Sintagmes verbals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>en</code>) || Acció (<code>ca</code>) || Entrada (<code>en</code>) || Eixida (<code>ca</code>)<br />
|-<br />
| || || ||<br />
|-<br />
|}<br />
<br />
<br />
[[Category:English and Catalan|*]]<br />
[[Category:English]]<br />
[[Category:Catalan]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=English_and_Catalan&diff=66836
English and Catalan
2018-05-01T22:01:15Z
<p>Marcriera: /* Bidix Guidelines */</p>
<hr />
<div>This is a language pair translating between [[English]] and [[Catalan]]. The pair is currently located in [https://github.com/apertium/apertium-eng-cat trunk].<br />
<br />
==Current status==<br />
<br />
''Last update: 23 Mar 2018''<br />
<br />
'''Bidix entries:''' 65,309<br />
<br />
'''Bidix paradigms:''' 15<br />
<br />
<br />
'''Transfer rules (English to Catalan):''' 221 (T1X), 60 (T2X), 38 (T3X)<br />
<br />
'''Transfer rules (Catalan to English):''' 97 (T1X), 28 (T2X), 9 (T3X)<br />
<br />
<br />
'''Coverage (English to Catalan):''' 92.50% (Wikipedia)<br />
<br />
'''Coverage (Catalan to English):''' 87.52% (Wikipedia)<br />
<br />
==Dictionary guidelines==<br />
<br />
The current English-Catalan bilingual dictionary is quite big (more than 65,000 entries), so tidiness is essential to ensure future development:<br />
<br />
* Keep entries sorted alphabetically.<br />
* Keep entries grouped by type and tags (do not mix different types of proper nouns together).<br />
* Check the file with apertium-dixtools (to update the number of entries and remove duplicates).<br />
<br />
===Adjectives===<br />
<br />
* Adjectives do not need <code><GD></code> or <code><ND></code> tags for Catalan in the bidix, they are automatically adjusted during transfer.<br />
* English adjectives with comparative and superlative forms (''high'', ''big'', etc.) need the <code><sint></code> tag to be generated properly.<br />
<br />
===Proper nouns===<br />
<br />
* In English, family names (<code><np><cog></code>) have two forms (singular and plural), but in Catalan they only have one (singular-plural), so two entries must be used. Example:<br />
: <code><l>'''Smith'''<np><cog><sg></l><r>'''Smith'''<np><cog><mf><sg></r></code><br />
: <code><l>'''Smith'''<np><cog><pl></l><r>'''Smith'''<np><cog><mf><sg></r></code> (LR-only)<br />
<br />
===Verbs===<br />
<br />
* Catalan verbs tied to a pronoun (''dutxar-se'', ''casar-se'', etc.) need the <code><pron></code> tag to be generated properly.<br />
<br />
==Transfer Rules==<br />
<br />
A detailed list of all chunk, interchunk and postchunk rules and macros is available in the following page: [[English_and_Catalan/Transfer_Rules|Transfer Rules]]<br />
<br />
==Future work==<br />
<br />
===Bidix===<br />
<br />
* Keep expanding the bidix (specially Catalan to English).<br />
* Fix proper noun entries (they do not have gender or number in Catalan).<br />
* Make use of the new apertium-separable module.<br />
* Make use of all the closed categories in the English dictionary.<br />
<br />
===Lexical selection===<br />
<br />
* Add new rules to improve lexical selection.<br />
* Fix odd translations in both directions.<br />
<br />
===Transfer===<br />
<br />
* Rewrite rules related to verb conjugation.<br />
* Add support for word order inversions in English questions.<br />
* Improve support for reflexive verbs and pronouns.<br />
* Move uppercase changes from T1X to T3X.<br />
<br />
====Ambiguous rules====<br />
<br />
{|class=wikitable<br />
! English pattern !! Catalan pattern !! Example input !! Example output<br />
|-<br />
|rowspan=2| det adj nom || det nom adj || The next step || El pas pròxim<br />
|-<br />
|| det adj nom || The next step || El pròxim pas<br />
|-<br />
|rowspan=2| be pp || ser pp || It is done like this. || És fet així.<br />
|-<br />
|| pro-ref vfin || It is done like this. || Es fa així.<br />
|-<br />
|}<br />
<br />
==Archived information==<br />
<br />
<span style="color:red;">'''The information below is out of date and kept only for archival purposes. Read the sections above for documentation on the current English-Catalan pair.'''</span><br />
<br />
===Introducció===<br />
Aquesta pàgina recull els suggeriments de millora al paquet apertium-eng-cat quant a vocabulari (especialment per a unitats multimot), canvis estructurals i lèxics, errades de format, etc. Les files marcades a la primera columna amb blau fosc corresponen a fenòmens que ja han estat tractats.<br />
<br />
===Unitats monoparaula i multiparaula===<br />
<br />
En aquesta taula podeu suggerir unitats monoparaula i multiparaula que haurien de ser als diccionaris d'apertium-eng-cat. Una unitat multiparaula es útil quan la traducció en conjunt de seqüències de més d'un mot és preferible a la traducció del mots per separat. Això pot ajudar a desambiguar una paraula amb més d'un sentit.<br />
<br />
Tenim bàsicament dos tipus d'unitats multiparaula: les que tenen flexió interna (el verb 'trobar a faltar': trobe a faltar, trobes a faltar, etc.) i les que no en tenen (com ara l'adverbi 'de bon grat'). Per a paraules senzilles i unitats multiparaula sense flexió, ens ajudareu més si feu servir la nostra eina d'inserció de vocabulari. <br />
<br />
Per a la resta, podeu emplenar aquesta taula. La columna de 'Forma valenciana' la farem servir quan hi haja una distinció entre variants. Els sentits volen dir que la multiparaula s'ha d'introduir en els dos sentits de la traducció (:), només d'anglés a català (>) o només de català a anglés (<). <br />
<br />
{|class=wikitable<br />
! Castellà (<code>en</code>) || Català (<code>ca</code>) || Forma valenciana || Sentit (:,<,>)<br />
|-<br />
|style="background-color: #bbbbff" |hurt || fer mal || ||:<br />
|-<br />
| prisoner || presoner || ||:<br />
|-<br />
| prisoner || pres || ||<<br />
|-<br />
| prisoner || reclús || ||<<br />
|}<br />
<br />
===Regles de transferència estructural===<br />
<br />
En aquesta secció us demanem que feu un esforç de definir un canvi estructural sistemàtic (tant com siga possible) entre anglés i català. Volem saber quin patró ens trobarem (seqüència d'etiquetes morfològiques acompanyades o no d'un lema específic en llengua origen) i quina acció li hem d'aplicar (seqüència d'etiquetes morfològiques acompanyades o no d'un lema específic en llengua meta). Distingirem entre canvis estructurals que afecten a sintagmes nominals i verbals. Per saber que significa, quins hi ha o com s'escriu un símbol mireu en la página ''[[List of symbols]]''.<br />
<br />
====Anglés → Català====<br />
<br />
'''Sintagmes nominals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>en</code>) || Acció (<code>ca</code>) || Entrada (<code>en</code>) || Eixida (<code>ca</code>)<br />
|-<br />
|style="background-color: #bbbbff"| adj + n || n + adj || green cow || vaca verda<br />
|-<br />
| || || ||<br />
|}<br />
<br />
'''Sintagmes verbals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>en</code>) || Acció (<code>ca</code>) || Entrada (<code>en</code>) || Eixida (<code>ca</code>)<br />
|-<br />
|style="background-color: #bbbbff"| prn.tn.p1.sg + will + vblex.inf || vblex.fti.p1.sg || I will go || aniré<br />
|-<br />
| || || ||<br />
|}<br />
<br />
====Català → Anglés====<br />
<br />
<br />
'''Sintagmes nominals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>ca</code>) || Acció (<code>es</code>) || Entrada (<code>ca</code>) || Eixida (<code>es</code>)<br />
|-<br />
|style="background-color: #bbbbff"| det.pos + n1's + n2 || det.def + n2 + de.pr + det.pos + n1 || my sister's teacher || el mestre de la meua germana<br />
|-<br />
| || || ||<br />
|}<br />
<br />
'''Sintagmes verbals'''<br />
<br />
{|class=wikitable<br />
! Patró (<code>en</code>) || Acció (<code>ca</code>) || Entrada (<code>en</code>) || Eixida (<code>ca</code>)<br />
|-<br />
| || || ||<br />
|-<br />
|}<br />
<br />
<br />
[[Category:English and Catalan|*]]<br />
[[Category:English]]<br />
[[Category:Catalan]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=Catalan&diff=66835
Catalan
2018-05-01T22:00:38Z
<p>Marcriera: /* Bidix Guidelines */</p>
<hr />
<div>'''Catalan''' ([[Wikipedia:Catalan language]]) is a [[Romance language]]. It is available in Apertium as a standalone analyser/generator ([[Catalan#Apertium-cat|apertium-cat]]) and as a component of several pairs which translate to/from Catalan.<br />
<br />
== Language pairs ==<br />
''See also: [[List of language pairs]]''<br />
<br />
In [[trunk]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-arg-cat]]</code> || align=center | Aragonese <-> Catalan || align=center | 17 Aug 2016<br />
|-<br />
| align=center | <code>[[apertium-ca-it]]</code> || align=center | Catalan <-> Italian || align=center | 12 Oct 2014<br />
|-<br />
| align=center | <code>[[apertium-en-ca]]</code> || align=center | English <-> Catalan || align=center | 28 Mar 2016<br />
|-<br />
| align=center | <code>[[apertium-eo-ca]]</code> || align=center | Esperanto <-- Catalan || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-fra-cat]]</code> || align=center | French <-> Catalan || align=center | 18 Apr 2017<br />
|-<br />
| align=center | <code>[[apertium-oc-ca]]</code> || align=center | Occitan <-> Catalan || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-pt-ca]]</code> || align=center | Portuguese <-> Catalan || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-spa-cat]]</code> || align=center | Spanish <-> Catalan || align=center | 01 Apr 2017<br />
|-<br />
|}<br />
<br />
In [[staging]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-cat-glg]]</code> || align=center | Catalan <-> Galician || align=center | 18 Nov 2016<br />
|-<br />
| align=center | <code>[[apertium-cat-srd]]</code> || align=center | Catalan <-> Sardinian || align=center | 03 Apr 2017<br />
|-<br />
|}<br />
<br />
In [[nursery]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-ca-ro]]</code> || align=center | Catalan <-> Romanian || align=center | 30 Sep 2015<br />
|-<br />
|}<br />
<br />
In [[incubator]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-cat-cos]]</code> || align=center | Catalan <-> Corsican || align=center | 05 Nov 2013<br />
|-<br />
| align=center | <code>[[apertium-cat-ina]]</code> || align=center | Catalan <-> Interlingua || align=center | 07 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-eng-cat]]</code> || align=center | English <-> Catalan || align=center | 24 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-por-cat]]</code> || align=center | Portuguese <-> Catalan || align=center | 22 Jan 2016<br />
|-<br />
|}<br />
<br />
== Apertium-cat ==<br />
<br />
===Current status===<br />
<br />
''Last update: 28 Aug 2017''<br />
<br />
'''Dix entries:''' 56,588<br />
<br />
'''Dix paradigms:''' 607<br />
<br />
'''Coverage:''' 94.04% (Wikipedia)<br />
<br />
===Dictionary guidelines===<br />
<br />
The current Catalan dictionary is quite big (more than 55,000 entries), so tidiness is essential to ensure future development:<br />
<br />
* Keep entries sorted alphabetically.<br />
* Keep entries grouped by type and tags (do not mix different types of proper nouns together).<br />
* Check the file with apertium-dixtools (to update the number of entries and remove duplicates).<br />
<br />
====Proper nouns====<br />
<br />
* Catalan proper nouns (names, toponyms, acronyms, etc.) should all have gender and number. Do '''not''' remove them!<br />
<br />
===Future work===<br />
<br />
* Add support for proper noun articles (en/na).<br />
* Restore gender and number to proper nouns that still do not have them.<br />
* Tweak entries related to proper nouns with translations (kings, queens, etc.).<br />
<br />
<br />
''For further documentation about Catalan in Apertium, check: [[:Category:Catalan]]''<br />
<br />
[[Category:Catalan]]<br />
[[Category:Languages]]<br />
[[Category:Romance languages]]<br />
[[Category:Documentation in English]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=English&diff=66834
English
2018-05-01T22:00:24Z
<p>Marcriera: /* Bidix Guidelines */</p>
<hr />
<div>'''English''' ([[Wikipedia:English language]]) is a West Germanic language. It is available in Apertium as a standalone analyser/generator ([[English#Apertium-eng|apertium-eng]]) and as a component of several pairs which translate to/from English.<br />
<br />
== Language pairs ==<br />
''See also: [[List of language pairs]]''<br />
<br />
In [[trunk]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-cy-en]]</code> || align=center | Welsh <-> English || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-en-ca]]</code> || align=center | English <-> Catalan || align=center | 28 Mar 2016<br />
|-<br />
| align=center | <code>[[apertium-en-es]]</code> || align=center | English <-> Spanish || align=center | 11 Apr 2017<br />
|-<br />
| align=center | <code>[[apertium-en-gl]]</code> || align=center | English <-> Galician || align=center | 15 Jul 2016<br />
|-<br />
| align=center | <code>[[apertium-eo-en]]</code> || align=center | Esperanto <-> English || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-eu-en]]</code> || align=center | Basque --> English || align=center | 13 Dec 2015<br />
|-<br />
| align=center | <code>[[apertium-hbs-eng]]</code> || align=center | Serbo-Croatian <-> English || align=center | 15 Oct 2014<br />
|-<br />
| align=center | <code>[[apertium-isl-eng]]</code> || align=center | Icelandic --> English || align=center | 02 Mar 2016<br />
|-<br />
| align=center | <code>[[apertium-mk-en]]</code> || align=center | Macedonian --> English || align=center | 12 Oct 2014<br />
|-<br />
|}<br />
<br />
In [[staging]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-eng-kaz]]</code> || align=center | English <-> Kazakh || align=center | 14 Apr 2017<br />
|-<br />
|}<br />
<br />
In [[nursery]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-bg-en]]</code> || align=center | Bulgarian <-> English || align=center | 09 Jun 2014<br />
|-<br />
| align=center | <code>[[apertium-en-pt]]</code> || align=center | English <-> Portuguese || align=center | 09 Jun 2014<br />
|-<br />
| align=center | <code>[[apertium-eng-afr]]</code> || align=center | English <-> Afrikaans || align=center | 18 Nov 2016<br />
|-<br />
| align=center | <code>[[apertium-eng-deu]]</code> || align=center | English <-> German || align=center | 13 Apr 2017<br />
|-<br />
| align=center | <code>[[apertium-eng-hin]]</code> || align=center | English <-> Hindi || align=center | 12 Jan 2017<br />
|-<br />
| align=center | <code>[[apertium-fin-eng]]</code> || align=center | Finnish <-> English || align=center | 07 Jun 2015<br />
|-<br />
| align=center | <code>[[apertium-hye-eng]]</code> || align=center | Armenian --> English || align=center | 22 Jan 2013<br />
|-<br />
| align=center | <code>[[apertium-nor-eng]]</code> || align=center | Norwegian <-> English || align=center | 25 Apr 2016<br />
|-<br />
|}<br />
<br />
In [[incubator]]:<br />
<br />
{|class=wikitable<br />
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update<br />
|-<br />
| align=center | <code>[[apertium-asm-eng]]</code> || align=center | Assamese <-> English || align=center | 04 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-bn-en]]</code> || align=center | Bengali <-> English || align=center | 04 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-ckb-eng]]</code> || align=center | Central Kurdish <-> English || align=center | 13 Oct 2016<br />
|-<br />
| align=center | <code>[[apertium-ell-eng]]</code> || align=center | Modern Greek <-> English || align=center | 29 Sep 2015<br />
|-<br />
| align=center | <code>[[apertium-en-ga]]</code> || align=center | English -?- Irish || align=center | 13 Dec 2016<br />
|-<br />
| align=center | <code>[[apertium-en-it]]</code> || align=center | English <-> Italian || align=center | 26 Jun 2015<br />
|-<br />
| align=center | <code>[[apertium-en-lt]]</code> || align=center | English -?- Lithuanian || align=center | 07 Dec 2010<br />
|-<br />
| align=center | <code>[[apertium-en-lv]]</code> || align=center | English -?- Latvian || align=center | 26 Jun 2015<br />
|-<br />
| align=center | <code>[[apertium-en-mt]]</code> || align=center | English -?- Maltese || align=center | 19 Jun 2011<br />
|-<br />
| align=center | <code>[[apertium-en-nl]]</code> || align=center | English <-> Dutch || align=center | 29 Apr 2011<br />
|-<br />
| align=center | <code>[[apertium-en-pl]]</code> || align=center | English <-> Polish || align=center | 26 Jun 2015<br />
|-<br />
| align=center | <code>[[apertium-en-sq]]</code> || align=center | English --> Albanian || align=center | 31 Aug 2010<br />
|-<br />
| align=center | <code>[[apertium-eng-cat]]</code> || align=center | English <-> Catalan || align=center | 24 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-eng-ina]]</code> || align=center | English <-> Interlingua || align=center | 13 Jan 2017<br />
|-<br />
| align=center | <code>[[apertium-eng-ita]]</code> || align=center | English <-> Italian || align=center | 08 Jan 2017<br />
|-<br />
| align=center | <code>[[apertium-eng-lvs]]</code> || align=center | English <-> Standard Latvian || align=center | 09 Jun 2014<br />
|-<br />
| align=center | <code>[[apertium-eng-pes]]</code> || align=center | English <-> Iranian Persian || align=center | 11 Aug 2015<br />
|-<br />
| align=center | <code>[[apertium-eng-sco]]</code> || align=center | English <-> Scots || align=center | 01 Apr 2017<br />
|-<br />
| align=center | <code>[[apertium-eng-tel]]</code> || align=center | English <-> Telugu || align=center | 26 Jun 2015<br />
|-<br />
| align=center | <code>[[apertium-fra-eng]]</code> || align=center | French <-> English || align=center | 24 Mar 2017<br />
|-<br />
| align=center | <code>[[apertium-gle-eng]]</code> || align=center | Irish <-> English || align=center | 01 Feb 2016<br />
|-<br />
| align=center | <code>[[apertium-ht-en]]</code> || align=center | Haitian Creole --> English || align=center | 04 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-hun-eng]]</code> || align=center | Hungarian --> English || align=center | 19 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-kmr-eng]]</code> || align=center | Kmer ? <-> English || align=center | 06 Mar 2017<br />
|-<br />
| align=center | <code>[[apertium-ky-en]]</code> || align=center | Kyrgyz -?- English || align=center | 29 Jun 2011<br />
|-<br />
| align=center | <code>[[apertium-la-en]]</code> || align=center | Latin -?- English || align=center | 01 Dec 2011<br />
|-<br />
| align=center | <code>[[apertium-lat-eng]]</code> || align=center | Latin <-> English || align=center | 11 Jan 2017<br />
|-<br />
| align=center | <code>[[apertium-mal-eng]]</code> || align=center | Malayalam <-> English || align=center | 04 Jan 2016<br />
|-<br />
| align=center | <code>[[apertium-mar-eng]]</code> || align=center | Marathi --> English || align=center | 12 May 2013<br />
|-<br />
| align=center | <code>[[apertium-mfe-en]]</code> || align=center | Morisyen --> English || align=center | 19 Jun 2010<br />
|-<br />
| align=center | <code>[[apertium-ne-en]]</code> || align=center | Nepali <-> English || align=center | 26 Jun 2015<br />
|-<br />
| align=center | <code>[[apertium-pes-eng]]</code> || align=center | Iranian Persian <-> English || align=center | 11 Mar 2017<br />
|-<br />
| align=center | <code>[[apertium-rus-eng]]</code> || align=center | Russian -?- English || align=center | 18 May 2014<br />
|-<br />
| align=center | <code>[[apertium-sah-eng]]</code> || align=center | Yakut -?- English || align=center | 18 Mar 2015<br />
|-<br />
| align=center | <code>[[apertium-si-en]]</code> || align=center | Sinhala <-> English || align=center | 26 Jun 2015<br />
|-<br />
| align=center | <code>[[apertium-sjo-eng]]</code> || align=center | Xibe --> English || align=center | 09 Nov 2014<br />
|-<br />
| align=center | <code>[[apertium-swa-eng]]</code> || align=center | Swahili -?- English || align=center | 17 Dec 2016<br />
|-<br />
| align=center | <code>[[apertium-swe-eng]]</code> || align=center | Swedish <-> English || align=center | 17 Dec 2016<br />
|-<br />
| align=center | <code>[[apertium-tha-eng]]</code> || align=center | Thai <-> English || align=center | 03 Dec 2016<br />
|-<br />
| align=center | <code>[[apertium-tr-en]]</code> || align=center | Turkish <-- English || align=center | 04 Aug 2011<br />
|-<br />
| align=center | <code>[[apertium-vi-en]]</code> || align=center | Vietnamese --> English || align=center | 24 Oct 2010<br />
|-<br />
|}<br />
<br />
== Apertium-eng ==<br />
<br />
===Current status===<br />
<br />
''Last update: 28 Aug 2017''<br />
<br />
'''Dix entries:''' 54,453<br />
<br />
'''Dix paradigms:''' 377<br />
<br />
'''Coverage:''' 93.55% (Wikipedia)<br />
<br />
===Dictionary guidelines===<br />
<br />
The current English dictionary is quite big (nearly 55,000 entries), so tidiness is essential to ensure future development:<br />
<br />
* Keep entries sorted alphabetically.<br />
* Keep entries grouped by type and tags (do not mix different types of proper nouns together).<br />
* Check the file with apertium-dixtools (to update the number of entries and remove duplicates).<br />
<br />
====Spelling variants====<br />
<br />
The standard spelling variant in Apertium is British English. American English spelling entries are officially supported using <code>v="eng"</code> in the original British entry and <code>v="eng_US"</code> in the American entry, which should be a subform of the first one.<br />
<br />
=== Tagger ===<br />
<br />
Apertium-eng currently uses a tagger trained in a supervised manner using a hand-tagged corpus and a perceptron. The corpus can be found [https://svn.code.sf.net/p/apertium/svn/languages/apertium-eng/texts/consensus here], and should be modified after any tag change in the monolingual dictionary to make sure it matches the current status of the language module. The MTX file for English has no rules defined for now, but it should improve disambiguation after some restrictions are added.<br />
<br />
The tagger can be trained with the perceptron using the following command (see [[Perceptron tagger]]):<br />
<br />
<pre><br />
$ apertium-tagger -xs 10 eng.prob eng.tagged eng.untagged apertium-eng.eng.mtx<br />
</pre><br />
<br />
''Note: in the example above, "10" is the number of iterations, "eng.tagged" is the hand-tagged corpus and "eng.untagged" is the corpus tagged by apertium-tagger.''<br />
<br />
Before making changes to the hand-tagged corpus, make sure you have read the [[Tagging_guidelines_for_English|tagging guidelines]]. A good corpus is the key to a good tagger!<br />
<br />
=== Constraint Grammar ===<br />
<br />
Apertium-eng currently has 38 CG rules. There is a lot of room for disambiguation improvement using CG, but ideally some rules should be moved to the tagger MTX.<br />
<br />
===Future work===<br />
<br />
* Improve support of British/American ortography variants.<br />
<br />
<br />
''For further documentation about English in Apertium, check: [[:Category:English]]''<br />
<br />
[[Category:Languages]]<br />
[[Category:West Germanic languages]]<br />
[[Category:Documentation in English]]</div>
Marcriera
https://wiki.apertium.org/w/index.php?title=User:Marcriera&diff=66833
User:Marcriera
2018-04-30T21:15:42Z
<p>Marcriera: </p>
<hr />
<div>Name: Marc Riera<br />
<br />
Email: [[Special:Emailuser/Marcriera|Send me a message]]<br />
<br />
IRC: marcrierai</div>
Marcriera