https://wiki.apertium.org/w/index.php?title=Tartu_Apertium_Course/Session_2&feed=atom&action=historyTartu Apertium Course/Session 2 - Revision history2024-03-28T08:47:34ZRevision history for this page on the wikiMediaWiki 1.34.1https://wiki.apertium.org/w/index.php?title=Tartu_Apertium_Course/Session_2&diff=54307&oldid=prevFrancis Tyers: /* Contiguity */2015-10-30T10:53:58Z<p><span dir="auto"><span class="autocomment">Contiguity</span></span></p>
<table class="diff diff-contentalign-left" data-mw="interface">
<col class="diff-marker" />
<col class="diff-content" />
<col class="diff-marker" />
<col class="diff-content" />
<tr class="diff-title" lang="en">
<td colspan="2" style="background-color: #fff; color: #222; text-align: center;">← Older revision</td>
<td colspan="2" style="background-color: #fff; color: #222; text-align: center;">Revision as of 10:53, 30 October 2015</td>
</tr><tr>
<td colspan="2" class="diff-lineno">Line 43:</td>
<td colspan="2" class="diff-lineno">Line 43:</td>
</tr>
<tr>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>Along with internal variation, multiwords may also be contiguous or discontiguous. Contiguous means that the words follow each other, separated by spaces. Most of the above examples are contiguous multiwords. They can also be discontiguous, like phrasal verbs in Swedish and Dutch, examples might be: ''lopen uit de hand'', ''äta upp'' and ''tilføje'':</div></td>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>Along with internal variation, multiwords may also be contiguous or discontiguous. Contiguous means that the words follow each other, separated by spaces. Most of the above examples are contiguous multiwords. They can also be discontiguous, like phrasal verbs in Swedish and Dutch, examples might be: ''lopen uit de hand'', ''äta upp'' and ''tilføje'':</div></td>
</tr>
<tr>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"></td>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"></td>
</tr>
<tr>
<td class="diff-marker">−</td>
<td style="color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* ''Twee jaar later '''loopt''' de situatie in het land onder de indruk van massabetogingen '''uit de hand'''.''{{slc|<del class="diffchange diffchange-inline">nl</del>}}</div></td>
<td class="diff-marker">+</td>
<td style="color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* ''Twee jaar later '''loopt''' de situatie in het land onder de indruk van massabetogingen '''uit de hand'''.''{{slc|<ins class="diffchange diffchange-inline">nld</ins>}}</div></td>
</tr>
<tr>
<td class="diff-marker">−</td>
<td style="color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* ''Alla stygga barnen '''åt''' tigern '''upp'''.''{{slc|<del class="diffchange diffchange-inline">sv</del>}}</div></td>
<td class="diff-marker">+</td>
<td style="color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* ''Alla stygga barnen '''åt''' tigern '''upp'''.''{{slc|<ins class="diffchange diffchange-inline">swe</ins>}}</div></td>
</tr>
<tr>
<td class="diff-marker">−</td>
<td style="color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div>* '''''Føje''' et ord eller en sætning '''til''' den brugerdefinerede ordbog.''{{slc|<del class="diffchange diffchange-inline">da</del>}}</div></td>
<td class="diff-marker">+</td>
<td style="color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div>* '''''Føje''' et ord eller en sætning '''til''' den brugerdefinerede ordbog.''{{slc|<ins class="diffchange diffchange-inline">dan</ins>}}</div></td>
</tr>
<tr>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"></td>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"></td>
</tr>
<tr>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>==Practice==</div></td>
<td class="diff-marker"> </td>
<td style="background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>==Practice==</div></td>
</tr>
</table>Francis Tyershttps://wiki.apertium.org/w/index.php?title=Tartu_Apertium_Course/Session_2&diff=54306&oldid=prevFrancis Tyers: Created page with "{{TOCD}} This session aims to give an introduction to some more advanced topics in the Apertium dictionaries. We have seen in session 1 how to make simple dictionary entries,..."2015-10-30T10:53:29Z<p>Created page with "{{TOCD}} This session aims to give an introduction to some more advanced topics in the Apertium dictionaries. We have seen in session 1 how to make simple dictionary entries,..."</p>
<p><b>New page</b></p><div>{{TOCD}}<br />
<br />
This session aims to give an introduction to some more advanced topics in the Apertium dictionaries. We have seen in session 1 how to make simple dictionary entries, and looked at paradigms. In this session we'll make entries for multiwords.<br />
<br />
==Theory==<br />
<br />
When talking about multiwords (or multiword expressions, MWEs), there are a few aspects to take into account. <br />
<br />
===Compositionality===<br />
<br />
{{comment|TODO; find examples}}<br />
<br />
A multi-word expression is compositional when its meaning can be deduced from the meanings of the words that comprise it. However, this is not to say that it will necessarily be compositional in translation. For example:<br />
<br />
* ''Телефонная будка''{{slc|rus}} is compositional, the meaning can be inferred from the meanings of the two words.<br />
* ''Juego de etiquetas''{{slc|spa}} is a multiword expression that is compositional, but the sense that each of the content words has is not the most frequent one. <br />
* ''Итальянская забастовка''{{slc|rus}} is partially non-compositional, the meaning cannot be deduced solely from the meanings of the constituent words.<br />
* ''Kafa çekmek''{{slc|tur}} is non-compositional, the meaning cannot be deduced from the meanings of the constituent words.<br />
<br />
This is important for translation as a compositional multiword might be possible to treat just as two simple words, while a completely non-compositional one has to be fully listed.<br />
<br />
===Variability===<br />
<br />
{{comment|TODO; find examples}}<br />
<br />
Multiwords may allow variation, either for inflection, e.g. French ''vaut la peine, valait la peine'', or by allowing modifying words, Kyrgyz: ''ичим айланат, ичим '''көп''' айланат'' "my stomach hurts", "my stomach hurts '''a lot'''". Multiword verbs may allow passivisation or not.<br />
<br />
They may inflect in only one position, be it: <br />
<br />
* At the end, ''pazartesi gün, pazartesi gün'''ler'''''{{slc|tur}} or<br />
* In the middle ''hôtel de ville, hôtel'''s''' de ville''{{slc|fra}}<br />
<br />
Or on both sides:<br />
<br />
* ''телефонная будка, телефонн'''ой''' будк'''и''''' {{slc|rus}} and <br />
* ''ич айлан-, ич'''им''' айлан'''ат''''' {{slc|kir}}.<br />
* ''курсак ач-, курсаг'''ым''' ач'''ып жатат''''' {{slc|kir}}.<br />
<br />
===Contiguity===<br />
<br />
{{comment|TODO; find examples}}<br />
<br />
Along with internal variation, multiwords may also be contiguous or discontiguous. Contiguous means that the words follow each other, separated by spaces. Most of the above examples are contiguous multiwords. They can also be discontiguous, like phrasal verbs in Swedish and Dutch, examples might be: ''lopen uit de hand'', ''äta upp'' and ''tilføje'':<br />
<br />
* ''Twee jaar later '''loopt''' de situatie in het land onder de indruk van massabetogingen '''uit de hand'''.''{{slc|nl}}<br />
* ''Alla stygga barnen '''åt''' tigern '''upp'''.''{{slc|sv}}<br />
* '''''Føje''' et ord eller en sætning '''til''' den brugerdefinerede ordbog.''{{slc|da}}<br />
<br />
==Practice==<br />
<br />
===Simple multiword===<br />
<br />
===Clitic===<br />
<br />
<br />
<br />
[[Category:Tartu Apertium Course|Session 2]]</div>Francis Tyers