<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=Talk%3ALextor</id>
	<title>Talk:Lextor - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.apertium.org/w/index.php?action=history&amp;feed=atom&amp;title=Talk%3ALextor"/>
	<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Talk:Lextor&amp;action=history"/>
	<updated>2026-05-29T23:21:16Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.1</generator>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=Talk:Lextor&amp;diff=12544&amp;oldid=prev</id>
		<title>Muki987: /* Words and expressions */</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Talk:Lextor&amp;diff=12544&amp;oldid=prev"/>
		<updated>2009-05-20T07:56:18Z</updated>

		<summary type="html">&lt;p&gt;&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;Words and expressions&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 07:56, 20 May 2009&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 24:&lt;/td&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 24:&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;appears, there are words, that indicate, the speech&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;appears, there are words, that indicate, the speech&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;is about travelling (horses, motors, trains, way,&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;is about travelling (horses, motors, trains, way,&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-deletedline diff-side-deleted&quot;&gt;&lt;div&gt;&lt;del class=&quot;diffchange diffchange-inline&quot;&gt; &lt;/del&gt;and the like), it is likely, we are talking about travel,&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-addedline diff-side-added&quot;&gt;&lt;div&gt;and the like), it is likely, we are talking about travel,&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;if the words indicate sport or working environment&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;if the words indicate sport or working environment&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-deleted&quot;&gt;&lt;div&gt;(training, human leading, success, and the like)&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-context diff-side-added&quot;&gt;&lt;div&gt;(training, human leading, success, and the like)&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Muki987</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.apertium.org/w/index.php?title=Talk:Lextor&amp;diff=12543&amp;oldid=prev</id>
		<title>Muki987: New page: ==Status== Lextor exists, but it is not turned on. It was found that using Lextor did not provide an improvement in translation quality above the &#039;baseline&#039; of just choosing the most frequ...</title>
		<link rel="alternate" type="text/html" href="https://wiki.apertium.org/w/index.php?title=Talk:Lextor&amp;diff=12543&amp;oldid=prev"/>
		<updated>2009-05-20T07:55:57Z</updated>

		<summary type="html">&lt;p&gt;New page: ==Status== Lextor exists, but it is not turned on. It was found that using Lextor did not provide an improvement in translation quality above the &amp;#039;baseline&amp;#039; of just choosing the most frequ...&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;==Status==&lt;br /&gt;
Lextor exists, but it is not turned on.&lt;br /&gt;
It was found that using Lextor did not provide an improvement in&lt;br /&gt;
translation quality above the &amp;#039;baseline&amp;#039; of just choosing the most&lt;br /&gt;
frequent or general translation. This is why it is turned off.&lt;br /&gt;
==Approaches==&lt;br /&gt;
There are many approaches to lexical selection, we&amp;#039;re studying them and&lt;br /&gt;
hope to implement something in the future. Although we don&amp;#039;t have&lt;br /&gt;
anything concrete planned for now.&lt;br /&gt;
&lt;br /&gt;
I think Felipe was thinking of something based on hierarchical decision&lt;br /&gt;
lists, but he might be able to offer a more in depth reply.&lt;br /&gt;
&lt;br /&gt;
==Words and expressions==&lt;br /&gt;
Since properly translated  words and properly &lt;br /&gt;
translated  expressions are the key  to translation&lt;br /&gt;
quality, I think, exactly the modules, that handle&lt;br /&gt;
word and expression recognition and translation &lt;br /&gt;
are the key to final quality.&lt;br /&gt;
&lt;br /&gt;
A human translator has for example the word coach.&lt;br /&gt;
The word can mean trainer or a mean, in that we travel. &lt;br /&gt;
If in the text context (sentence) where the coach&lt;br /&gt;
appears, there are words, that indicate, the speech&lt;br /&gt;
is about travelling (horses, motors, trains, way,&lt;br /&gt;
 and the like), it is likely, we are talking about travel,&lt;br /&gt;
if the words indicate sport or working environment&lt;br /&gt;
(training, human leading, success, and the like)&lt;br /&gt;
we are talking about a trainer. &lt;br /&gt;
&lt;br /&gt;
If it is not possible to find he meaning using just the &lt;br /&gt;
sentence, the whole text must be considered. &lt;br /&gt;
If all fails, than we must  fall back to  the &lt;br /&gt;
statistically most likely meaning.&lt;br /&gt;
&lt;br /&gt;
In fact the above is the only way, I can imagine,&lt;br /&gt;
that works. Therefore I am really curious, how really lextor&lt;br /&gt;
is intended to work, using real example words like&lt;br /&gt;
coach and real example texts. The better things are &lt;br /&gt;
documented, the more the chance, that we get a working&lt;br /&gt;
solution. The wiki is a very good mean for documentation,&lt;br /&gt;
and I suggest to add more throughout documentation&lt;br /&gt;
for word disambiguation with understandable step-by-step&lt;br /&gt;
examples.&lt;br /&gt;
&lt;br /&gt;
It is  possible, that lextor&amp;#039;s way is viable,&lt;br /&gt;
it just needs much larger corpora, that we used to &lt;br /&gt;
give it for training. Maybe the corpus minimum size &lt;br /&gt;
and coverage need to be specified.&lt;br /&gt;
&lt;br /&gt;
You write &amp;quot;run all possible disambiguations&amp;quot;.&lt;br /&gt;
What does that mean in the case of coach?&lt;br /&gt;
Take texts where coach is a trainer, and &lt;br /&gt;
remember all words, than take texts, where coach is &lt;br /&gt;
a mean to travel and notice all words. &lt;br /&gt;
Finally take the text to translate, and &lt;br /&gt;
check all words in it (first just the sentence),&lt;br /&gt;
to which version it is more similar?&lt;br /&gt;
&lt;br /&gt;
However to the above approach needs aligned large &lt;br /&gt;
corporas of both languages. And I can not imagine&lt;br /&gt;
anything else, that works. Therefore my &lt;br /&gt;
request to clarify such questions.&lt;/div&gt;</summary>
		<author><name>Muki987</name></author>
		
	</entry>
</feed>