Difference between revisions of "User talk:Unhammer"

From Apertium
Jump to navigation Jump to search
(numbering)
 
(5 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
Welcome to the Apertium Wiki! - [[User:Francis Tyers|Francis Tyers]] 22:46, 21 March 2009 (UTC)
 
Welcome to the Apertium Wiki! - [[User:Francis Tyers|Francis Tyers]] 22:46, 21 March 2009 (UTC)
   
  +
Hei! -[[User:ivarref|Ivar Refsdal]]
== vislcg3 -w capitalisation option ==
 
   
  +
== Talk:Liste des paires de langues ==
The vislcg3 -w option already outputs this:
 
<pre>
 
in: JEG/jeg<prn>, out: JEG/JEG<prn>
 
in: JeG/jeg<prn>, out: JeG/JEG<prn>
 
in: jeG/jeg<prn>, out: jeG/jeg<prn>
 
in: Jeg/jeg<prn>, out: Jeg/Jeg<prn>
 
in: jeg/jeg<prn>, out: jeg/jeg<prn>
 
</pre>
 
   
  +
most of the three-letter-code pairs seem to be missing? (sme-nob, sme-fin, etc.)
But we can't just look at the first and last character if the lemma is eg. an acronym, we have to look at the first '''lowercase character''' in the lemma (baseform):
 
  +
: and it was for pairs you were working about. Francis Tyers wrote me about that later. The problem was a wrong character difficult to see in a regular expression ( l instead of ] ). So it didn't worked when there was a three-letter-code on the left side. See The answer I gave to Francis on my or his discussion page. [[User:Bech|Bech]] 16:53, 3 April 2012 (UTC)
   
  +
== Language is hard – hẘæt! ==
# in: bcg-vaksine/BCG-vaksine<n><m><sg><ind> out: bcg-vaksine/BCG-vaksine
 
# in: BCG-vaksine/BCG-vaksine<n><m><sg><ind> out: bcg-vaksine/BCG-vaksine
 
# in: BCG-VAKSINE/BCG-vaksine<n><m><sg><ind> out: bcg-vaksine/BCG-VAKSINE
 
# in: Bcg-vaksine/BCG-vaksine<n><m><sg><ind> out: bcg-vaksine/BCG-vaksine
 
# in: Bcg-Vaksine/BCG-vaksine<n><m><sg><ind> out: bcg-vaksine/BCG-Vaksine
 
   
  +
<spectie> tuõˊlˈlʼjed <-- this is a word
so in 3. above, the first lowercase character is the 'v', if _that_ one is uppercased and the final one is, we uppercase. If that one is uppercased while the final one is lowercased, as in 5 above, we capitalise.
 
  +
<Flammie> at least skolt sami doesn't have combining underlines or
  +
word-internal exclamation marks and vertical bars
  +
<spectie> yeah, they win by not having word-internal exclamation marks
  +
<spectie> pretty much ever language apart from armenian wins there >__>

Latest revision as of 08:45, 6 December 2023

Welcome to the Apertium Wiki! - Francis&nbsp;Tyers 22:46, 21 March 2009 (UTC)

Hei! -Ivar Refsdal

Talk:Liste des paires de langues[edit]

most of the three-letter-code pairs seem to be missing? (sme-nob, sme-fin, etc.)

and it was for pairs you were working about. Francis Tyers wrote me about that later. The problem was a wrong character difficult to see in a regular expression ( l instead of ] ). So it didn't worked when there was a three-letter-code on the left side. See The answer I gave to Francis on my or his discussion page. Bech 16:53, 3 April 2012 (UTC)

Language is hard – hẘæt![edit]

   <spectie> tuõˊlˈlʼjed <-- this is a word
   <Flammie> at least skolt sami doesn't have combining underlines or
             word-internal exclamation marks and vertical bars
   <spectie> yeah, they win by not having word-internal exclamation marks
   <spectie> pretty much ever language apart from armenian wins there >__>