Search results

Jump to navigation Jump to search
  • ...etc. By the time you finish you should have a reasonable model of missing unknown words. <match case="Aa" unknown="true"><add-reading tags="np.ant"/></match>
    4 KB (558 words) - 13:07, 26 June 2020
  • ==Adding/fixing unknown words== If you have some words that are unknown in a certain language pair, you can help out by simply writing a list of wo
    3 KB (549 words) - 09:17, 26 May 2021
  • ...words as opposed to "blank" chars. Its main effect is on tokenisation of ''unknown'' words, since non-alphabet characters may still be part of a ''known'' wor ...a word not in the dictionary, but composed of alphabetic chars, we get an unknown-word analysis:
    2 KB (400 words) - 08:52, 28 April 2014
  • Number of tokenised words unknown to analyser: 63730 — 43.1 % of tokens had * unknown to bidix: 112 — 0.1 % of tokens had @
    4 KB (496 words) - 18:27, 19 June 2017
  • ...%BBB% and run it through Apertium's %AAA%-%BBB% translator to identify 50 unknown forms. Add the stems of these forms to the analyser in an appropriate way ...%BBB% and run it through Apertium's %AAA%-%BBB% translator to identify 50 unknown forms. Add the stems of these forms to the analyser in an appropriate way
    32 KB (4,862 words) - 06:23, 5 December 2019
  • *'''-f --missing-freqs:''' path to sqlite3 database of words that were unknown (requires <code>sudo apt-get install sqlite3</code>) *'''markUnknown=no''' (optional): include this to remove "*" in front of unknown words
    37 KB (5,132 words) - 16:36, 5 June 2020
  • === Unknown ===
    8 KB (1,234 words) - 17:01, 3 December 2020
  • ===Unknown===
    38 KB (6,273 words) - 11:01, 24 December 2020
  • Number of unknown words (marked with a star) in test: 117<br/> Percentage of unknown words: 3,87 %<br/>
    6 KB (845 words) - 20:08, 3 October 2011
  • Both [[lttoolbox]] and [[HFST]] have methods for dynamically analysing unknown compound words into their constituent parts. See below for how it's done in ..., and only do compounding if the other methods would give an unknown word. Unknown words are made up of strings of characters from &lt;alphabet&gt;, separated
    16 KB (2,689 words) - 09:07, 6 April 2021
  • Number of unknown words (marked with a star) in test: 117<br/> Percentage of unknown words: 3,87 %<br/>
    98 KB (16,331 words) - 20:28, 30 September 2011
  • Note: Reference translation MUST have no unknown-word marks, even if systems that do not mark unknown words with a star.
    6 KB (981 words) - 09:13, 21 November 2021
  • hsb.dix:25: element s: validity error : IDREF attribute n references an unknown ID "nom" hsb.dix:33: element s: validity error : IDREF attribute n references an unknown ID "nom"
    19 KB (3,440 words) - 12:10, 26 September 2016
  • ...interchunk, this is quite easy, as each unknown word has the chunk lemma 'unknown', but it's un- or under-documented how this should be done using apertium-t
    2 KB (209 words) - 11:06, 24 March 2012
  • hsb.dix:25: element s: validity error : IDREF attribute n references an unknown ID "nom" hsb.dix:33: element s: validity error : IDREF attribute n references an unknown ID "nom"
    25 KB (2,260 words) - 18:36, 12 January 2012
  • === Unknown ===
    4 KB (682 words) - 11:14, 16 April 2012
  • ==I get "Unknown option --xpath"==
    5 KB (863 words) - 09:04, 10 October 2017
  • ==How do I see unknown words?==
    2 KB (331 words) - 12:03, 28 February 2017
  • ...high, and compares with commercial systems -- over 95% coverage (around 5 unknown words out of 100 words), and between 3-7% word-error rate (out of 100 words ...final coverage of the system was around 90%, e.g. over a set of corpora 10 unknown words out of 100 on average. The word-error rate was around 17%, meaning th
    12 KB (1,679 words) - 12:00, 31 January 2012
  • ...in extending dictionaries by assigning stems and inflectional paradigms to unknown words] (pp.19-26.). EAMT 2014 – 17th Annual conference of the European As ...sites/default/files/FreeRBMT-2012.pdf#33 Choosing the correct paradigm for unknown words in rule-based machine translation systems]. Third International Works
    33 KB (4,418 words) - 11:52, 29 December 2021

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)