Difference between revisions of "Northern Sámi and Finnish/Completed tasks"
Jump to navigation
Jump to search
Line 24: | Line 24: | ||
** Replaced. To use with the rest of the GT toolchain, use <code>sed 's/→/>/g' | sed 's/←/</g'</code> -- [[User:Francis Tyers|Francis Tyers]] |
** Replaced. To use with the rest of the GT toolchain, use <code>sed 's/→/>/g' | sed 's/←/</g'</code> -- [[User:Francis Tyers|Francis Tyers]] |
||
[[Category:North |
[[Category:North Saami and Finnish]] |
Latest revision as of 07:15, 10 December 2015
Adding subcategories (Dem, Itg, etc.) to pronouns in Omorfi
Fred Karlsson's constraint grammar for Finnish has been GPL'd, and is available and undergoing conversion to CG3 here: https://victorio.uit.no/langtech/trunk/kt/fin/src- This should be converted in an Apertium-compatible manner from the start! No using reserved symbols (e.g.
<
,>
and/
)
- This should be converted in an Apertium-compatible manner from the start! No using reserved symbols (e.g.
How can we restrict generation of alternative forms in the Sámi generator ? In lttoolbox this is done with LR (only analyse)/RL (only generate) markings.- As follows: The RL forms should be marked as such in the source code. The tag for it is +Use/NG. All forms given this tag will be included in the analyser sme.fst but excluded from the generator isme.fst
hfst-lookup
or something similar to _generate_ analyses that come in with ^ and $
Can we rig up SVN to pull in the twol file from GT svn directly ?- No we can't. SVN doesn't allow file externals. -- Francis Tyers
Some tags do not get replaced by the relabel script:olleet olla+V[GEN=ACT]+Pcp1+Pos+Pl+Nom
- This should be taken care of. Further problems might be due to a missing Multichar_symbol in the omorfi.hlexc file. -- Francis Tyers
Sub-categorise conjunctions into CC/CS ?
Generation with correct case. At the moment the North Sámi generator cannot generate words with initial caps.
Syntax tags should not use>
and<
, until these are replaced, the translator should not run the syntax section of the CG (section #5). Seemodes.xml
file.- Replaced. To use with the rest of the GT toolchain, use
sed 's/→/>/g' | sed 's/←/</g'
-- Francis Tyers
- Replaced. To use with the rest of the GT toolchain, use