Difference between revisions of "Talk:Starting a new language with HFST"
Jump to navigation
Jump to search
(Created page with ' ==Revision 1989== <pre> $ make hfst-lexc apertium-tr-tk.tk.lexc -o tk-tr.lexc.hfst hfst-lexc: warning: Defaulting to foma type (since it has native lexc support); Use command…') |
|||
Line 1: | Line 1: | ||
==Lexicon== |
|||
<pre> |
|||
Multichar_Symbols |
|||
%<n%> ! Существительное |
|||
%<nom%> ! Именительный |
|||
%<pl%> ! Множественное число |
|||
%{A%} ! 'a' или 'е', часть архифонемы |
|||
⚫ | |||
%> ! Граница морфем |
|||
LEXICON Root |
|||
NounStems; |
|||
LEXICON Ninfl |
|||
%<n%>: #; |
|||
%<n%>%<pl%>:%>l%{A%}r #; |
|||
LEXICON NounStems |
|||
maşgala Ninfl;! " семья " |
|||
esger Ninfl;! "солдат" |
|||
</pre> |
|||
==Twol== |
|||
<pre> |
|||
Alphabet |
|||
A B Ç D E Ä F G H I J Ž K L M N |
|||
Ň O Ö P R S Ş T U Ü W Y Ý Z |
|||
a b ç d e ä f g h i j ž k l m n |
|||
ň o ö p r s ş t u ü w y ý z |
|||
%{A%}:a %>:0 ; |
|||
Sets |
|||
Consonant = B Ç D F G H J Ž K L M N Ň P R S Ş T W Z |
|||
b ç d f g h j ž k l m n ň p r s ş t w z ; |
|||
FrontVowel = Ä E I Ö Ü ä e i ö ü ; |
|||
BackVowel = A Y O U a y o u ; |
|||
NonBack = Consonant FrontVowel %> ; |
|||
NonFront = Consonant BackVowel %> ; |
|||
Rules |
|||
"Front harmony in suffixes" |
|||
%{A%}:e <=> FrontVowel: NonBack:* _ ; |
|||
</pre> |
|||
==Makefile== |
|||
<pre> |
|||
all: |
|||
hfst-lexc apertium-tr-tk.tk.lexc -o tk-tr.lexc.hfst |
|||
hfst-twolc -R -i apertium-tr-tk.tk.twol -o tk-tr.twol.hfst |
|||
hfst-compose-intersect -1 tk-tr.lexc.hfst -2 tk-tr.twol.hfst -o tr-tk.autogen.hfst |
|||
hfst-fst2strings tr-tk.autogen.hfst |
|||
</pre> |
|||
==Revisions== |
|||
⚫ | |||
<pre> |
<pre> |
Revision as of 14:03, 23 January 2012
Lexicon
Multichar_Symbols %<n%> ! Существительное %<nom%> ! Именительный %<pl%> ! Множественное число %{A%} ! 'a' или 'е', часть архифонемы %> ! Граница морфем LEXICON Root NounStems; LEXICON Ninfl %<n%>: #; %<n%>%<pl%>:%>l%{A%}r #; LEXICON NounStems maşgala Ninfl;! " семья " esger Ninfl;! "солдат"
Twol
Alphabet A B Ç D E Ä F G H I J Ž K L M N Ň O Ö P R S Ş T U Ü W Y Ý Z a b ç d e ä f g h i j ž k l m n ň o ö p r s ş t u ü w y ý z %{A%}:a %>:0 ; Sets Consonant = B Ç D F G H J Ž K L M N Ň P R S Ş T W Z b ç d f g h j ž k l m n ň p r s ş t w z ; FrontVowel = Ä E I Ö Ü ä e i ö ü ; BackVowel = A Y O U a y o u ; NonBack = Consonant FrontVowel %> ; NonFront = Consonant BackVowel %> ; Rules "Front harmony in suffixes" %{A%}:e <=> FrontVowel: NonBack:* _ ;
Makefile
all: hfst-lexc apertium-tr-tk.tk.lexc -o tk-tr.lexc.hfst hfst-twolc -R -i apertium-tr-tk.tk.twol -o tk-tr.twol.hfst hfst-compose-intersect -1 tk-tr.lexc.hfst -2 tk-tr.twol.hfst -o tr-tk.autogen.hfst hfst-fst2strings tr-tk.autogen.hfst
Revisions
Revision 1990
$ make hfst-lexc apertium-tr-tk.tk.lexc -o tk-tr.lexc.hfst hfst-lexc: warning: Defaulting to foma type (since it has native lexc support); Use command-line option --format to override Root...1, Ninfl...2, NounStems...2 Building lexicon... Determinizing... Minimizing... Done! hfst-twolc -R -i apertium-tr-tk.tk.twol -o tk-tr.twol.hfst Reading input from apertium-tr-tk.tk.twol. Writing output to tk-tr.twol.hfst. syntax error, unexpected SYMBOL_SPACE on line 6: Sets Aborted. make: *** [all] Ошибка 1