Difference between revisions of "Speeding up monodix creation"
Jump to navigation
Jump to search
(New page: This page outlines some ideas for increasing the speed at which monolingual dictionaries (analysers) can be created. ==Extract== ==Tag transfer== Category:Documentation) |
|||
Line 6: | Line 6: | ||
==Tag transfer== |
==Tag transfer== |
||
+ | Try this at some point: |
||
+ | <pre> |
||
+ | <spectie> you have an aligned corpus |
||
+ | <spectie> polish--czech, czech--slovak, danish--swedish |
||
+ | <spectie> and you have an analyser for polish, czech or danish |
||
+ | <spectie> you want to make an analyser for swedish |
||
+ | <spectie> you make templates from the paradigms in the danish analyser |
||
+ | <spectie> tag the danish of the corpus |
||
+ | <spectie> that you have |
||
+ | <spectie> align it with the swedish side |
||
+ | <spectie> then read off the alignments, taking the surface forms from the right side and the tags from the left side |
||
+ | </pre> |
||
[[Category:Documentation]] |
[[Category:Documentation]] |
Revision as of 22:09, 9 April 2008
This page outlines some ideas for increasing the speed at which monolingual dictionaries (analysers) can be created.
Extract
Tag transfer
Try this at some point:
<spectie> you have an aligned corpus <spectie> polish--czech, czech--slovak, danish--swedish <spectie> and you have an analyser for polish, czech or danish <spectie> you want to make an analyser for swedish <spectie> you make templates from the paradigms in the danish analyser <spectie> tag the danish of the corpus <spectie> that you have <spectie> align it with the swedish side <spectie> then read off the alignments, taking the surface forms from the right side and the tags from the left side