Difference between revisions of "Polish and Russian/Project description"
Mary.szmary (talk | contribs) |
Mary.szmary (talk | contribs) |
||
Line 13: | Line 13: | ||
At first, the goal of the project was to achieve 90% coverage of corpora used. It turned out to be a challenging task for three months of work, partly because of the peculiarities of Slavic morphology and morphophonology and the lack of available bilingual electronic dictionaries (and poverty of the latter). As a result, starting from the end of July our main task was to lowering the number of mistakes than on achieving high coverage. |
At first, the goal of the project was to achieve 90% coverage of corpora used. It turned out to be a challenging task for three months of work, partly because of the peculiarities of Slavic morphology and morphophonology and the lack of available bilingual electronic dictionaries (and poverty of the latter). As a result, starting from the end of July our main task was to lowering the number of mistakes than on achieving high coverage. |
||
{|class=wikitable |
{|class=wikitable |
||
Line 22: | Line 21: | ||
! Coverage !! ''Russian'' (%) !! ''Polish'' (%) |
! Coverage !! ''Russian'' (%) !! ''Polish'' (%) |
||
|- |
|- |
||
| '''Raw coverage''' || |
| '''Raw coverage''' || 94.2% || 87.6% |
||
|- |
|- |
||
|} |
|} |
Revision as of 16:51, 23 August 2016
Commitment
The list of all commits: https://apertium.projectjj.com/gsoc2016/maryszmary.html
Monolingual Polish package: https://svn.code.sf.net/p/apertium/svn/languages/apertium-pol/
Monolingual Russian package: https://svn.code.sf.net/p/apertium/svn/languages/apertium-rus/
Bilingual Polish-Russian package: https://svn.code.sf.net/p/apertium/svn/incubator/apertium-pol-rus/
Project description
Statistics
At first, the goal of the project was to achieve 90% coverage of corpora used. It turned out to be a challenging task for three months of work, partly because of the peculiarities of Slavic morphology and morphophonology and the lack of available bilingual electronic dictionaries (and poverty of the latter). As a result, starting from the end of July our main task was to lowering the number of mistakes than on achieving high coverage.
Coverage | Polish → Russian (%) | Russian → Polish (%) |
---|---|---|
Trimmed coverage | 85.1% | 83.8% |
Coverage | Russian (%) | Polish (%) |
Raw coverage | 94.2% | 87.6% |
The number of lemmas in bilingual dictionary: 48,836.
The number of lemmas in Polish dictionary: 10,023.