Difference between revisions of "Polish and Russian/Project description"

From Apertium
Jump to navigation Jump to search
Line 11: Line 11:
   
 
== Statistics ==
 
== Statistics ==
  +
  +
At first, the goal of the project was to achieve 90% coverage of corpora used.
  +
  +
{|class=wikitable
  +
! Coverage !! ''Sicilian → Spanish'' (%) !! ''Spanish → Sicilian'' (%)
  +
|-
  +
|'''Trimmed coverage''' || 83.4% || 83,8%
  +
|-
  +
! Coverage !! ''Sicilian'' (%) !! ''Spanish'' (%)
  +
|-
  +
| '''Raw coverage''' || 85.7% || 91,6%
  +
|-
  +
|}
  +
  +
'''The number of lemmas in bilingual dictionary: ''' 48,836.
  +
  +
'''The number of lemmas in Polish dictionary: ''' 10,023.
   
 
== Future work ==
 
== Future work ==

Revision as of 16:34, 23 August 2016

Commitment

The list of all commits: https://apertium.projectjj.com/gsoc2016/maryszmary.html

Monolingual Polish package: https://svn.code.sf.net/p/apertium/svn/languages/apertium-pol/

Monolingual Russian package: https://svn.code.sf.net/p/apertium/svn/languages/apertium-rus/

Bilingual Polish-Russian package: https://svn.code.sf.net/p/apertium/svn/incubator/apertium-pol-rus/

Project description

Statistics

At first, the goal of the project was to achieve 90% coverage of corpora used.

Coverage Sicilian → Spanish (%) Spanish → Sicilian (%)
Trimmed coverage 83.4% 83,8%
Coverage Sicilian (%) Spanish (%)
Raw coverage 85.7% 91,6%

The number of lemmas in bilingual dictionary: 48,836.

The number of lemmas in Polish dictionary: 10,023.

Future work

Resources