User:Francis Tyers/Sandbox2
< User:Francis Tyers
Jump to navigation
Jump to search
Revision as of 15:53, 2 August 2011 by Francis Tyers (talk | contribs)
Corpus: cawiki-20110616-pages-articles.xml.bz2
cleaned with `aq-wikicrp'
1758582 lines
531983 unique analyses
2740 analyses with >1 translation
289 words (lemma+pos) with >1 translation in corpus
712 words in dictionary with >1 translation
1.03 fertility of dictionary over corpus (e.g. total number of word:word translations / total number of words)