Difference between revisions of "Maltese and Hebrew/Final report"
Jump to navigation
Jump to search
Line 2: | Line 2: | ||
==Description== |
==Description== |
||
===Maltese=== |
===Maltese=== |
||
===Hebrew=== |
===Hebrew=== |
||
In comparison, writing he.dix and handling Hebrew generation was fairly easy. Other than my own Hebrew knowledge, this was mostly due to research I've done before GSoC started (for my application). |
|||
We have tweaked some code from the [http://hspell.ivrix.org.il/ hspell] Hebrew spellchecker project, to get most of the open-category terms. |
|||
This way we easily got good enough coverage of nouns, verbs, adjectives, etc. |
|||
For closed-category terms, I added a lot of them at the beginning of the project, and then fixed what was needed as we went alone with the bidix. |
|||
===Bidix=== |
===Bidix=== |
Revision as of 14:27, 25 August 2011
Description
Maltese
Hebrew
In comparison, writing he.dix and handling Hebrew generation was fairly easy. Other than my own Hebrew knowledge, this was mostly due to research I've done before GSoC started (for my application).
We have tweaked some code from the hspell Hebrew spellchecker project, to get most of the open-category terms. This way we easily got good enough coverage of nouns, verbs, adjectives, etc.
For closed-category terms, I added a lot of them at the beginning of the project, and then fixed what was needed as we went alone with the bidix.
Bidix
Transfer rules
Statistics
- Dictionaries
- Coverage
- Maltese Wikipedia ( , std. dev.: )
- Maltese news sites ( , std. dev.: )
- Maltese Scannel corpus ( , std. dev.: )
- Rules
- Error rate