Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

English and Catalan/Workplan

From Apertium
Jump to: navigation, search

You can find the detailed goals for each week here.

Week Dates Work done Bidix WER / PER Coverage
Post-application period 4 April - 29 May
  • Make personal pronouns work again
  • Fix some modal verbs
  • Write documentation on the current status of transfer rules
35,000

41.15%/29.34% (en-ca) 47.63%/38.5% (eng-cat)

85.9%
1 30 May - 4 June
  • Make toponyms, names and surnames work properly again
  • Reorganise proper nouns in bidix
  • Expand bidix
  • Write documentation on the current status of transfer rules
36,987 (37,000)  ? (45%)  ? (86.3%)
2 5 June - 11 June
  • Expand bidix
  • Port lexical selection rules from apertium-en-es
  • Write documentation on the current status of transfer rules
38,829 (39,000)  ? (43.5%)  ? (86.7%)
3 12 June - 18 June
  • Expand bidix
  • Bidix testvoc to find lemmas with wrong tags
40,841 (41,000)  ? (42%)  ? (87.1%)
4 19 June - 25 June
  • Expand bidix
  • Fix wrong tags in bidix
  • Write documentation on the current status of transfer rules
  • Begin change to proper nouns with gender and number
42,778 (43,000)  ? (40.5%)  ? (87.5%)
5 26 June - 2 July
  • Expand bidix
  • Write documentation on the current status of transfer rules
  • Crossdics (eng-spa-cat)
  • Bidix testvoc to establish future priorities

Deliverable #1

45,003 (45,000) 55.7%/42% (39%) TER: 54% 88.78% (87.8%)
6 3 July - 9 July
  • Expand bidix
  • Write documentation on the current status of transfer rules
  • Crossdics (eng-spa-cat)
  • Tag fixes after testvoc
  • Evaluation of common tagger errors to prepare for tagger training
47,110 (47,000)  ? (38%)  ? (88.1%)
7 10 July - 16 July
  • Expand bidix
  • Tag fixes after testvoc
  • Tagger training preparation (tagged corpora unification)
  • Transfer rule refactoring
48,488 (49,000)  ? (37%)  ? (88.5%)
8 17 July - 23 July
  • Expand bidix
  • Tag fixes after testvoc
  • Tagger training preparation (tagged corpora unification)
  • Transfer rule refactoring
  • Allow English genitives to be generated
  • Start fixing broken proper nouns in Catalan
50,537 (51,000)  ? (36%)  ? (88.8%)
9 24 July - 30 July
  • Expand bidix
  • Tagger training preparation (tagged corpora unification)
  • Transfer rule refactoring

Deliverable #2

53,967 (53,000) 60.8%/42.7% (35%) TER: 60% 91.05% (89.1%)
10 31 July - 6 August
  • Perceptron tagger training
  • Transfer rule refactoring
53,967 (55,000)  ? (34%)  ? (89.4%)
11 7 August - 13 August
  • Expand bidix
  • Transfer rule refactoring
  • New transfer rules (ENG>CAT)
  • Transfer rule documentation automatisation
66,185 (56,500)  ? (33.5%)  ? (89.6%)
12 14 August - 20 August
  • New transfer rules (ENG>CAT)
  • Write documentation
66,185 (58,000)  ? (33%)  ? (89.8%)
13 21 August - 27 August
  • New transfer rules (ENG>CAT)
  • Transfer rule refactoring
  • Contraint Grammar restrictions
  • Write documentation

Final evaluation

66,185 (59,000) 51.1%/33.7% (32.5%) 92.5% (90.0%)
Personal tools