Difference between revisions of "User:Irene/workplan"
Line 28: | Line 28: | ||
6/24: migrated to source forge. added testing set. |
6/24: migrated to source forge. added testing set. |
||
|| |
|| |
||
|- |
|- |
||
| || || |
| || || |
||
| |
|||
|- |
|- |
||
Line 36: | Line 37: | ||
|- |
|- |
||
| 5 || 6/26 - 7/2 || finite-state acceptor for 'take out' |
| 5 || 6/26 - 7/2 || hard-coded finite-state acceptor for 'take out' |
||
|| |
|| |
||
6/26: trying to make the program backtrack when it gets to <ANY_CHAR> or <ANY_TAG> |
6/26: trying to make the program backtrack when it gets to <ANY_CHAR> or <ANY_TAG> <br /> |
||
6/27: debugging & supporting any number of tags |
6/27: debugging & supporting any number of tags <br /> |
||
6/28: successfully reads and prints ^take<vblex><pres><tag1><tag2><tag3><tag4>$ ^the<det><tag1><tag2><tag3><tag4>$ ^thing<n><sg><tag><Tag>$ ^out<adv>$ |
6/28: successfully reads and prints ^take<vblex><pres><tag1><tag2><tag3><tag4>$ ^the<det><tag1><tag2><tag3><tag4>$ ^thing<n><sg><tag><Tag>$ ^out<adv>$ <br /> |
||
6/29: working on being selective about what middle words are accepted |
6/29: working on being selective about what middle words are accepted <br /> |
||
7/1: python prototype for acceptor is pretty much working, just needs to be able to read from corpuses that don't put every sentence on a new line, and to assign numbers to states in a more elegant fashion. <br /> |
|||
6/30: |
|||
|- |
|- |
||
| 6 || 7/3 - 7/9 || finite-state transducer for 'take out' |
| 6 || 7/3 - 7/9 || hard-coded finite-state transducer for 'take out' |
||
|| |
|||
7/3: tried to convert the python script to c++ code. trying to use lttoolbox's FST class. <br /> |
|||
7/4: still trying to convert to c++ and use lttoolbox <br /> |
|||
7/5: |
|||
7/6: |
|||
7/7: |
|||
7/8: |
|||
7/9: |
|||
|- |
|- |
||
Line 54: | Line 63: | ||
|- |
|- |
||
!'''Second evaluation''' !! 7/24 - 7/28 !! |
!'''Second evaluation''' !! 7/24 - 7/28 !! finite-state implementation |
||
|- |
|- |
||
Revision as of 04:48, 5 July 2017
Workplan
Week | Dates | Goals | Progress/Notes | Evaluation |
---|---|---|---|---|
1 | 5/30 - 6/4 | some data, find test corpus | ||
2 | 6/5 - 6/11 | script to bootstrap separable multiwords from dictionaries, set up testing framework, support/preparing data for English separable verbs | ||
3 | 6/12 - 6/18 | preparing data, prototype script set up, read specifications of Lttoolbox API | ||
4 | 6/19 - 6/25 |
6/19: separate out the language-dependent functions in the c++ prototype, work on reordering module for Romance languages (Spanish, Portuguese) |
6/19: Still trying to get FST example to compile on my computer. Worked on the reordering module for English in c++ |
|
First evaluation | 6/26 - 6/30 | testing framework set up + prototype system in Python | ||
5 | 6/26 - 7/2 | hard-coded finite-state acceptor for 'take out' |
6/26: trying to make the program backtrack when it gets to <ANY_CHAR> or <ANY_TAG> | |
6 | 7/3 - 7/9 | hard-coded finite-state transducer for 'take out' |
7/3: tried to convert the python script to c++ code. trying to use lttoolbox's FST class. | |
7 | 7/10 - 7/16 | |||
8 | 7/17 - 7/23 | |||
Second evaluation | 7/24 - 7/28 | finite-state implementation | ||
9 | 7/24 - 7/30 | integration with Apertium: fit module between pre-transfer and lt-proc-b | ||
10 | 7/31 - 8/6 | support for individual language pairs | ||
11 | 8/7 - 8/13 | (cont. support for individual language pairs) | ||
12 | 8/14 - 8/20 | (cont. support for individual language pairs) | ||
13 | 8/21 - 8/27 | (cont. support for individual language pairs) | ||
Final evaluation | 8/29 - 9/5 | finite-state implementation in C++ with lttoolbox |