Difference between revisions of "User:Asfrent/MSc Log"

From Apertium
Jump to navigation Jump to search
Line 4: Line 4:


=== Short term plan / Pendings ===
=== Short term plan / Pendings ===

* Write a DCG for the apertium stream format. '''[5h]'''
{| class="wikitable"
* Research UTF and Prolog.
!Pending
* Write a simple PoS disambiguator that makes a random choice.
!Estimated date
* Set up a repository for the project.
!Notes
* Check licensing of MIL code.
|-
* Design internal representation of the input data.
| Write a DCG for the apertium stream format.
* Design rules.
|
* Implement basic predicates.
|
* Learn rules using MIL.
|-
| Research UTF and Prolog.
|
|
|-
| Write a simple PoS disambiguator that makes a random choice.
|
|
|-
| Set up a repository for the project.
|
|
|-
| Check licensing of MIL code.
|
|
|-
| Design internal representation of the input data.
|
|
|-
| Design rules.
|
|
|-
| Implement basic predicates.
|
|
|-
| Learn rules using MIL.
|
|
|}


=== Questions ===
=== Questions ===

Revision as of 10:34, 13 July 2014

MSc

Plan, questions, stuff

Short term plan / Pendings

Pending Estimated date Notes
Write a DCG for the apertium stream format.
Research UTF and Prolog.
Write a simple PoS disambiguator that makes a random choice.
Set up a repository for the project.
Check licensing of MIL code.
Design internal representation of the input data.
Design rules.
Implement basic predicates.
Learn rules using MIL.

Questions

Log

11.07.2014

  • Read ILP paper from Francis.
  • Got MIL code, did a few tests.
  • Tracked down and downloaded test data from Apertium project for the tagger.
  • Read about tagging, CG and rules.
  • Wrote a Prolog script that reads all the lines from a file.

12.07.2014

  • Started to read CG docs in order to make the design of the data structures.
  • Did a bit of research on Prolog DCG.
  • Wrote stream tokenizer in Prolog.
  • Wrote token splitter in Prolog.