Difference between revisions of "User:Ggregori"

From Apertium
Jump to navigation Jump to search
Line 25: Line 25:
 
*Using UTF with Python: 'codecs' and 'unicodedata' can be some useful modules.
 
*Using UTF with Python: 'codecs' and 'unicodedata' can be some useful modules.
 
*Testing the option 'lt-proc' -b which is going to be the input of my compiler.
 
*Testing the option 'lt-proc' -b which is going to be the input of my compiler.
  +
  +
  +
(02/05 - 08/05): This week I ended all the review/research needed, although I couldn't do all I wanted because I had to travel.
  +
*Ended with the introductory book reviewing NLP and Python.
  +
*Started designing and redefining the compiler's architecture following last year work and selected and did some tests with some modules. Some of the changes or improvements:
  +
**Use of pipes/command-line arguments for the input of the compiler (like the rest of Apertium).
  +
**Configurable logging module for info and debugging purposes (module: logging).
  +
**Refactoring some methods in the expatparser class (e.g. extracting common code of the callback method).
  +
**Create some additional classes in order to add some flexibility (e.g. parent class parser with the common code).
   
   

Revision as of 17:54, 9 May 2011

About me

Name: Gabriel Gregori Manzano

Email/Google chat: Email me

IRC nick: ggregori

GSoC 2011

VM for the transfer module - Application

Github repository: pending...

TODO list

  • Research and experiment with the topics mentioned by my mentor:
    • implementation of UNIX wildcards.
  • Start the work refining last year's architecture (e.g. choosing the modules for logging...).
  • Define and implement some small tests for the compiler.

Weekly reports

(25/04 - 01/05): Basically this week has been dedicated to research/review some topics (some of them suggested by my mentor)

  • I have been reviewing NLP and Python using 'Natural Language Processing with Python' book.
  • I have been looking for a way to represent morphological labels in UCS/UTF and my mentor suggested using negative numbers as in Apertium internals. Anyway, I can worry about this later.
  • Using UTF with Python: 'codecs' and 'unicodedata' can be some useful modules.
  • Testing the option 'lt-proc' -b which is going to be the input of my compiler.


(02/05 - 08/05): This week I ended all the review/research needed, although I couldn't do all I wanted because I had to travel.

  • Ended with the introductory book reviewing NLP and Python.
  • Started designing and redefining the compiler's architecture following last year work and selected and did some tests with some modules. Some of the changes or improvements:
    • Use of pipes/command-line arguments for the input of the compiler (like the rest of Apertium).
    • Configurable logging module for info and debugging purposes (module: logging).
    • Refactoring some methods in the expatparser class (e.g. extracting common code of the callback method).
    • Create some additional classes in order to add some flexibility (e.g. parent class parser with the common code).