User:Gang Chen/GSoC 2013 Progress

From Apertium
Jump to navigation Jump to search

GSOC 2013

I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".

my proposal is here: Proposal

svn repo

https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost/apertium

General Progress

2013-06-24: LSW tagger working, with rules.

2013-06-20: LSW tagger working, without rules.

2013-06-11: SW tagger working.

2013-05-30: Start.

Detailed svn log


r45280 | elephantgcc | 2013-06-24 14:21:20 +0800 (Mon, 24 Jun 2013) | 1 line

MOD: bugfix, normalize not 0.


r45277 | elephantgcc | 2013-06-24 10:47:03 +0800 (Mon, 24 Jun 2013) | 1 line

MOD: add find_similar_ambiguity_class, fix bug in morpho_stream


r45258 | elephantgcc | 2013-06-23 18:33:52 +0800 (Sun, 23 Jun 2013) | 1 line

MOD: add rule support for light-sw tagger, partly.


r45235 | elephantgcc | 2013-06-22 13:53:59 +0800 (Sat, 22 Jun 2013) | 1 line

MOD: light sliding-window tagger, a basic working version.


r45214 | elephantgcc | 2013-06-21 17:49:14 +0800 (Fri, 21 Jun 2013) | 1 line

ADD: add light-sw tagger, partly.


r45213 | elephantgcc | 2013-06-21 16:19:24 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: bugfix, ZERO define, init and iteration formula.


r45209 | elephantgcc | 2013-06-21 13:01:50 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: refine function names.


r45208 | elephantgcc | 2013-06-21 11:14:45 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: tagger_data write only non-ZERO values.


r45194 | elephantgcc | 2013-06-20 19:32:51 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: style check.


r45193 | elephantgcc | 2013-06-20 18:44:57 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: bugfix, avoid '-nan' parameters.


r45191 | elephantgcc | 2013-06-20 17:07:05 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: add retrain() for sw tagger.


r45189 | elephantgcc | 2013-06-20 16:42:30 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: bugfix, use heap space for 3-dimensional parameters.


r45188 | elephantgcc | 2013-06-20 16:30:03 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: add support for null_flush in sw tagger.


r45176 | elephantgcc | 2013-06-19 19:46:29 +0800 (Wed, 19 Jun 2013) | 1 line

MOD: done with EOS, refine training's and tagging's reading control.


r45161 | elephantgcc | 2013-06-18 22:23:26 +0800 (Tue, 18 Jun 2013) | 1 line

MOD: add switch for debug, eos, null_flush.


r45159 | elephantgcc | 2013-06-18 21:59:48 +0800 (Tue, 18 Jun 2013) | 1 line

MOD: add print_para_matrix() for debugging in sw tagger.


r45116 | elephantgcc | 2013-06-17 19:09:17 +0800 (Mon, 17 Jun 2013) | 1 line

MOD: let sw tagger end when the input word is NULL.


r45031 | elephantgcc | 2013-06-13 10:59:53 +0800 (Thu, 13 Jun 2013) | 1 line

MOD: change initial tag score from 0 to -1.


r45029 | elephantgcc | 2013-06-12 21:52:44 +0800 (Wed, 12 Jun 2013) | 1 line

MOD: add option for sw tagger.


r45017 | elephantgcc | 2013-06-11 21:15:01 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: add judgement 'morpho_stream.getEndOfFile()', first demo(training and tagging) OK.


r45016 | elephantgcc | 2013-06-11 20:27:46 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: clean up debug info in morpho_stream.cc


r45015 | elephantgcc | 2013-06-11 20:17:34 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: clean up debug info in hmm.cc


r45010 | elephantgcc | 2013-06-11 17:44:56 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: clean up for matrix c


r45009 | elephantgcc | 2013-06-11 17:30:09 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: bugfix for strange M and N in tagger.cc, should use 'td.readSWPoST' but not 'td.read'


r44984 | elephantgcc | 2013-06-10 16:07:48 +0800 (Mon, 10 Jun 2013) | 1 line

MOD: training basic version, tagging basic version with bug.


r44809 | elephantgcc | 2013-05-30 14:03:22 +0800 (Thu, 30 May 2013) | 1 line

COPY: copy trunk/apertium to branches/apertium-swpost/apertium