User:Gang Chen

From Apertium
Jump to navigation Jump to search

About me

Name: Gang Chen

Email: pkuchengang@gmail.com

IRC: Gang

GitHub Repo: https://github.com/elephantgcc

GSOC 2013

I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".

my proposal is here: Proposal

svn repo

https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost/apertium

Current Progress


r45280 | elephantgcc | 2013-06-24 14:21:20 +0800 (Mon, 24 Jun 2013) | 1 line

MOD: bugfix, normalize not 0.


r45277 | elephantgcc | 2013-06-24 10:47:03 +0800 (Mon, 24 Jun 2013) | 1 line

MOD: add find_similar_ambiguity_class, fix bug in morpho_stream


r45258 | elephantgcc | 2013-06-23 18:33:52 +0800 (Sun, 23 Jun 2013) | 1 line

MOD: add rule support for light-sw tagger, partly.


r45235 | elephantgcc | 2013-06-22 13:53:59 +0800 (Sat, 22 Jun 2013) | 1 line

MOD: light sliding-window tagger, a basic working version.


r45214 | elephantgcc | 2013-06-21 17:49:14 +0800 (Fri, 21 Jun 2013) | 1 line

ADD: add light-sw tagger, partly.


r45213 | elephantgcc | 2013-06-21 16:19:24 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: bugfix, ZERO define, init and iteration formula.


r45209 | elephantgcc | 2013-06-21 13:01:50 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: refine function names.


r45208 | elephantgcc | 2013-06-21 11:14:45 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: tagger_data write only non-ZERO values.


r45194 | elephantgcc | 2013-06-20 19:32:51 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: style check.


r45193 | elephantgcc | 2013-06-20 18:44:57 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: bugfix, avoid '-nan' parameters.


r45191 | elephantgcc | 2013-06-20 17:07:05 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: add retrain() for sw tagger.


r45189 | elephantgcc | 2013-06-20 16:42:30 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: bugfix, use heap space for 3-dimensional parameters.


r45188 | elephantgcc | 2013-06-20 16:30:03 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: add support for null_flush in sw tagger.


r45176 | elephantgcc | 2013-06-19 19:46:29 +0800 (Wed, 19 Jun 2013) | 1 line

MOD: done with EOS, refine training's and tagging's reading control.


r45161 | elephantgcc | 2013-06-18 22:23:26 +0800 (Tue, 18 Jun 2013) | 1 line

MOD: add switch for debug, eos, null_flush.


r45159 | elephantgcc | 2013-06-18 21:59:48 +0800 (Tue, 18 Jun 2013) | 1 line

MOD: add print_para_matrix() for debugging in sw tagger.


r45116 | elephantgcc | 2013-06-17 19:09:17 +0800 (Mon, 17 Jun 2013) | 1 line

MOD: let sw tagger end when the input word is NULL.


r45031 | elephantgcc | 2013-06-13 10:59:53 +0800 (Thu, 13 Jun 2013) | 1 line

MOD: change initial tag score from 0 to -1.


r45029 | elephantgcc | 2013-06-12 21:52:44 +0800 (Wed, 12 Jun 2013) | 1 line

MOD: add option for sw tagger.


r45017 | elephantgcc | 2013-06-11 21:15:01 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: add judgement 'morpho_stream.getEndOfFile()', first demo(training and tagging) OK.


r45016 | elephantgcc | 2013-06-11 20:27:46 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: clean up debug info in morpho_stream.cc


r45015 | elephantgcc | 2013-06-11 20:17:34 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: clean up debug info in hmm.cc


r45010 | elephantgcc | 2013-06-11 17:44:56 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: clean up for matrix c


r45009 | elephantgcc | 2013-06-11 17:30:09 +0800 (Tue, 11 Jun 2013) | 1 line

MOD: bugfix for strange M and N in tagger.cc, should use 'td.readSWPoST' but not 'td.read'


r44984 | elephantgcc | 2013-06-10 16:07:48 +0800 (Mon, 10 Jun 2013) | 1 line

MOD: training basic version, tagging basic version with bug.


r44809 | elephantgcc | 2013-05-30 14:03:22 +0800 (Thu, 30 May 2013) | 1 line

COPY: copy trunk/apertium to branches/apertium-swpost/apertium