Difference between revisions of "User:Gang Chen"
Line 9: | Line 9: | ||
'''GitHub Repo:''' https://github.com/elephantgcc |
'''GitHub Repo:''' https://github.com/elephantgcc |
||
== GSOC 2013 == |
|||
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium". |
|||
my proposal is here: [http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22 Proposal] |
|||
=== svn repo === |
|||
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost/apertium |
|||
=== Current Progress === |
|||
------------------------------------------------------------------------ |
|||
r45280 | elephantgcc | 2013-06-24 14:21:20 +0800 (Mon, 24 Jun 2013) | 1 line |
|||
MOD: bugfix, normalize not 0. |
|||
------------------------------------------------------------------------ |
|||
r45277 | elephantgcc | 2013-06-24 10:47:03 +0800 (Mon, 24 Jun 2013) | 1 line |
|||
MOD: add find_similar_ambiguity_class, fix bug in morpho_stream |
|||
------------------------------------------------------------------------ |
|||
r45258 | elephantgcc | 2013-06-23 18:33:52 +0800 (Sun, 23 Jun 2013) | 1 line |
|||
MOD: add rule support for light-sw tagger, partly. |
|||
------------------------------------------------------------------------ |
|||
r45235 | elephantgcc | 2013-06-22 13:53:59 +0800 (Sat, 22 Jun 2013) | 1 line |
|||
MOD: light sliding-window tagger, a basic working version. |
|||
------------------------------------------------------------------------ |
|||
r45214 | elephantgcc | 2013-06-21 17:49:14 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
ADD: add light-sw tagger, partly. |
|||
------------------------------------------------------------------------ |
|||
r45213 | elephantgcc | 2013-06-21 16:19:24 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
MOD: bugfix, ZERO define, init and iteration formula. |
|||
------------------------------------------------------------------------ |
|||
r45209 | elephantgcc | 2013-06-21 13:01:50 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
MOD: refine function names. |
|||
------------------------------------------------------------------------ |
|||
r45208 | elephantgcc | 2013-06-21 11:14:45 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
MOD: tagger_data write only non-ZERO values. |
|||
------------------------------------------------------------------------ |
|||
r45194 | elephantgcc | 2013-06-20 19:32:51 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: style check. |
|||
------------------------------------------------------------------------ |
|||
r45193 | elephantgcc | 2013-06-20 18:44:57 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: bugfix, avoid '-nan' parameters. |
|||
------------------------------------------------------------------------ |
|||
r45191 | elephantgcc | 2013-06-20 17:07:05 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: add retrain() for sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45189 | elephantgcc | 2013-06-20 16:42:30 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: bugfix, use heap space for 3-dimensional parameters. |
|||
------------------------------------------------------------------------ |
|||
r45188 | elephantgcc | 2013-06-20 16:30:03 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: add support for null_flush in sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45176 | elephantgcc | 2013-06-19 19:46:29 +0800 (Wed, 19 Jun 2013) | 1 line |
|||
MOD: done with EOS, refine training's and tagging's reading control. |
|||
------------------------------------------------------------------------ |
|||
r45161 | elephantgcc | 2013-06-18 22:23:26 +0800 (Tue, 18 Jun 2013) | 1 line |
|||
MOD: add switch for debug, eos, null_flush. |
|||
------------------------------------------------------------------------ |
|||
r45159 | elephantgcc | 2013-06-18 21:59:48 +0800 (Tue, 18 Jun 2013) | 1 line |
|||
MOD: add print_para_matrix() for debugging in sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45116 | elephantgcc | 2013-06-17 19:09:17 +0800 (Mon, 17 Jun 2013) | 1 line |
|||
MOD: let sw tagger end when the input word is NULL. |
|||
------------------------------------------------------------------------ |
|||
r45031 | elephantgcc | 2013-06-13 10:59:53 +0800 (Thu, 13 Jun 2013) | 1 line |
|||
MOD: change initial tag score from 0 to -1. |
|||
------------------------------------------------------------------------ |
|||
r45029 | elephantgcc | 2013-06-12 21:52:44 +0800 (Wed, 12 Jun 2013) | 1 line |
|||
MOD: add option for sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45017 | elephantgcc | 2013-06-11 21:15:01 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
MOD: add judgement 'morpho_stream.getEndOfFile()', first demo(training and tagging) OK. |
|||
------------------------------------------------------------------------ |
|||
r45016 | elephantgcc | 2013-06-11 20:27:46 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
MOD: clean up debug info in morpho_stream.cc |
|||
------------------------------------------------------------------------ |
|||
r45015 | elephantgcc | 2013-06-11 20:17:34 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
MOD: clean up debug info in hmm.cc |
|||
------------------------------------------------------------------------ |
|||
r45010 | elephantgcc | 2013-06-11 17:44:56 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
MOD: clean up for matrix c |
|||
------------------------------------------------------------------------ |
|||
r45009 | elephantgcc | 2013-06-11 17:30:09 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
MOD: bugfix for strange M and N in tagger.cc, should use 'td.readSWPoST' but not 'td.read' |
|||
------------------------------------------------------------------------ |
|||
r44984 | elephantgcc | 2013-06-10 16:07:48 +0800 (Mon, 10 Jun 2013) | 1 line |
|||
MOD: training basic version, tagging basic version with bug. |
|||
------------------------------------------------------------------------ |
|||
r44809 | elephantgcc | 2013-05-30 14:03:22 +0800 (Thu, 30 May 2013) | 1 line |
|||
COPY: copy trunk/apertium to branches/apertium-swpost/apertium |
Revision as of 14:22, 25 June 2013
About me
Name: Gang Chen
Email: pkuchengang@gmail.com
IRC: Gang
GitHub Repo: https://github.com/elephantgcc
GSOC 2013
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".
my proposal is here: Proposal
svn repo
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost/apertium
Current Progress
r45280 | elephantgcc | 2013-06-24 14:21:20 +0800 (Mon, 24 Jun 2013) | 1 line
MOD: bugfix, normalize not 0.
r45277 | elephantgcc | 2013-06-24 10:47:03 +0800 (Mon, 24 Jun 2013) | 1 line
MOD: add find_similar_ambiguity_class, fix bug in morpho_stream
r45258 | elephantgcc | 2013-06-23 18:33:52 +0800 (Sun, 23 Jun 2013) | 1 line
MOD: add rule support for light-sw tagger, partly.
r45235 | elephantgcc | 2013-06-22 13:53:59 +0800 (Sat, 22 Jun 2013) | 1 line
MOD: light sliding-window tagger, a basic working version.
r45214 | elephantgcc | 2013-06-21 17:49:14 +0800 (Fri, 21 Jun 2013) | 1 line
ADD: add light-sw tagger, partly.
r45213 | elephantgcc | 2013-06-21 16:19:24 +0800 (Fri, 21 Jun 2013) | 1 line
MOD: bugfix, ZERO define, init and iteration formula.
r45209 | elephantgcc | 2013-06-21 13:01:50 +0800 (Fri, 21 Jun 2013) | 1 line
MOD: refine function names.
r45208 | elephantgcc | 2013-06-21 11:14:45 +0800 (Fri, 21 Jun 2013) | 1 line
MOD: tagger_data write only non-ZERO values.
r45194 | elephantgcc | 2013-06-20 19:32:51 +0800 (Thu, 20 Jun 2013) | 1 line
MOD: style check.
r45193 | elephantgcc | 2013-06-20 18:44:57 +0800 (Thu, 20 Jun 2013) | 1 line
MOD: bugfix, avoid '-nan' parameters.
r45191 | elephantgcc | 2013-06-20 17:07:05 +0800 (Thu, 20 Jun 2013) | 1 line
MOD: add retrain() for sw tagger.
r45189 | elephantgcc | 2013-06-20 16:42:30 +0800 (Thu, 20 Jun 2013) | 1 line
MOD: bugfix, use heap space for 3-dimensional parameters.
r45188 | elephantgcc | 2013-06-20 16:30:03 +0800 (Thu, 20 Jun 2013) | 1 line
MOD: add support for null_flush in sw tagger.
r45176 | elephantgcc | 2013-06-19 19:46:29 +0800 (Wed, 19 Jun 2013) | 1 line
MOD: done with EOS, refine training's and tagging's reading control.
r45161 | elephantgcc | 2013-06-18 22:23:26 +0800 (Tue, 18 Jun 2013) | 1 line
MOD: add switch for debug, eos, null_flush.
r45159 | elephantgcc | 2013-06-18 21:59:48 +0800 (Tue, 18 Jun 2013) | 1 line
MOD: add print_para_matrix() for debugging in sw tagger.
r45116 | elephantgcc | 2013-06-17 19:09:17 +0800 (Mon, 17 Jun 2013) | 1 line
MOD: let sw tagger end when the input word is NULL.
r45031 | elephantgcc | 2013-06-13 10:59:53 +0800 (Thu, 13 Jun 2013) | 1 line
MOD: change initial tag score from 0 to -1.
r45029 | elephantgcc | 2013-06-12 21:52:44 +0800 (Wed, 12 Jun 2013) | 1 line
MOD: add option for sw tagger.
r45017 | elephantgcc | 2013-06-11 21:15:01 +0800 (Tue, 11 Jun 2013) | 1 line
MOD: add judgement 'morpho_stream.getEndOfFile()', first demo(training and tagging) OK.
r45016 | elephantgcc | 2013-06-11 20:27:46 +0800 (Tue, 11 Jun 2013) | 1 line
MOD: clean up debug info in morpho_stream.cc
r45015 | elephantgcc | 2013-06-11 20:17:34 +0800 (Tue, 11 Jun 2013) | 1 line
MOD: clean up debug info in hmm.cc
r45010 | elephantgcc | 2013-06-11 17:44:56 +0800 (Tue, 11 Jun 2013) | 1 line
MOD: clean up for matrix c
r45009 | elephantgcc | 2013-06-11 17:30:09 +0800 (Tue, 11 Jun 2013) | 1 line
MOD: bugfix for strange M and N in tagger.cc, should use 'td.readSWPoST' but not 'td.read'
r44984 | elephantgcc | 2013-06-10 16:07:48 +0800 (Mon, 10 Jun 2013) | 1 line
MOD: training basic version, tagging basic version with bug.
r44809 | elephantgcc | 2013-05-30 14:03:22 +0800 (Thu, 30 May 2013) | 1 line
COPY: copy trunk/apertium to branches/apertium-swpost/apertium