Difference between revisions of "User:Gang Chen"

From Apertium
Jump to navigation Jump to search
 
(23 intermediate revisions by the same user not shown)
Line 1: Line 1:

== About me ==

'''Name:''' Gang Chen

'''Email:''' pkuchengang@gmail.com

'''IRC:''' Gang

'''GitHub Repo:''' https://github.com/elephantgcc

== GSOC 2013 ==
== GSOC 2013 ==


I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".


=== Proposal ===
my proposal is here: [http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22 Proposal]
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22

=== svn repo ===
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost/apertium

=== Current Progress ===
------------------------------------------------------------------------
r45280 | elephantgcc | 2013-06-24 14:21:20 +0800 (Mon, 24 Jun 2013) | 1 line

MOD: bugfix, normalize not 0.
------------------------------------------------------------------------
r45277 | elephantgcc | 2013-06-24 10:47:03 +0800 (Mon, 24 Jun 2013) | 1 line

MOD: add find_similar_ambiguity_class, fix bug in morpho_stream
------------------------------------------------------------------------
r45258 | elephantgcc | 2013-06-23 18:33:52 +0800 (Sun, 23 Jun 2013) | 1 line

MOD: add rule support for light-sw tagger, partly.
------------------------------------------------------------------------
r45235 | elephantgcc | 2013-06-22 13:53:59 +0800 (Sat, 22 Jun 2013) | 1 line

MOD: light sliding-window tagger, a basic working version.
------------------------------------------------------------------------
r45214 | elephantgcc | 2013-06-21 17:49:14 +0800 (Fri, 21 Jun 2013) | 1 line

ADD: add light-sw tagger, partly.
------------------------------------------------------------------------
r45213 | elephantgcc | 2013-06-21 16:19:24 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: bugfix, ZERO define, init and iteration formula.
------------------------------------------------------------------------
r45209 | elephantgcc | 2013-06-21 13:01:50 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: refine function names.
------------------------------------------------------------------------
r45208 | elephantgcc | 2013-06-21 11:14:45 +0800 (Fri, 21 Jun 2013) | 1 line

MOD: tagger_data write only non-ZERO values.
------------------------------------------------------------------------
r45194 | elephantgcc | 2013-06-20 19:32:51 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: style check.
------------------------------------------------------------------------
r45193 | elephantgcc | 2013-06-20 18:44:57 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: bugfix, avoid '-nan' parameters.
------------------------------------------------------------------------
r45191 | elephantgcc | 2013-06-20 17:07:05 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: add retrain() for sw tagger.
------------------------------------------------------------------------
r45189 | elephantgcc | 2013-06-20 16:42:30 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: bugfix, use heap space for 3-dimensional parameters.
------------------------------------------------------------------------
r45188 | elephantgcc | 2013-06-20 16:30:03 +0800 (Thu, 20 Jun 2013) | 1 line

MOD: add support for null_flush in sw tagger.
------------------------------------------------------------------------
r45176 | elephantgcc | 2013-06-19 19:46:29 +0800 (Wed, 19 Jun 2013) | 1 line

MOD: done with EOS, refine training's and tagging's reading control.
------------------------------------------------------------------------
r45161 | elephantgcc | 2013-06-18 22:23:26 +0800 (Tue, 18 Jun 2013) | 1 line

MOD: add switch for debug, eos, null_flush.
------------------------------------------------------------------------
r45159 | elephantgcc | 2013-06-18 21:59:48 +0800 (Tue, 18 Jun 2013) | 1 line

MOD: add print_para_matrix() for debugging in sw tagger.
------------------------------------------------------------------------
r45116 | elephantgcc | 2013-06-17 19:09:17 +0800 (Mon, 17 Jun 2013) | 1 line

MOD: let sw tagger end when the input word is NULL.
------------------------------------------------------------------------
r45031 | elephantgcc | 2013-06-13 10:59:53 +0800 (Thu, 13 Jun 2013) | 1 line

MOD: change initial tag score from 0 to -1.
------------------------------------------------------------------------
r45029 | elephantgcc | 2013-06-12 21:52:44 +0800 (Wed, 12 Jun 2013) | 1 line

MOD: add option for sw tagger.
------------------------------------------------------------------------
r45017 | elephantgcc | 2013-06-11 21:15:01 +0800 (Tue, 11 Jun 2013) | 1 line


=== SVN ===
MOD: add judgement 'morpho_stream.getEndOfFile()', first demo(training and tagging) OK.
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost
------------------------------------------------------------------------
r45016 | elephantgcc | 2013-06-11 20:27:46 +0800 (Tue, 11 Jun 2013) | 1 line


=== Progress ===
MOD: clean up debug info in morpho_stream.cc
http://wiki.apertium.org/w/index.php?title=User:Gang_Chen/GSoC_2013_Progress
------------------------------------------------------------------------
r45015 | elephantgcc | 2013-06-11 20:17:34 +0800 (Tue, 11 Jun 2013) | 1 line


=== Summary ===
MOD: clean up debug info in hmm.cc
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Summary
------------------------------------------------------------------------
r45010 | elephantgcc | 2013-06-11 17:44:56 +0800 (Tue, 11 Jun 2013) | 1 line


=== Documentation / Final Report ===
MOD: clean up for matrix c
https://docs.google.com/file/d/0BxMmvpeK3ibWN0NjZmEtWnAxdDQ/edit
------------------------------------------------------------------------
r45009 | elephantgcc | 2013-06-11 17:30:09 +0800 (Tue, 11 Jun 2013) | 1 line


== Useful tools ==
MOD: bugfix for strange M and N in tagger.cc, should use 'td.readSWPoST' but not 'td.read'
------------------------------------------------------------------------
r44984 | elephantgcc | 2013-06-10 16:07:48 +0800 (Mon, 10 Jun 2013) | 1 line


=== Wikipedia extractor ===
MOD: training basic version, tagging basic version with bug.
------------------------------------------------------------------------
r44809 | elephantgcc | 2013-05-30 14:03:22 +0800 (Thu, 30 May 2013) | 1 line


http://wiki.apertium.org/wiki/User:Gang_Chen/Wikipedia_Extractor
COPY: copy trunk/apertium to branches/apertium-swpost/apertium

Latest revision as of 10:51, 28 May 2018