Difference between revisions of "User:Gang Chen"

From Apertium
Jump to navigation Jump to search
 
(23 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
== About me ==
 
 
'''Name:''' Gang Chen
 
 
'''Email:''' pkuchengang@gmail.com
 
 
'''IRC:''' Gang
 
 
'''GitHub Repo:''' https://github.com/elephantgcc
 
 
 
== GSOC 2013 ==
 
== GSOC 2013 ==
   
 
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".
 
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".
   
  +
=== Proposal ===
my proposal is here: [http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22 Proposal]
 
  +
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22
 
=== svn repo ===
 
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost/apertium
 
 
=== Current Progress ===
 
------------------------------------------------------------------------
 
r45280 | elephantgcc | 2013-06-24 14:21:20 +0800 (Mon, 24 Jun 2013) | 1 line
 
 
MOD: bugfix, normalize not 0.
 
------------------------------------------------------------------------
 
r45277 | elephantgcc | 2013-06-24 10:47:03 +0800 (Mon, 24 Jun 2013) | 1 line
 
 
MOD: add find_similar_ambiguity_class, fix bug in morpho_stream
 
------------------------------------------------------------------------
 
r45258 | elephantgcc | 2013-06-23 18:33:52 +0800 (Sun, 23 Jun 2013) | 1 line
 
 
MOD: add rule support for light-sw tagger, partly.
 
------------------------------------------------------------------------
 
r45235 | elephantgcc | 2013-06-22 13:53:59 +0800 (Sat, 22 Jun 2013) | 1 line
 
 
MOD: light sliding-window tagger, a basic working version.
 
------------------------------------------------------------------------
 
r45214 | elephantgcc | 2013-06-21 17:49:14 +0800 (Fri, 21 Jun 2013) | 1 line
 
 
ADD: add light-sw tagger, partly.
 
------------------------------------------------------------------------
 
r45213 | elephantgcc | 2013-06-21 16:19:24 +0800 (Fri, 21 Jun 2013) | 1 line
 
 
MOD: bugfix, ZERO define, init and iteration formula.
 
------------------------------------------------------------------------
 
r45209 | elephantgcc | 2013-06-21 13:01:50 +0800 (Fri, 21 Jun 2013) | 1 line
 
 
MOD: refine function names.
 
------------------------------------------------------------------------
 
r45208 | elephantgcc | 2013-06-21 11:14:45 +0800 (Fri, 21 Jun 2013) | 1 line
 
 
MOD: tagger_data write only non-ZERO values.
 
------------------------------------------------------------------------
 
r45194 | elephantgcc | 2013-06-20 19:32:51 +0800 (Thu, 20 Jun 2013) | 1 line
 
 
MOD: style check.
 
------------------------------------------------------------------------
 
r45193 | elephantgcc | 2013-06-20 18:44:57 +0800 (Thu, 20 Jun 2013) | 1 line
 
 
MOD: bugfix, avoid '-nan' parameters.
 
------------------------------------------------------------------------
 
r45191 | elephantgcc | 2013-06-20 17:07:05 +0800 (Thu, 20 Jun 2013) | 1 line
 
 
MOD: add retrain() for sw tagger.
 
------------------------------------------------------------------------
 
r45189 | elephantgcc | 2013-06-20 16:42:30 +0800 (Thu, 20 Jun 2013) | 1 line
 
 
MOD: bugfix, use heap space for 3-dimensional parameters.
 
------------------------------------------------------------------------
 
r45188 | elephantgcc | 2013-06-20 16:30:03 +0800 (Thu, 20 Jun 2013) | 1 line
 
 
MOD: add support for null_flush in sw tagger.
 
------------------------------------------------------------------------
 
r45176 | elephantgcc | 2013-06-19 19:46:29 +0800 (Wed, 19 Jun 2013) | 1 line
 
 
MOD: done with EOS, refine training's and tagging's reading control.
 
------------------------------------------------------------------------
 
r45161 | elephantgcc | 2013-06-18 22:23:26 +0800 (Tue, 18 Jun 2013) | 1 line
 
 
MOD: add switch for debug, eos, null_flush.
 
------------------------------------------------------------------------
 
r45159 | elephantgcc | 2013-06-18 21:59:48 +0800 (Tue, 18 Jun 2013) | 1 line
 
 
MOD: add print_para_matrix() for debugging in sw tagger.
 
------------------------------------------------------------------------
 
r45116 | elephantgcc | 2013-06-17 19:09:17 +0800 (Mon, 17 Jun 2013) | 1 line
 
 
MOD: let sw tagger end when the input word is NULL.
 
------------------------------------------------------------------------
 
r45031 | elephantgcc | 2013-06-13 10:59:53 +0800 (Thu, 13 Jun 2013) | 1 line
 
 
MOD: change initial tag score from 0 to -1.
 
------------------------------------------------------------------------
 
r45029 | elephantgcc | 2013-06-12 21:52:44 +0800 (Wed, 12 Jun 2013) | 1 line
 
 
MOD: add option for sw tagger.
 
------------------------------------------------------------------------
 
r45017 | elephantgcc | 2013-06-11 21:15:01 +0800 (Tue, 11 Jun 2013) | 1 line
 
   
  +
=== SVN ===
MOD: add judgement 'morpho_stream.getEndOfFile()', first demo(training and tagging) OK.
 
  +
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost
------------------------------------------------------------------------
 
r45016 | elephantgcc | 2013-06-11 20:27:46 +0800 (Tue, 11 Jun 2013) | 1 line
 
   
  +
=== Progress ===
MOD: clean up debug info in morpho_stream.cc
 
  +
http://wiki.apertium.org/w/index.php?title=User:Gang_Chen/GSoC_2013_Progress
------------------------------------------------------------------------
 
r45015 | elephantgcc | 2013-06-11 20:17:34 +0800 (Tue, 11 Jun 2013) | 1 line
 
   
  +
=== Summary ===
MOD: clean up debug info in hmm.cc
 
  +
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Summary
------------------------------------------------------------------------
 
r45010 | elephantgcc | 2013-06-11 17:44:56 +0800 (Tue, 11 Jun 2013) | 1 line
 
   
  +
=== Documentation / Final Report ===
MOD: clean up for matrix c
 
  +
https://docs.google.com/file/d/0BxMmvpeK3ibWN0NjZmEtWnAxdDQ/edit
------------------------------------------------------------------------
 
r45009 | elephantgcc | 2013-06-11 17:30:09 +0800 (Tue, 11 Jun 2013) | 1 line
 
   
  +
== Useful tools ==
MOD: bugfix for strange M and N in tagger.cc, should use 'td.readSWPoST' but not 'td.read'
 
------------------------------------------------------------------------
 
r44984 | elephantgcc | 2013-06-10 16:07:48 +0800 (Mon, 10 Jun 2013) | 1 line
 
   
  +
=== Wikipedia extractor ===
MOD: training basic version, tagging basic version with bug.
 
------------------------------------------------------------------------
 
r44809 | elephantgcc | 2013-05-30 14:03:22 +0800 (Thu, 30 May 2013) | 1 line
 
   
  +
http://wiki.apertium.org/wiki/User:Gang_Chen/Wikipedia_Extractor
COPY: copy trunk/apertium to branches/apertium-swpost/apertium
 

Latest revision as of 10:51, 28 May 2018