Difference between revisions of "User:Gang Chen"
Jump to navigation
Jump to search
(23 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
== About me == |
|||
'''Name:''' Gang Chen |
|||
'''Email:''' pkuchengang@gmail.com |
|||
'''IRC:''' Gang |
|||
'''GitHub Repo:''' https://github.com/elephantgcc |
|||
== GSOC 2013 == |
== GSOC 2013 == |
||
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium". |
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium". |
||
=== Proposal === |
|||
my proposal is here: [http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22 Proposal] |
|||
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22 |
|||
=== svn repo === |
|||
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost/apertium |
|||
=== Current Progress === |
|||
------------------------------------------------------------------------ |
|||
r45280 | elephantgcc | 2013-06-24 14:21:20 +0800 (Mon, 24 Jun 2013) | 1 line |
|||
MOD: bugfix, normalize not 0. |
|||
------------------------------------------------------------------------ |
|||
r45277 | elephantgcc | 2013-06-24 10:47:03 +0800 (Mon, 24 Jun 2013) | 1 line |
|||
MOD: add find_similar_ambiguity_class, fix bug in morpho_stream |
|||
------------------------------------------------------------------------ |
|||
r45258 | elephantgcc | 2013-06-23 18:33:52 +0800 (Sun, 23 Jun 2013) | 1 line |
|||
MOD: add rule support for light-sw tagger, partly. |
|||
------------------------------------------------------------------------ |
|||
r45235 | elephantgcc | 2013-06-22 13:53:59 +0800 (Sat, 22 Jun 2013) | 1 line |
|||
MOD: light sliding-window tagger, a basic working version. |
|||
------------------------------------------------------------------------ |
|||
r45214 | elephantgcc | 2013-06-21 17:49:14 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
ADD: add light-sw tagger, partly. |
|||
------------------------------------------------------------------------ |
|||
r45213 | elephantgcc | 2013-06-21 16:19:24 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
MOD: bugfix, ZERO define, init and iteration formula. |
|||
------------------------------------------------------------------------ |
|||
r45209 | elephantgcc | 2013-06-21 13:01:50 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
MOD: refine function names. |
|||
------------------------------------------------------------------------ |
|||
r45208 | elephantgcc | 2013-06-21 11:14:45 +0800 (Fri, 21 Jun 2013) | 1 line |
|||
MOD: tagger_data write only non-ZERO values. |
|||
------------------------------------------------------------------------ |
|||
r45194 | elephantgcc | 2013-06-20 19:32:51 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: style check. |
|||
------------------------------------------------------------------------ |
|||
r45193 | elephantgcc | 2013-06-20 18:44:57 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: bugfix, avoid '-nan' parameters. |
|||
------------------------------------------------------------------------ |
|||
r45191 | elephantgcc | 2013-06-20 17:07:05 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: add retrain() for sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45189 | elephantgcc | 2013-06-20 16:42:30 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: bugfix, use heap space for 3-dimensional parameters. |
|||
------------------------------------------------------------------------ |
|||
r45188 | elephantgcc | 2013-06-20 16:30:03 +0800 (Thu, 20 Jun 2013) | 1 line |
|||
MOD: add support for null_flush in sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45176 | elephantgcc | 2013-06-19 19:46:29 +0800 (Wed, 19 Jun 2013) | 1 line |
|||
MOD: done with EOS, refine training's and tagging's reading control. |
|||
------------------------------------------------------------------------ |
|||
r45161 | elephantgcc | 2013-06-18 22:23:26 +0800 (Tue, 18 Jun 2013) | 1 line |
|||
MOD: add switch for debug, eos, null_flush. |
|||
------------------------------------------------------------------------ |
|||
r45159 | elephantgcc | 2013-06-18 21:59:48 +0800 (Tue, 18 Jun 2013) | 1 line |
|||
MOD: add print_para_matrix() for debugging in sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45116 | elephantgcc | 2013-06-17 19:09:17 +0800 (Mon, 17 Jun 2013) | 1 line |
|||
MOD: let sw tagger end when the input word is NULL. |
|||
------------------------------------------------------------------------ |
|||
r45031 | elephantgcc | 2013-06-13 10:59:53 +0800 (Thu, 13 Jun 2013) | 1 line |
|||
MOD: change initial tag score from 0 to -1. |
|||
------------------------------------------------------------------------ |
|||
r45029 | elephantgcc | 2013-06-12 21:52:44 +0800 (Wed, 12 Jun 2013) | 1 line |
|||
MOD: add option for sw tagger. |
|||
------------------------------------------------------------------------ |
|||
r45017 | elephantgcc | 2013-06-11 21:15:01 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
=== SVN === |
|||
MOD: add judgement 'morpho_stream.getEndOfFile()', first demo(training and tagging) OK. |
|||
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost |
|||
------------------------------------------------------------------------ |
|||
r45016 | elephantgcc | 2013-06-11 20:27:46 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
=== Progress === |
|||
MOD: clean up debug info in morpho_stream.cc |
|||
http://wiki.apertium.org/w/index.php?title=User:Gang_Chen/GSoC_2013_Progress |
|||
------------------------------------------------------------------------ |
|||
r45015 | elephantgcc | 2013-06-11 20:17:34 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
=== Summary === |
|||
MOD: clean up debug info in hmm.cc |
|||
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Summary |
|||
------------------------------------------------------------------------ |
|||
r45010 | elephantgcc | 2013-06-11 17:44:56 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
=== Documentation / Final Report === |
|||
MOD: clean up for matrix c |
|||
https://docs.google.com/file/d/0BxMmvpeK3ibWN0NjZmEtWnAxdDQ/edit |
|||
------------------------------------------------------------------------ |
|||
r45009 | elephantgcc | 2013-06-11 17:30:09 +0800 (Tue, 11 Jun 2013) | 1 line |
|||
== Useful tools == |
|||
MOD: bugfix for strange M and N in tagger.cc, should use 'td.readSWPoST' but not 'td.read' |
|||
------------------------------------------------------------------------ |
|||
r44984 | elephantgcc | 2013-06-10 16:07:48 +0800 (Mon, 10 Jun 2013) | 1 line |
|||
=== Wikipedia extractor === |
|||
MOD: training basic version, tagging basic version with bug. |
|||
------------------------------------------------------------------------ |
|||
r44809 | elephantgcc | 2013-05-30 14:03:22 +0800 (Thu, 30 May 2013) | 1 line |
|||
http://wiki.apertium.org/wiki/User:Gang_Chen/Wikipedia_Extractor |
|||
COPY: copy trunk/apertium to branches/apertium-swpost/apertium |
Latest revision as of 10:51, 28 May 2018
Contents
GSOC 2013[edit]
I'm working with Apertium for the GSoC 2013, on the project "Sliding Window Part of Speech Tagger for Apertium".
Proposal[edit]
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Application:_%22Sliding_Window_PoS_Tagger%22
SVN[edit]
https://svn.code.sf.net/p/apertium/svn/branches/apertium-swpost
Progress[edit]
http://wiki.apertium.org/w/index.php?title=User:Gang_Chen/GSoC_2013_Progress
Summary[edit]
http://wiki.apertium.org/wiki/User:Gang_Chen/GSoC_2013_Summary
Documentation / Final Report[edit]
https://docs.google.com/file/d/0BxMmvpeK3ibWN0NjZmEtWnAxdDQ/edit
Useful tools[edit]
Wikipedia extractor[edit]
http://wiki.apertium.org/wiki/User:Gang_Chen/Wikipedia_Extractor