Difference between revisions of "Chunking"

From Apertium
Jump to navigation Jump to search
m (Chunks moved to Chunking)
Line 25: Line 25:
<spectie> but the chunks cannot be recursive
<spectie> but the chunks cannot be recursive
</pre>
</pre>

==External links==
* [http://en.wikipedia.org/wiki/Chunking_(computational_linguistics) wikipedia]
* [http://nltk.org/doc/en/chunk.html Chunking] (Natural Language Toolkit)
* [http://crfchunker.sourceforge.net/ CRFChunker] (Conditional Random Fields English Phrase Chunker)
* [http://jtextpro.sourceforge.net/ JTextPro] (A Java-based Text Processing Toolkit)

Revision as of 15:11, 4 September 2008

     jacobn>	But really I have a big problem about all this "shallow transfer".

	<spectie>	shallow transfer = no parse trees
	<spectie>	basically
	<jimregan2>	yep
     
	<jacobn>	HOW is reordering of the phrase then going to happen!!
     jimregan2> we use chunking

    <jimregan2> first we reorder words in the chunk, then we reorder chunks

	<jacobn>	Pls tell me 'bout it or point to a web page
	<jimregan2>	um
	<jimregan2>	it's easy enough
	<jimregan2>	first, we match phrase patterns
	<jimregan2>	adj+noun
	<jimregan2>	adj+adj+noun
	<jimregan2>	from these, we make a 'pseudo lemma', with a tag containing the type - normally 'SN' (noun phrase) or SV (verb phrase)
	<jimregan2>	then, we translate based on these pseudo words
	<jimregan2>	breaking the language down to its bare essentials, basically
	<jimregan2>	at the moment, I'm taking the 'hard wired' parts of the english to spanish chunker, and adapting it for french
	<jimregan2>	changing 'más' to 'plus' in a macro, etc.

      <spectie> but the chunks cannot be recursive	

External links