Difference between revisions of "One-liners"

From Apertium
Jump to navigation Jump to search
(Created page with '{{TOCD}} ==Useful (mostly) bash one-liners== * Perl regular-expression for removing all tags after the initial: perl -pe 's/(\^[^<]+<[^>]+>)(<\w+>)*\$/\1\$/g;' ^Lemma<V><Pres…')
 
Line 6: Line 6:


^Lemma<V><Pres><Sg>$ -> ^Lemma<V>$
^Lemma<V><Pres><Sg>$ -> ^Lemma<V>$

* Get unknown words from chunked text and sort by frequency: sed 's/\$\W*\^/$\n^/g' | grep '@' | sed 's/><.*/>$/g' | sort -f | uniq -ci | sort -gr

Revision as of 15:34, 13 June 2010

Useful (mostly) bash one-liners

  • Perl regular-expression for removing all tags after the initial: perl -pe 's/(\^[^<]+<[^>]+>)(<\w+>)*\$/\1\$/g;'

^Lemma<V><Pres><Sg>$ -> ^Lemma<V>$

  • Get unknown words from chunked text and sort by frequency: sed 's/\$\W*\^/$\n^/g' | grep '@' | sed 's/><.*/>$/g' | sort -f | uniq -ci | sort -gr