Difference between revisions of "One-liners"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
| Line 3: | Line 3: | ||
==Useful (mostly) bash one-liners==  | 
  ==Useful (mostly) bash one-liners==  | 
||
* Perl regular-expression for removing all tags after the initial:    | 
  * Perl regular-expression for removing all tags after the initial:    | 
||
<pre>  | 
|||
perl -pe 's/(\^[^<]+<[^>]+>)(<\w+>)*\$/\1\$/g;'  | 
|||
^Lemma<V><Pres><Sg>$ -> ^Lemma<V>$  | 
  ^Lemma<V><Pres><Sg>$ -> ^Lemma<V>$  | 
||
</pre>  | 
|||
| ⚫ | |||
<pre>  | 
|||
sed 's/\$\W*\^/$\n^/g' | grep '@' | sed 's/><.*/>$/g' |  sort -f | uniq -ci  | sort -gr  | 
|||
</pre>  | 
|||
| ⚫ | |||
<pre>  | 
|||
| ⚫ | |||
sed ':a;N;$!ba;s/\n//g'  | 
|||
</pre>  | 
|||
[[Category:Tools]]  | 
|||
| ⚫ | |||
Revision as of 19:50, 13 June 2010
Contents | 
Useful (mostly) bash one-liners
- Perl regular-expression for removing all tags after the initial:
 
perl -pe 's/(\^[^<]+<[^>]+>)(<\w+>)*\$/\1\$/g;' ^Lemma<V><Pres><Sg>$ -> ^Lemma<V>$
- Get unknown words from chunked text and sort by frequency:
 
sed 's/\$\W*\^/$\n^/g' | grep '@' | sed 's/><.*/>$/g' | sort -f | uniq -ci | sort -gr
- Strip newlines:
 
sed ':a;N;$!ba;s/\n//g'