Talk:Corpus test

From Apertium
Revision as of 23:19, 14 November 2011 by Bech (talk | contribs) (asking for a more clear formulation)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Creation of a corpus

These 2 lines are not very clear for a non english native :

  • Grep out all lines with # and @ - this will help you find problems in bidix (@) and target language monodix (#).
  • Pipe through nl -s '. ' to get the right line numbers.

An example would be better. And on my computer, nl -s does not work, but the option -n of grep (fgrep, egrep) does.

Why not something like :

  • fgrep -n "#" monodix
  • fgrep -n "@" bidix