Difference between revisions of "How to bootstrap a new pair"

From Apertium
Jump to navigation Jump to search
Line 111: Line 111:
==With no existing monolingual packages==
==With no existing monolingual packages==


Do none of the two languagues you're making a pair of already have monolingual modules in https://svn.code.sf.net/p/apertium/svn/languages/ or https://svn.code.sf.net/p/apertium/svn/incubator ?
First make the two monolingual packages:


Then follow this part, replacing LANG1 and LANG2 for the ISO 639-3 codes of your languages:
Then compile the monolingual packages:


First make and compile the new monolingual packages:
<pre>
python3 apertium-init.py LANG1
cd apertium-LANG1
./autogen.sh
make -j
cd ..

python3 apertium-init.py LANG2
cd apertium-LANG2
./autogen.sh
make -j
cd ..
</pre>


Then generate the pair:
Then generate the pair:
<pre>

python3 apertium-init.py LANG1-LANG2
</pre>


Then compile the pair:
Then compile the pair:

<pre>
./autogen.sh --with-lang1=../apertium-LANG1 --with-lang2=../apertium-LANG2
make -j
</pre>


And test:
And test:
<pre>
echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1
</pre>

Now you can add words to apertium-LANG1-LANG2.LANG1-LANG2.dix, then test again:

<pre>
make -j
echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1
</pre>

If you had to add words to the monolingual dictionaries, you will have to type "make" in those directories first. Alternatively, there is a shortcut from the pair directory: "make langs" should make the monolingual dictionaries even if you're in the pair directory.






Revision as of 07:49, 8 December 2015

Prerequisites

You need to get this installed first:

  • apertium-init
  • apertium/lttoolbox/hfst, see Installation, in particular the prerequisites parts. (You most likely don't need to go all the way to "minimal instlalation from svn", since you should get this stuff from Tino's repositories. If you're on Windows, get the virtualbox)

With two existing monolingual packages

Do the two languagues you're making a pair of already have monolingual modules in https://svn.code.sf.net/p/apertium/svn/languages/ (or perhaps https://svn.code.sf.net/p/apertium/svn/incubator )?

Then follow this part, replacing LANG1 and LANG2 for the ISO 639-3 codes of your languages:


First compile the monolingual packages:

svn co https://svn.code.sf.net/p/apertium/svn/languages/apertium-LANG1
cd apertium-LANG1
./autogen.sh
make -j
cd ..
svn co https://svn.code.sf.net/p/apertium/svn/languages/apertium-LANG2
cd apertium-LANG2
./autogen.sh
make -j
cd ..

Then generate the pair:

python3 apertium-init.py LANG1-LANG2

Then compile the pair:

./autogen.sh --with-lang1=../apertium-LANG1 --with-lang2=../apertium-LANG2
make -j

And test:

echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1

Now you can add words to apertium-LANG1-LANG2.LANG1-LANG2.dix, then test again:

make -j
echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1

If you had to add words to the monolingual dictionaries, you will have to type "make" in those directories first. Alternatively, there is a shortcut from the pair directory: "make langs" should make the monolingual dictionaries even if you're in the pair directory.

With one existing monolingual package

Does just one of the two languagues you're making a pair of already have a monolingual module in https://svn.code.sf.net/p/apertium/svn/languages/ (or perhaps https://svn.code.sf.net/p/apertium/svn/incubator )?

Then follow this part, replacing LANG1 and LANG2 for the ISO 639-3 codes of your languages; here we assume LANG1 needs to be made from scratch.

First make a new monolingual package:

python3 apertium-init.py LANG1
cd apertium-LANG1
./autogen.sh
make -j
cd ..

Then get and compile the existing monolingual package:

svn co https://svn.code.sf.net/p/apertium/svn/languages/apertium-LANG2
cd apertium-LANG2
./autogen.sh
make -j
cd ..

Then generate the pair:

python3 apertium-init.py LANG1-LANG2

Then compile the pair:

./autogen.sh --with-lang1=../apertium-LANG1 --with-lang2=../apertium-LANG2
make -j

And test:

echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1

Now you can add words to apertium-LANG1-LANG2.LANG1-LANG2.dix, then test again:

make -j
echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1

If you had to add words to the monolingual dictionaries, you will have to type "make" in those directories first. Alternatively, there is a shortcut from the pair directory: "make langs" should make the monolingual dictionaries even if you're in the pair directory.

With no existing monolingual packages

Do none of the two languagues you're making a pair of already have monolingual modules in https://svn.code.sf.net/p/apertium/svn/languages/ or https://svn.code.sf.net/p/apertium/svn/incubator ?

Then follow this part, replacing LANG1 and LANG2 for the ISO 639-3 codes of your languages:

First make and compile the new monolingual packages:

python3 apertium-init.py LANG1
cd apertium-LANG1
./autogen.sh
make -j
cd ..

python3 apertium-init.py LANG2
cd apertium-LANG2
./autogen.sh
make -j
cd ..

Then generate the pair:

python3 apertium-init.py LANG1-LANG2

Then compile the pair:

./autogen.sh --with-lang1=../apertium-LANG1 --with-lang2=../apertium-LANG2
make -j

And test:

echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1

Now you can add words to apertium-LANG1-LANG2.LANG1-LANG2.dix, then test again:

make -j
echo house | apertium -d . LANG1-LANG2
echo Haus | apertium -d . LANG2-LANG1

If you had to add words to the monolingual dictionaries, you will have to type "make" in those directories first. Alternatively, there is a shortcut from the pair directory: "make langs" should make the monolingual dictionaries even if you're in the pair directory.