https://wiki.apertium.org/w/api.php?action=feedcontributions&user=Eirien&feedformat=atom
Apertium - User contributions [en]
2024-03-29T12:58:47Z
User contributions
MediaWiki 1.34.1
https://wiki.apertium.org/w/index.php?title=Install_Apertium_core_using_packaging&diff=73396
Install Apertium core using packaging
2021-04-27T18:26:03Z
<p>Eirien: /* The Apertium package repository */</p>
<hr />
<div>You are using a Debian/Red Hat-based operating system. You want a fast clean install.<br />
<br />
You could use a public repository<ref><br />
=====Using public repositories, such as Debian Stable=====<br />
<span style="color:darkred;">'''Not recommended'''</span>, except for assessment. See the main section on the Apertium package repository.<br />
<br />
Apertium is a mature project, so you will find packages for Apertium in well-known public repositories. Moreover, the Apertium core is well-developed, so often backwards-compatible. If you use one of these package downloads, they will work, and can give you a good idea of what Apertium is and can do.<br />
<br />
In Debian repositories, Apertium core is listed simply as,<br />
<br />
<pre><br />
apertium<br />
</pre><br />
<br />
And can be installed by <code>sudo apt get</code> or similar method. Also, several language translation pairs will be available. For various reasons, the author has successfully developed language pairs using public repository versions of Apertium core.<br />
<br />
There are good reasons not to use these downloads. First, although Apertium is often backwards-compatible, this is not a key aim of the project. The code may break between versions. Next, although Apertium is an older project, it is actively developed. Honest! Although Apertium core does not change often, it has been changed, and may be changed again. And Apertium has found a place, for example, supporting languages under threat. This is work without end. Check the main repository and you will find a steady stream of contributions. And then we can talk about the many projects surrounding Apertium. Despite organisation, key contributors can not always remember what is available, and where it is! And, finally, contributors to Apertium put a lot of effort into maintaining a package system that is up-to-date and ready to go.<br />
<br />
Sometimes there is a good reason not to use the Apertium-provided packages. A workplace may place a restriction on install or rollout. Go ahead, use versions from a public repository. But the Apertium-provided packages are reliable, a key resource, and most people should use them.<br />
</ref> but you will be more up-to-date with the Apertium package repository.<br />
<br />
== The Apertium package repository ==<br />
{{TOCD}}<br />
[[User:Tino Didriksen|Tino Didriksen]] provides a repository of Debian/RPM packages. Downloading packages from there, you can install all prerequisites and Apertium tools. You also get, for optional install; release-level language pairs, service providers, constraint grammar code, and more. All under package management.<br />
<br />
Before we start, a comment from the mail list, from an admin-level contributor,<br />
<br />
<blockquote><br />
If you're going to be developing, I'd say pick nightly. The -release<br />
versions of lttoolbox and apertium might not be up-to-date with what<br />
apertium-init expects.<br />
</blockquote><br />
<br />
<br />
=== Debian-based ===<br />
If you have a Debian or Debian-based system (e.g. Ubuntu, Mint...).<br />
<br />
First, remove any Apertium packages you have installed from operating system repositories. They will be out-of-date, sometimes by years.<br />
<br />
Add the repository,<br />
<br />
<pre><br />
# Pick one:<br />
<br />
# Nightly, unstable, new, almost always use this:<br />
curl -sS https://apertium.projectjj.com/apt/install-nightly.sh | sudo bash<br />
<br />
# Release, stable, old:<br />
curl -sS https://apertium.projectjj.com/apt/install-release.sh | sudo bash<br />
</pre><br />
<br />
<br />
You should see messages. <br />
<br />
Install dev tools,<br />
<br />
<pre><br />
sudo apt-get -f install apertium-all-dev<br />
</pre><br />
<br />
Note: If you get any errors such as uninstallable unmet dependencies, try updating your ubuntu to the latest normal version. <br />
<br />
====About the Debian repository install====<br />
Check the script installed Apertium repository details,<br />
<br />
<pre><br />
apt-cache policy | grep apertium<br />
</pre><br />
<br />
Unfortunately, due to the seamless upgrading of Debian packaging, it is difficult to see which packages the new repository has added, and where. Even Synaptic, the wonder GUI, has no way through. You could try this brute force commandline,<br />
<br />
<pre><br />
find /var/lib/apt/lists/ |grep projectjj.*Packages | xargs grep -h Package<br />
</pre><br />
<br />
Which will, if nothing else, tell you a lot about byways of the Apertium project.<br />
<br />
==== Cleaning up a mixed install ====<br />
If you have installed some of the core tools from source and want to switch to packages, you can remove the source install with something like this. Be extremely careful with getting this right so it doesn't wipe your whole machine. (If Apertium isn't working no matter what you try, then this is a helpful step to take after uninstalling and reinstalling it)<br />
<br />
<pre><br />
# Wipe most files<br />
rm -rfv /usr/local/*/*apertium* /usr/local/*/*/*apertium*<br />
rm -rfv /usr/local/bin/lt-* /usr/local/bin/lrx-* /usr/local/bin/lsx-*<br />
rm -rfv /usr/local/*/*lttoolbox* /usr/local/*/*/*lttoolbox*<br />
rm -rfv /usr/local/*/*hfst* /usr/local/*/*/*hfst*<br />
rm -rfv /usr/local/bin/cg-*<br />
rm -rfv /usr/local/*/*cg3* /usr/local/*/*/*cg3*<br />
<br />
# Wipe any certain conflicting file<br />
grep -h '^Package:' /var/lib/apt/lists/apertium.projectjj.com* \<br />
| awk '{print $2}' | xargs -rn1 dpkg-query -L 2>/dev/null | grep '^/usr' \<br />
| xargs -rn1 '-I{}' bash -c "if [[ -f '{}' ]]; then echo '{}'; fi;" \<br />
| sed 's/\/usr\//\/usr\/local\//;' | xargs rm -fv<br />
</pre><br />
<br />
=== Red Hat-based ===<br />
If you have a RHEL/CentOS/Fedora/OpenSUSE or similar system.<br />
<br />
First, remove any Apertium packages you have installed from operating system repositories. They will be out-of-date, sometimes by years.<br />
<br />
Add the repository,<br />
<br />
<pre><br />
# Pick one<br />
<br />
# Nightly, unstable, new, almost always use this:<br />
curl -sS https://apertium.projectjj.com/rpm/install-nightly.sh | sudo bash<br />
<br />
# Release, stable, old:<br />
curl -sS https://apertium.projectjj.com/rpm/install-release.sh | sudo bash<br />
</pre><br />
<br />
Install dev tools,<br />
<br />
<pre><br />
# RHEL/CentOS:<br />
sudo yum install apertium-all-devel<br />
# Fedora:<br />
sudo dnf install apertium-all-devel<br />
# OpenSUSE:<br />
sudo zypper install apertium-all-devel<br />
</pre><br />
<br />
=== Arch Linux ===<br />
<br />
<pre><br />
curl -sS https://apertium.projectjj.com/archlinux-apertium-all-dev.sh | sudo bash<br />
</pre><br />
<br />
== And now... ==<br />
Maybe you want to test this is working. So download a test language, [[Install language data using packaging]]. Then [[Install quick tests]] can make you sure.<br />
<br />
If you know you need the HFST or CG3 modules, packages are available, or see [[Installation of grammar libraries]]. You may also be interested in the many tips at [[Bash completion]].<br />
<br />
If you are developing, now you can [[Install language data by compiling]] or [[How to bootstrap a new pair]]. If you are not, [[Install language data using packaging]] and go speak many languages :)<br />
<br />
==Notes==<br />
<references/><br />
[[Category:Installation]]<br />
[[Category:Documentation in English]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Task_ideas_for_Google_Code-in&diff=70525
Task ideas for Google Code-in
2019-10-24T19:31:47Z
<p>Eirien: /* Mentors */</p>
<hr />
<div><br />
{{TOCD}}<br />
This is the task ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.<br />
<br />
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:<br />
<br />
<!--# '''this does not include time taken to [[Minimal installation from SVN|install]] / set up apertium (and relevant tools)'''.--><br />
# this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve. <br />
<br />
<!--Если ты не понимаешь английский язык или предпочитаешь работать над русским языком или другими языками России, смотри: [[Task ideas for Google Code-in/Russian]]--><br />
'''Categories:'''<br />
<br />
* {{sc|code}}: Tasks related to writing or refactoring code<br />
* {{sc|documentation}}: Tasks related to creating/editing documents and helping others learn more<br />
* {{sc|research}}: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions<br />
* {{sc|quality}}: Tasks related to testing and ensuring code is of high quality.<br />
* {{sc|design}}: Tasks related to user experience research or user interface design and interaction<br />
<br />
'''Clarification of "multiple task" types'''<br />
* multi = number of students who can do a given task<br />
* dup = number of times a student can do the same task<br />
<br />
You can find descriptions of some of the mentors [[List_of_Apertium_mentors | here]].<br />
<br />
== Task ideas ==<br />
<br />
'''The current task ideas here are for 2019.''' See [[Talk:Task ideas for Google Code-in]] for task ideas from previous years.<br />
<br />
<table class="sortable wikitable"><br />
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th><th>bgnr?</th><th>multi?</th><th>duplicates</th></tr><br />
<br />
{{Taskidea<br />
|type=research, quality, documentation<br />
|title=Adopt a Wiki page<br />
|description=Request an Apertium wiki account and adopt a wiki page by updating and fixing any issues with it. Examples of things to update might be documentation that still refers to our SVN repo (we're on GitHub now), documentation of new features, clarification of unclear things, indicating that a page no longer reflects how things are done, or updating documentation to reflect the current options and defaults of various tools.<br />
|tags=wiki<br />
|mentors=*<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research, code<br />
|title=expand coverage of Kyrgyz to English structural transfer<br />
|description=Find a sentence in Kyrgyz that once the lexical items are added to the bilingual dictionary is not fully (or correctly) parsed by the <tt>kir-eng-transfer</tt> Apertium mode. Determine what rule(s) need(s) to be added (or fixed) to cover this structure, and update <tt>apertium-eng-kir.kir-eng.rtx</tt> accordingly. You will first want to clone and compile [https://github.com/apertium/apertium-eng-kir apertium-eng-kir].<br />
|tags=Kyrgyz, English, recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add recursive transfer support to a language pair that doesn't support it<br />
|description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfer". Add vanilla <tt>.rtx</tt> files for both directions, and modify <tt>Makefile.am</tt> and <tt>modes.xml</tt> so that the branch compiles and runs. See [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair this page] for instructions on how to do this.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add 2 recursive transfer rules to a language pair<br />
|description=Add two recursive transfer rules to an Apertium language pair. These rules consist of, at minimum, a syntactic pattern to match, a phrase to combine them into, and an output pattern ([http://wiki.apertium.org/wiki/Apertium-recursive#Further_Documentation more documentation here]). If the language pair does not support recursive transfer, make sure [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair to set it up] first.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=20<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language pair<br />
|description=Use the [[Apertium-init]] script to bootstrap a new translation pair between two languages which have monolingual modules already in Apertium. To see if a translation pair has already been made, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add 100 common stems to the dictionary. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation.<br />
|tags=languages, bootstrap, dictionaries, translators<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language module<br />
|description=Use the [[Apertium-init]] script to bootstrap a new language module that doesn't currently exist in Apertium. To see if a language is available, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add enough stems and morphology to the module so that it analyses and generates at least 100 correct forms. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation. [[Task ideas for Google Code-in/Add words from frequency list|Read more about adding stems...]]<br />
|tags=languages, bootstrap, dictionaries<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, sevilay, Unhammer<br />
|title=Write 10 lexical selection for an existing translation pair<br />
|description=Add 10 lexical selection rules to an existing translation pair. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add lexical-select rules|Read more...]]<br />
|tags=languages, bootstrap, lexical selection, translators<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, Unhammer<br />
|title=Write 10 constraint grammar rules for an existing language module<br />
|description=Add 10 constraint grammar rules to an existing language module for a language that you know. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add constraint-grammar rules|Read more...]]<br />
|tags=languages, bootstrap, constraint grammar<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=Research<br />
|mentors=anakuz, fotonzade<br />
|title=Syntactic annotation of text<br />
|description=Pick a text of about 200 words and make a syntactic annotation for it according to the Universal Dependencies treebank. UD Annotatrix can be used for visualisation. Consult with your mentor about the language.<br />
|tags=UD, trees, annotation<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW,ftyers, fotonzade<br />
|title=Create a UD-Apertium morphology mapping<br />
|description=Choose a language that has a Universal Dependencies treebank and tabulate a potential set of Apertium morph labels based on the (universal) UD morph labels. See Apertium's [[list of symbols]] and [http://universaldependencies.org/ UD]'s POS and feature tags for the labels.<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW, ftyers, fotonzade<br />
|title=Create an Apertium-UD morphology mapping<br />
|description=Choose a language that has an Apertium morphological analyser and adapt it to convert the morphology to UD morphology<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|title=Install Apertium and verify that it works<br />
|description=See [[Installation]] for instructions and if you encounter any issues along the way, document them and/or improve the wiki instructions!<br />
|tags=bash<br />
|mentors=ftyers, JNW, Unhammer, anakuz, Josh, fotonzade, sevilay, eirien<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|title=Write a contrastive grammar<br />
|description=Document 6 differences between two (preferably related) languages and where they would need to be addressed in the [[Apertium pipeline]] (morph analysis, transfer, etc). Use a grammar book/resource for inspiration. Each difference should have no fewer than 3 examples. Put your work on the Apertium wiki under [[Language1_and_Language2/Contrastive_grammar]]. See [[Farsi_and_English/Pending_tests]] for an example of a contrastive grammar that a previous GCI student made.<br />
|mentors=Mikel, JNW, Josh, xavivars, fotonzade, sevilay <br />
|tags=wiki, languages<br />
|beginner=yes<br />
|multi=40<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, Unhammer, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 200 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 200 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|multi=40<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, ftyers, sevilay, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 500 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 500 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|dup=10<br />
|beginner=no<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=fotonzade, JNW, ftyers, anakuz, xavivars, Mikel, shardulc, sevilay, Unhammer<br />
|tags=xml, dictionaries, svn<br />
|title=Post-edit 100 sentences of any public domain text from %AAA% to %BBB%<br />
|description=Many of our systems benefit from statistical methods used with (ideally public domain) bilingual data. For this task, you need to translate a public domain text from %AAA% to %BBB% using any available machine translation system and clean up the translations yourself manually. Commit the post-edited texts (in plain text format) to an existing (via pull request) or if needed new github repository for the pair in dev/ or texts/ folder. The texts are subject to mentor approval.<br />
|multi=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay<br />
|tags=disambiguation, svn<br />
|title=Disambiguate 500 tokens of text in %AAA%<br />
|description=Run some text through a morphological analyser and disambiguate the output. Discuss with the mentor beforehand to approve the choice of language and text. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Manually_disambiguate_text Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien<br />
|tags=dictionaries<br />
|title=Categorise 100 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien<br />
|tags=dictionaries<br />
|title=Categorise 500 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
}}<br />
<br />
<br />
<br />
</table><br />
<br />
==Mentors==<br />
<br />
These are languages that can be substituted for AAA and/or BBB for tasks each mentor is listed to mentor above.<br />
<br />
{|class=wikitable<br />
! Mentor !! Languages<br />
|-<br />
| ftyers || eng, spa, cat, fra, nor, rus, por, swe, tur, gag, aze<br />
|-<br />
| JNW || eng, spa, fra, rus, tur, gag, aze, kaz, kir, kaa, tat, bak, kum, nog, kaa, uzb, uig, crh, khk, yid<br />
|-<br />
| anakuz || grn, spa, por, rus<br />
|-<br />
| fotonzade || eng, tur, aze, uig, tat, crh, kmr, ckb, fas<br />
|-<br />
| xavivars || cat, spa, eng, fra <br />
|-<br />
| unhammer || nno, nob, swe, dan, fao, sme, ovd<br />
|-<br />
| shardulc || eng, fra, mar, hin, urd, kan<br />
|-<br />
| m-alpha || eng, fra, byv<br />
|-<br />
| popcorndude || eng, spa<br />
|-<br />
|sevilay || tur, kaz, aze, tat, gag, eng<br />
|-<br />
| eirien || sah, rus, eng<br />
|}<br />
<br />
== Counts ==<br />
<br />
Last updated by [[User:Eirien|Eirien]] ([[User talk:Eirien|talk]]) 22:29, 24 October 2019 (UTC+3).<br />
<br />
{| class="sortable wikitable"<br />
|-<br />
! scope="col" | Category<br />
! scope="col" | Count<br />
|-<br />
|{{sc|code}}<br />
|align="right"|35<br />
|-<br />
|{{sc|documentation}}<br />
|align="right"|5<br />
|-<br />
|{{sc|research}}<br />
|align="right"|11<br />
|-<br />
|{{sc|quality}}<br />
|align="right"|10<br />
|-<br />
|{{sc|design}}<br />
|align="right"|5<br />
|- class="sortbottom"<br />
!Total<br />
|align="right"|66<br />
|}<br />
<br />
[[Category:Google Code-in]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Task_ideas_for_Google_Code-in&diff=70524
Task ideas for Google Code-in
2019-10-24T19:29:55Z
<p>Eirien: </p>
<hr />
<div><br />
{{TOCD}}<br />
This is the task ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.<br />
<br />
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:<br />
<br />
<!--# '''this does not include time taken to [[Minimal installation from SVN|install]] / set up apertium (and relevant tools)'''.--><br />
# this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve. <br />
<br />
<!--Если ты не понимаешь английский язык или предпочитаешь работать над русским языком или другими языками России, смотри: [[Task ideas for Google Code-in/Russian]]--><br />
'''Categories:'''<br />
<br />
* {{sc|code}}: Tasks related to writing or refactoring code<br />
* {{sc|documentation}}: Tasks related to creating/editing documents and helping others learn more<br />
* {{sc|research}}: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions<br />
* {{sc|quality}}: Tasks related to testing and ensuring code is of high quality.<br />
* {{sc|design}}: Tasks related to user experience research or user interface design and interaction<br />
<br />
'''Clarification of "multiple task" types'''<br />
* multi = number of students who can do a given task<br />
* dup = number of times a student can do the same task<br />
<br />
You can find descriptions of some of the mentors [[List_of_Apertium_mentors | here]].<br />
<br />
== Task ideas ==<br />
<br />
'''The current task ideas here are for 2019.''' See [[Talk:Task ideas for Google Code-in]] for task ideas from previous years.<br />
<br />
<table class="sortable wikitable"><br />
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th><th>bgnr?</th><th>multi?</th><th>duplicates</th></tr><br />
<br />
{{Taskidea<br />
|type=research, quality, documentation<br />
|title=Adopt a Wiki page<br />
|description=Request an Apertium wiki account and adopt a wiki page by updating and fixing any issues with it. Examples of things to update might be documentation that still refers to our SVN repo (we're on GitHub now), documentation of new features, clarification of unclear things, indicating that a page no longer reflects how things are done, or updating documentation to reflect the current options and defaults of various tools.<br />
|tags=wiki<br />
|mentors=*<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research, code<br />
|title=expand coverage of Kyrgyz to English structural transfer<br />
|description=Find a sentence in Kyrgyz that once the lexical items are added to the bilingual dictionary is not fully (or correctly) parsed by the <tt>kir-eng-transfer</tt> Apertium mode. Determine what rule(s) need(s) to be added (or fixed) to cover this structure, and update <tt>apertium-eng-kir.kir-eng.rtx</tt> accordingly. You will first want to clone and compile [https://github.com/apertium/apertium-eng-kir apertium-eng-kir].<br />
|tags=Kyrgyz, English, recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add recursive transfer support to a language pair that doesn't support it<br />
|description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfer". Add vanilla <tt>.rtx</tt> files for both directions, and modify <tt>Makefile.am</tt> and <tt>modes.xml</tt> so that the branch compiles and runs. See [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair this page] for instructions on how to do this.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add 2 recursive transfer rules to a language pair<br />
|description=Add two recursive transfer rules to an Apertium language pair. These rules consist of, at minimum, a syntactic pattern to match, a phrase to combine them into, and an output pattern ([http://wiki.apertium.org/wiki/Apertium-recursive#Further_Documentation more documentation here]). If the language pair does not support recursive transfer, make sure [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair to set it up] first.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=20<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language pair<br />
|description=Use the [[Apertium-init]] script to bootstrap a new translation pair between two languages which have monolingual modules already in Apertium. To see if a translation pair has already been made, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add 100 common stems to the dictionary. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation.<br />
|tags=languages, bootstrap, dictionaries, translators<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language module<br />
|description=Use the [[Apertium-init]] script to bootstrap a new language module that doesn't currently exist in Apertium. To see if a language is available, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add enough stems and morphology to the module so that it analyses and generates at least 100 correct forms. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation. [[Task ideas for Google Code-in/Add words from frequency list|Read more about adding stems...]]<br />
|tags=languages, bootstrap, dictionaries<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, sevilay, Unhammer<br />
|title=Write 10 lexical selection for an existing translation pair<br />
|description=Add 10 lexical selection rules to an existing translation pair. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add lexical-select rules|Read more...]]<br />
|tags=languages, bootstrap, lexical selection, translators<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, Unhammer<br />
|title=Write 10 constraint grammar rules for an existing language module<br />
|description=Add 10 constraint grammar rules to an existing language module for a language that you know. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add constraint-grammar rules|Read more...]]<br />
|tags=languages, bootstrap, constraint grammar<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=Research<br />
|mentors=anakuz, fotonzade<br />
|title=Syntactic annotation of text<br />
|description=Pick a text of about 200 words and make a syntactic annotation for it according to the Universal Dependencies treebank. UD Annotatrix can be used for visualisation. Consult with your mentor about the language.<br />
|tags=UD, trees, annotation<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW,ftyers, fotonzade<br />
|title=Create a UD-Apertium morphology mapping<br />
|description=Choose a language that has a Universal Dependencies treebank and tabulate a potential set of Apertium morph labels based on the (universal) UD morph labels. See Apertium's [[list of symbols]] and [http://universaldependencies.org/ UD]'s POS and feature tags for the labels.<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW, ftyers, fotonzade<br />
|title=Create an Apertium-UD morphology mapping<br />
|description=Choose a language that has an Apertium morphological analyser and adapt it to convert the morphology to UD morphology<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|title=Install Apertium and verify that it works<br />
|description=See [[Installation]] for instructions and if you encounter any issues along the way, document them and/or improve the wiki instructions!<br />
|tags=bash<br />
|mentors=ftyers, JNW, Unhammer, anakuz, Josh, fotonzade, sevilay, eirien<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|title=Write a contrastive grammar<br />
|description=Document 6 differences between two (preferably related) languages and where they would need to be addressed in the [[Apertium pipeline]] (morph analysis, transfer, etc). Use a grammar book/resource for inspiration. Each difference should have no fewer than 3 examples. Put your work on the Apertium wiki under [[Language1_and_Language2/Contrastive_grammar]]. See [[Farsi_and_English/Pending_tests]] for an example of a contrastive grammar that a previous GCI student made.<br />
|mentors=Mikel, JNW, Josh, xavivars, fotonzade, sevilay <br />
|tags=wiki, languages<br />
|beginner=yes<br />
|multi=40<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, Unhammer, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 200 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 200 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|multi=40<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, ftyers, sevilay, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 500 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 500 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|dup=10<br />
|beginner=no<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=fotonzade, JNW, ftyers, anakuz, xavivars, Mikel, shardulc, sevilay, Unhammer<br />
|tags=xml, dictionaries, svn<br />
|title=Post-edit 100 sentences of any public domain text from %AAA% to %BBB%<br />
|description=Many of our systems benefit from statistical methods used with (ideally public domain) bilingual data. For this task, you need to translate a public domain text from %AAA% to %BBB% using any available machine translation system and clean up the translations yourself manually. Commit the post-edited texts (in plain text format) to an existing (via pull request) or if needed new github repository for the pair in dev/ or texts/ folder. The texts are subject to mentor approval.<br />
|multi=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay<br />
|tags=disambiguation, svn<br />
|title=Disambiguate 500 tokens of text in %AAA%<br />
|description=Run some text through a morphological analyser and disambiguate the output. Discuss with the mentor beforehand to approve the choice of language and text. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Manually_disambiguate_text Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien<br />
|tags=dictionaries<br />
|title=Categorise 100 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien<br />
|tags=dictionaries<br />
|title=Categorise 500 words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
}}<br />
<br />
<br />
<br />
</table><br />
<br />
==Mentors==<br />
<br />
These are languages that can be substituted for AAA and/or BBB for tasks each mentor is listed to mentor above.<br />
<br />
{|class=wikitable<br />
! Mentor !! Languages<br />
|-<br />
| ftyers || eng, spa, cat, fra, nor, rus, por, swe, tur, gag, aze<br />
|-<br />
| JNW || eng, spa, fra, rus, tur, gag, aze, kaz, kir, kaa, tat, bak, kum, nog, kaa, uzb, uig, crh, khk, yid<br />
|-<br />
| anakuz || grn, spa, por, rus<br />
|-<br />
| fotonzade || eng, tur, aze, uig, tat, crh, kmr, ckb, fas<br />
|-<br />
| xavivars || cat, spa, eng, fra <br />
|-<br />
| unhammer || nno, nob, swe, dan, fao, sme, ovd<br />
|-<br />
| shardulc || eng, fra, mar, hin, urd, kan<br />
|-<br />
| m-alpha || eng, fra, byv<br />
|-<br />
| popcorndude || eng, spa<br />
|-<br />
|sevilay || tur, kaz, aze, tat, gag, eng<br />
|-<br />
| eirien || sah, rus, eng, kaz<br />
|}<br />
<br />
== Counts ==<br />
<br />
Last updated by [[User:Eirien|Eirien]] ([[User talk:Eirien|talk]]) 22:29, 24 October 2019 (UTC+3).<br />
<br />
{| class="sortable wikitable"<br />
|-<br />
! scope="col" | Category<br />
! scope="col" | Count<br />
|-<br />
|{{sc|code}}<br />
|align="right"|35<br />
|-<br />
|{{sc|documentation}}<br />
|align="right"|5<br />
|-<br />
|{{sc|research}}<br />
|align="right"|11<br />
|-<br />
|{{sc|quality}}<br />
|align="right"|10<br />
|-<br />
|{{sc|design}}<br />
|align="right"|5<br />
|- class="sortbottom"<br />
!Total<br />
|align="right"|66<br />
|}<br />
<br />
[[Category:Google Code-in]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Task_ideas_for_Google_Code-in&diff=70523
Task ideas for Google Code-in
2019-10-24T19:21:21Z
<p>Eirien: /* Task ideas */</p>
<hr />
<div><br />
{{TOCD}}<br />
This is the task ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.<br />
<br />
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:<br />
<br />
<!--# '''this does not include time taken to [[Minimal installation from SVN|install]] / set up apertium (and relevant tools)'''.--><br />
# this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve. <br />
<br />
<!--Если ты не понимаешь английский язык или предпочитаешь работать над русским языком или другими языками России, смотри: [[Task ideas for Google Code-in/Russian]]--><br />
'''Categories:'''<br />
<br />
* {{sc|code}}: Tasks related to writing or refactoring code<br />
* {{sc|documentation}}: Tasks related to creating/editing documents and helping others learn more<br />
* {{sc|research}}: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions<br />
* {{sc|quality}}: Tasks related to testing and ensuring code is of high quality.<br />
* {{sc|design}}: Tasks related to user experience research or user interface design and interaction<br />
<br />
'''Clarification of "multiple task" types'''<br />
* multi = number of students who can do a given task<br />
* dup = number of times a student can do the same task<br />
<br />
You can find descriptions of some of the mentors [[List_of_Apertium_mentors | here]].<br />
<br />
== Task ideas ==<br />
<br />
'''The current task ideas here are for 2019.''' See [[Talk:Task ideas for Google Code-in]] for task ideas from previous years.<br />
<br />
<table class="sortable wikitable"><br />
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th><th>bgnr?</th><th>multi?</th><th>duplicates</th></tr><br />
<br />
{{Taskidea<br />
|type=research, quality, documentation<br />
|title=Adopt a Wiki page<br />
|description=Request an Apertium wiki account and adopt a wiki page by updating and fixing any issues with it. Examples of things to update might be documentation that still refers to our SVN repo (we're on GitHub now), documentation of new features, clarification of unclear things, indicating that a page no longer reflects how things are done, or updating documentation to reflect the current options and defaults of various tools.<br />
|tags=wiki<br />
|mentors=*<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research, code<br />
|title=expand coverage of Kyrgyz to English structural transfer<br />
|description=Find a sentence in Kyrgyz that once the lexical items are added to the bilingual dictionary is not fully (or correctly) parsed by the <tt>kir-eng-transfer</tt> Apertium mode. Determine what rule(s) need(s) to be added (or fixed) to cover this structure, and update <tt>apertium-eng-kir.kir-eng.rtx</tt> accordingly. You will first want to clone and compile [https://github.com/apertium/apertium-eng-kir apertium-eng-kir].<br />
|tags=Kyrgyz, English, recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add recursive transfer support to a language pair that doesn't support it<br />
|description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfer". Add vanilla <tt>.rtx</tt> files for both directions, and modify <tt>Makefile.am</tt> and <tt>modes.xml</tt> so that the branch compiles and runs. See [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair this page] for instructions on how to do this.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add 2 recursive transfer rules to a language pair<br />
|description=Add two recursive transfer rules to an Apertium language pair. These rules consist of, at minimum, a syntactic pattern to match, a phrase to combine them into, and an output pattern ([http://wiki.apertium.org/wiki/Apertium-recursive#Further_Documentation more documentation here]). If the language pair does not support recursive transfer, make sure [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair to set it up] first.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=20<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language pair<br />
|description=Use the [[Apertium-init]] script to bootstrap a new translation pair between two languages which have monolingual modules already in Apertium. To see if a translation pair has already been made, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add 100 common stems to the dictionary. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation.<br />
|tags=languages, bootstrap, dictionaries, translators<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language module<br />
|description=Use the [[Apertium-init]] script to bootstrap a new language module that doesn't currently exist in Apertium. To see if a language is available, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add enough stems and morphology to the module so that it analyses and generates at least 100 correct forms. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation. [[Task ideas for Google Code-in/Add words from frequency list|Read more about adding stems...]]<br />
|tags=languages, bootstrap, dictionaries<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, sevilay, Unhammer<br />
|title=Write 10 lexical selection for an existing translation pair<br />
|description=Add 10 lexical selection rules to an existing translation pair. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add lexical-select rules|Read more...]]<br />
|tags=languages, bootstrap, lexical selection, translators<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, Unhammer<br />
|title=Write 10 constraint grammar rules for an existing language module<br />
|description=Add 10 constraint grammar rules to an existing language module for a language that you know. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add constraint-grammar rules|Read more...]]<br />
|tags=languages, bootstrap, constraint grammar<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=Research<br />
|mentors=anakuz, fotonzade<br />
|title=Syntactic annotation of text<br />
|description=Pick a text of about 200 words and make a syntactic annotation for it according to the Universal Dependencies treebank. UD Annotatrix can be used for visualisation. Consult with your mentor about the language.<br />
|tags=UD, trees, annotation<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW,ftyers, fotonzade<br />
|title=Create a UD-Apertium morphology mapping<br />
|description=Choose a language that has a Universal Dependencies treebank and tabulate a potential set of Apertium morph labels based on the (universal) UD morph labels. See Apertium's [[list of symbols]] and [http://universaldependencies.org/ UD]'s POS and feature tags for the labels.<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW, ftyers, fotonzade<br />
|title=Create an Apertium-UD morphology mapping<br />
|description=Choose a language that has an Apertium morphological analyser and adapt it to convert the morphology to UD morphology<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|title=Install Apertium and verify that it works<br />
|description=See [[Installation]] for instructions and if you encounter any issues along the way, document them and/or improve the wiki instructions!<br />
|tags=bash<br />
|mentors=ftyers, JNW, Unhammer, anakuz, Josh, fotonzade, sevilay, eirien<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|title=Write a contrastive grammar<br />
|description=Document 6 differences between two (preferably related) languages and where they would need to be addressed in the [[Apertium pipeline]] (morph analysis, transfer, etc). Use a grammar book/resource for inspiration. Each difference should have no fewer than 3 examples. Put your work on the Apertium wiki under [[Language1_and_Language2/Contrastive_grammar]]. See [[Farsi_and_English/Pending_tests]] for an example of a contrastive grammar that a previous GCI student made.<br />
|mentors=Mikel, JNW, Josh, xavivars, fotonzade, sevilay <br />
|tags=wiki, languages<br />
|beginner=yes<br />
|multi=40<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, Unhammer, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 200 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 200 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|multi=40<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, ftyers, sevilay, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 500 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 500 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|dup=10<br />
|beginner=no<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=fotonzade, JNW, ftyers, anakuz, xavivars, Mikel, shardulc, sevilay, Unhammer<br />
|tags=xml, dictionaries, svn<br />
|title=Post-edit 100 sentences of any public domain text from %AAA% to %BBB%<br />
|description=Many of our systems benefit from statistical methods used with (ideally public domain) bilingual data. For this task, you need to translate a public domain text from %AAA% to %BBB% using any available machine translation system and clean up the translations yourself manually. Commit the post-edited texts (in plain text format) to an existing (via pull request) or if needed new github repository for the pair in dev/ or texts/ folder. The texts are subject to mentor approval.<br />
|multi=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay<br />
|tags=disambiguation, svn<br />
|title=Disambiguate 500 tokens of text in %AAA%<br />
|description=Run some text through a morphological analyser and disambiguate the output. Discuss with the mentor beforehand to approve the choice of language and text. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Manually_disambiguate_text Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien<br />
|tags=dictionaries<br />
|title=Categorise words from frequency list in %AAA% <br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
}}<br />
<br />
<br />
<br />
</table><br />
<br />
==Mentors==<br />
<br />
These are languages that can be substituted for AAA and/or BBB for tasks each mentor is listed to mentor above.<br />
<br />
{|class=wikitable<br />
! Mentor !! Languages<br />
|-<br />
| ftyers || eng, spa, cat, fra, nor, rus, por, swe, tur, gag, aze<br />
|-<br />
| JNW || eng, spa, fra, rus, tur, gag, aze, kaz, kir, kaa, tat, bak, kum, nog, kaa, uzb, uig, crh, khk, yid<br />
|-<br />
| anakuz || grn, spa, por, rus<br />
|-<br />
| fotonzade || eng, tur, aze, uig, tat, crh, kmr, ckb, fas<br />
|-<br />
| xavivars || cat, spa, eng, fra <br />
|-<br />
| unhammer || nno, nob, swe, dan, fao, sme, ovd<br />
|-<br />
| shardulc || eng, fra, mar, hin, urd, kan<br />
|-<br />
| m-alpha || eng, fra, byv<br />
|-<br />
| popcorndude || eng, spa<br />
|-<br />
|sevilay || tur, kaz, aze, tat, gag, eng<br />
|-<br />
| eirien || sah, rus, eng, kaz<br />
|}<br />
<br />
== Counts ==<br />
<br />
Last updated by [[User:Eirien|Eirien]] ([[User talk:Eirien|talk]]) 22:16, 24 October 2019 (UTC+3).<br />
<br />
{| class="sortable wikitable"<br />
|-<br />
! scope="col" | Category<br />
! scope="col" | Count<br />
|-<br />
|{{sc|code}}<br />
|align="right"|35<br />
|-<br />
|{{sc|documentation}}<br />
|align="right"|5<br />
|-<br />
|{{sc|research}}<br />
|align="right"|11<br />
|-<br />
|{{sc|quality}}<br />
|align="right"|10<br />
|-<br />
|{{sc|design}}<br />
|align="right"|5<br />
|- class="sortbottom"<br />
!Total<br />
|align="right"|54<br />
|}<br />
<br />
[[Category:Google Code-in]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Task_ideas_for_Google_Code-in&diff=70522
Task ideas for Google Code-in
2019-10-24T19:19:08Z
<p>Eirien: </p>
<hr />
<div><br />
{{TOCD}}<br />
This is the task ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.<br />
<br />
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:<br />
<br />
<!--# '''this does not include time taken to [[Minimal installation from SVN|install]] / set up apertium (and relevant tools)'''.--><br />
# this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve. <br />
<br />
<!--Если ты не понимаешь английский язык или предпочитаешь работать над русским языком или другими языками России, смотри: [[Task ideas for Google Code-in/Russian]]--><br />
'''Categories:'''<br />
<br />
* {{sc|code}}: Tasks related to writing or refactoring code<br />
* {{sc|documentation}}: Tasks related to creating/editing documents and helping others learn more<br />
* {{sc|research}}: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions<br />
* {{sc|quality}}: Tasks related to testing and ensuring code is of high quality.<br />
* {{sc|design}}: Tasks related to user experience research or user interface design and interaction<br />
<br />
'''Clarification of "multiple task" types'''<br />
* multi = number of students who can do a given task<br />
* dup = number of times a student can do the same task<br />
<br />
You can find descriptions of some of the mentors [[List_of_Apertium_mentors | here]].<br />
<br />
== Task ideas ==<br />
<br />
'''The current task ideas here are for 2019.''' See [[Talk:Task ideas for Google Code-in]] for task ideas from previous years.<br />
<br />
<table class="sortable wikitable"><br />
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th><th>bgnr?</th><th>multi?</th><th>duplicates</th></tr><br />
<br />
{{Taskidea<br />
|type=research, quality, documentation<br />
|title=Adopt a Wiki page<br />
|description=Request an Apertium wiki account and adopt a wiki page by updating and fixing any issues with it. Examples of things to update might be documentation that still refers to our SVN repo (we're on GitHub now), documentation of new features, clarification of unclear things, indicating that a page no longer reflects how things are done, or updating documentation to reflect the current options and defaults of various tools.<br />
|tags=wiki<br />
|mentors=*<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research, code<br />
|title=expand coverage of Kyrgyz to English structural transfer<br />
|description=Find a sentence in Kyrgyz that once the lexical items are added to the bilingual dictionary is not fully (or correctly) parsed by the <tt>kir-eng-transfer</tt> Apertium mode. Determine what rule(s) need(s) to be added (or fixed) to cover this structure, and update <tt>apertium-eng-kir.kir-eng.rtx</tt> accordingly. You will first want to clone and compile [https://github.com/apertium/apertium-eng-kir apertium-eng-kir].<br />
|tags=Kyrgyz, English, recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add recursive transfer support to a language pair that doesn't support it<br />
|description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfer". Add vanilla <tt>.rtx</tt> files for both directions, and modify <tt>Makefile.am</tt> and <tt>modes.xml</tt> so that the branch compiles and runs. See [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair this page] for instructions on how to do this.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add 2 recursive transfer rules to a language pair<br />
|description=Add two recursive transfer rules to an Apertium language pair. These rules consist of, at minimum, a syntactic pattern to match, a phrase to combine them into, and an output pattern ([http://wiki.apertium.org/wiki/Apertium-recursive#Further_Documentation more documentation here]). If the language pair does not support recursive transfer, make sure [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair to set it up] first.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=20<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language pair<br />
|description=Use the [[Apertium-init]] script to bootstrap a new translation pair between two languages which have monolingual modules already in Apertium. To see if a translation pair has already been made, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add 100 common stems to the dictionary. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation.<br />
|tags=languages, bootstrap, dictionaries, translators<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language module<br />
|description=Use the [[Apertium-init]] script to bootstrap a new language module that doesn't currently exist in Apertium. To see if a language is available, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add enough stems and morphology to the module so that it analyses and generates at least 100 correct forms. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation. [[Task ideas for Google Code-in/Add words from frequency list|Read more about adding stems...]]<br />
|tags=languages, bootstrap, dictionaries<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, sevilay, Unhammer<br />
|title=Write 10 lexical selection for an existing translation pair<br />
|description=Add 10 lexical selection rules to an existing translation pair. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add lexical-select rules|Read more...]]<br />
|tags=languages, bootstrap, lexical selection, translators<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, Unhammer<br />
|title=Write 10 constraint grammar rules for an existing language module<br />
|description=Add 10 constraint grammar rules to an existing language module for a language that you know. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add constraint-grammar rules|Read more...]]<br />
|tags=languages, bootstrap, constraint grammar<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=Research<br />
|mentors=anakuz, fotonzade<br />
|title=Syntactic annotation of text<br />
|description=Pick a text of about 200 words and make a syntactic annotation for it according to the Universal Dependencies treebank. UD Annotatrix can be used for visualisation. Consult with your mentor about the language.<br />
|tags=UD, trees, annotation<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW,ftyers, fotonzade<br />
|title=Create a UD-Apertium morphology mapping<br />
|description=Choose a language that has a Universal Dependencies treebank and tabulate a potential set of Apertium morph labels based on the (universal) UD morph labels. See Apertium's [[list of symbols]] and [http://universaldependencies.org/ UD]'s POS and feature tags for the labels.<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW, ftyers, fotonzade<br />
|title=Create an Apertium-UD morphology mapping<br />
|description=Choose a language that has an Apertium morphological analyser and adapt it to convert the morphology to UD morphology<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|title=Install Apertium and verify that it works<br />
|description=See [[Installation]] for instructions and if you encounter any issues along the way, document them and/or improve the wiki instructions!<br />
|tags=bash<br />
|mentors=ftyers, JNW, Unhammer, anakuz, Josh, fotonzade, sevilay, eirien<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|title=Write a contrastive grammar<br />
|description=Document 6 differences between two (preferably related) languages and where they would need to be addressed in the [[Apertium pipeline]] (morph analysis, transfer, etc). Use a grammar book/resource for inspiration. Each difference should have no fewer than 3 examples. Put your work on the Apertium wiki under [[Language1_and_Language2/Contrastive_grammar]]. See [[Farsi_and_English/Pending_tests]] for an example of a contrastive grammar that a previous GCI student made.<br />
|mentors=Mikel, JNW, Josh, xavivars, fotonzade, sevilay <br />
|tags=wiki, languages<br />
|beginner=yes<br />
|multi=40<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, Unhammer, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 200 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 200 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|multi=40<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, ftyers, sevilay, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 500 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 500 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|dup=10<br />
|beginner=no<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=fotonzade, JNW, ftyers, anakuz, xavivars, Mikel, shardulc, sevilay, Unhammer<br />
|tags=xml, dictionaries, svn<br />
|title=Post-edit 100 sentences of any public domain text from %AAA% to %BBB%<br />
|description=Many of our systems benefit from statistical methods used with (ideally public domain) bilingual data. For this task, you need to translate a public domain text from %AAA% to %BBB% using any available machine translation system and clean up the translations yourself manually. Commit the post-edited texts (in plain text format) to an existing (via pull request) or if needed new github repository for the pair in dev/ or texts/ folder. The texts are subject to mentor approval.<br />
|multi=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay<br />
|tags=disambiguation, svn<br />
|title=Disambiguate 500 tokens of text in %AAA%<br />
|description=Run some text through a morphological analyser and disambiguate the output. Discuss with the mentor beforehand to approve the choice of language and text. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Manually_disambiguate_text Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien<br />
|tags=dictionaries<br />
|title=Categorise words from frequency list<br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
}}<br />
<br />
<br />
<br />
</table><br />
<br />
==Mentors==<br />
<br />
These are languages that can be substituted for AAA and/or BBB for tasks each mentor is listed to mentor above.<br />
<br />
{|class=wikitable<br />
! Mentor !! Languages<br />
|-<br />
| ftyers || eng, spa, cat, fra, nor, rus, por, swe, tur, gag, aze<br />
|-<br />
| JNW || eng, spa, fra, rus, tur, gag, aze, kaz, kir, kaa, tat, bak, kum, nog, kaa, uzb, uig, crh, khk, yid<br />
|-<br />
| anakuz || grn, spa, por, rus<br />
|-<br />
| fotonzade || eng, tur, aze, uig, tat, crh, kmr, ckb, fas<br />
|-<br />
| xavivars || cat, spa, eng, fra <br />
|-<br />
| unhammer || nno, nob, swe, dan, fao, sme, ovd<br />
|-<br />
| shardulc || eng, fra, mar, hin, urd, kan<br />
|-<br />
| m-alpha || eng, fra, byv<br />
|-<br />
| popcorndude || eng, spa<br />
|-<br />
|sevilay || tur, kaz, aze, tat, gag, eng<br />
|-<br />
| eirien || sah, rus, eng, kaz<br />
|}<br />
<br />
== Counts ==<br />
<br />
Last updated by [[User:Eirien|Eirien]] ([[User talk:Eirien|talk]]) 22:16, 24 October 2019 (UTC+3).<br />
<br />
{| class="sortable wikitable"<br />
|-<br />
! scope="col" | Category<br />
! scope="col" | Count<br />
|-<br />
|{{sc|code}}<br />
|align="right"|35<br />
|-<br />
|{{sc|documentation}}<br />
|align="right"|5<br />
|-<br />
|{{sc|research}}<br />
|align="right"|11<br />
|-<br />
|{{sc|quality}}<br />
|align="right"|10<br />
|-<br />
|{{sc|design}}<br />
|align="right"|5<br />
|- class="sortbottom"<br />
!Total<br />
|align="right"|54<br />
|}<br />
<br />
[[Category:Google Code-in]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Task_ideas_for_Google_Code-in&diff=70521
Task ideas for Google Code-in
2019-10-24T19:18:39Z
<p>Eirien: </p>
<hr />
<div><br />
{{TOCD}}<br />
This is the task ideas page for [https://developers.google.com/open-source/gci/ Google Code-in], here you can find ideas on interesting tasks that will improve your knowledge of Apertium and help you get into the world of open-source development.<br />
<br />
The people column lists people who you should get in contact with to request further information. All tasks are 2 hours maximum estimated amount of time that would be spent on the task by an experienced developer, however:<br />
<br />
<!--# '''this does not include time taken to [[Minimal installation from SVN|install]] / set up apertium (and relevant tools)'''.--><br />
# this is the time expected to take by an experienced developer, you may find that you spend more time on the task because of the learning curve. <br />
<br />
<!--Если ты не понимаешь английский язык или предпочитаешь работать над русским языком или другими языками России, смотри: [[Task ideas for Google Code-in/Russian]]--><br />
'''Categories:'''<br />
<br />
* {{sc|code}}: Tasks related to writing or refactoring code<br />
* {{sc|documentation}}: Tasks related to creating/editing documents and helping others learn more<br />
* {{sc|research}}: Tasks related to community management, outreach/marketting, or studying problems and recommending solutions<br />
* {{sc|quality}}: Tasks related to testing and ensuring code is of high quality.<br />
* {{sc|design}}: Tasks related to user experience research or user interface design and interaction<br />
<br />
'''Clarification of "multiple task" types'''<br />
* multi = number of students who can do a given task<br />
* dup = number of times a student can do the same task<br />
<br />
You can find descriptions of some of the mentors [[List_of_Apertium_mentors | here]].<br />
<br />
== Task ideas ==<br />
<br />
'''The current task ideas here are for 2019.''' See [[Talk:Task ideas for Google Code-in]] for task ideas from previous years.<br />
<br />
<table class="sortable wikitable"><br />
<tr><th>type</th><th>title</th><th>description</th><th>tags</th><th>mentors</th><th>bgnr?</th><th>multi?</th><th>duplicates</th></tr><br />
<br />
{{Taskidea<br />
|type=research, quality, documentation<br />
|title=Adopt a Wiki page<br />
|description=Request an Apertium wiki account and adopt a wiki page by updating and fixing any issues with it. Examples of things to update might be documentation that still refers to our SVN repo (we're on GitHub now), documentation of new features, clarification of unclear things, indicating that a page no longer reflects how things are done, or updating documentation to reflect the current options and defaults of various tools.<br />
|tags=wiki<br />
|mentors=*<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research, code<br />
|title=expand coverage of Kyrgyz to English structural transfer<br />
|description=Find a sentence in Kyrgyz that once the lexical items are added to the bilingual dictionary is not fully (or correctly) parsed by the <tt>kir-eng-transfer</tt> Apertium mode. Determine what rule(s) need(s) to be added (or fixed) to cover this structure, and update <tt>apertium-eng-kir.kir-eng.rtx</tt> accordingly. You will first want to clone and compile [https://github.com/apertium/apertium-eng-kir apertium-eng-kir].<br />
|tags=Kyrgyz, English, recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add recursive transfer support to a language pair that doesn't support it<br />
|description=Make a branch of an Apertium language pair that doesn't support recursive transfer and call it "recursive transfer". Add vanilla <tt>.rtx</tt> files for both directions, and modify <tt>Makefile.am</tt> and <tt>modes.xml</tt> so that the branch compiles and runs. See [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair this page] for instructions on how to do this.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|title=Add 2 recursive transfer rules to a language pair<br />
|description=Add two recursive transfer rules to an Apertium language pair. These rules consist of, at minimum, a syntactic pattern to match, a phrase to combine them into, and an output pattern ([http://wiki.apertium.org/wiki/Apertium-recursive#Further_Documentation more documentation here]). If the language pair does not support recursive transfer, make sure [http://wiki.apertium.org/wiki/Apertium-recursive#Incorporating_Into_a_Pair to set it up] first.<br />
|tags=recursive transfer, pairs<br />
|mentors=JNW, popcorndude<br />
|multi=150<br />
|dup=20<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language pair<br />
|description=Use the [[Apertium-init]] script to bootstrap a new translation pair between two languages which have monolingual modules already in Apertium. To see if a translation pair has already been made, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add 100 common stems to the dictionary. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation.<br />
|tags=languages, bootstrap, dictionaries, translators<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW<br />
|title=Use apertium-init to bootstrap a new language module<br />
|description=Use the [[Apertium-init]] script to bootstrap a new language module that doesn't currently exist in Apertium. To see if a language is available, search our repositories on [https://github.com/apertium/ github], and especially ask on IRC. Add enough stems and morphology to the module so that it analyses and generates at least 100 correct forms. Your submission should be in the form of a repository on github that we can fork to the Apertium organisation. [[Task ideas for Google Code-in/Add words from frequency list|Read more about adding stems...]]<br />
|tags=languages, bootstrap, dictionaries<br />
|beginner=yes<br />
|multi=25<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, sevilay, Unhammer<br />
|title=Write 10 lexical selection for an existing translation pair<br />
|description=Add 10 lexical selection rules to an existing translation pair. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add lexical-select rules|Read more...]]<br />
|tags=languages, bootstrap, lexical selection, translators<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=code<br />
|mentors=JNW, Unhammer<br />
|title=Write 10 constraint grammar rules for an existing language module<br />
|description=Add 10 constraint grammar rules to an existing language module for a language that you know. Submit your work as a github pull request to that pair. [[Task ideas for Google Code-in/Add constraint-grammar rules|Read more...]]<br />
|tags=languages, bootstrap, constraint grammar<br />
|multi=25<br />
|dup=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=Research<br />
|mentors=anakuz, fotonzade<br />
|title=Syntactic annotation of text<br />
|description=Pick a text of about 200 words and make a syntactic annotation for it according to the Universal Dependencies treebank. UD Annotatrix can be used for visualisation. Consult with your mentor about the language.<br />
|tags=UD, trees, annotation<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW,ftyers, fotonzade<br />
|title=Create a UD-Apertium morphology mapping<br />
|description=Choose a language that has a Universal Dependencies treebank and tabulate a potential set of Apertium morph labels based on the (universal) UD morph labels. See Apertium's [[list of symbols]] and [http://universaldependencies.org/ UD]'s POS and feature tags for the labels.<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=JNW, ftyers, fotonzade<br />
|title=Create an Apertium-UD morphology mapping<br />
|description=Choose a language that has an Apertium morphological analyser and adapt it to convert the morphology to UD morphology<br />
|tags=morphology, ud, dependencies<br />
|beginner=<br />
|multi=5<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|title=Install Apertium and verify that it works<br />
|description=See [[Installation]] for instructions and if you encounter any issues along the way, document them and/or improve the wiki instructions!<br />
|tags=bash<br />
|mentors=ftyers, JNW, Unhammer, anakuz, Josh, fotonzade, sevilay, eirien<br />
|multi=150<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|title=Write a contrastive grammar<br />
|description=Document 6 differences between two (preferably related) languages and where they would need to be addressed in the [[Apertium pipeline]] (morph analysis, transfer, etc). Use a grammar book/resource for inspiration. Each difference should have no fewer than 3 examples. Put your work on the Apertium wiki under [[Language1_and_Language2/Contrastive_grammar]]. See [[Farsi_and_English/Pending_tests]] for an example of a contrastive grammar that a previous GCI student made.<br />
|mentors=Mikel, JNW, Josh, xavivars, fotonzade, sevilay <br />
|tags=wiki, languages<br />
|beginner=yes<br />
|multi=40<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay, Unhammer, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 200 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 200 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|multi=40<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, ftyers, sevilay, eirien<br />
|tags=xml, dictionaries, svn<br />
|title=Add 500 new entries to the bilingual dictionary of language pair %AAA%-%BBB%<br />
|description=Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 500 new words to a bidirectional dictionary. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Grow_bilingual Read more]...<br />
|dup=10<br />
|beginner=no<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=fotonzade, JNW, ftyers, anakuz, xavivars, Mikel, shardulc, sevilay, Unhammer<br />
|tags=xml, dictionaries, svn<br />
|title=Post-edit 100 sentences of any public domain text from %AAA% to %BBB%<br />
|description=Many of our systems benefit from statistical methods used with (ideally public domain) bilingual data. For this task, you need to translate a public domain text from %AAA% to %BBB% using any available machine translation system and clean up the translations yourself manually. Commit the post-edited texts (in plain text format) to an existing (via pull request) or if needed new github repository for the pair in dev/ or texts/ folder. The texts are subject to mentor approval.<br />
|multi=10<br />
|beginner=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=quality<br />
|mentors=Mikel, anakuz, xavivars, fotonzade, sevilay<br />
|tags=disambiguation, svn<br />
|title=Disambiguate 500 tokens of text in %AAA%<br />
|description=Run some text through a morphological analyser and disambiguate the output. Discuss with the mentor beforehand to approve the choice of language and text. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Manually_disambiguate_text Read more]...<br />
|multi=yes<br />
}}<br />
<br />
{{Taskidea<br />
|type=research<br />
|mentors=eirien<br />
|tags=dictionaries<br />
|title=Categorise words from frequency list<br />
|description=Categorise words by frequency into one of the major part-of-speech categories. You will receive a frequency list. Work from top to bottom. At the beginning of each line you should put a letter which categorises the word form by its part-of-speech. For example n for noun, v for verb, etc. [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in/Categorise_words_from_frequency_list Read more]...<br />
|multi=yes<br />
}}<br />
<br />
<br />
<br />
</table><br />
<br />
==Mentors==<br />
<br />
These are languages that can be substituted for AAA and/or BBB for tasks each mentor is listed to mentor above.<br />
<br />
{|class=wikitable<br />
! Mentor !! Languages<br />
|-<br />
| ftyers || eng, spa, cat, fra, nor, rus, por, swe, tur, gag, aze<br />
|-<br />
| JNW || eng, spa, fra, rus, tur, gag, aze, kaz, kir, kaa, tat, bak, kum, nog, kaa, uzb, uig, crh, khk, yid<br />
|-<br />
| anakuz || grn, spa, por, rus<br />
|-<br />
| fotonzade || eng, tur, aze, uig, tat, crh, kmr, ckb, fas<br />
|-<br />
| xavivars || cat, spa, eng, fra <br />
|-<br />
| unhammer || nno, nob, swe, dan, fao, sme, ovd<br />
|-<br />
| shardulc || eng, fra, mar, hin, urd, kan<br />
|-<br />
| m-alpha || eng, fra, byv<br />
|-<br />
| popcorndude || eng, spa<br />
|-<br />
|sevilay || tur, kaz, aze, tat, gag, eng<br />
|-<br />
| eirien || sah, rus, eng, kaz<br />
|}<br />
<br />
== Counts ==<br />
<br />
Last updated by [[User:Eirien|Eirien]] ([[User talk:Eirien|talk]]) 22:16, 24 October 2019 (UTC+3).<br />
<br />
{| class="sortable wikitable"<br />
|-<br />
! scope="col" | Category<br />
! scope="col" | Count<br />
|-<br />
|{{sc|code}}<br />
|align="right"|35<br />
|-<br />
|{{sc|documentation}}<br />
|align="right"|5<br />
|-<br />
|{{sc|research}}<br />
|align="right"|11<br />
|-<br />
|{{sc|quality}}<br />
|align="right"|10<br />
|-<br />
|{{sc|design}}<br />
|align="right"|5<br />
|- class="sortbottom"<br />
!Total<br />
|align="right"|53<br />
|}<br />
<br />
[[Category:Google Code-in]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Kazakh_and_Sakha/GSoC2018_report&diff=67525
Kazakh and Sakha/GSoC2018 report
2018-08-14T07:41:48Z
<p>Eirien: /* References */</p>
<hr />
<div>This page serves as a summary of all the work done in the [[Kazakh and Sakha]] pair during Google Summer of Code 2018. The project consisted mainly of building a bilingual bidix and enriching the [[Sakha]] morphological analyzer.<br />
<br />
== Commits ==<br />
My commits can be found [https://apertium.projectjj.com/gsoc2018/eirien/eirien.html here]. You can also download my work as a [https://apertium.projectjj.com/gsoc2018/eirien.zip zip file].<br />
<br />
== Corpora and Coverage ==<br />
Our corpora were [https://dumps.wikimedia.org/kkwiki/20180501/kkwiki-20180501-pages-articles.xml.bz2 Kazakh Wikipedia] and [https://dumps.wikimedia.org/sahwiki/20180501/sahwiki-20180501-pages-articles.xml.bz2 Sakha Wikipedia] articles.<br />
<br />
Mostly work consisted of adding stems to dictionaries. Stems were added from frequency lists.<br />
<br />
30 April - 6 July<br />
<br />
Stems were added to the Kazakh-Sakha bilingual dictionary.<br />
<br />
7 July - 14 August<br />
<br />
We focused on the Sakha morphological analyser to achieve 90% coverage. Stems were added, grammatical reference[1] was reviewed and test files were created.<br />
<br />
===Translator coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 10000<br />
| 150<br />
| 2870<br />
| 29.36%<br />
| 70.57%<br />
|}<br />
<br />
===Sakha morphological analyser coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 95654<br />
| 4070<br />
| 9015<br />
| 73.16%<br />
| 88.75%<br />
|}<br />
<br />
==References==<br />
# Ubryatova, E.I. (Ed.) (1982), Grammatika sovremennogo yakutskogo literaturnogo yazika, Moscow. <br />
# [https://sakhatyla.ru/ SakhaTyla.Ru - Sakha Dictionary]<br />
# [https://sozdik.kz/ Казахско-русский словарь]<br />
# [https://glosbe.com/ Online dictionary]<br />
<br />
==Future work==<br />
* Add more stems to Sakha monolingual dictionary<br />
* Add more stems to Kazakh-Sakha bilingual dictionary<br />
* Add transfer rules, etc.</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Kazakh_and_Sakha/GSoC2018_report&diff=67524
Kazakh and Sakha/GSoC2018 report
2018-08-14T07:40:58Z
<p>Eirien: /* Corpora and Coverage */</p>
<hr />
<div>This page serves as a summary of all the work done in the [[Kazakh and Sakha]] pair during Google Summer of Code 2018. The project consisted mainly of building a bilingual bidix and enriching the [[Sakha]] morphological analyzer.<br />
<br />
== Commits ==<br />
My commits can be found [https://apertium.projectjj.com/gsoc2018/eirien/eirien.html here]. You can also download my work as a [https://apertium.projectjj.com/gsoc2018/eirien.zip zip file].<br />
<br />
== Corpora and Coverage ==<br />
Our corpora were [https://dumps.wikimedia.org/kkwiki/20180501/kkwiki-20180501-pages-articles.xml.bz2 Kazakh Wikipedia] and [https://dumps.wikimedia.org/sahwiki/20180501/sahwiki-20180501-pages-articles.xml.bz2 Sakha Wikipedia] articles.<br />
<br />
Mostly work consisted of adding stems to dictionaries. Stems were added from frequency lists.<br />
<br />
30 April - 6 July<br />
<br />
Stems were added to the Kazakh-Sakha bilingual dictionary.<br />
<br />
7 July - 14 August<br />
<br />
We focused on the Sakha morphological analyser to achieve 90% coverage. Stems were added, grammatical reference[1] was reviewed and test files were created.<br />
<br />
===Translator coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 10000<br />
| 150<br />
| 2870<br />
| 29.36%<br />
| 70.57%<br />
|}<br />
<br />
===Sakha morphological analyser coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 95654<br />
| 4070<br />
| 9015<br />
| 73.16%<br />
| 88.75%<br />
|}<br />
<br />
==References==<br />
* Ubryatova, E.I. (Ed.) (1982), Grammatika sovremennogo yakutskogo literaturnogo yazika, Moscow. <br />
* [https://sakhatyla.ru/ SakhaTyla.Ru - Sakha Dictionary]<br />
* [https://sozdik.kz/ Казахско-русский словарь]<br />
* [https://glosbe.com/ Online dictionary]<br />
<br />
==Future work==<br />
* Add more stems to Sakha monolingual dictionary<br />
* Add more stems to Kazakh-Sakha bilingual dictionary<br />
* Add transfer rules, etc.</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Kazakh_and_Sakha/GSoC2018_report&diff=67523
Kazakh and Sakha/GSoC2018 report
2018-08-14T07:40:23Z
<p>Eirien: /* Corpora and Coverage */</p>
<hr />
<div>This page serves as a summary of all the work done in the [[Kazakh and Sakha]] pair during Google Summer of Code 2018. The project consisted mainly of building a bilingual bidix and enriching the [[Sakha]] morphological analyzer.<br />
<br />
== Commits ==<br />
My commits can be found [https://apertium.projectjj.com/gsoc2018/eirien/eirien.html here]. You can also download my work as a [https://apertium.projectjj.com/gsoc2018/eirien.zip zip file].<br />
<br />
== Corpora and Coverage ==<br />
Our corpora were [https://dumps.wikimedia.org/kkwiki/20180501/kkwiki-20180501-pages-articles.xml.bz2 Kazakh Wikipedia] and [https://dumps.wikimedia.org/sahwiki/20180501/sahwiki-20180501-pages-articles.xml.bz2 Sakha Wikipedia] articles.<br />
<br />
Mostly work consisted of adding stems to dictionaries. Stems were added from frequency lists.<br />
<br />
30 April - 6 July<br />
<br />
Stems were added to Kazakh-Sakha bilingual dictionary.<br />
<br />
7 July - 14 August<br />
<br />
We focused on the Sakha morphological analyser to achieve 90% coverage. Stems were added, grammatical reference[1] was reviewed and test files were created.<br />
<br />
===Translator coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 10000<br />
| 150<br />
| 2870<br />
| 29.36%<br />
| 70.57%<br />
|}<br />
<br />
===Sakha morphological analyser coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 95654<br />
| 4070<br />
| 9015<br />
| 73.16%<br />
| 88.75%<br />
|}<br />
<br />
==References==<br />
* Ubryatova, E.I. (Ed.) (1982), Grammatika sovremennogo yakutskogo literaturnogo yazika, Moscow. <br />
* [https://sakhatyla.ru/ SakhaTyla.Ru - Sakha Dictionary]<br />
* [https://sozdik.kz/ Казахско-русский словарь]<br />
* [https://glosbe.com/ Online dictionary]<br />
<br />
==Future work==<br />
* Add more stems to Sakha monolingual dictionary<br />
* Add more stems to Kazakh-Sakha bilingual dictionary<br />
* Add transfer rules, etc.</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Kazakh_and_Sakha/GSoC2018_report&diff=67522
Kazakh and Sakha/GSoC2018 report
2018-08-14T07:39:21Z
<p>Eirien: /* Corpora and Coverage */</p>
<hr />
<div>This page serves as a summary of all the work done in the [[Kazakh and Sakha]] pair during Google Summer of Code 2018. The project consisted mainly of building a bilingual bidix and enriching the [[Sakha]] morphological analyzer.<br />
<br />
== Commits ==<br />
My commits can be found [https://apertium.projectjj.com/gsoc2018/eirien/eirien.html here]. You can also download my work as a [https://apertium.projectjj.com/gsoc2018/eirien.zip zip file].<br />
<br />
== Corpora and Coverage ==<br />
Our corpora were [https://dumps.wikimedia.org/kkwiki/20180501/kkwiki-20180501-pages-articles.xml.bz2 Kazakh Wikipedia] and [https://dumps.wikimedia.org/sahwiki/20180501/sahwiki-20180501-pages-articles.xml.bz2 Sakha Wikipedia] articles.<br />
<br />
Mostly work consisted of adding stems to dictionaries. Stems were added from frequency lists.<br />
<br />
30 April - 6 July<br />
<br />
Stems were added to Kazakh-Sakha bilingual dictionary.<br />
<br />
7 July - 14 August<br />
<br />
We focused on the Sakha morphological analyser to achieve 90% coverage. Stems were added, grammatical reference was reviewed and test files were created.<br />
<br />
===Translator coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 10000<br />
| 150<br />
| 2870<br />
| 29.36%<br />
| 70.57%<br />
|}<br />
<br />
===Sakha morphological analyser coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 95654<br />
| 4070<br />
| 9015<br />
| 73.16%<br />
| 88.75%<br />
|}<br />
<br />
==References==<br />
* Ubryatova, E.I. (Ed.) (1982), Grammatika sovremennogo yakutskogo literaturnogo yazika, Moscow. <br />
* [https://sakhatyla.ru/ SakhaTyla.Ru - Sakha Dictionary]<br />
* [https://sozdik.kz/ Казахско-русский словарь]<br />
* [https://glosbe.com/ Online dictionary]<br />
<br />
==Future work==<br />
* Add more stems to Sakha monolingual dictionary<br />
* Add more stems to Kazakh-Sakha bilingual dictionary<br />
* Add transfer rules, etc.</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Kazakh_and_Sakha/GSoC2018_report&diff=67512
Kazakh and Sakha/GSoC2018 report
2018-08-13T18:24:27Z
<p>Eirien: Created page with "This page serves as a summary of all the work done in the Kazakh and Sakha pair during Google Summer of Code 2018. The project consisted mainly of building a bilingual bid..."</p>
<hr />
<div>This page serves as a summary of all the work done in the [[Kazakh and Sakha]] pair during Google Summer of Code 2018. The project consisted mainly of building a bilingual bidix and enriching the [[Sakha]] morphological analyzer.<br />
<br />
== Commits ==<br />
My commits can be found [https://apertium.projectjj.com/gsoc2018/eirien/eirien.html here]. You can also download my work as a [https://apertium.projectjj.com/gsoc2018/eirien.zip zip file].<br />
<br />
== Corpora and Coverage ==<br />
Our corpora were [https://dumps.wikimedia.org/kkwiki/20180501/kkwiki-20180501-pages-articles.xml.bz2 Kazakh Wikipedia] and [https://dumps.wikimedia.org/sahwiki/20180501/sahwiki-20180501-pages-articles.xml.bz2 Sakha Wikipedia] articles.<br />
<br />
===Translator coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 10000<br />
| 150<br />
| 2870<br />
| 29.36%<br />
| 70.57%<br />
|}<br />
<br />
===Sakha morphological analyser coverage===<br />
<br />
{| class="wikitable"<br />
|-<br />
! Corpus<br />
! Words<br />
! Stems before<br />
! Stems after<br />
! Coverage before<br />
! Coverage after<br />
|-<br />
| Wikipedia<br />
| 95654<br />
| 4070<br />
| 9015<br />
| 73.16%<br />
| 88.75%<br />
|}<br />
<br />
Mostly work consisted of adding stems to dictionaries. Stems were added from frequency lists.<br />
<br />
==References==<br />
* Ubryatova, E.I. (Ed.) (1982), Grammatika sovremennogo yakutskogo literaturnogo yazika, Moscow. <br />
* [https://sakhatyla.ru/ SakhaTyla.Ru - Sakha Dictionary]<br />
* [https://sozdik.kz/ Казахско-русский словарь]<br />
* [https://glosbe.com/ Online dictionary]<br />
<br />
==Future work==<br />
* Add more stems to Sakha monolingual dictionary<br />
* Add more stems to Kazakh-Sakha bilingual dictionary<br />
* Add transfer rules, etc.</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67511
User:Eirien/GSoC2018 Workplan
2018-08-13T14:44:08Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 80.73<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 83.4<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.09<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.65<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 87.00<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 87.36<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 88.75<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien&diff=67379
User:Eirien
2018-08-07T12:59:02Z
<p>Eirien: </p>
<hr />
<div>My name is Sardana Ivanova. I contribute to [[Apertium-kaz-sah]] and [[Apertium-sah]].<br />
<br />
==Contact Info==<br />
<br />
'''E-mail:''' i.sardana.n@gmail.com<br />
<br />
'''IRC:''' Eirien<br />
<br />
'''Github:''' https://github.com/varie<br />
<br />
'''Timezone:''' UTC/GMT+3<br />
<br />
[[Category:Users|Eirien]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67378
User:Eirien/GSoC2018 Workplan
2018-08-07T11:32:30Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 80.73<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 83.4<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.09<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.65<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 87.00<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 87.36<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67368
User:Eirien/GSoC2018 Workplan
2018-08-01T18:10:19Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 80.73<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 83.4<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.09<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.65<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 87.00<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67343
User:Eirien/GSoC2018 Workplan
2018-07-23T14:04:14Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 80.73<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 83.4<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.09<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.65<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67324
User:Eirien/GSoC2018 Workplan
2018-07-16T17:59:24Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 80.73<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 83.4<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 86.09<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67310
User:Eirien/GSoC2018 Workplan
2018-07-09T19:09:14Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 80.73<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 83.4<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67302
Apertium-sah
2018-07-06T14:45:22Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
суругугар -> суругар<br />
<br />
атаҕыгар -> атаҕар<br />
<br />
бөлөҕүгэр -> бөлөҕөр<br />
<br />
каталогыгар -> каталогар<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
бастатан - модальное сл. прежде всего, в первую очередь; бастатан, тэрээһин үлэ туһунан прежде всего об организационной работе<br />
<br />
8.<br />
<br />
сорох:сорох DET-IND ; ! "некоторый"<br />
<br />
сорохтор, сорохторо<br />
<br />
9. ger plus dat<br />
<br />
оҥорорго, барарга<br />
<br />
10. буолуо, буолуон<br />
<br />
11. буоллар<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67289
Apertium-sah
2018-07-05T08:52:10Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
суругугар -> суругар<br />
<br />
атаҕыгар -> атаҕар<br />
<br />
бөлөҕүгэр -> бөлөҕөр<br />
<br />
каталогыгар -> каталогар<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
бастатан - модальное сл. прежде всего, в первую очередь; бастатан, тэрээһин үлэ туһунан прежде всего об организационной работе<br />
<br />
8.<br />
<br />
сорох:сорох DET-IND ; ! "некоторый"<br />
<br />
сорохтор, сорохторо<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67281
Apertium-sah
2018-07-03T19:56:18Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
суругугар -> суругар<br />
<br />
атаҕыгар -> атаҕар<br />
<br />
бөлөҕүгэр -> бөлөҕөр<br />
<br />
каталогыгар -> каталогар<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
8.<br />
<br />
сорох:сорох DET-IND ; ! "некоторый"<br />
<br />
сорохтор, сорохторо<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67280
User:Eirien/GSoC2018 Workplan
2018-07-03T16:47:08Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 80.73<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67279
Apertium-sah
2018-07-03T16:44:12Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
суругугар -> суругар<br />
<br />
атаҕыгар -> атаҕар<br />
<br />
бөлөҕүгэр -> бөлөҕөр<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
8.<br />
<br />
сорох:сорох DET-IND ; ! "некоторый"<br />
<br />
сорохтор, сорохторо<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67211
Apertium-sah
2018-06-24T15:51:52Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
суругугар -> суругар<br />
<br />
атаҕыгар -> атаҕар<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
8.<br />
<br />
сорох:сорох DET-IND ; ! "некоторый"<br />
<br />
сорохтор, сорохторо<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67210
Apertium-sah
2018-06-24T15:18:38Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
суругугар -> суругар<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
8.<br />
<br />
сорох:сорох DET-IND ; ! "некоторый"<br />
<br />
сорохтор, сорохторо<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67209
Apertium-sah
2018-06-24T15:03:49Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
8.<br />
<br />
сорох:сорох DET-IND ; ! "некоторый"<br />
<br />
сорохтор, сорохторо<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67180
User:Eirien/GSoC2018 Workplan
2018-06-18T11:28:43Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 79.11<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67155
User:Eirien/GSoC2018 Workplan
2018-06-14T08:54:27Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 78.24<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67133
Apertium-sah
2018-06-10T15:33:57Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
7. where to add?<br />
<br />
оннооҕор - модальное сл. даже; тем более; оннооҕор оҕолор чуумпурбуттар даже дети присмирели;<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67132
Apertium-sah
2018-06-10T14:38:10Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: <br />
* эбит, илик<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
* соҕус<br />
<br />
ичигэс соҕус тепловатый; аҕыйах соҕус довольно мало, маловато; элбэх соҕус довольно много;<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67114
User:Eirien/GSoC2018 Workplan
2018-06-05T10:33:46Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 77.64<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67105
Apertium-sah
2018-06-03T15:32:09Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: эбит, илик?<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
6.<br />
<br />
спорт - N5 спортын - N1 спортун - спордун - which form is correct?<br />
<br />
торт - N5 тортын - N1 тортун - тордун - which form is correct?<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67104
Apertium-sah
2018-06-03T14:52:24Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
5. where to add частицы: эбит, илик?<br />
<br />
Оҥоро иликпин "Я ещё не сделал";<br />
<br />
Оҥоро иликкин "Ты ещё не сделал";<br />
<br />
Оҥоро иликпит "Мы ещё не сделали";<br />
<br />
Кэлбит эбиттэр "Они, оказывается, пришли"<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67103
Apertium-sah
2018-06-03T14:13:42Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
4.<br />
<br />
дьахтар<n><pl><nom> дьахта'''рд'''ар<br />
<br />
should be: дьахта'''лл'''ар<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67084
User:Eirien/GSoC2018 Workplan
2018-05-28T11:22:34Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 76.89<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=67069
Apertium-sah
2018-05-25T14:59:36Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
3. Contracted forms<br />
<br />
нэһилиэгигэр -> нэһилиэгэр<br />
<br />
олоҕугар -> олоҕор<br />
<br />
түмүгүгэр -> түмүгэр<br />
<br />
өрүтүгэр -> өттүгэр<br />
<br />
үөрэҕигэр -> үөрэҕэр<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=67055
User:Eirien/GSoC2018 Workplan
2018-05-22T08:37:07Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
! style="width: 10%" | Achieved translator<br />
! style="width: 10%" | Achieved mono.<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
| style="text-align:center" | 29.36<br />
| style="text-align:center" | 73.16<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | 75.64<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
| style="text-align:center" | ?<br />
| style="text-align:center" | ?<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=66992
Apertium-sah
2018-05-12T10:44:47Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
үөскээ - үөск'''үү'''р<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=66918
Apertium-sah
2018-05-08T09:31:06Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
таҕыс<v><iv><aor><p3><sg> - таҕыһар; should be: тахсар<br />
<br />
сырыт<v><iv><aor><p3><sg> - сырытар; should be: сылдьар<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=66917
Apertium-sah
2018-05-08T08:57:06Z
<p>Eirien: /* To do */</p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
2.<br />
<br />
уһун<v><iv><aor><p3><sg> - уһунар; should be: устар<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=66889
User:Eirien/GSoC2018 Workplan
2018-05-06T19:07:19Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 35%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Translator coverage<br />
! style="width: 10%" | Sakha morphological analyser coverage<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Discussing detailed work plan with mentors<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~150<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~30%<br />
| style="text-align:center" | ~73%<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~40%<br />
| style="text-align:center" | ~74.2%<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~50%<br />
| style="text-align:center" | ~75.4%<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~60%<br />
| style="text-align:center" | ~76.6%<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~70%<br />
| style="text-align:center" | ~77.8%<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~75%<br />
| style="text-align:center" | ~79%<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~80%<br />
| style="text-align:center" | ~80.2%<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~85%<br />
| style="text-align:center" | ~81.4%<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~82.6%<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~83.8%<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~85%<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~86.2%<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~87.4%<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~88.6%<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~89.8%<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~90%<br />
| style="text-align:center" | ~91%<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=66867
User:Eirien/GSoC2018 Workplan
2018-05-06T09:35:38Z
<p>Eirien: /* Workplan */</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Apertium-sah&diff=66866
Apertium-sah
2018-05-05T21:59:36Z
<p>Eirien: </p>
<hr />
<div>==Current status==<br />
<br />
{{LangStats | lang = sah | corpus1 = wikipedia | corpus2 = kyym | corpus3 = udhr }}<br />
<br />
==To do==<br />
<br />
1. <br />
<br />
<pre>^сөбүлиир/сөбүлээ<v><tv><prc_aor>/сөбүлээ<v><tv><gpr_aor>/сөбүлээ<v><tv><ger_aor><nom>/сөбүлээ<v><tv><aor><p3><sg>$</pre><br />
<br />
should be: сөбүлээ - cөбүл'''үү'''р, үҥкүлээ - үҥкүл'''үү'''р, көрүлээ - көрүл'''үү'''р, ...<br />
<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien&diff=66865
User:Eirien
2018-05-05T21:06:24Z
<p>Eirien: </p>
<hr />
<div>My name is Sardana Ivanova. I contribute to [[Apertium-kaz-sah]] and [[Apertium-sah]].<br />
<br />
==Contact Info==<br />
<br />
'''E-mail:''' i.sardana.n@gmail.com<br />
<br />
'''IRC:''' Eirien<br />
<br />
'''Github:''' https://github.com/varie<br />
<br />
'''Timezone:''' UTC/GMT+3</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien&diff=66864
User:Eirien
2018-05-05T19:24:04Z
<p>Eirien: Created page with "My name is Sardana Ivanova. I contribute to Kazakh and Sakha and Apertium-sah. ==Contact Info== '''E-mail:''' i.sardana.n@gmail.com '''IRC:''' Eirien '''Github:'''..."</p>
<hr />
<div>My name is Sardana Ivanova. I contribute to [[Kazakh and Sakha]] and [[Apertium-sah]].<br />
<br />
==Contact Info==<br />
<br />
'''E-mail:''' i.sardana.n@gmail.com<br />
<br />
'''IRC:''' Eirien<br />
<br />
'''Github:''' https://github.com/varie<br />
<br />
'''Timezone:''' UTC/GMT+3</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/GSoC2018_Workplan&diff=66863
User:Eirien/GSoC2018 Workplan
2018-05-05T16:14:53Z
<p>Eirien: Created page with "This is a work plan for Kazakh and Sakha translation pair development during Google Summer of Code 2018. === Workplan === {|class="wikitable" ! style="width: 10%" | ..."</p>
<hr />
<div>This is a work plan for [[Kazakh and Sakha]] translation pair development during [[Google Summer of Code]] 2018.<br />
<br />
=== Workplan ===<br />
<br />
{|class="wikitable"<br />
! style="width: 10%" | Week<br />
! style="width: 15%" | Dates<br />
! style="width: 45%" | Goals<br />
! style="width: 10%" | Bidix<br />
! style="width: 10%" | WER / PER<br />
! style="width: 10%" | Coverage<br />
|-<br />
! 1<br />
| style="text-align:center" | 30 April - 6 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 2<br />
| style="text-align:center" | 7 May - 13 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 3<br />
| style="text-align:center" | 14 May - 20 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 4<br />
| style="text-align:center" | 21 May - 27 May<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 5<br />
| style="text-align:center" | 28 May - 3 June<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Add transfer rules<br />
'''Deliverable #1'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 6<br />
| style="text-align:center" | 4 June - 10 June<br />
|<br />
* Add new stems to dictionaries<br />
* Begin analysis of translation error patterns to prepare rule priority list<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 7<br />
| style="text-align:center" | 11 June - 17 June<br />
|<br />
* Add new stems to dictionaries<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 8<br />
| style="text-align:center" | 18 June - 24 June<br />
|<br />
* Add new stems to dictionaries<br />
* ?<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 9<br />
| style="text-align:center" | 25 June - 1 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Finish list of necessary rules by frequency<br />
'''Deliverable #2'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 10<br />
| style="text-align:center" | 2 July - 8 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 11<br />
| style="text-align:center" | 9 July - 15 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 12<br />
| style="text-align:center" | 16 July - 22 July<br />
|<br />
* Add new stems to dictionaries<br />
* Add transfer rules<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 13<br />
| style="text-align:center" | 23 July - 29 July<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 14<br />
| style="text-align:center" | 30 July - 5 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 15<br />
| style="text-align:center" | 6 August - 12 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|-<br />
! 16<br />
| style="text-align:center" | 13 August - 19 August<br />
|<br />
* Add new stems to dictionaries<br />
* Testvoc<br />
* Write documentation<br />
'''Final evaluation'''<br />
| style="text-align:center" | ~?<br />
| style="text-align:center" | ~?%<br />
| style="text-align:center" | ~?%<br />
|}</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Kazakh_and_Sakha&diff=66801
Kazakh and Sakha
2018-04-24T09:55:33Z
<p>Eirien: /* Installation */</p>
<hr />
<div>This is Apertium translation pair for [[Kazakh]] and [[Sakha]]. The pair is currently located in [https://github.com/apertium/apertium-kaz-sah incubator].<br />
<br />
==Installation==<br />
<br />
You will need:<br />
<br />
* [[apertium-kaz]]<br />
* [[apertium-sah]]<br />
* [[apertium-kaz-sah]]<br />
<br />
Installing [[apertium-kaz]]<br />
<pre>$ git clone https://github.com/apertium/apertium-kaz.git<br />
$ cd apertium-kaz<br />
$ ./autogen.sh<br />
$ make</pre><br />
<br />
Installing [[apertium-sah]]<br />
<pre>$ git clone https://github.com/apertium/apertium-sah.git<br />
$ cd apertium-sah<br />
$ ./autogen.sh<br />
$ make</pre><br />
<br />
Installing [[apertium-kaz-sah]]<br />
<pre>$ git clone https://github.com/apertium/apertium-kaz-sah.git<br />
$ cd apertium-kaz-sah<br />
$ ./autogen.sh --with-lang1=../apertium-kaz --with-lang2=../apertium-sah # paths to apertium-kaz and apertium-sah source folders<br />
$ make</pre><br />
<br />
==Development workflow==<br />
<br />
===Transfer rules===<br />
<br />
Example of adding <loc> -> <dat> rule<br />
<br />
In apertium-kaz-sah.kaz-sah.t1x file<br />
<br />
Adding to section <section-def-cats><br />
<pre><def-cat n="nom"><br />
<cat-item tags="n.*"/><br />
</def-cat></pre><br />
<br />
Adding to section <section-def-attrs><br />
<pre><def-attr n="a_cas"><br />
<attr-item tags="nom"/><br />
<attr-item tags="dat"/><br />
<attr-item tags="loc"/><br />
</def-attr></pre><br />
<br />
Adding to section <section-def-macros><br />
<pre><def-macro n="f_default_nom1" npar="1"><br />
<choose><when><test><equal><clip pos="1" side="tl" part="a_cas"/><lit-tag v="loc"/></equal></test><br />
<let><clip pos="1" side="tl" part="a_cas"/><lit-tag v="dat"/></let></when></choose><br />
</def-macro></pre><br />
<br />
Adding to section <section-rules><br />
<pre><rule comment="REGLA: NOUN"><br />
<pattern><br />
<pattern-item n="nom"/><br />
</pattern><br />
<action><br />
<call-macro n="f_default_nom1"><br />
<with-param pos="1"/><br />
</call-macro><br />
<out><br />
<chunk name="n" case="caseFirstWord"><br />
<tags><br />
<tag><lit-tag v="SN"/></tag><br />
</tags><br />
<lu><br />
<clip pos="1" side="tl" part="whole"/><br />
</lu><br />
</chunk><br />
</out><br />
</action><br />
</rule></pre><br />
[[Category:Kazakh and Sakha|*]]<br />
[[Category:Kazakh]]<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=Kazakh_and_Sakha&diff=66740
Kazakh and Sakha
2018-03-28T20:03:55Z
<p>Eirien: </p>
<hr />
<div>This is Apertium translation pair for [[Kazakh]] and [[Sakha]]. The pair is currently located in [https://github.com/apertium/apertium-kaz-sah incubator].<br />
<br />
==Installation==<br />
<br />
You will need:<br />
<br />
* [[apertium-kaz]]<br />
* [[apertium-sah]]<br />
* [[apertium-kaz-sah]]<br />
<br />
Install [[apertium-kaz]]<br />
<pre>git clone https://github.com/apertium/apertium-kaz.git</pre><br />
<pre>cd apertium-kaz</pre><br />
<pre>./autogen.sh</pre><br />
<pre>make</pre><br />
<br />
Install [[apertium-sah]]<br />
<pre>git clone https://github.com/apertium/apertium-sah.git</pre><br />
<pre>cd apertium-sah</pre><br />
<pre>./autogen.sh</pre><br />
<pre>make</pre><br />
<br />
Install [[apertium-kaz-sah]]<br />
<pre>git clone https://github.com/apertium/apertium-kaz-sah.git</pre><br />
<pre>cd apertium-kaz-sah</pre><br />
<pre>./autogen.sh --with-lang1=../apertium-kaz --with-lang2=../apertium-sah # paths to apertium-kaz and apertium-sah source folders</pre><br />
<pre>make</pre><br />
<br />
==Development workflow==<br />
<br />
===Transfer rules===<br />
<br />
Example of adding <loc> -> <dat> rule<br />
<br />
In apertium-kaz-sah.kaz-sah.t1x file<br />
<br />
Adding to section <section-def-cats><br />
<pre><def-cat n="nom"><br />
<cat-item tags="n.*"/><br />
</def-cat></pre><br />
<br />
Adding to section <section-def-attrs><br />
<pre><def-attr n="a_cas"><br />
<attr-item tags="nom"/><br />
<attr-item tags="dat"/><br />
<attr-item tags="loc"/><br />
</def-attr></pre><br />
<br />
Adding to section <section-def-macros><br />
<pre><def-macro n="f_default_nom1" npar="1"><br />
<choose><when><test><equal><clip pos="1" side="tl" part="a_cas"/><lit-tag v="loc"/></equal></test><br />
<let><clip pos="1" side="tl" part="a_cas"/><lit-tag v="dat"/></let></when></choose><br />
</def-macro></pre><br />
<br />
Adding to section <section-rules><br />
<pre><rule comment="REGLA: NOUN"><br />
<pattern><br />
<pattern-item n="nom"/><br />
</pattern><br />
<action><br />
<call-macro n="f_default_nom1"><br />
<with-param pos="1"/><br />
</call-macro><br />
<out><br />
<chunk name="n" case="caseFirstWord"><br />
<tags><br />
<tag><lit-tag v="SN"/></tag><br />
</tags><br />
<lu><br />
<clip pos="1" side="tl" part="whole"/><br />
</lu><br />
</chunk><br />
</out><br />
</action><br />
</rule></pre><br />
[[Category:Kazakh and Sakha|*]]<br />
[[Category:Kazakh]]<br />
[[Category:Sakha]]</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/Proposal2018&diff=66715
User:Eirien/Proposal2018
2018-03-27T13:01:38Z
<p>Eirien: /* Coding challenge */</p>
<hr />
<div>[[Category: GSoC 2018 student proposals]]<br />
<br />
==Contact Info==<br />
<br />
'''Name:''' Sardana Ivanova<br />
<br />
'''E-mail:''' i.sardana.n@gmail.com<br />
<br />
'''IRC:''' Eirien<br />
<br />
'''Github:''' https://github.com/varie<br />
<br />
'''Location:''' Helsinki, Finland<br />
<br />
'''Timezone:''' UTC/GMT+3<br />
<br />
==Why is it you are interested in machine translation?==<br />
<br />
Since I have degrees both in Philology and Information Technologies, my background naturally lead me to Computational Linguistics and to machine translation in particular. I had experience in Japanese translation and interpretation and in programming so I think that machine translation is a very nice topic where I could combine both skills.<br />
<br />
==Why is it that you are interested in Apertium?==<br />
<br />
I’ve heard about Apertium first time from Roman Yangarber, researcher in University of Helsinki during my exchange semester there. We were interested in adding Sakha language to Revita - language learning system which they are developing. For that purpose we needed Sakha morphological analyser and Roman introduced Apertium to me. Saying “I was surprised” will not fully pass my wonderment that time. Someone is developing Sakha morphological analyser and somewhere so far away from place where Sakha is commonly spoken. We couldn’t find morphological analyser which would fulfill all requirements of Revita [1], so we decided to try to enhance Apertium’s Sakha morphological transducer. And then I thought that adding language pair where one of languages is sakha would be very good contribution to development of Sakha language.<br />
<br />
==Which of the published tasks are you interested in? What do you plan to do?==<br />
<br />
===Title: Apertium translation pair for Kazakh and Sakha===<br />
<br />
I plan to develop Apertium translation pair for Kazakh and Sakha languages, which is currently in a very early development stage.<br />
<br />
===Why Google and Apertium should sponsor it?===<br />
<br />
There are around 450 000 Sakha (Yakut) language speakers in the World. Sakha is considered vulnerable language. Supporting languages so they don’t disappear is very important.<br />
<br />
===How and who it will benefit in society===<br />
<br />
It would benefit society in whole by keeping diversity supporting vulnerable languages and in particular Kazakh and Sakha language speakers. There is no any machine translation system which translates to/from Sakha as far as I know. Creating language pair where one language is Sakha would greatly support Sakha language leading to further development of Sakha language machine translation.<br />
<br />
===Workplan===<br />
<br />
'''Post Application Period'''<br />
Finish coding challenge<br />
Get to know Apertium better: reading documentation and experimenting<br />
<br />
'''Community Bonding Period'''<br />
Discuss strategies and details with mentor<br />
Improve knowledge of Kazakh<br />
<br />
'''Week 1'''<br />
<br />
Add nouns and adjectives to bilingual dictionary<br />
<br />
'''Week 2'''<br />
<br />
Write transfer rules for nouns and adjectives<br />
<br />
'''Week 3'''<br />
<br />
Add verbs and other parts of speech to bilingual dictionary<br />
<br />
Write transfer rules for verbs<br />
<br />
'''Week 4'''<br />
<br />
Run tests<br />
<br />
Update documentation<br />
<br />
Prepare for the first evaluation<br />
<br />
'''Deliverable 1: Bilingual dictionary, basic transfer rules'''<br />
<br />
'''Week 5'''<br />
<br />
Even up nouns and adjectives<br />
<br />
'''Week 6'''<br />
<br />
Even up verbs and other parts of speech<br />
<br />
'''Week 7'''<br />
<br />
Extend bilingual dictionary<br />
<br />
'''Week 8'''<br />
<br />
Run tests<br />
<br />
Update documentation<br />
<br />
Prepare for the second evaluation<br />
<br />
'''Deliverable 2: Improved bilingual dictionary, transfer rules'''<br />
<br />
'''Week 9'''<br />
<br />
'''Week 10'''<br />
<br />
Extend bilingual dictionary<br />
Add multiwords<br />
<br />
Work on transfer rules<br />
<br />
'''Week 11'''<br />
<br />
Run final tests<br />
<br />
Fix issues<br />
<br />
'''Week 12'''<br />
<br />
Brush up the project and documentation<br />
<br />
Prepare for final evaluation<br />
<br />
==Skills==<br />
<br />
I have Master’s degree in Philology: Japanese language and literature and Master’s degree in Fundamental Informatics and Information Technologies. Currently I am a first year Computer Science PhD student in Saint Petersburg State University, Saint Petersburg, Russia. Now I am a visiting researcher in Department of Computer Science in University of Helsinki.<br />
<br />
Besides I have one year C++ working experience in game development company.<br />
<br />
'''Languages:''' Sakha (native), Russian (native), English (advanced), Japanese (intermediate), Finnish (elementary)<br />
<br />
'''Programming skills:''' C++, Python, C, C#, Java<br />
<br />
Apertium is my first shy step into open-source.<br />
<br />
==Non-Summer-of-Code plans for the Summer==<br />
<br />
Maybe a week holiday for Ozora music festival (30 July - 5 August) during which I will be unable to spend more than 10-15 hours for the project. For the rest of summer I will be able to dedicate at least 30 hours a week to the project.<br />
<br />
==Coding challenge==<br />
<br />
https://github.com/apertium/apertium-kaz-sah - coding challenge started for last year GSoC, where I added words to dictionary, but I didn’t manage to upload proposal on time last year, so I would like to try this year :D. It is still in progress.<br />
<br />
==References==<br />
<br />
1. https://revita.cs.helsinki.fi/</div>
Eirien
https://wiki.apertium.org/w/index.php?title=User:Eirien/Proposal2018&diff=66708
User:Eirien/Proposal2018
2018-03-27T11:45:34Z
<p>Eirien: </p>
<hr />
<div>[[Category: GSoC 2018 student proposals]]<br />
<br />
==Contact Info==<br />
<br />
'''Name:''' Sardana Ivanova<br />
<br />
'''E-mail:''' i.sardana.n@gmail.com<br />
<br />
'''IRC:''' Eirien<br />
<br />
'''Github:''' https://github.com/varie<br />
<br />
'''Location:''' Helsinki, Finland<br />
<br />
'''Timezone:''' UTC/GMT+3<br />
<br />
==Why is it you are interested in machine translation?==<br />
<br />
Since I have degrees both in Philology and Information Technologies, my background naturally lead me to Computational Linguistics and to machine translation in particular. I had experience in Japanese translation and interpretation and in programming so I think that machine translation is a very nice topic where I could combine both skills.<br />
<br />
==Why is it that you are interested in Apertium?==<br />
<br />
I’ve heard about Apertium first time from Roman Yangarber, researcher in University of Helsinki during my exchange semester there. We were interested in adding Sakha language to Revita - language learning system which they are developing. For that purpose we needed Sakha morphological analyser and Roman introduced Apertium to me. Saying “I was surprised” will not fully pass my wonderment that time. Someone is developing Sakha morphological analyser and somewhere so far away from place where Sakha is commonly spoken. We couldn’t find morphological analyser which would fulfill all requirements of Revita [1], so we decided to try to enhance Apertium’s Sakha morphological transducer. And then I thought that adding language pair where one of languages is sakha would be very good contribution to development of Sakha language.<br />
<br />
==Which of the published tasks are you interested in? What do you plan to do?==<br />
<br />
===Title: Apertium translation pair for Kazakh and Sakha===<br />
<br />
I plan to develop Apertium translation pair for Kazakh and Sakha languages, which is currently in a very early development stage.<br />
<br />
===Why Google and Apertium should sponsor it?===<br />
<br />
There are around 450 000 Sakha (Yakut) language speakers in the World. Sakha is considered vulnerable language. Supporting languages so they don’t disappear is very important.<br />
<br />
===How and who it will benefit in society===<br />
<br />
It would benefit society in whole by keeping diversity supporting vulnerable languages and in particular Kazakh and Sakha language speakers. There is no any machine translation system which translates to/from Sakha as far as I know. Creating language pair where one language is Sakha would greatly support Sakha language leading to further development of Sakha language machine translation.<br />
<br />
===Workplan===<br />
<br />
'''Post Application Period'''<br />
Finish coding challenge<br />
Get to know Apertium better: reading documentation and experimenting<br />
<br />
'''Community Bonding Period'''<br />
Discuss strategies and details with mentor<br />
Improve knowledge of Kazakh<br />
<br />
'''Week 1'''<br />
<br />
Add nouns and adjectives to bilingual dictionary<br />
<br />
'''Week 2'''<br />
<br />
Write transfer rules for nouns and adjectives<br />
<br />
'''Week 3'''<br />
<br />
Add verbs and other parts of speech to bilingual dictionary<br />
<br />
Write transfer rules for verbs<br />
<br />
'''Week 4'''<br />
<br />
Run tests<br />
<br />
Update documentation<br />
<br />
Prepare for the first evaluation<br />
<br />
'''Deliverable 1: Bilingual dictionary, basic transfer rules'''<br />
<br />
'''Week 5'''<br />
<br />
Even up nouns and adjectives<br />
<br />
'''Week 6'''<br />
<br />
Even up verbs and other parts of speech<br />
<br />
'''Week 7'''<br />
<br />
Extend bilingual dictionary<br />
<br />
'''Week 8'''<br />
<br />
Run tests<br />
<br />
Update documentation<br />
<br />
Prepare for the second evaluation<br />
<br />
'''Deliverable 2: Improved bilingual dictionary, transfer rules'''<br />
<br />
'''Week 9'''<br />
<br />
'''Week 10'''<br />
<br />
Extend bilingual dictionary<br />
Add multiwords<br />
<br />
Work on transfer rules<br />
<br />
'''Week 11'''<br />
<br />
Run final tests<br />
<br />
Fix issues<br />
<br />
'''Week 12'''<br />
<br />
Brush up the project and documentation<br />
<br />
Prepare for final evaluation<br />
<br />
==Skills==<br />
<br />
I have Master’s degree in Philology: Japanese language and literature and Master’s degree in Fundamental Informatics and Information Technologies. Currently I am a first year Computer Science PhD student in Saint Petersburg State University, Saint Petersburg, Russia. Now I am a visiting researcher in Department of Computer Science in University of Helsinki.<br />
<br />
Besides I have one year C++ working experience in game development company.<br />
<br />
'''Languages:''' Sakha (native), Russian (native), English (advanced), Japanese (intermediate), Finnish (elementary)<br />
<br />
'''Programming skills:''' C++, Python, C, C#, Java<br />
<br />
Apertium is my first shy step into open-source.<br />
<br />
==Non-Summer-of-Code plans for the Summer==<br />
<br />
Maybe a week holiday for Ozora music festival (30 July - 5 August) during which I will be unable to spend more than 10-15 hours for the project. For the rest of summer I will be able to dedicate at least 30 hours a week to the project.<br />
<br />
==Coding challenge==<br />
<br />
https://github.com/apertium/apertium-kaz-sah - coding challenge started for last year GSoC, where I added words to dictionary, but I didn’t manage to upload proposal on time last year, so I would like to try this year :D. I will update it as soon as I figure out how to write transfer rules.<br />
<br />
==References==<br />
<br />
1. https://revita.cs.helsinki.fi/</div>
Eirien