Difference between revisions of "Tartu Apertium Course"
Jump to navigation
Jump to search
(8 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
{{TOCD}} |
|||
<center> |
<center> |
||
<span style="font-size: large">Tartu Apertium Course</span> |
<span style="font-size: large">Tartu Apertium Course</span> |
||
Line 60: | Line 61: | ||
==Participants== |
==Participants== |
||
==Papers== |
|||
You can use this as a template for your course report: |
|||
* [http://www.mt-archive.info/10/MTS-2013-Salimzyanov.pdf A free/open-source Kazakh-Tatar machine translation system] |
|||
==Sessions== |
|||
* [[/Session 0|Session 0]] |
|||
* [[/Session 1|Session 1]] |
|||
* [[/Session 2|Session 2]] |
|||
* [[/Session 3|Session 3]] |
|||
* [[/Session 4|Session 4]] |
|||
* [[/Session 5|Session 5]] |
|||
* [[/Session 6|Session 6]] |
|||
* [[/Session 7|Session 7]] |
|||
==Hacks== |
|||
;Multichar symbols: are yours defined ? |
|||
<pre> |
|||
cat apertium-est.est.lexc | grep -o '%<[^>]\+>' | sort -u | tr -d '%' > /tmp/lexc-tags |
|||
hfst-summarise -S est.automorf.hfst | grep -o '<[^>]\+>' | sort -u > /tmp/hfst-tags |
|||
comm /tmp/hfst-tags /tmp/lexc-tags |
|||
</pre> |
|||
==External links== |
==External links== |
||
* [https://courses.cs.ut.ee/2015/rbmt/fall Tartu Ülikool: Rule-based machine translation] |
* [https://courses.cs.ut.ee/2015/rbmt/fall Tartu Ülikool: Rule-based machine translation] |
||
* [http://oahpa.no/sme/gramm/grammatihkka.eng.html Grammar sketch of North Sámi] |
|||
* [https://svn.code.sf.net/p/apertium/svn/branches/courses/2015-tartu/slides Slides] |
|||
* [https://svn.code.sf.net/p/apertium/svn/branches/courses/2015-tartu/practicals/spectre360 Fran's code] (look at tags and word forms) |
|||
[[Category: |
[[Category:Tartu Apertium Course|*]] |
Latest revision as of 19:16, 13 November 2015
Tartu Apertium Course
Tartu, Estonia
2nd October — 13th October
Programme[edit]
- Main: Detailed programme
Week 1 | ||
---|---|---|
Day | Time | Material |
Monday (2nd October) |
10:00 — 12:00 | Introduction |
14:00 — 18:00 | Installation and setup | |
Tuesday (3rd October) |
10:00 — 12:00 | Morphological dictionaries |
16:00 — 18:00 | Morphological dictionaries | |
Thursday (5th October) |
10:00 — 12:00 | Morphological disambiguation |
14:00 — 18:00 | Morphological disambiguation | |
Friday (6th October) |
10:00 — 12:00 | Lexical transfer |
14:00 — 18:00 | Lexical transfer | |
Week 2 | ||
Day | Time | Material |
Monday (9th October) |
10:00 — 12:00 | Lexical selection |
14:00 — 18:00 | Lexical selection | |
Tuesday (10th October) |
10:00 — 12:00 | Structural transfer |
16:00 — 18:00 | Structural transfer | |
Thursday (11th October) |
10:00 — 12:00 | Structural transfer |
14:00 — 18:00 | Data consistency and quality | |
Friday (12th October) |
10:00 — 12:00 | Data consistency and quality |
14:00 — 18:00 | Project planning, question and answer |
Participants[edit]
Papers[edit]
You can use this as a template for your course report:
Sessions[edit]
Hacks[edit]
- Multichar symbols
- are yours defined ?
cat apertium-est.est.lexc | grep -o '%<[^>]\+>' | sort -u | tr -d '%' > /tmp/lexc-tags hfst-summarise -S est.automorf.hfst | grep -o '<[^>]\+>' | sort -u > /tmp/hfst-tags comm /tmp/hfst-tags /tmp/lexc-tags
External links[edit]
- Tartu Ülikool: Rule-based machine translation
- Grammar sketch of North Sámi
- Slides
- Fran's code (look at tags and word forms)