Difference between revisions of "Tartu Apertium Course"
Jump to navigation
Jump to search
| (8 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
{{TOCD}} |
|||
<center> |
<center> |
||
<span style="font-size: large">Tartu Apertium Course</span> |
<span style="font-size: large">Tartu Apertium Course</span> |
||
| Line 60: | Line 61: | ||
==Participants== |
==Participants== |
||
==Papers== |
|||
You can use this as a template for your course report: |
|||
* [http://www.mt-archive.info/10/MTS-2013-Salimzyanov.pdf A free/open-source Kazakh-Tatar machine translation system] |
|||
==Sessions== |
|||
* [[/Session 0|Session 0]] |
|||
* [[/Session 1|Session 1]] |
|||
* [[/Session 2|Session 2]] |
|||
* [[/Session 3|Session 3]] |
|||
* [[/Session 4|Session 4]] |
|||
* [[/Session 5|Session 5]] |
|||
* [[/Session 6|Session 6]] |
|||
* [[/Session 7|Session 7]] |
|||
==Hacks== |
|||
;Multichar symbols: are yours defined ? |
|||
<pre> |
|||
cat apertium-est.est.lexc | grep -o '%<[^>]\+>' | sort -u | tr -d '%' > /tmp/lexc-tags |
|||
hfst-summarise -S est.automorf.hfst | grep -o '<[^>]\+>' | sort -u > /tmp/hfst-tags |
|||
comm /tmp/hfst-tags /tmp/lexc-tags |
|||
</pre> |
|||
==External links== |
==External links== |
||
* [https://courses.cs.ut.ee/2015/rbmt/fall Tartu Ülikool: Rule-based machine translation] |
* [https://courses.cs.ut.ee/2015/rbmt/fall Tartu Ülikool: Rule-based machine translation] |
||
* [http://oahpa.no/sme/gramm/grammatihkka.eng.html Grammar sketch of North Sámi] |
|||
* [https://svn.code.sf.net/p/apertium/svn/branches/courses/2015-tartu/slides Slides] |
|||
* [https://svn.code.sf.net/p/apertium/svn/branches/courses/2015-tartu/practicals/spectre360 Fran's code] (look at tags and word forms) |
|||
[[Category: |
[[Category:Tartu Apertium Course|*]] |
||
Latest revision as of 19:16, 13 November 2015
Tartu Apertium Course
Tartu, Estonia
2nd October — 13th October
Programme[edit]
- Main: Detailed programme
| Week 1 | ||
|---|---|---|
| Day | Time | Material |
| Monday (2nd October) |
10:00 — 12:00 | Introduction |
| 14:00 — 18:00 | Installation and setup | |
| Tuesday (3rd October) |
10:00 — 12:00 | Morphological dictionaries |
| 16:00 — 18:00 | Morphological dictionaries | |
| Thursday (5th October) |
10:00 — 12:00 | Morphological disambiguation |
| 14:00 — 18:00 | Morphological disambiguation | |
| Friday (6th October) |
10:00 — 12:00 | Lexical transfer |
| 14:00 — 18:00 | Lexical transfer | |
| Week 2 | ||
| Day | Time | Material |
| Monday (9th October) |
10:00 — 12:00 | Lexical selection |
| 14:00 — 18:00 | Lexical selection | |
| Tuesday (10th October) |
10:00 — 12:00 | Structural transfer |
| 16:00 — 18:00 | Structural transfer | |
| Thursday (11th October) |
10:00 — 12:00 | Structural transfer |
| 14:00 — 18:00 | Data consistency and quality | |
| Friday (12th October) |
10:00 — 12:00 | Data consistency and quality |
| 14:00 — 18:00 | Project planning, question and answer | |
Participants[edit]
Papers[edit]
You can use this as a template for your course report:
Sessions[edit]
Hacks[edit]
- Multichar symbols
- are yours defined ?
cat apertium-est.est.lexc | grep -o '%<[^>]\+>' | sort -u | tr -d '%' > /tmp/lexc-tags hfst-summarise -S est.automorf.hfst | grep -o '<[^>]\+>' | sort -u > /tmp/hfst-tags comm /tmp/hfst-tags /tmp/lexc-tags
External links[edit]
- Tartu Ülikool: Rule-based machine translation
- Grammar sketch of North Sámi
- Slides
- Fran's code (look at tags and word forms)