Difference between revisions of "Tartu Apertium Course"
Jump to navigation
Jump to search
(4 intermediate revisions by the same user not shown) | |||
Line 74: | Line 74: | ||
* [[/Session 1|Session 1]] |
* [[/Session 1|Session 1]] |
||
* [[/Session 2|Session 2]] |
* [[/Session 2|Session 2]] |
||
* [[/Session 3|Session 3]] |
|||
* [[/Session 4|Session 4]] |
|||
* [[/Session 5|Session 5]] |
|||
* [[/Session 6|Session 6]] |
|||
* [[/Session 7|Session 7]] |
|||
==Hacks== |
|||
;Multichar symbols: are yours defined ? |
|||
<pre> |
|||
cat apertium-est.est.lexc | grep -o '%<[^>]\+>' | sort -u | tr -d '%' > /tmp/lexc-tags |
|||
hfst-summarise -S est.automorf.hfst | grep -o '<[^>]\+>' | sort -u > /tmp/hfst-tags |
|||
comm /tmp/hfst-tags /tmp/lexc-tags |
|||
</pre> |
|||
==External links== |
==External links== |
||
Line 80: | Line 93: | ||
* [https://courses.cs.ut.ee/2015/rbmt/fall Tartu Ülikool: Rule-based machine translation] |
* [https://courses.cs.ut.ee/2015/rbmt/fall Tartu Ülikool: Rule-based machine translation] |
||
* [http://oahpa.no/sme/gramm/grammatihkka.eng.html Grammar sketch of North Sámi] |
* [http://oahpa.no/sme/gramm/grammatihkka.eng.html Grammar sketch of North Sámi] |
||
* [https://svn.code.sf.net/p/apertium/svn/branches/courses/2015-tartu/slides Slides] |
|||
* [https://svn.code.sf.net/p/apertium/svn/branches/courses/2015-tartu/practicals/spectre360 Fran's code] (look at tags and word forms) |
|||
[[Category:Tartu Apertium Course|*]] |
[[Category:Tartu Apertium Course|*]] |
Latest revision as of 19:16, 13 November 2015
Tartu Apertium Course
Tartu, Estonia
2nd October — 13th October
Programme[edit]
- Main: Detailed programme
Week 1 | ||
---|---|---|
Day | Time | Material |
Monday (2nd October) |
10:00 — 12:00 | Introduction |
14:00 — 18:00 | Installation and setup | |
Tuesday (3rd October) |
10:00 — 12:00 | Morphological dictionaries |
16:00 — 18:00 | Morphological dictionaries | |
Thursday (5th October) |
10:00 — 12:00 | Morphological disambiguation |
14:00 — 18:00 | Morphological disambiguation | |
Friday (6th October) |
10:00 — 12:00 | Lexical transfer |
14:00 — 18:00 | Lexical transfer | |
Week 2 | ||
Day | Time | Material |
Monday (9th October) |
10:00 — 12:00 | Lexical selection |
14:00 — 18:00 | Lexical selection | |
Tuesday (10th October) |
10:00 — 12:00 | Structural transfer |
16:00 — 18:00 | Structural transfer | |
Thursday (11th October) |
10:00 — 12:00 | Structural transfer |
14:00 — 18:00 | Data consistency and quality | |
Friday (12th October) |
10:00 — 12:00 | Data consistency and quality |
14:00 — 18:00 | Project planning, question and answer |
Participants[edit]
Papers[edit]
You can use this as a template for your course report:
Sessions[edit]
Hacks[edit]
- Multichar symbols
- are yours defined ?
cat apertium-est.est.lexc | grep -o '%<[^>]\+>' | sort -u | tr -d '%' > /tmp/lexc-tags hfst-summarise -S est.automorf.hfst | grep -o '<[^>]\+>' | sort -u > /tmp/hfst-tags comm /tmp/hfst-tags /tmp/lexc-tags
External links[edit]
- Tartu Ülikool: Rule-based machine translation
- Grammar sketch of North Sámi
- Slides
- Fran's code (look at tags and word forms)