Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on
If you have any questions, please come and talk to us on
#apertium
on irc.freenode.net
or contact the GitHub migration team.Uighur and Turkish/Paper
From Apertium
< Uighur and Turkish(Difference between revisions)
(→Non-english Research) |
(→Non-english Research) |
||
Line 33: | Line 33: | ||
* [https://www.jstage.jst.go.jp/article/jnlp1994/8/3/8_3_123/_article/-char/ja/ Conversion process of case particles for Japanese-Uighur machine translation] - There is an english version of this above |
* [https://www.jstage.jst.go.jp/article/jnlp1994/8/3/8_3_123/_article/-char/ja/ Conversion process of case particles for Japanese-Uighur machine translation] - There is an english version of this above |
||
'''Turkish''': |
'''Turkish''': |
||
− | * [https://polen.itu.edu.tr/xmlui/bitstream/handle/11527/527/10459.pdf?sequence=1&isAllowed=y Uygurcadan Türkçeye bilgisayarlı çeviri.] [https://scholar.googleusercontent.com/scholar.bib?q=info:zmL5BQqzH04J:scholar.google.com/&output=citation&scisig=AAGBfm0AAAAAWzsR0fI2a0hOeD13Uc1ofWmW8gYQMlf6&scisf=4&ct=citation&cd=-1&hl=tr BibTex] - We should probably find and cite this, and also look into the other stuff Orhun/Tantuğ/Adalı have been up to. |
||
− | * [https://polen.itu.edu.tr/handle/11527/518 Akraba Ve Bitişken Diller Arasında Bilgisayarlı Çeviri İçin Karma Bir Model] [https://scholar.googleusercontent.com/scholar.bib?q=info:5rSIKuO_cR4J:scholar.google.com/&output=citation&scisig=AAGBfm0AAAAAWzsVAXD0j27q7UXvnKrisWh5Eft5EKd8&scisf=4&ct=citation&cd=-1&hl=tr BibTeX] This also has Turkmen-Turkish as a prototype, lacks lexsel. The prose seems to sacrifice intelligibility for the sake of using words with Turkic roots. <s>Also has a corpus of around a million words that we might be able to get our hands on.</s> Seems to be a turkish corpus |
||
* [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.614.5731&rep=rep1&type=pdf Rule Based Analysis of the Uyghur Nouns] [https://scholar.googleusercontent.com/scholar.bib?q=info:IPwR6tYl7u8J:scholar.google.com/&output=citation&scisig=AAGBfm0AAAAAW0Wr46dRPOwCNffhZ3f09Pmh96Cv0DwQ&scisf=4&ct=citation&cd=-1&hl=tr BibTex] |
* [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.614.5731&rep=rep1&type=pdf Rule Based Analysis of the Uyghur Nouns] [https://scholar.googleusercontent.com/scholar.bib?q=info:IPwR6tYl7u8J:scholar.google.com/&output=citation&scisig=AAGBfm0AAAAAW0Wr46dRPOwCNffhZ3f09Pmh96Cv0DwQ&scisf=4&ct=citation&cd=-1&hl=tr BibTex] |
||
* [https://tez.yok.gov.tr/UlusalTezMerkezi YÖK] Thesis #290286: Description of tatar morphology and a Tatar-Turkish machine translation system / Tatarcanın morfolojisinin tanımlanması ve Tatarca-Türkçe makine çeviri sistemi - ERCAN GÖKGÖZ |
* [https://tez.yok.gov.tr/UlusalTezMerkezi YÖK] Thesis #290286: Description of tatar morphology and a Tatar-Turkish machine translation system / Tatarcanın morfolojisinin tanımlanması ve Tatarca-Türkçe makine çeviri sistemi - ERCAN GÖKGÖZ |
||
* [https://tez.yok.gov.tr/UlusalTezMerkezi YÖK] Thesis #343000: Text conversion system between Turkic dialects / Türk lehçeleri arasında çeviri sistemi - EMEL ALKIM |
* [https://tez.yok.gov.tr/UlusalTezMerkezi YÖK] Thesis #343000: Text conversion system between Turkic dialects / Türk lehçeleri arasında çeviri sistemi - EMEL ALKIM |
||
+ | * [https://tez.yok.gov.tr/UlusalTezMerkezi YÖK] Thesis #318598: Two level Uyghur morphology and Uyghur Turkish machine translation / İki düzeyli Uygur morfolojisi ve Uygur Türkçe makine çevirisi - RÜMEYSA KESKİN |
||
+ | * [https://tez.yok.gov.tr/UlusalTezMerkezi YÖK] Thesis #293849: Uygurcadan Türkçeye bilgisayarlı çeviri - Murat Orhun |
||
+ | * [https://tez.yok.gov.tr/UlusalTezMerkezi YÖK] Thesis #223702: Akraba Ve Bitişken Diller Arasında Bilgisayarlı Çeviri İçin Karma Bir Model - A. Cüneyd Tantuğ |
||
==== Other ==== |
==== Other ==== |
Revision as of 12:09, 13 August 2018
Info related to paper on uig-tur RBMT.
Contents |
TO DO
- Fill out latex template
- Evaluate system on corpora
To write
- Intro
- Evals
Images and Diagrams
Related Work
SEARCH ON CNKI + GOOGLE TRANSLATE
Uyghur-specific
Chinese-Uyghur
- Chinese-Uyghur machine translation system for phrase-based statistical translation BiBTeX
- Phrase-based Chinese-Uyghur/Uyghur-Chinese Statistical Machine Translation BibTeX
- Chinese-uyghur statistical machine translation: The initial explorations - having trouble finding this but it would be good to read BiBTeX
- DESIGNING RELATED TRANSFORMATION AND MATCHING RULES FOR CHINESE-UYGHUR MACHINE TRANSLATION BiBTeX
- Memory-augmented Chinese-Uyghur Neural Machine Translation - This looks interesting, 200K sentences of bilingual data collected, we should contact the authors to see if we can access it BiBTeX
- XMU Neural Machine Translation Online Service - Also has uyghur, web interface here, but unclear wrt details of data/evals BiBTeX
Japanese-Uyghur
- An experiment on Japanese-Uighur machine translation with MOSES
- A Rule Based Approach for Japanese-Uighur Machine Translation System
- Utilizing Agglutinative Features in Japanese-Uighur Machine Translation
- An Experiment on Japanese-Uighur Machine Translation and Its Evaluation
- Translation of Case Suffixes on Japanese-Uighur Machine Translation Inaccessible?
Non-english Research
Japanese: Muhtar Muhsut seems to have worked a lot on japanese-uighur translation in japanese universities.
- A PARAMETERIZED APPROACH TO PROCESSING AUXILIARY VERBS IN JAPANESE- UIGHUR MACHINE TRANSLATION BibTeX
- Paraphrasing Japanese Words to Expand a Japanese - Uighur Bilingual Dictionary
- Conversion process of case particles for Japanese-Uighur machine translation - There is an english version of this above
Turkish:
- Rule Based Analysis of the Uyghur Nouns BibTex
- YÖK Thesis #290286: Description of tatar morphology and a Tatar-Turkish machine translation system / Tatarcanın morfolojisinin tanımlanması ve Tatarca-Türkçe makine çeviri sistemi - ERCAN GÖKGÖZ
- YÖK Thesis #343000: Text conversion system between Turkic dialects / Türk lehçeleri arasında çeviri sistemi - EMEL ALKIM
- YÖK Thesis #318598: Two level Uyghur morphology and Uyghur Turkish machine translation / İki düzeyli Uygur morfolojisi ve Uygur Türkçe makine çevirisi - RÜMEYSA KESKİN
- YÖK Thesis #293849: Uygurcadan Türkçeye bilgisayarlı çeviri - Murat Orhun
- YÖK Thesis #223702: Akraba Ve Bitişken Diller Arasında Bilgisayarlı Çeviri İçin Karma Bir Model - A. Cüneyd Tantuğ
Other
Turkic
- A free/open-source Kazakh-Tatar machine translation system
- Machine translation between turkic languages
- ADD ALTINTAS CRH STUFF
- Hamzaoğlu azeri
- A mt system from turkmen to turkish employing finite state and statistical methods BibTeX
Fun
Tangentially Related
- Uyghur-Chinese Translation Disambiguation Method Research Based on Knowledge Automatic-Acquisition
- Uyghur Language Model with Graphic Structure
- Construction of Uyghur named entity corpus
- Conceptual Design of Uyghur FrameNet
Evaluations
Since there seem to have been a number of prototypes, mostly for Chinese-Uyghur, it might make sense to compare these values with any evaluation those systems may have, and also the Turkic Apertium pairs crh-tur, kaz-tur, kaz-tat and gag-tur.
- Adalı &co seem to have worked a good bit on Uyghur analysis and disambiguation, so that would also be a good thing to compare.
WER
BLEU
Various Potential Corpora
- ELRA Uyghur Text corpus - they're not too eager to provide this
- The development of tagged Uyghur corpus
- Chinese-Uyghur Parallel Corpus Construction and its Application