Difference between revisions of "Main Page"

From Apertium
Jump to navigation Jump to search
(19 intermediate revisions by 7 users not shown)
Line 1: Line 1:
  +
{{Main page header}}
== Contact Information ==
 
  +
<div style="font-size: 95%">
'''Name:''' Mambetniyazov Amirniyaz
 
  +
''[[Apertium:About|Apertium]] is a free/open-source platform for developing rule-based machine translation systems!''
  +
{|style="width:100%; background:#fbfbfb; margin-top:1.2em; border:1px solid #ccc;"
  +
|style="width:50%; vertical-align: top; padding-left:20px; padding-right:20px;"|
   
  +
To see the whole list of general documentation pages written in English, see [[:Category:Documentation in English|documentation in English]].
'''Location:''' Tashkent
 
   
  +
Pour ceux qui sont plus à l'aise avec la langue française, une partie des pages anglaises a été traduite. Voir [[:Category:Documentation en français|documentation en français]].
'''University:''' Tashkent University of Informaton named after Al-Khwarizmi
 
   
  +
==Translators==
'''Email address:''' mr.amirniyaz3010@gmail.com
 
   
  +
The following 51 pairs have released versions and are considered to be stable:
'''IRC:''' Amirniyaz
 
  +
{|style="border: 0px solid #fbfbfb"
  +
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|
  +
*Spanish ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=spa-cat spa-cat])
  +
*Spanish ← Romanian ([http://www.apertium.org/?id=translatetext&dir=ro-es es-ro])
  +
*French ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=fra-cat fra-cat])
  +
*Occitan ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=oc-ca oc-ca])
  +
*English ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=en-gl en-gl])
  +
*Swedish ⇆ Danish ([http://www.apertium.org/?id=translatetext&dir=swe-dan swe-dan])
  +
*Macedonian → English ([http://www.apertium.org/?id=translatetext&dir=mkd-eng mkd-eng])
  +
*Afrikaans ⇆ Dutch ([http://www.apertium.org/?id=translatetext&dir=afr-nld afr-nld])
  +
*Indonesian ⇆ Malaysian ([http://www.apertium.org/?id=translatetext&dir=id-ms id-ms])
  +
*Icelandic ⇆ Swedish ([http://www.apertium.org/?id=translatetext&dir=isl-swe isl-swe])
  +
*Occitan ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=oc-es oc-es])
  +
*Spanish ⇆ Portuguese ([http://www.apertium.org/?id=translatetext&dir=es-pt es-pt])
  +
*English ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=eng-cat eng-cat])
  +
*English ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=eng-spa eng-spa])
  +
*Sardinian ← Italian ([http://www.apertium.org/?id=translatetext&dir=ita-srd ita-srd])
  +
*Sardinian ← Catalan ([http://www.apertium.org/?id=translatetext&dir=ita-srd cat-srd])
  +
*French → Arpitan ([http://www.apertium.org/?id=translatetext&dir=fra-frp fra-frp])
  +
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|
  +
*English ⇆ Esperanto ([http://apertium.saluton.dk/?id=translatetext&dir=en-eo en-eo])
  +
*Spanish → Asturian ([http://www.apertium.org/?id=translatetext&dir=spa-ast spa-ast])
  +
*Catalan ⇆ Italian ([http://www.apertium.org/?id=translatetext&dir=ita-cat cat-ita])
  +
*Maltese → Arabic ([http://www.apertium.org/?id=translatetext&dir=mlt-ara mlt-ara])
  +
*Serbo-Croatian ⇆ Slovenian ([http://www.apertium.org/?id=translatetext&dir=hbs-slv hbs-slv])
  +
*Danish ⇆ Norwegian ([http://www.apertium.org/?id=translatetext&dir=dan-nor dan-nor])
  +
*Spanish ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=es-gl es-gl])
  +
*French ⇆ Spanish ([http://www.apertium.org/?id=translatetext&dir=fr-es fr-es])
  +
*French → Occitan ([http://www.apertium.org/?id=translatetext&dir=fra-oci fra-oci])
  +
*Esperanto ← Spanish ([http://www.apertium.org/?id=translatetext&lang=eo&dir=es-eo eo-es])
  +
*Welsh → English ([http://www.apertium.org/?id=translatetext&dir=cym-eng cym-eng])
  +
*Breton → French ([http://www.apertium.org/?id=translatetext&dir=br-fr br-fr])
  +
*Icelandic → English ([http://www.apertium.org/?id=translatetext&dir=isl-eng isl-eng])
  +
*Esperanto ← Catalan ([http://www.apertium.org/?id=translatetext&lang=eo&dir=ca-eo eo-ca])
  +
*North Sámi → Norwegian ([http://www.apertium.org/?id=translatetext&dir=sme-nob sme-nob])
  +
*Crimean Tatar → Turkish ([http://www.apertium.org/?id=translatetext&dir=crh-tur crh-tur])
  +
*Romanian ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=ron-cat ron-cat])
  +
|style="width:11%; border: 0px solid #fbfbfb; font-size:95%; background:#fbfbfb; vertical-align: top;"|
  +
*Serbo-Croatian → Macedonian ([http://www.apertium.org/?id=translatetext&dir=hbs-mkd hbs-mkd])
  +
*Serbo-Croatian → English ([http://www.apertium.org/?id=translatetext&dir=hbs-eng hbs-eng])
  +
*Portuguese ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=por-cat por-cat])
  +
*Portuguese ⇆ Galician ([http://www.apertium.org/?id=translatetext&dir=pt-gl pt-gl])
  +
*Basque → Spanish ([http://www.apertium.org/?id=translatetext&dir=eu-es eu-es])
  +
*Norwegian Nynorsk ⇆ Bokmål ([http://www.apertium.org/?id=translatetext&dir=nno-nob nno-nob])
  +
*Macedonian ⇆ Bulgarian ([http://www.apertium.org/?id=translatetext&dir=mkd-bul mkd-bul])
  +
*Esperanto ← French ([http://apertium.saluton.dk/?id=translatetext&lang=eo&dir=fr-eo eo-fr])
  +
*Basque → English ([http://www.apertium.org/?id=translatetext&dir=eu-en eu-en])
  +
*Spanish ⇆ Aragonese ([http://www.apertium.org/?id=translatetext&dir=spa-arg spa-arg])
  +
*Kazakh ⇆ Tatar ([http://www.apertium.org/?id=translatetext&dir=kaz-tat kaz-tat])
  +
*Urdu ⇆ Hindi ([http://www.apertium.org/?id=translatetext&dir=urd-hin urd-hin])
  +
*Aragonese ⇆ Catalan ([http://www.apertium.org/?id=translatetext&dir=arg-cat arg-cat])
  +
*Swedish ⇆ Norwegian ([http://www.apertium.org/?id=translatetext&dir=swe-nno swe-nor])
  +
*Belarusian ⇆ Russian ([http://www.apertium.org/?id=translatetext&dir=bel-rus bel-rus])
  +
*Russian ⇆ Ukrainian ([http://www.apertium.org/?id=translatetext&dir=rus-ukr rus-ukr])
  +
*Polish → Silesian ([http://www.apertium.org/?id=translatetext&dir=pol-szl pol-szl])
  +
|-
  +
|}
   
  +
Other pairs currently in development may be found in our [[Using_Git|GitHub repositories]]. You can [[test drive]] release versions [http://apertium.org here].<!--, and all versions (including versions for unreleased language pairs) [http://xixona.dlsi.ua.es/testing/ here] &mdash; this has the latest versions (up to 6 hours old).--> More information can be found on the page: [[List of language pairs]]
'''Timezone:''' GTM+5
 
  +
|}
  +
</div>
   
  +
{|style="width:100%; background:#fbfbfb; margin-top:1.2em; border:1px solid #ccc;"
'''SourceForge:''' m-amirniyaz
 
  +
|style="width:50%; color:#000; vertical-align: top;padding-left:20px; padding-right:20px;"|
   
  +
== How to create a new language pair ==
== Why is it you are interested in machine translation? ==
 
I trust that machine interpretation is a standout amongst the most critical zones for humankind by giving individuals chance to share learning in different dialects. Particularly, later on I am intending to take a shot at machine interpretation of lesser-realized dialects so as to give those language users a chance to secure comparative measure of learning as prominent language holders.
 
== Why is it that you are interested in the Apertium project? ==
 
Apertium offers favorable circumstances for me to provide something valuable for my Karakalpak linguistics. Established on the standards of saving society and legacy through language, Apertium interfaces the domains of the antiquated and present day through advances in machine interpretation stages.
 
   
  +
* '''[[Apertium New Language Pair HOWTO]]''' &mdash; step-by-step description of how to start a new language pair in Apertium.
== Which of the published tasks are you interested in? What do you plan to do? ==
 
  +
** [[Créer une nouvelle paire de langues]]
Adopt a new language pair: Tatar - Karakalpak. Both languages belong to the group Kipchak languages. This nations are close to each other through its historical and cultural heritages. Moreover, I am profoundly interested in classical tatar literature especially the works of Gabdulla Tuqay.
 
  +
** [[Come scrivere una nuova coppia di lingue Apertium]]
== Title ==
 
  +
** [[Amestar un par de llingües nuevu]]
Adopting an unreleased language pair of Tatar -> Karakalpak languages.
 
  +
** [[Uputstvo za novi jezički par za Apertium]]
 
  +
** [[Апертиум, как се създава нова езикова двойка]]
== Reasons why Google and Apertium should sponsor it ==
 
  +
** [[Руководство по созданию новой языковой пары]]
Because this project stimulates the prevalence of Karakalpak language as the number of native speakers are diminishing extensively. I am currently pursuing my Bachelor’s degree on program engineering. I have a legitimate information and involvement with improvement in Python, C++, Kotlin and xml.
 
  +
** [[Jak zrobić nową parę języków]]
== A description of how and who it will benefit in society ==
 
  +
** [[Apertium, Tvorba novej dvojice jazykov]]
Firstly, it would be a great help to Karakalpak language speakers, due to UNESCO it’s status considered as vulnerable. The reason for its lower status is that not enough attention is being given to the projects in Karakalpak language. According to statistics, Tatar language is the second popular language in Russia after russian language. Thus, my project of developing Tatar -> Karakalpak dictionary will help majority of people and it assumed to be in high demand among users.
 
  +
** [[Як створити нову мовну пару‎]]
== Work plan ==
 
  +
** [[Nieuw talenpaar maken]]
=== Community bonding period (May 6 - 27): ===
 
  +
** [[अपर्टियम मे नई भाषा जोडे]]
*Getting closer with Apertium tools and community
 
  +
** [[अपर्टियम मा नयाँ भाषा जोड्नुहोस]]
*Finding the language resources for Karakalpak and Tatar
 
  +
** [[অপের্তিউম নতুন ভাষা জোড়া তে হয় কিভাবে]]
*build a better frequency list of Tatar words
 
  +
** [[Hvordan lage et nytt språkpar i Apertium‎]]
*reading more about the HFST
 
  +
** [[Oprettelse af et nyt sprogpar i Apertium]]
*Doing coding challenge
 
  +
** [[Kiel aldoni novan lingvoparon‎]]
*Begin editing Tatar <-> Karakalpak dictionary
 
  +
** [[Sevel ur c'houblad yezhoù nevez]]
=== Work Period (May 27 - August 19): ===
 
Week 1:
 
   
  +
|style="width:50%; color:#000; vertical-align: top;padding-left:20px; padding-right:20px;"|
*Add nouns and adjectives to bilingual dictionary
 
   
  +
== Documentation (other pages) ==
Week 2:
 
   
  +
{{main|Documentation}}
*Write transfer rules for nouns and adjectives
 
  +
* '''[[Using Git]]''' &mdash; cheatsheet on how to use Git for users and developers.
  +
* '''[[Contributing to an existing pair]]''' &mdash; some pointers for contributing to an existing pair.
  +
** [[Comment contribuer à une paire de langues existante]]
  +
* '''[[Become a language pair developer for Apertium]]''' &mdash; A quick, step-by-step guide for Ubuntu and Debian users on how to add entries to a language pair.
  +
** [[Devenir un développeur de paire de langues pour Apertium]]
  +
* '''[[Frequently Asked Questions]]''' &mdash; what it says in the link.
  +
** [[Questions fréquentes]]
  +
* '''[[Apertium system architecture]]''' &mdash; a brief account of the modules in the pipeline, and examples showing the steps it takes to translate a sentence from one language to another.
   
  +
|}
Week 3:
 
   
  +
__NOTOC__
*Add verbs and other parts of speech to bilingual dictionary
 
  +
__NOEDITSECTION__
   
  +
[[Category:Top-level categories]]
*Write transfer rules for verbs
 
 
Week 4:
 
 
*Run tests
 
 
*Update documentation
 
 
*Prepare for the first evaluation
 
 
'''Deliverable 1:''' Bilingual dictionary, basic transfer rules
 
 
Week 5:
 
 
*Even up nouns and adjectives
 
 
Week 6:
 
 
*Even up verbs and other parts of speech
 
 
Week 7:
 
 
*Extend bilingual dictionary
 
 
Week 8:
 
 
*Run tests
 
 
*Update documentation
 
 
*Prepare for the second evaluation
 
 
'''Deliverable 2:''' Improved bilingual dictionary, transfer rules
 
 
Week 9:
 
 
Week 10:
 
 
*Extend bilingual dictionary Add multiwords
 
 
*Work on transfer rules
 
 
Week 11:
 
 
*Run final tests
 
 
*Fix issues
 
 
Week 12:
 
 
*Brush up the project and documentation
 
 
*Prepare for final evaluation
 
 
== List your skills and give evidence of your qualifications ==
 
I am on the 3th year of Bachelor’s degree in Programm Engineering at the Tashkent University of Information Technology named after Muhammad Al-Khwarizmi.
 
My native language is Karakalpak [Kaa] and I know Tatar language on a good level too, mainly due to their similarity.
 
Programming skills: C, C++, Java, Python, git and xml.
 
== List any non-Summer-of-Code plans you have for the Summer ==
 
I have no non-GSoC plans for the summer and can contribute from 30 to 40 hours a week. However, my school finishes in the middle of June. Therefore, if it is fine I would like to work ~ 20 hours in the first month and in the 2nd & 3rd months I will work ~ 50 hours per week in order to compensate.
 

Revision as of 08:18, 31 December 2020

InstallationResourcesContactDocumentationDevelopmentTools

Gnome-home.png Home PageBugs.png BugsInternet.png WikiGaim.png Chat


Apertium is a free/open-source platform for developing rule-based machine translation systems!

To see the whole list of general documentation pages written in English, see documentation in English.

Pour ceux qui sont plus à l'aise avec la langue française, une partie des pages anglaises a été traduite. Voir documentation en français.

Translators

The following 51 pairs have released versions and are considered to be stable:

Other pairs currently in development may be found in our GitHub repositories. You can test drive release versions here. More information can be found on the page: List of language pairs

How to create a new language pair

Documentation (other pages)

Main article: Documentation