Difference between revisions of "Indirect contribution guide"

From Apertium
Jump to navigation Jump to search
(Fix up some of the tortured prose, and remove some inaccuracies. A contrastive analysis is *not* one of the most difficult things... it's one of the easiest.)
Line 14: Line 14:
   
 
===Create contrastive analyses===
 
===Create contrastive analyses===
  +
  +
A 'contrastive analysis' is a set of example sentences which show the differences and similarities between a pair of languages. In a sense, it's a 'feature corpus' which we can use to develop and test rule hypotheses: if we see that the pattern ''noun + adjective'' becomes ''adjective + noun'', then we have a good basis for building a rule. Think of it as 'raw input to a linguist': when we have a good enough idea of what a pair of languages look like, we later use these analyses to build translation rules.
  +
  +
One thing to note is that, when we see that something happens 9/10 times, or 8/10 times, etc., then we need to expand that exceptional part of the analysis, to get a better idea of what's happening: is it a certain class of words, or just a pure exception?
   
 
====What you must do.====
 
====What you must do.====
  +
The contrastive analyses is most difficult work about translating in "Apertium".<br />
 
 
Your task is to make a set of test sentences in the first language, and translate them to the other. An English translation may e useful in enlisting help, but is not required.
Contrastive analyses is difference between two languages.<br />
 
  +
Your task is to make sentences in the first language and translating to the other.<br />
 
  +
A sample sentence in wiki markup looks like this:
You must also give English translation.<br />
 
  +
<br />
 
 
<pre>* {{test|First language abbreviation|First language.|Second language.|English translation.}}</pre>
You must put it like:<br />
 
  +
* {{test|First language abbreviation|First language.|Second language.|English translation.}}
 
 
Examples:<br />
 
Examples:<br />
 
* {{test|ru|Чашка большая.|Чашата е голямата.|The cup is big.}}
 
* {{test|ru|Чашка большая.|Чашата е голямата.|The cup is big.}}

Revision as of 15:37, 2 January 2011

Many people come to us with a question like "I'm not a programmer/linguist/whatever. Is there any way I can contribute?". This document is intended to show how you can make an "indirect" contribution, by documenting language resources, helping us to build bilingual test sets, translating, promoting, etc.

About This Tutorial

This tutorial will teach you:

  • How to creating contrastive analyses.
  • How to catalog resources.
  • How to convert dictionaries.
  • How to translate.
  • How to help "Apertium" in other ways .

Create contrastive analyses

A 'contrastive analysis' is a set of example sentences which show the differences and similarities between a pair of languages. In a sense, it's a 'feature corpus' which we can use to develop and test rule hypotheses: if we see that the pattern noun + adjective becomes adjective + noun, then we have a good basis for building a rule. Think of it as 'raw input to a linguist': when we have a good enough idea of what a pair of languages look like, we later use these analyses to build translation rules.

One thing to note is that, when we see that something happens 9/10 times, or 8/10 times, etc., then we need to expand that exceptional part of the analysis, to get a better idea of what's happening: is it a certain class of words, or just a pure exception?

What you must do.

Your task is to make a set of test sentences in the first language, and translate them to the other. An English translation may e useful in enlisting help, but is not required.

A sample sentence in wiki markup looks like this:

* {{test|First language abbreviation|First language.|Second language.|English translation.}}

Examples:

  • (ru) Чашка большая. → Чашата е голямата. :: The cup is big.
  • (el) Τι γίνεσαι? → как си? :: How are you?
  • (bg) Вера се оглежда в огледалото. → Вера смотрит на себя в зеркало. :: Vera is looking at herself in the mirror.

You must translate:

  1. Simple syntax
    • Copula
    • Reported speech
    • Clitic placement
  2. Pronouns
    • Personal
    • Demonstrative
    • Relative
    • Possesive
    • Reflexive
    • Interrogative
  3. Nouns
    • General
    • Indefinite and definite forms
    • 1 Noun phrases
    • Indefinite (a, some)
    • Definite (the)
    • Demonstrative (this, that)
    • Quantified (a few, no, all)
  4. Numerals
    • Cardinal
    • Ordinal
  5. Adjective
    • Comparative
    • Superlative
  6. Adverbs
  7. Verbs
    • To be
    • General
    • Indicative mood
      • Present tense
      • Imperfect tense
      • Aorist tense
      • Perfect tense
      • Pluperfect tense
      • Future tense
    • Conditional mood
    • Imperative mood
  8. Questions

If you want you can also add

  1. Interjections
  2. Punctuation marks


You can find an example here:
For Bulgarian and Russian.[1]
For Bulgarian and Greek.[2]

1. http://wiki.apertium.org/wiki/Bulgarian_and_Russian/Pending_tests
2. http://wiki.apertium.org/wiki/Bulgarian_and_Greek/Pending_tests

How can you do it?

It is only easy if you only know the both languages.
If you don't know them it is good idea to read this.
One of the languages must be native to you or you must know it very well.
I will give you some tips, that you can help you with your work.

  1. Ask your friends did they know the language you don't know.