Apertium-eng-srn

From Apertium
Revision as of 21:59, 3 September 2017 by Ethanchi (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Sranan Tongo (lit. "Surinamese tongue") is an English-based creole language spoken as a lingua franca by approximately 500,000 people in Suriname. Shared between the Dutch-, Indigenous-, Javanese-, Hindustani-, and Chinese-speaking communities, Sranan Tongo generally serves as a second language, although around 125,000 Surinamese speak it as a first language. Sranan Tongo's lexicon is a fusion of English, Dutch, Portuguese and Central and West African languages, with rampant use of Dutch loanwords. Although it has no direct relatives, Sranan Tongo displays numerous similarities with Krio, the lingua franca of Sierra Leone—to the point where the two languages are somwhat mutually intelligible—as well as with Atlantic English pidgins to a lesser extent.

This language pair is the first of its kind in existence online. Although much progress has been made in lexicon and grammar, more work is needed to improve grammar in the Sranan to English direction.

One major challenge in Sranan-English translation is that many verbs may also be used as their related nouns or adjectives (and vice versa). To compound this problem, determiners (found before nouns) take very similar forms to personal pronouns (found before verbs), making it difficult to disambiguate words. For example, 'mi singi' may be translated either as 'my song' or 'I sing'; 'den aksi' may mean either 'they ask' or 'the questions.'

Apertium-srn is available on SVN here. Apertium-eng-srn is available on SVN here.

Vocabulary Sources | Rutu fu den Wortu

Vocabulary is taken from SIL's Wortubuku fu Sranan Tongo.

Statistics | Den Statistiek

Sranan stems 3,083
Sranan rlx rules 62
Sranan paradigms 30
English-Sranan stems 5,145
Sranan-English t1x rules 22
Sranan-English t2x rules 3

Sample Translations

  • (srn) Dan fa den ben wakawaka ini a oso a ijskasi bigin degedege. → Then because they wandered inside the house the refrigerator begins wobbling.
  • (srn) Dan mi yere: BAM! A koba fadon tapu a patu nanga okro. → Then I hear: *BAM! The bowl falls on the pot with okra.
  • (srn) Dan a patu kanti fadon tapu mi bakasei. → Then the pot edge falls on my buttocks.
  • (srn) Mi dyompo opo.Mi bari! Mi bari! → I jump up. I shout! I shout!

Progress | Sani na du

  • Turned Sranan Tongo dictionary (pdf) into machine-readable format
  • Cleaned entries
  • Paradigms created for most Sranan lemmas
  • Advanced chunking implemented
  • Different combinations of modal verbs, tense particles, and multiple verbs implemented in chunking
  • Disambiguation constraint grammar entries added

To be done | San musu du

  • English -> Sranan transfer rules—not yet tested
  • Proper handling of adjectives with <sint>
  • Handling of causative structures (e.g. *mi mama taigi mi brada meki a tyari mi go na datra.*)
  • Anthroponyms
  • Proper handling of de as the existential particle when word-final (e.g. datra de -> "the doctor is there," not "the doctor is")
  • Proper handling of multiwords