Apertium-eng-srn

From Apertium
Revision as of 21:26, 3 September 2017 by Ethanchi (talk | contribs) (Created page with "{{TOCD}} Sranan Tongo (lit. "Surinamese tongue") is an English-based creole language spoken as a lingua franca by approximately 500,000 people in Suriname. Shared between th...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Sranan Tongo (lit. "Surinamese tongue") is an English-based creole language spoken as a lingua franca by approximately 500,000 people in Suriname. Shared between the Dutch-, Indigenous-, Javanese-, Hindustani-, and Chinese-speaking communities, Sranan Tongo generally serves as a second language, although around 125,000 Surinamese speak it as a first language. Sranan Tongo's lexicon is a fusion of English, Dutch, Portuguese and Central and West African languages, with rampant use of Dutch loanwords. Although it has no direct relatives, Sranan Tongo displays large similarities with Krio, the lingua franca of Sierra Leone, and more generally with Atlantic English pidgins.

Vocabulary Sources | Rutu fu den Wortu

Vocabulary is taken from SIL's [Wortubuku fu Sranan Tongo].

Statistics | Den Statistiek

Sranan lemmas 3,054
English-Sranan conversions 5,146
Sranan-English chunking rules 23
Sranan-English chunking rules 3

Sample Translations

  • (srn) Dan fa den ben wakawaka ini a oso a ijskasi bigin degedege. → Then because they wandered inside the house the refrigerator begins wobbling.
  • (srn) Dan mi yere: BAM! A koba fadon tapu a patu nanga okro. → Dan mi yere: BAM! A koba fadon tapu a patu nanga okro.\
  • (srn) Dan a patu kanti fadon tapu mi bakasei. → Then the pot edge falls on my buttocks.
  • (srn) Mi dyompo opo. Mi bari! Mi bari! → Mi dyompo opo. Mi bari! Mi bari!

Progress

  • Turned Sranan Tongo dictionary (pdf) into machine-readable format
  • Cleaned entries
  • Paradigms created for most Sranan lemmas
  • Advanced chunking implemented
  • Different combinations of modal verbs, tense particles, and multiple verbs implemented in chunking
  • Disambiguation constraint grammar entries added

To be done|San musu du

  • English -> Sranan transfer rules—not yet tested
  • Proper handling of adjectives with <sint>
  • Handling of causative structures (e.g. *mi mama taigi mi brada meki a tyari mi go na datra.*)
  • Anthroponyms
  • Proper handling of *de* as the existential particle when word-final (e.g. "datra de" -> "the doctor is there," not "the doctor is")
  • Proper handling of multiwords