Difference between revisions of "Apertium-eng-srn"
| Firespeaker (talk | contribs)  | |||
| Line 51: | Line 51: | ||
| * Proper handling of *de* as the existential particle when word-final (e.g. "datra de" -> "the doctor is there," not "the doctor is") | * Proper handling of *de* as the existential particle when word-final (e.g. "datra de" -> "the doctor is there," not "the doctor is") | ||
| * Proper handling of multiwords | * Proper handling of multiwords | ||
| [[Category:Language pairs]] | |||
| [[Category:Sranan]] | |||
| [[Category:English]] | |||
Revision as of 21:47, 3 September 2017
Sranan Tongo (lit. "Surinamese tongue") is an English-based creole language spoken as a lingua franca by approximately 500,000 people in Suriname. Shared between the Dutch-, Indigenous-, Javanese-, Hindustani-, and Chinese-speaking communities, Sranan Tongo generally serves as a second language, although around 125,000 Surinamese speak it as a first language. Sranan Tongo's lexicon is a fusion of English, Dutch, Portuguese and Central and West African languages, with rampant use of Dutch loanwords. Although it has no direct relatives, Sranan Tongo displays numerous similarities with Krio, the lingua franca of Sierra Leone—to the point where the two languages are somwhat mutually intelligible—as well as with Atlantic English pidgins to a lesser extent.
This language pair is the first of its kind in existence online. Although much progress has been made in lexicon and grammar, more work is needed to improve grammar in the Sranan to English direction.
One major challenge in Sranan-English translation is that many verbs may also be used as their related nouns or adjectives (and vice versa). To compound this problem, determiners (found before nouns) take very similar forms to personal pronouns (found before verbs), making it difficult to disambiguate words. For example, 'mi singi' may be translated either as 'my song' or 'I sing'; 'den aksi' may mean either 'they ask' or 'the questions.'
Vocabulary Sources | Rutu fu den Wortu
Vocabulary is taken from SIL's Wortubuku fu Sranan Tongo.
Statistics | Den Statistiek
| Sranan lemmas | 3,054 | 
| English-Sranan translations | 5,146 | 
| Sranan-English chunking rules | 23 | 
| Sranan-English interchunk rules | 3 | 
Sample Translations
- (srn) Dan fa den ben wakawaka ini a oso a ijskasi bigin degedege. → Then because they wandered inside the house the refrigerator begins wobbling.
- (srn) Dan mi yere: BAM! A koba fadon tapu a patu nanga okro. → Then I hear: *BAM! The bowl falls on the pot with okra.
- (srn) Dan a patu kanti fadon tapu mi bakasei. → Then the pot edge falls on my buttocks.
- (srn) Mi dyompo opo. Mi bari! Mi bari! → I jump up.I shout! I shout!
Progress | Sani na du
- Turned Sranan Tongo dictionary (pdf) into machine-readable format
- Cleaned entries
- Paradigms created for most Sranan lemmas
- Advanced chunking implemented
- Different combinations of modal verbs, tense particles, and multiple verbs implemented in chunking
- Disambiguation constraint grammar entries added
To be done | San musu du
- English -> Sranan transfer rules—not yet tested
- Proper handling of adjectives with <sint>
- Handling of causative structures (e.g. *mi mama taigi mi brada meki a tyari mi go na datra.*)
- Anthroponyms
- Proper handling of *de* as the existential particle when word-final (e.g. "datra de" -> "the doctor is there," not "the doctor is")
- Proper handling of multiwords

