Siciliano y castellano/Informe final
Commitment
The list of all commits: https://apertium.projectjj.com/gsoc2016/uliana-sentsova.html
Monolingual Sicilian dictionary:
Bilingual Sicilian-Spanish dictionary: https://svn.code.sf.net/p/apertium/svn/incubator/apertium-scn-spa/
Description
1. Sicilian language TODO
2. Project goals
TODO
corpus, coverage, testvoc, pending tests
Constraint grammar
To distinguish coincident forms
A lot of verb forms coincide in Sicilian. The coincidence of verb forms within one verb paradigm occurs fairly often in Sicilian language.
Within one part of speech and between words of different categories.
Thus, for instance, singular forms of regular verbs match its second person imperative, and present conjunctive For example, to distinguish a masculine noun "munnu" y Present Indicative form "munnu" of the verb "munnari", Regular verbs of the 2-nd conjugation have the same forms for Present Indicative of the first and the second person, Present Indicative of the third person singular usually coincides with the Imperative of the second person plural by verbs of the 1-st conjugation. All verbs demonstrate coinciding forms for first, second and third forms of Present Subjunctive.
A good example is the Sicilian noun"cristianu" that not only signifies a person of Christian faith but can also denote a human being in general.
Transfer rules
Transfer rules help to translate correctly syntactic differences between languages that cannot be translated directly. There are 40 transfer rules in total.
- Unlike in Spanish, the synthetic future is no longer in use in Sicilian language, therefore it is replaced by the periphrastic compound forms with common verbs like "jiri", "vèniri" or "aviri".
- The synthetic conditional forms of verbs are normally replaced by indicative or subjunctive forms.
- There are also transfer rules to translate the verb construction with passive and modal meaning.
Statistics
Coverage | Sicilian-castellano (%) | Castellano-siciliano (%) |
---|---|---|
Trimmed coverage | 83.4% | % |
Coverage | Sicilian (%) | Spanish (%) |
Raw coverage' | 85.5% | 91,6% |
The number of lemmas in bilingual dictionary: 11,253.
The number of lemmas in Sicilian dictionary:
Challenging issues
1. Abundance of spelling forms
2. Accent system
3. Pronouns
examples (jardinu / iardinu / giardinu = ‘garden’, palora / parola /paràula /palàura = ‘word’).
cunjùnciri, cognùngiri, conjùngiri, cugnùnciri, cognùncici, coniùngiri, conjùnciri
TODO
Future work
Syntactic properties, more rules, automatic forms merge algorithm TODO
Resources
https://scn.wikipedia.org/wiki/P%C3%A0ggina_principali
https://scn.wiktionary.org/wiki/P%C3%A0ggina_principali
Bonner, Introduction to Sicilian Grammar
El nuovo dizionario siciliano-italiano