Difference between revisions of "Malayalam and English/documentation"

From Apertium
Jump to navigation Jump to search
Line 60: Line 60:
 
mēghaṁ:mēgha N1 ; ! mēghaṁ ! cloud
 
mēghaṁ:mēgha N1 ; ! mēghaṁ ! cloud
 
</pre>
 
</pre>
  +
Currently there are 10 noun paradigms ,N1 N2 ,...N10
  +
General trends in paradigms
  +
LEXICON N1 :- Words ending with anusuvara( ം )
  +
LEXICON N2 :- words ending with the vowel a or i
  +
LEXICON N3 :- words ending with virama or a vowel
  +
LEXICON N4 :- words ending with the vowel a or i
  +
LEXICON N5 :- words ends with virama (eg വീട് )
  +
LEXICON N6 :- for the word പേര്‍ (pēr‍)
  +
LEXICON N7 :-words ends with the vowel u
  +
LEXICON N8 :-
  +
LEXICON N9 :-
  +
LEXICON N10 :-

Revision as of 19:10, 15 August 2014

Malayalam is both agglutinative and inflective language . it belong dravidian language category . In apertium we are trying to implement englaish malayalam pair using hfst .it is described here Starting_a_new_language_with_HFST

Morphotactic using lexc

let's take an example of a noun word , malayalam noun can have 8 inflections , nominative,dative, instrumental, locative ,accusative,vocative and sociative . it can be also classified on the basis on number ,singular and plural , let's declare essential symbols

Multichar_Symbols

%<n%>           ! Noun                        ! നാമം

%<nom%>         ! Nominative                  !

%<acc%>         ! Accusative                  !

%<dat%>         ! Dative                      !

%<soc%>         ! Sociative                   !

%<gen%>         ! Genitive                    !

%<ins%>         ! Instrumental                !

%<loc%>         ! Locative                    !

%<voc%>         ! Vocative                    !

%<sg%>          ! Singular                    !

%<pl%>          ! Plural                      !

Now we have all essential symbols for a noun , let's add an example paradigm

LEXICON Root

Miscellaneous ;
Conjunctions ; 
Postpositions ;
Pronouns ;
Determiners ;
Numerals ;
NominalStems ;
LEXICON N1 

%<n%>%<sg%>%<nom%>:ṁ CLIT-N-NOM ; ! ṁ
%<n%>%<sg%>%<loc%>:%>ttil‍ CLIT-N-LOC ; ! ttil
%<n%>%<sg%>%<acc%>:%>tte CLIT-N-ACC ; ! tte
%<n%>%<sg%>%<gen%>:%>ttinṟe CLIT-N-GEN ; ! ttinṟe
%<n%>%<sg%>%<dat%>:%>ttin CLIT-N ; ! ttin
%<n%>%<sg%>%<dat%>:%>ttinu CLIT-N ; ! ttinu ! debug
!plural
%<n%>%<pl%>%<nom%>:%>ṅṅaḷ‍ CLIT-N-NOM ; ! ṅṅaḷ‍
%<n%>%<pl%>%<acc%>:%>ṅṅaḷe CLIT-N-ACC ; ! ṅṅale
%<n%>%<pl%>%<gen%>:%>ṅṅaḷuṭe CLIT-N-GEN ; ! ṅṅaḷuṭe

and an example word

LEXICON NominalStems
mēghaṁ:mēgha N1 ; ! mēghaṁ ! cloud

Currently there are 10 noun paradigms ,N1 N2 ,...N10 General trends in paradigms LEXICON N1 :- Words ending with anusuvara( ം ) LEXICON N2 :- words ending with the vowel a or i LEXICON N3 :- words ending with virama or a vowel LEXICON N4 :- words ending with the vowel a or i LEXICON N5 :- words ends with virama (eg വീട് ) LEXICON N6 :- for the word പേര്‍ (pēr‍) LEXICON N7 :-words ends with the vowel u LEXICON N8 :- LEXICON N9 :- LEXICON N10 :-