Difference between revisions of "Scottish Gaelic and Irish"
Jump to navigation
Jump to search
Line 7: | Line 7: | ||
# put the paradigms in 1 entry-per-line format |
# put the paradigms in 1 entry-per-line format |
||
# noun paradigms |
# noun paradigms |
||
## some have only one entry -- these are defective? -- e.g. bá__n_m |
## some have only one entry -- these are defective? -- e.g. <code>bá__n_m</code> |
||
## some have three entries -- defective also? -- e.g. band/ia__n_m |
## some have three entries -- defective also? -- e.g. <code>band/ia__n_m</code> |
||
# verb paradigms |
# verb paradigms |
||
## sort the entries so that the order makes sense |
## sort the entries so that the order makes sense |
||
## is there an imperative p1.sg ??? |
## <s>is there an imperative p1.sg ???</s> |
||
# adjective paradigms |
# adjective paradigms |
||
## some paradigms have more entries than others, e.g. ca/s__adj has 3, and bré/an__adj has 4 |
## some paradigms have more entries than others, e.g. <code>ca/s__adj</code> has 3, and <code>bré/an__adj</code> has 4 |
||
## comparatives/superlatives seem to be missing |
## comparatives/superlatives seem to be missing |
||
# are some proper nouns marked with common noun paradigms instead of proper noun paradigms ? |
# are some proper nouns marked with common noun paradigms instead of proper noun paradigms ? |
||
## find out with cat apertium-ga-gd.ga.dix | grep '<e lm="[A-Z]' |
## find out with <code>cat apertium-ga-gd.ga.dix | grep '<e lm="[A-Z]'</code> |
||
# sort the entries in the <section id="main"> by |
# sort the entries in the <section id="main"> by a) part-of-speech, b) alphabetical order |
||
# i think we're missing possessives and demonstratives, quantifiers and perhaps some definite/indefinite pronouns |
# i think we're missing possessives and demonstratives, quantifiers and perhaps some definite/indefinite pronouns |
||
Revision as of 22:50, 7 May 2011
Todo
Irish dictionary
- put the paradigms in 1 entry-per-line format
- noun paradigms
- some have only one entry -- these are defective? -- e.g.
bá__n_m
- some have three entries -- defective also? -- e.g.
band/ia__n_m
- some have only one entry -- these are defective? -- e.g.
- verb paradigms
- sort the entries so that the order makes sense
is there an imperative p1.sg ???
- adjective paradigms
- some paradigms have more entries than others, e.g.
ca/s__adj
has 3, andbré/an__adj
has 4 - comparatives/superlatives seem to be missing
- some paradigms have more entries than others, e.g.
- are some proper nouns marked with common noun paradigms instead of proper noun paradigms ?
- find out with
cat apertium-ga-gd.ga.dix | grep '<e lm="[A-Z]'
- find out with
- sort the entries in the <section id="main"> by a) part-of-speech, b) alphabetical order
- i think we're missing possessives and demonstratives, quantifiers and perhaps some definite/indefinite pronouns
Old todo
- Perform an intersection on the monolingual dictionaries. (Making them consistent)
- We only want stuff in the Irish analyser that we can translate into Scottish Gaelic -- so, in order for a word to be included, it should be in both the Irish monolingual, bilingual and the translation in the Scottish Gaelic monolingual. With the words for which we don't have translations we can just comment them out -- or move them to a separate file in
dev/
- We only want stuff in the Irish analyser that we can translate into Scottish Gaelic -- so, in order for a word to be included, it should be in both the Irish monolingual, bilingual and the translation in the Scottish Gaelic monolingual. With the words for which we don't have translations we can just comment them out -- or move them to a separate file in
- Add all missing closed categories to the monolingual dictionaries.
- Do some fixing of the bilingual dictionary
- Some restrictions probably need adding.
- Some conjunctions are marked "cnj" and not subdivided for "cnjcoo", "cnjsub" etc.
- Making constraint grammar rules more CG-like
- Write rules to do initial mutations for generation.
- Write some transfer rules.
- For example to do tenses, number agreement, etc.
Testing
See also
Notes