Difference between revisions of "Konkani"

From Apertium
Jump to navigation Jump to search
 
(15 intermediate revisions by the same user not shown)
Line 9: Line 9:
 
! width=200 | Pair name !! width=350 | Language !! width=130 | Last update
 
! width=200 | Pair name !! width=350 | Language !! width=130 | Last update
 
|-
 
|-
| align=center | <code>[[apertium-kok-hin]]</code> || align=center | Konkani <-> Hindi || align=center | 24 Jul 2020
+
| align=center | <code>[[apertium-kok-hin]]</code> || align=center | Konkani <-> Hindi || align=center | 23 Oct 2020
 
|-
 
|-
 
|}
 
|}
Line 17: Line 17:
 
===Current status===
 
===Current status===
   
''Last update: 03 Aug 2020''
+
''Last update: 23 Oct 2020''
   
'''Dix entries:''' 65,379
+
'''Dix entries:''' 65,385
   
'''Dix paradigms:''' 88
+
'''Dix paradigms:''' 126
   
 
'''Coverage:''' 93.36% (Wikipedia)
 
'''Coverage:''' 93.36% (Wikipedia)
Line 36: Line 36:
   
 
The standard spelling variant in Apertium is the official Devanagri script.
 
The standard spelling variant in Apertium is the official Devanagri script.
  +
  +
=== Constraint Grammar ===
  +
  +
Apertium-kok currently has 33 CG rules. Still there is a lot of room for disambiguation improvement using CG.
   
 
===Future work===
 
===Future work===
   
  +
* Add more words and paradigms
* Improve the XML dictionary
 

Latest revision as of 19:13, 25 October 2020

Konkani (Wikipedia:Konkani language) is an Indo-Aryan or Indic language. It is available in Apertium as a standalone analyser/generator (apertium-kok) and as a component of pair which translates to/from Konkani.

Language pair[edit]

See also: List of language pairs

In incubator:

Pair name Language Last update
apertium-kok-hin Konkani <-> Hindi 23 Oct 2020

Apertium-kok[edit]

Current status[edit]

Last update: 23 Oct 2020

Dix entries: 65,385

Dix paradigms: 126

Coverage: 93.36% (Wikipedia)

Dictionary guidelines[edit]

The current Konkani dictionary is quite big (nearly 65,000 entries), so tidiness is essential to ensure future development:

  • Keep entries sorted alphabetically.
  • Keep entries grouped by type and tags (do not mix different types of proper nouns together).
  • Check the file with apertium-dixtools (to update the number of entries and remove duplicates).

Spelling variants[edit]

The standard spelling variant in Apertium is the official Devanagri script.

Constraint Grammar[edit]

Apertium-kok currently has 33 CG rules. Still there is a lot of room for disambiguation improvement using CG.

Future work[edit]

  • Add more words and paradigms