Difference between revisions of "Konkani"

From Apertium
Jump to navigation Jump to search
 
(4 intermediate revisions by the same user not shown)
Line 9: Line 9:
! width=200 | Pair name !! width=350 | Language !! width=130 | Last update
! width=200 | Pair name !! width=350 | Language !! width=130 | Last update
|-
|-
| align=center | <code>[[apertium-kok-hin]]</code> || align=center | Konkani <-> Hindi || align=center | 18 Oct 2020
| align=center | <code>[[apertium-kok-hin]]</code> || align=center | Konkani <-> Hindi || align=center | 23 Oct 2020
|-
|-
|}
|}
Line 17: Line 17:
===Current status===
===Current status===


''Last update: 18 Oct 2020''
''Last update: 6th Feb 2024''


'''Dix entries:''' 65,383
'''Dix entries:''' 61,169


'''Dix paradigms:''' 90
'''Dix paradigms:''' 169


'''Coverage:''' 93.36% (Wikipedia)
'''Coverage:''' 93.36% (Wikipedia)
Line 39: Line 39:
=== Constraint Grammar ===
=== Constraint Grammar ===


Apertium-kok currently has 26 CG rules. Still there is a lot of room for disambiguation improvement using CG.
Apertium-kok currently has 43 CG rules. Still there is a lot of room for disambiguation improvement using CG.


===Future work===
===Future work===

Latest revision as of 15:53, 6 February 2024

Konkani (Wikipedia:Konkani language) is an Indo-Aryan or Indic language. It is available in Apertium as a standalone analyser/generator (apertium-kok) and as a component of pair which translates to/from Konkani.

Language pair[edit]

See also: List of language pairs

In incubator:

Pair name Language Last update
apertium-kok-hin Konkani <-> Hindi 23 Oct 2020

Apertium-kok[edit]

Current status[edit]

Last update: 6th Feb 2024

Dix entries: 61,169

Dix paradigms: 169

Coverage: 93.36% (Wikipedia)

Dictionary guidelines[edit]

The current Konkani dictionary is quite big (nearly 65,000 entries), so tidiness is essential to ensure future development:

  • Keep entries sorted alphabetically.
  • Keep entries grouped by type and tags (do not mix different types of proper nouns together).
  • Check the file with apertium-dixtools (to update the number of entries and remove duplicates).

Spelling variants[edit]

The standard spelling variant in Apertium is the official Devanagri script.

Constraint Grammar[edit]

Apertium-kok currently has 43 CG rules. Still there is a lot of room for disambiguation improvement using CG.

Future work[edit]

  • Add more words and paradigms