Difference between revisions of "Konkani"

From Apertium
Jump to navigation Jump to search
 
(24 intermediate revisions by the same user not shown)
Line 1: Line 1:
* https://github.com/apertium/apertium-kok
 
 
 
'''Konkani''' ([[Wikipedia:Konkani language]]) is an Indo-Aryan or Indic language. It is available in Apertium as a standalone analyser/generator ([[Konkani#Apertium-kok|apertium-kok]]) and as a component of pair which translates to/from Konkani.
 
'''Konkani''' ([[Wikipedia:Konkani language]]) is an Indo-Aryan or Indic language. It is available in Apertium as a standalone analyser/generator ([[Konkani#Apertium-kok|apertium-kok]]) and as a component of pair which translates to/from Konkani.
   
== Language pairs ==
+
== Language pair ==
  +
''See also: [[List of language pairs]]''
 
   
 
In [[incubator]]:
 
In [[incubator]]:
   
  +
{|class=wikitable
 
! width=200 | Pair name !! width=350 | Language !! width=130 | Last update
 
! width=200 | Pair name !! width=350 | Language !! width=130 | Last update
 
|-
 
|-
| align=center | <code>[[apertium-kok-hin]]</code> || align=center | Modern Konkani <-> Hindi || align=center | 03 Aug 2015
+
| align=center | <code>[[apertium-kok-hin]]</code> || align=center | Konkani <-> Hindi || align=center | 23 Oct 2020
 
|-
 
|-
 
|}
 
|}
  +
  +
== Apertium-kok ==
  +
  +
===Current status===
  +
  +
''Last update: 6th Feb 2024''
  +
  +
'''Dix entries:''' 61,169
  +
  +
'''Dix paradigms:''' 169
  +
  +
'''Coverage:''' 93.36% (Wikipedia)
  +
  +
===Dictionary guidelines===
  +
  +
The current Konkani dictionary is quite big (nearly 65,000 entries), so tidiness is essential to ensure future development:
  +
  +
* Keep entries sorted alphabetically.
  +
* Keep entries grouped by type and tags (do not mix different types of proper nouns together).
  +
* Check the file with apertium-dixtools (to update the number of entries and remove duplicates).
  +
  +
====Spelling variants====
  +
  +
The standard spelling variant in Apertium is the official Devanagri script.
  +
  +
=== Constraint Grammar ===
  +
  +
Apertium-kok currently has 43 CG rules. Still there is a lot of room for disambiguation improvement using CG.
  +
  +
===Future work===
  +
  +
* Add more words and paradigms

Latest revision as of 15:53, 6 February 2024

Konkani (Wikipedia:Konkani language) is an Indo-Aryan or Indic language. It is available in Apertium as a standalone analyser/generator (apertium-kok) and as a component of pair which translates to/from Konkani.

Language pair[edit]

See also: List of language pairs

In incubator:

Pair name Language Last update
apertium-kok-hin Konkani <-> Hindi 23 Oct 2020

Apertium-kok[edit]

Current status[edit]

Last update: 6th Feb 2024

Dix entries: 61,169

Dix paradigms: 169

Coverage: 93.36% (Wikipedia)

Dictionary guidelines[edit]

The current Konkani dictionary is quite big (nearly 65,000 entries), so tidiness is essential to ensure future development:

  • Keep entries sorted alphabetically.
  • Keep entries grouped by type and tags (do not mix different types of proper nouns together).
  • Check the file with apertium-dixtools (to update the number of entries and remove duplicates).

Spelling variants[edit]

The standard spelling variant in Apertium is the official Devanagri script.

Constraint Grammar[edit]

Apertium-kok currently has 43 CG rules. Still there is a lot of room for disambiguation improvement using CG.

Future work[edit]

  • Add more words and paradigms