Catalan

From Apertium
Revision as of 14:23, 5 October 2019 by Hectoralos (talk | contribs) (→‎Language pairs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Catalan (Wikipedia:Catalan language) is a Romance language. It is available in Apertium as a standalone analyser/generator (apertium-cat) and as a component of several pairs which translate to/from Catalan.

Language pairs

See also: List of language pairs

In trunk:

Pair name Languages Last update
apertium-arg-cat Aragonese <-> Catalan 12 Sep 2016
apertium-cat-ita Catalan <-> Italian 02 Sep 2019
apertium-en-ca English <-> Catalan 28 Mar 2016
apertium-eo-ca Esperanto <-- Catalan 13 Dec 2015
apertium-fra-cat French <-> Catalan 18 Apr 2017
apertium-oc-ca Occitan <-> Catalan 13 Dec 2015
apertium-por-cat Portuguese <-> Catalan 16 Sep 2019
apertium-spa-cat Spanish <-> Catalan 01 Apr 2017
apertium-cat-srd Catalan <-> Sardinian 06 Sep 2017

In staging:

Pair name Languages Last update
apertium-cat-glg Catalan <-> Galician 18 Nov 2016

In nursery:

Pair name Languages Last update
apertium-ca-ro Catalan <-> Romanian 30 Sep 2015

In incubator:

Pair name Languages Last update
apertium-cat-cos Catalan <-> Corsican 05 Nov 2013
apertium-cat-ina Catalan <-> Interlingua 07 Jan 2016
apertium-eng-cat English <-> Catalan 24 Jan 2016

Apertium-cat

Current status

Last update: 28 Aug 2017

Dix entries: 56,588

Dix paradigms: 607

Coverage: 94.04% (Wikipedia)

Dictionary guidelines

The current Catalan dictionary is quite big (more than 55,000 entries), so tidiness is essential to ensure future development:

  • Keep entries sorted alphabetically.
  • Keep entries grouped by type and tags (do not mix different types of proper nouns together).
  • Check the file with apertium-dixtools (to update the number of entries and remove duplicates).

Proper nouns

Catalan proper nouns (names, toponyms, acronyms, etc.) should all have gender and number. They were once removed, but they should be specified using the following paradigms:

  • Toponyms <np><top><m><sg>: Iran__np
  • Toponyms <np><top><f><sg>: Àfrica__np
  • Toponyms <np><top><m><pl>: Estats_Units__np
  • Toponyms <np><top><f><pl>: Balears__np
  • Antroponyms <np><ant><m><sg>: Marc__np
  • Antroponyms <np><ant><f><sg>: Maria__np
  • Family names <np><cog><mf><sp>: Saussure__np
  • Others <np><al><m><sg>: Linux__np
  • Others <np><al><f><sg>: Wikipedia__np
  • Others <np><al><m><pl>: Jocs_Olímpics__np
  • Others <np><al><f><pl>: Falles__np
  • Others <np><al><mf><sp>: Honda__np

Future work

  • Add support for proper noun articles (en/na).
  • Restore gender and number to proper nouns that still do not have them.
  • Tweak entries related to proper nouns with translations (kings, queens, etc.).
    • This is already partially done using lexical selection and a specific macro in several language pairs


For further documentation about Catalan in Apertium, check: Category:Catalan