Difference between revisions of "Catalan"
(Created page with "'''Catalan''' (Wikipedia:Catalan language) is a Romance language. It is available in Apertium as a standalone analyser/generator (apertium-cat...") |
Hectoralos (talk | contribs) |
||
(11 intermediate revisions by 2 users not shown) | |||
Line 5: | Line 5: | ||
In [[trunk]]: |
In [[trunk]]: |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-arg-cat]]</code> || align=center | Aragonese <-> Catalan || align=center | 12 Sep 2016 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-cat-ita]]</code> || align=center | Catalan <-> Italian || align=center | 02 Sep 2019 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-en-ca]]</code> || align=center | English <-> Catalan || align=center | 28 Mar 2016 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-eo-ca]]</code> || align=center | Esperanto <-- Catalan || align=center | 13 Dec 2015 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-fra-cat]]</code> || align=center | French <-> Catalan || align=center | 18 Apr 2017 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-oc-ca]]</code> || align=center | Occitan <-> Catalan || align=center | 13 Dec 2015 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-por-cat]]</code> || align=center | Portuguese <-> Catalan || align=center | 16 Sep 2019 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-spa-cat]]</code> || align=center | Spanish <-> Catalan || align=center | 01 Apr 2017 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-cat-srd]]</code> || align=center | Catalan <-> Sardinian || align=center | 06 Sep 2017 |
||
+ | |- |
||
+ | |} |
||
In [[staging]]: |
In [[staging]]: |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-cat-glg]]</code> || align=center | Catalan <-> Galician || align=center | 18 Nov 2016 |
||
+ | |- |
||
+ | |} |
||
In [[nursery]]: |
In [[nursery]]: |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-ca-ro]]</code> || align=center | Catalan <-> Romanian || align=center | 30 Sep 2015 |
||
+ | |- |
||
+ | |} |
||
In [[incubator]]: |
In [[incubator]]: |
||
+ | |||
+ | {|class=wikitable |
||
+ | ! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-cat-cos]]</code> || align=center | Catalan <-> Corsican || align=center | 05 Nov 2013 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-cat-ina]]</code> || align=center | Catalan <-> Interlingua || align=center | 07 Jan 2016 |
||
+ | |- |
||
+ | | align=center | <code>[[apertium-eng-cat]]</code> || align=center | English <-> Catalan || align=center | 24 Jan 2016 |
||
+ | |- |
||
+ | |} |
||
== Apertium-cat == |
== Apertium-cat == |
||
+ | |||
+ | ===Current status=== |
||
+ | |||
+ | ''Last update: 28 Aug 2017'' |
||
+ | |||
+ | '''Dix entries:''' 56,588 |
||
+ | |||
+ | '''Dix paradigms:''' 607 |
||
+ | |||
+ | '''Coverage:''' 94.04% (Wikipedia) |
||
+ | |||
+ | ===Dictionary guidelines=== |
||
+ | |||
+ | The current Catalan dictionary is quite big (more than 55,000 entries), so tidiness is essential to ensure future development: |
||
+ | |||
+ | * Keep entries sorted alphabetically. |
||
+ | * Keep entries grouped by type and tags (do not mix different types of proper nouns together). |
||
+ | * Check the file with apertium-dixtools (to update the number of entries and remove duplicates). |
||
+ | |||
+ | ====Proper nouns==== |
||
+ | |||
+ | Catalan proper nouns (names, toponyms, acronyms, etc.) should all have gender and number. They were once removed, but they should be specified using the following paradigms: |
||
+ | |||
+ | * Toponyms <np><top><m><sg>: Iran__np |
||
+ | * Toponyms <np><top><f><sg>: Àfrica__np |
||
+ | * Toponyms <np><top><m><pl>: Estats_Units__np |
||
+ | * Toponyms <np><top><f><pl>: Balears__np |
||
+ | |||
+ | * Antroponyms <np><ant><m><sg>: Marc__np |
||
+ | * Antroponyms <np><ant><f><sg>: Maria__np |
||
+ | * Family names <np><cog><mf><sp>: Saussure__np |
||
+ | |||
+ | * Others <np><al><m><sg>: Linux__np |
||
+ | * Others <np><al><f><sg>: Wikipedia__np |
||
+ | * Others <np><al><m><pl>: Jocs_Olímpics__np |
||
+ | * Others <np><al><f><pl>: Falles__np |
||
+ | * Others <np><al><mf><sp>: Honda__np |
||
+ | |||
+ | ===Future work=== |
||
+ | |||
+ | * Add support for proper noun articles (en/na). |
||
+ | * Restore gender and number to proper nouns that still do not have them. |
||
+ | * Tweak entries related to proper nouns with translations (kings, queens, etc.). |
||
+ | **This is already partially done using lexical selection and a specific macro in several language pairs |
||
+ | |||
+ | |||
+ | ''For further documentation about Catalan in Apertium, check: [[:Category:Catalan]]'' |
||
+ | |||
+ | [[Category:Catalan]] |
||
+ | [[Category:Languages]] |
||
+ | [[Category:Romance languages]] |
||
+ | [[Category:Documentation in English]] |
Latest revision as of 14:23, 5 October 2019
Catalan (Wikipedia:Catalan language) is a Romance language. It is available in Apertium as a standalone analyser/generator (apertium-cat) and as a component of several pairs which translate to/from Catalan.
Contents
Language pairs[edit]
See also: List of language pairs
In trunk:
Pair name | Languages | Last update |
---|---|---|
apertium-arg-cat |
Aragonese <-> Catalan | 12 Sep 2016 |
apertium-cat-ita |
Catalan <-> Italian | 02 Sep 2019 |
apertium-en-ca |
English <-> Catalan | 28 Mar 2016 |
apertium-eo-ca |
Esperanto <-- Catalan | 13 Dec 2015 |
apertium-fra-cat |
French <-> Catalan | 18 Apr 2017 |
apertium-oc-ca |
Occitan <-> Catalan | 13 Dec 2015 |
apertium-por-cat |
Portuguese <-> Catalan | 16 Sep 2019 |
apertium-spa-cat |
Spanish <-> Catalan | 01 Apr 2017 |
apertium-cat-srd |
Catalan <-> Sardinian | 06 Sep 2017 |
In staging:
Pair name | Languages | Last update |
---|---|---|
apertium-cat-glg |
Catalan <-> Galician | 18 Nov 2016 |
In nursery:
Pair name | Languages | Last update |
---|---|---|
apertium-ca-ro |
Catalan <-> Romanian | 30 Sep 2015 |
In incubator:
Pair name | Languages | Last update |
---|---|---|
apertium-cat-cos |
Catalan <-> Corsican | 05 Nov 2013 |
apertium-cat-ina |
Catalan <-> Interlingua | 07 Jan 2016 |
apertium-eng-cat |
English <-> Catalan | 24 Jan 2016 |
Apertium-cat[edit]
Current status[edit]
Last update: 28 Aug 2017
Dix entries: 56,588
Dix paradigms: 607
Coverage: 94.04% (Wikipedia)
Dictionary guidelines[edit]
The current Catalan dictionary is quite big (more than 55,000 entries), so tidiness is essential to ensure future development:
- Keep entries sorted alphabetically.
- Keep entries grouped by type and tags (do not mix different types of proper nouns together).
- Check the file with apertium-dixtools (to update the number of entries and remove duplicates).
Proper nouns[edit]
Catalan proper nouns (names, toponyms, acronyms, etc.) should all have gender and number. They were once removed, but they should be specified using the following paradigms:
- Toponyms <np><top><m><sg>: Iran__np
- Toponyms <np><top><f><sg>: Àfrica__np
- Toponyms <np><top><m><pl>: Estats_Units__np
- Toponyms <np><top><f><pl>: Balears__np
- Antroponyms <np><ant><m><sg>: Marc__np
- Antroponyms <np><ant><f><sg>: Maria__np
- Family names <np><cog><mf><sp>: Saussure__np
- Others <np><al><m><sg>: Linux__np
- Others <np><al><f><sg>: Wikipedia__np
- Others <np><al><m><pl>: Jocs_Olímpics__np
- Others <np><al><f><pl>: Falles__np
- Others <np><al><mf><sp>: Honda__np
Future work[edit]
- Add support for proper noun articles (en/na).
- Restore gender and number to proper nouns that still do not have them.
- Tweak entries related to proper nouns with translations (kings, queens, etc.).
- This is already partially done using lexical selection and a specific macro in several language pairs
For further documentation about Catalan in Apertium, check: Category:Catalan