Difference between revisions of "Catalan"
Hectoralos (talk | contribs) |
|||
(8 intermediate revisions by 2 users not shown) | |||
Line 9: | Line 9: | ||
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update |
! width=200 | Pair name !! width=350 | Languages !! width=130 | Last update |
||
|- |
|- |
||
| align=center | <code>[[apertium-arg-cat]]</code> || align=center | Aragonese <-> Catalan || align=center | |
| align=center | <code>[[apertium-arg-cat]]</code> || align=center | Aragonese <-> Catalan || align=center | 12 Sep 2016 |
||
|- |
|- |
||
| align=center | <code>[[apertium- |
| align=center | <code>[[apertium-cat-ita]]</code> || align=center | Catalan <-> Italian || align=center | 02 Sep 2019 |
||
|- |
|- |
||
| align=center | <code>[[apertium-en-ca]]</code> || align=center | English <-> Catalan || align=center | 28 Mar 2016 |
| align=center | <code>[[apertium-en-ca]]</code> || align=center | English <-> Catalan || align=center | 28 Mar 2016 |
||
Line 21: | Line 21: | ||
| align=center | <code>[[apertium-oc-ca]]</code> || align=center | Occitan <-> Catalan || align=center | 13 Dec 2015 |
| align=center | <code>[[apertium-oc-ca]]</code> || align=center | Occitan <-> Catalan || align=center | 13 Dec 2015 |
||
|- |
|- |
||
| align=center | <code>[[apertium- |
| align=center | <code>[[apertium-por-cat]]</code> || align=center | Portuguese <-> Catalan || align=center | 16 Sep 2019 |
||
|- |
|- |
||
| align=center | <code>[[apertium-spa-cat]]</code> || align=center | Spanish <-> Catalan || align=center | 01 Apr 2017 |
| align=center | <code>[[apertium-spa-cat]]</code> || align=center | Spanish <-> Catalan || align=center | 01 Apr 2017 |
||
⚫ | |||
⚫ | |||
|- |
|- |
||
|} |
|} |
||
Line 33: | Line 35: | ||
|- |
|- |
||
| align=center | <code>[[apertium-cat-glg]]</code> || align=center | Catalan <-> Galician || align=center | 18 Nov 2016 |
| align=center | <code>[[apertium-cat-glg]]</code> || align=center | Catalan <-> Galician || align=center | 18 Nov 2016 |
||
⚫ | |||
⚫ | |||
|- |
|- |
||
|} |
|} |
||
Line 57: | Line 57: | ||
|- |
|- |
||
| align=center | <code>[[apertium-eng-cat]]</code> || align=center | English <-> Catalan || align=center | 24 Jan 2016 |
| align=center | <code>[[apertium-eng-cat]]</code> || align=center | English <-> Catalan || align=center | 24 Jan 2016 |
||
|- |
|||
| align=center | <code>[[apertium-por-cat]]</code> || align=center | Portuguese <-> Catalan || align=center | 22 Jan 2016 |
|||
|- |
|- |
||
|} |
|} |
||
== Apertium-cat == |
== Apertium-cat == |
||
===Current status=== |
|||
''Last update: 28 Aug 2017'' |
|||
'''Dix entries:''' 56,588 |
|||
'''Dix paradigms:''' 607 |
|||
'''Coverage:''' 94.04% (Wikipedia) |
|||
===Dictionary guidelines=== |
|||
The current Catalan dictionary is quite big (more than 55,000 entries), so tidiness is essential to ensure future development: |
|||
* Keep entries sorted alphabetically. |
|||
* Keep entries grouped by type and tags (do not mix different types of proper nouns together). |
|||
* Check the file with apertium-dixtools (to update the number of entries and remove duplicates). |
|||
====Proper nouns==== |
|||
Catalan proper nouns (names, toponyms, acronyms, etc.) should all have gender and number. They were once removed, but they should be specified using the following paradigms: |
|||
* Toponyms <np><top><m><sg>: Iran__np |
|||
* Toponyms <np><top><f><sg>: Àfrica__np |
|||
* Toponyms <np><top><m><pl>: Estats_Units__np |
|||
* Toponyms <np><top><f><pl>: Balears__np |
|||
* Antroponyms <np><ant><m><sg>: Marc__np |
|||
* Antroponyms <np><ant><f><sg>: Maria__np |
|||
* Family names <np><cog><mf><sp>: Saussure__np |
|||
* Others <np><al><m><sg>: Linux__np |
|||
* Others <np><al><f><sg>: Wikipedia__np |
|||
* Others <np><al><m><pl>: Jocs_Olímpics__np |
|||
* Others <np><al><f><pl>: Falles__np |
|||
* Others <np><al><mf><sp>: Honda__np |
|||
===Future work=== |
|||
* Add support for proper noun articles (en/na). |
|||
* Restore gender and number to proper nouns that still do not have them. |
|||
* Tweak entries related to proper nouns with translations (kings, queens, etc.). |
|||
**This is already partially done using lexical selection and a specific macro in several language pairs |
|||
''For further documentation about Catalan in Apertium, check: [[:Category:Catalan]]'' |
|||
[[Category:Catalan]] |
[[Category:Catalan]] |
||
[[Category:Languages]] |
[[Category:Languages]] |
||
[[Category:Romance languages]] |
[[Category:Romance languages]] |
||
[[Category:Documentation in English]] |
Latest revision as of 14:23, 5 October 2019
Catalan (Wikipedia:Catalan language) is a Romance language. It is available in Apertium as a standalone analyser/generator (apertium-cat) and as a component of several pairs which translate to/from Catalan.
Contents
Language pairs[edit]
See also: List of language pairs
In trunk:
Pair name | Languages | Last update |
---|---|---|
apertium-arg-cat |
Aragonese <-> Catalan | 12 Sep 2016 |
apertium-cat-ita |
Catalan <-> Italian | 02 Sep 2019 |
apertium-en-ca |
English <-> Catalan | 28 Mar 2016 |
apertium-eo-ca |
Esperanto <-- Catalan | 13 Dec 2015 |
apertium-fra-cat |
French <-> Catalan | 18 Apr 2017 |
apertium-oc-ca |
Occitan <-> Catalan | 13 Dec 2015 |
apertium-por-cat |
Portuguese <-> Catalan | 16 Sep 2019 |
apertium-spa-cat |
Spanish <-> Catalan | 01 Apr 2017 |
apertium-cat-srd |
Catalan <-> Sardinian | 06 Sep 2017 |
In staging:
Pair name | Languages | Last update |
---|---|---|
apertium-cat-glg |
Catalan <-> Galician | 18 Nov 2016 |
In nursery:
Pair name | Languages | Last update |
---|---|---|
apertium-ca-ro |
Catalan <-> Romanian | 30 Sep 2015 |
In incubator:
Pair name | Languages | Last update |
---|---|---|
apertium-cat-cos |
Catalan <-> Corsican | 05 Nov 2013 |
apertium-cat-ina |
Catalan <-> Interlingua | 07 Jan 2016 |
apertium-eng-cat |
English <-> Catalan | 24 Jan 2016 |
Apertium-cat[edit]
Current status[edit]
Last update: 28 Aug 2017
Dix entries: 56,588
Dix paradigms: 607
Coverage: 94.04% (Wikipedia)
Dictionary guidelines[edit]
The current Catalan dictionary is quite big (more than 55,000 entries), so tidiness is essential to ensure future development:
- Keep entries sorted alphabetically.
- Keep entries grouped by type and tags (do not mix different types of proper nouns together).
- Check the file with apertium-dixtools (to update the number of entries and remove duplicates).
Proper nouns[edit]
Catalan proper nouns (names, toponyms, acronyms, etc.) should all have gender and number. They were once removed, but they should be specified using the following paradigms:
- Toponyms <np><top><m><sg>: Iran__np
- Toponyms <np><top><f><sg>: Àfrica__np
- Toponyms <np><top><m><pl>: Estats_Units__np
- Toponyms <np><top><f><pl>: Balears__np
- Antroponyms <np><ant><m><sg>: Marc__np
- Antroponyms <np><ant><f><sg>: Maria__np
- Family names <np><cog><mf><sp>: Saussure__np
- Others <np><al><m><sg>: Linux__np
- Others <np><al><f><sg>: Wikipedia__np
- Others <np><al><m><pl>: Jocs_Olímpics__np
- Others <np><al><f><pl>: Falles__np
- Others <np><al><mf><sp>: Honda__np
Future work[edit]
- Add support for proper noun articles (en/na).
- Restore gender and number to proper nouns that still do not have them.
- Tweak entries related to proper nouns with translations (kings, queens, etc.).
- This is already partially done using lexical selection and a specific macro in several language pairs
For further documentation about Catalan in Apertium, check: Category:Catalan