Difference between revisions of "User:Youssefsan"

From Apertium
Jump to navigation Jump to search
 
(146 intermediate revisions by 2 users not shown)
Line 1: Line 1:
I am Youssef Oualmakran. I have a four-year degree in translation [[French]]-[[English]] and French-[[Spanish]]. In 2006-2007 I taught [[Dutch]]. In 2007 I taught also English. I am also a power industrial engineer.
I am Youssef Oualmakran.

2012-2013. Currently, I am studing the use of case based reasoning (CBR) for power restoration in smart grids. I aslo studying [[Arabic]] and [[Russian]], as attending [[German]] classes in Girona.

I would like to learn more about:
* languages (see below)
* programming: C++, Python
* artificial intelligence and machine learning
* linguistics

I am a mentor for the Google Code In 2012 (see [[user:youssefsan/GCI 2012]]). On this Wiki, I am a sysop (administrator) since December 2012. You can contact me on my talk page, on IRC (when I am connected), or by email.

I used to work on the Dutch-French pair.
You can email me at '''<youssefsan@gmail.com>'''.

More about me on http://fr.wikipedia.org/wiki/Utilisateur:Youssefsan
More about me on http://fr.wikipedia.org/wiki/Utilisateur:Youssefsan



== Languages ==

* Good knowledge (= able to communicate/understand quite well) in [[French]], [[English]], [[Catalan]], [[Spanish]].
*Advanced intermidiate in [[Dutch]].
*Lower intermediate [[German]].
*Basic (= limited or very limited capabilities): [[Arabic]], [[Russian]], [[Amazigh]]
* Studied somehow (probably very broken to say the least) [[Japanese]], [[Chinese]], [[Latin]]

Interests in:
* Improving [[Arabic]] and [[Russian]]
* Progressing in [[Japanese]], [[Chinese]]
* Starting [[Greek]]
* Knowing more [[romance languages]]: [[Italian]], [[Portuguese]], [[Romanian]]
* (At least) scandinavian languages: [[Swedish]], [[Norvegian]], [[Danish]]
* Amerindian languages: [[Nahuatl]], [[Maya]], ...
* [[Esperanto]]
* Non-Indoeuropean languages: Euskera, [[Finnish]], [[Hungarian]] ....

== Links ==
*[[List of Apertium mentors]]

== New todo ==

* create [[paradigm]]
* add examples and expand [[lemma]]
* Improve [[Monodix basics]]
::''File:Monodix.png'' fix it. Paradigms are missing and other possible errors.
:: https://en.wikipedia.org/wiki/Wikipedia:Tutorial/Formatting



* Update [[Greek]]
* Read [[Task ideas for Google Code-in]]
* [[user:Youssefsan/todo]]: more stuff to do
* [[user:youssefsan/GCI 2012]]: mentorship
* review [[Missing chemical elements]], http://www.google-melange.com/gci/task/view/google/gci2012/7968236
* connect regularly to IRC #Apertium but not too much
* update [[Spanish]] (missing pairs)
* update [[Dutch]] (available pairs, open source tools)
* Look at Apertium work for [[Arabic]] and [[Russian]]
* Ar-mt: still some non Arabic letters there, Hebrew?
* Arabic and final vocals?

* [[Romance languages]] add covered languages ([[List of language pairs]]) and uncovered ([[Wikipedia:Romance language]])

*Have a look at [[Language and pair maintainer]]

* Tamazight
** Introduction à la réalisation d'un dictionnaire Amazigh-Amazigh à base Kabyle www.ummto.dz/IMG/pdf/these_corrigee_Hamek_Brahim.pdf Thesis
** http://arabeclassique.forumactif.com/f140-berbere-berber forum
** research institute on Tamazight (they search collaboration) http://www.itrane-university.net/
** http://www.itrane-university.net/ (click on e-library)

== Less urgent todo ==

Wiki:
* Merge [[monodix]], [[Morphological dictionary]] and [[Monodix basics]]

Errors:
* fr-ca. "Traduction en catalan" --> "Traducció en català" when it should have been "al" instead of "en català"

Update
*[[Contributing to an existing pair]] (remove emails--> mailing list and update)
* [[Romance languages]] add missing languages and pairs ([[Wikipedia:Romance languages]])

Have a look at:
* https://svn.code.sf.net/p/apertium/svn/languages/
* [[Apertium for Dummies]]
* [[Setup for working on morphological dictionaries]]
* http://en.wikipedia.org/wiki/Apertium
*[[Using linguistic resources]]
*[[Verb movement]]
* http://wiki.apertium.eu/index.php/Session_0:_Overview
* [[Курсы машинного перевода для языков России/Session 0]]
* [[Contributing]]

Google Summer of Code:
* Languages with no coverage
** [[Germanic languages]]: [[Frisian]], [[Limburgish]], [[Luxembourgish]]...
** Amerindian: [[Maya]], [[Nahuatl]],...

== Language books ==

English:
* http://www.teachyourself.co.uk/

French:
* http://www.cbsnews.com/video/watch/?id=50142079n
* http://www.fr.fnac.be/

Other
* http://www.editorial.albujayra.com/distribucion/index.php?language=es Based in Spain

== English ==

*aluminium also spelled aluminum

*polysemic English words: No = negation ; = symbol of Nobelium

== Spanish-French ==

'''Missing words in Spanish'''

*copiar (= «copier»)

'''Disambig'''

*servidor (= «serviteur» or «serveur»)

== Test ==
Bonjour. Je suis Youssef Oualmakran, diplômé en traduction. J'étudie actuellement à Bruxelles en Belgique. Je vais vous écrire un petit test pour Apertium, un logiciel de traduction automatique. Ce logiciel est sous licence libre. Si vous avez des questions vous pouvez me contacter par courrier électronique. Bonne journée.

== TODO ==

* Update Apertium article on fr:, make a new article in nl:
** See if there are other outdated articles, or missing articles
* improve the article machine translation on fr:
* Update the article on Framasoft
* Write an article in French Linux magazines

*Write an article on the wiki of Ubuntu https://wiki.ubuntu.com/

*'''High priority''' reverse enegenieer http://ilk.uvt.nl/tadpole/ Morphological analyzer GPL for Dutch

== Notes ==

For xml linguistics data, Apertium uses DTD + xmllint from libxml2-utils

== Links ==

* http://mt-archive.info/ Papers on machine translation
* http://meta.wikimedia.org/wiki/Interwiki_map


To edit the text of Recent change
To edit the text of Recent change
Line 11: Line 158:


*http://wiki.framasoft.info/Notices/Index
*http://wiki.framasoft.info/Notices/Index


*http://xixona.dlsi.ua.es/~ebenimeli/apertium/ Alternative home page

*http://fr.wikipedia.org/wiki/Mod%C3%A8le_de_Markov_cach%C3%A9


== Links ==
== Links ==
Line 20: Line 172:
*http://apertium.sourceforge.net/extending.html
*http://apertium.sourceforge.net/extending.html
*file:///home/youssef/apertium/apertium-documentation-en/documentation.html
*file:///home/youssef/apertium/apertium-documentation-en/documentation.html
*www.sf.net/projects/matxin


== Test ==
== Test ==
Line 30: Line 183:
*http://portal.unesco.org/ci/en/ev.php-URL_ID=24553&URL_DO=DO_TOPIC&URL_SECTION=201.html WLDC
*http://portal.unesco.org/ci/en/ev.php-URL_ID=24553&URL_DO=DO_TOPIC&URL_SECTION=201.html WLDC
*http://dico.isc.cnrs.fr/ dico du cnrs fr, en + synonyms
*http://dico.isc.cnrs.fr/ dico du cnrs fr, en + synonyms
*http://forum.wordreference.com/ languages forums


== Promotion ==
== Promotion ==
Line 36: Line 190:
*http://www.framasoft.net/
*http://www.framasoft.net/
*http://freshmeat.net/
*http://freshmeat.net/
*http://gallery.yahoo.com/dev/e515a802e4ba1af14705442e3ecd8bd3 contact wacky for yahoo messenger plugin
*http://fr.wikipedia.org/wiki/Wikip%C3%A9dia:Promotion/Francophonie/M%C3%A9dias


== other tools ==
== other tools ==
Line 41: Line 197:
*http://www.cs.chalmers.se/~markus/extract/
*http://www.cs.chalmers.se/~markus/extract/
*$ cat <list> | lt-proc nl-fr.automorf.bin | grep '\/\*'
*$ cat <list> | lt-proc nl-fr.automorf.bin | grep '\/\*'
*http://www.otal.umd.edu/uupractice/clir/ CLIR (Cross Language Information Retrieva)
*http://transifex.org/ l10n platform originally for Fedora


== Todo ==
== Todo ==


*http://ilazki.thinkgeek.co.uk/~spectre/apertium-en_GB.po
*http://ilazki.thinkgeek.co.uk/~spectre/apertium-en_GB.po
*http://developer.pidgin.im/query asking new features (making a ticket and call it enhancement)
*youssef@youssef-desktop:~/apertium/webspace$


== Apertium Bugzilla ==
== Apertium Bugzilla ==
Line 53: Line 213:


*http://gtb.inl.nl/?owner=WNT
*http://gtb.inl.nl/?owner=WNT
*http://oase.uci.kun.nl/~ans/ ANS
*http://www.paultenhove.nl/ventura/engine.php?Cmd=see&P_site=352&P_self=34&PMax=list&PSkip= Free bidix on agriculture, etc (tspier2)
*http://ilk.uvt.nl/tadpole/ Morphological analyzer GPL for Dutch
*http://taalunieversum.org/nieuws/2110/ontwikkeling_corpus_geschreven_nederlands_kan_doorgaan Stevin

== Languages stats ==

*http://en.wikipedia.org/wiki/Wikipedia:Multilingual_statistics
:Polish, Japanese and Italian are languages in Wikipedia top10 still missing for Apertium


== Irish ==

* http://www.daltai.com/phrases/phrases004.htm

== NL ==

*http://www.linupedia.nl/wiki/index.php/Hoofdpagina
*http://doc.nl.linux.org/
*http://uitmuntend.de/ nl-de
*http://ote.2meta.com/

== Linguistics ==

*http://ccat.sas.upenn.edu/jod/texts/donatus.4.html Donatus, ~1600 part of speech

== Mapundgun (Chile)==

*Mainly spoken, not written.
*Contreversy about translation from Microsoft
*http://www.liberaciondigital.org/wiki/Equipos_LD_en_Regiones#Temuco

== Mexico ==

*http://www.lenmex.org/

== Po ==

*http://pootle.wordforge.org/doc/en/howto.html

== Esperanto ==

*http://e-d-e.org/?lang=fr 'Europe Démocratie Espéranto' Poltical party
:http://e-d-e.org/rubrique.php3?id_rubrique=14&lang=fr Contact

*http://www.esperanto-gb.org/eab/contact.htm#feedback UK
*http://www.esperanto.ie/english/aboutEAI.htm Ireland

== Occitan ==

*http://occitanet.free.fr/fr/index.html
*http://www.panoccitan.org/association/

== Neo normand ==

*http://groups.google.fr/group/neonormand?msg=subscribe

== XML ==

*http://www.w3schools.com/schema/schema_intro.asp XSD tutorial
*http://lists.xml.org/archives/xml-dev/200106/msg00589.html xml 1.0 vs xml 1.1
*http://java.sun.com/webservices/jaxp/dist/1.1/docs/tutorial/TOC.html#intro java and xml
*http://java.sun.com/xml/
*http://en.wikipedia.org/wiki/JAXP
*http://www.cafeconleche.org/slides/sd2000east/dom/01.html
*http://www.rgagnon.com/javadetails/java-0408.html
*http://snippets.dzone.com/posts/show/4009
*http://www.roseindia.net/xml/dom/ ++
*http://www.ibm.com/developerworks/xml/library/x-jaxp/
*http://www.xml.com/pub/a/2005/07/06/jaxp.html
*http://wiki.java.net/bin/view/Javapedia/JAXP
*https://jaxp.dev.java.net/
*http://developers.sun.com/sw/building/codesamples/dom/doc/DOMSample.java
*http://developers.sun.com/sw/building/codesamples/dom/index.html
*http://java.sun.com/developer/technicalArticles/xml/JavaTechandXML_part3/index.html
*http://www.xquery.com/tutorials/xqj_tutorial/ XQJ

== XQuery ==

*http://en.wikipedia.org/wiki/XQuery#Implementations
*http://saxon.sourceforge.net/ (commercial&free)
*http://www.gnu.org/software/qexo/

== Spanish ==
*http://wikilengua.fundeu.es/index.php/Portada

== German ==

*http://wiki.mcworld.org/index.php/Basic_German Wiki
*http://www.sagen.at/doku/lexika/mundart1811/mundart1811.html Dictionary
*http://www.wordmirror.com/index.php
*dict.leo.org

== Spell checker ==

*http://wiki.services.openoffice.org/wiki/Dictionary#French_.28France.2C_....29
*http://www.gutenberg.eu.org/distributions/6-francais-gutenberg.html
*http://www.abisource.com/projects/enchant/

== Cree ==

*http://adornthetruth.com/nikamo/ Cree language, java software, GPL

== Php/java ==

*http://quercus.caucho.com/

== Irc tips ==

*/msg nickserv ghost youssef password

== Ant ==

*http://ant.apache.org/
*http://wiki.apache.org/ant/FrontPage
*http://en.wikibooks.org/wiki/Programming:Apache_Ant
*http://ideoplex.com/focus/java#ant

== Translation projects ==

*http://www.aspirationtech.org/events/opentranslation

== Misc ==

*http://www.opentaal.org/
*http://www.speling.org/

== Speech synthesis ==

*http://www.cstr.ed.ac.uk/projects/festival/

== Firefox plugin ==

*http://developer.mozilla.org/en/docs/Building_an_Extension
*https://addons.mozilla.org/en-US/firefox/browse/type:1/cat:37/sort:popular

[[Category:Users|Youssefsan]]

== Language tools ==

*http://www.languagetool.org/ grammar spell checker. Rules defined in XML and Java

== MT general ==

*mt-archive.info

== Translation memory==

*http://linuxfr.org/2008/03/03/23787.html
*http://memoireslibres.tuxfamily.org/index.html

== Euroweek ==

*http://www.free-articles-zone.com/author/14090
*http://www.expertarticles.com/article/Reference-Education/Language/Tools-to-Use-for-a-Professional-Translation.html
*http://thetundradrums.com/news/story/1627 Alaskan languages

== Language skills problems ==
*http://news.bbc.co.uk/2/hi/uk_news/england/shropshire/7279463.stm


== Catalan ==

*http://www.assemblee-nationale.fr/13/cra/provisoire/netprovi.asp

==Spanish==

*http://culturitalia.uibk.ac.at/hispanoteca/Foro-preguntas/ARCHIVO-Foro/Buenos días-buenas tardes-noches.htm

==German==
*http://youtube.com/watch?v=Mf5KXzpdqKk


==MT errors ==


*http://adweek.blogs.com/photos/uncategorized/2008/07/10/translateservererror.jpg
*http://china.kylereed.com/Sichuan/FunnySign.jpg

== Language learning ==
*http://shtooka.net/project/fr/ Audio sound for fr
*http://linuxfr.org/2008/08/20/24396.html Openspell: learning African langauges


== Multilingual wiki ==
*http://www.anwiki.com


== Free interfaces on Ubuntu==

*Freespeak
*Youtranslate

Latest revision as of 20:47, 12 December 2013

I am Youssef Oualmakran. I have a four-year degree in translation French-English and French-Spanish. In 2006-2007 I taught Dutch. In 2007 I taught also English. I am also a power industrial engineer.

2012-2013. Currently, I am studing the use of case based reasoning (CBR) for power restoration in smart grids. I aslo studying Arabic and Russian, as attending German classes in Girona.

I would like to learn more about:

  • languages (see below)
  • programming: C++, Python
  • artificial intelligence and machine learning
  • linguistics

I am a mentor for the Google Code In 2012 (see user:youssefsan/GCI 2012). On this Wiki, I am a sysop (administrator) since December 2012. You can contact me on my talk page, on IRC (when I am connected), or by email.

I used to work on the Dutch-French pair. You can email me at <youssefsan@gmail.com>.

More about me on http://fr.wikipedia.org/wiki/Utilisateur:Youssefsan


Languages[edit]

Interests in:

Links[edit]

New todo[edit]

File:Monodix.png fix it. Paradigms are missing and other possible errors.
https://en.wikipedia.org/wiki/Wikipedia:Tutorial/Formatting


Less urgent todo[edit]

Wiki:

Errors:

  • fr-ca. "Traduction en catalan" --> "Traducció en català" when it should have been "al" instead of "en català"

Update

Have a look at:

Google Summer of Code:

Language books[edit]

English:

French:

Other

English[edit]

  • aluminium also spelled aluminum
  • polysemic English words: No = negation ; = symbol of Nobelium

Spanish-French[edit]

Missing words in Spanish

  • copiar (= «copier»)

Disambig

  • servidor (= «serviteur» or «serveur»)

Test[edit]

Bonjour. Je suis Youssef Oualmakran, diplômé en traduction. J'étudie actuellement à Bruxelles en Belgique. Je vais vous écrire un petit test pour Apertium, un logiciel de traduction automatique. Ce logiciel est sous licence libre. Si vous avez des questions vous pouvez me contacter par courrier électronique. Bonne journée.

TODO[edit]

  • Update Apertium article on fr:, make a new article in nl:
    • See if there are other outdated articles, or missing articles
  • improve the article machine translation on fr:
  • Update the article on Framasoft
  • Write an article in French Linux magazines

Notes[edit]

For xml linguistics data, Apertium uses DTD + xmllint from libxml2-utils

Links[edit]

To edit the text of Recent change

See also


Links[edit]

Test[edit]

Linguistics[edit]

Promotion[edit]

other tools[edit]

Todo[edit]

Apertium Bugzilla[edit]

Dutch[edit]

Languages stats[edit]

Polish, Japanese and Italian are languages in Wikipedia top10 still missing for Apertium


Irish[edit]

NL[edit]

Linguistics[edit]

Mapundgun (Chile)[edit]

Mexico[edit]

Po[edit]

Esperanto[edit]

http://e-d-e.org/rubrique.php3?id_rubrique=14&lang=fr Contact

Occitan[edit]

Neo normand[edit]

XML[edit]

XQuery[edit]

Spanish[edit]

German[edit]

Spell checker[edit]

Cree[edit]

Php/java[edit]

Irc tips[edit]

  • /msg nickserv ghost youssef password

Ant[edit]

Translation projects[edit]

Misc[edit]

Speech synthesis[edit]

Firefox plugin[edit]

Language tools[edit]

MT general[edit]

  • mt-archive.info

Translation memory[edit]

Euroweek[edit]

Language skills problems[edit]


Catalan[edit]

Spanish[edit]

German[edit]


MT errors[edit]

Language learning[edit]


Multilingual wiki[edit]


Free interfaces on Ubuntu[edit]

  • Freespeak
  • Youtranslate