Difference between revisions of "User:Youssefsan"
Youssefsan (talk | contribs) m (→Search engine) |
Youssefsan (talk | contribs) |
||
(32 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
I am Youssef Oualmakran. I have a four-year degree in translation French-English and French-Spanish. In 2006-2007 I taught Dutch. In 2007 I taught also English. I am also a power industrial engineer. |
I am Youssef Oualmakran. I have a four-year degree in translation [[French]]-[[English]] and French-[[Spanish]]. In 2006-2007 I taught [[Dutch]]. In 2007 I taught also English. I am also a power industrial engineer. |
||
2012-2013. Currently, I am studing the use of case based reasoning (CBR) for power restoration in smart grids. I aslo studying [[Arabic]] and [[Russian]], as attending [[German]] classes in Girona. |
|||
⚫ | |||
I would like to learn more about: |
|||
* languages (see below) |
|||
* programming: C++, Python |
|||
* artificial intelligence and machine learning |
|||
* linguistics |
|||
I am a mentor for the Google Code In 2012 (see [[user:youssefsan/GCI 2012]]). On this Wiki, I am a sysop (administrator) since December 2012. You can contact me on my talk page, on IRC (when I am connected), or by email. |
|||
⚫ | |||
You can email me at '''<youssefsan@gmail.com>'''. |
You can email me at '''<youssefsan@gmail.com>'''. |
||
More about me on http://fr.wikipedia.org/wiki/Utilisateur:Youssefsan |
More about me on http://fr.wikipedia.org/wiki/Utilisateur:Youssefsan |
||
== Languages == |
|||
* Good knowledge (= able to communicate/understand quite well) in [[French]], [[English]], [[Catalan]], [[Spanish]]. |
|||
*Advanced intermidiate in [[Dutch]]. |
|||
*Lower intermediate [[German]]. |
|||
*Basic (= limited or very limited capabilities): [[Arabic]], [[Russian]], [[Amazigh]] |
|||
* Studied somehow (probably very broken to say the least) [[Japanese]], [[Chinese]], [[Latin]] |
|||
Interests in: |
|||
* Improving [[Arabic]] and [[Russian]] |
|||
* Progressing in [[Japanese]], [[Chinese]] |
|||
* Starting [[Greek]] |
|||
* Knowing more [[romance languages]]: [[Italian]], [[Portuguese]], [[Romanian]] |
|||
* (At least) scandinavian languages: [[Swedish]], [[Norvegian]], [[Danish]] |
|||
* Amerindian languages: [[Nahuatl]], [[Maya]], ... |
|||
* [[Esperanto]] |
|||
* Non-Indoeuropean languages: Euskera, [[Finnish]], [[Hungarian]] .... |
|||
== Links == |
|||
*[[List of Apertium mentors]] |
|||
== New todo == |
== New todo == |
||
* create [[paradigm]] |
|||
* add examples and expand [[lemma]] |
|||
* Improve [[Monodix basics]] |
|||
::''File:Monodix.png'' fix it. Paradigms are missing and other possible errors. |
|||
:: https://en.wikipedia.org/wiki/Wikipedia:Tutorial/Formatting |
|||
* Update [[Greek]] |
|||
* Read [[Task ideas for Google Code-in]] |
* Read [[Task ideas for Google Code-in]] |
||
* [[user:Youssefsan/todo]]: more stuff to do |
* [[user:Youssefsan/todo]]: more stuff to do |
||
Line 13: | Line 54: | ||
* review [[Missing chemical elements]], http://www.google-melange.com/gci/task/view/google/gci2012/7968236 |
* review [[Missing chemical elements]], http://www.google-melange.com/gci/task/view/google/gci2012/7968236 |
||
* connect regularly to IRC #Apertium but not too much |
* connect regularly to IRC #Apertium but not too much |
||
* update [[Spanish]] (missing pairs) |
|||
* download Apertium source code with SVN |
|||
* update [[Dutch]] (available pairs, open source tools) |
|||
* Look at Apertium work for [[Arabic]] and [[Russian]] |
* Look at Apertium work for [[Arabic]] and [[Russian]] |
||
* Ar-mt: still some non Arabic letters there, Hebrew? |
* Ar-mt: still some non Arabic letters there, Hebrew? |
||
* Arabic and final vocals? |
* Arabic and final vocals? |
||
* [[Romance languages]] add covered languages ([[List of language pairs]]) and uncovered ([[Wikipedia:Romance language]]) |
|||
*Have a look at [[Language and pair maintainer]] |
|||
* Tamazight |
|||
** Introduction à la réalisation d'un dictionnaire Amazigh-Amazigh à base Kabyle www.ummto.dz/IMG/pdf/these_corrigee_Hamek_Brahim.pdf Thesis |
|||
** http://arabeclassique.forumactif.com/f140-berbere-berber forum |
|||
** research institute on Tamazight (they search collaboration) http://www.itrane-university.net/ |
|||
** http://www.itrane-university.net/ (click on e-library) |
|||
== Less urgent todo == |
== Less urgent todo == |
||
Wiki: |
|||
* Merge [[monodix]], [[Morphological dictionary]] and [[Monodix basics]] |
|||
Errors: |
|||
* fr-ca. "Traduction en catalan" --> "Traducció en català" when it should have been "al" instead of "en català" |
|||
Update |
|||
*[[Contributing to an existing pair]] (remove emails--> mailing list and update) |
|||
* [[Romance languages]] add missing languages and pairs ([[Wikipedia:Romance languages]]) |
|||
Have a look at: |
Have a look at: |
||
* https://svn.code.sf.net/p/apertium/svn/languages/ |
|||
* [[Apertium for Dummies]] |
|||
* [[Setup for working on morphological dictionaries]] |
|||
* http://en.wikipedia.org/wiki/Apertium |
|||
*[[Using linguistic resources]] |
*[[Using linguistic resources]] |
||
*[[Verb movement]] |
*[[Verb movement]] |
||
* http://wiki.apertium.eu/index.php/Session_0:_Overview |
|||
* [[Курсы машинного перевода для языков России/Session 0]] |
|||
* [[Contributing]] |
|||
Google Summer of Code: |
Google Summer of Code: |
||
Line 29: | Line 97: | ||
** [[Germanic languages]]: [[Frisian]], [[Limburgish]], [[Luxembourgish]]... |
** [[Germanic languages]]: [[Frisian]], [[Limburgish]], [[Luxembourgish]]... |
||
** Amerindian: [[Maya]], [[Nahuatl]],... |
** Amerindian: [[Maya]], [[Nahuatl]],... |
||
== Language books == |
|||
English: |
|||
* http://www.teachyourself.co.uk/ |
|||
French: |
|||
* http://www.cbsnews.com/video/watch/?id=50142079n |
|||
* http://www.fr.fnac.be/ |
|||
Other |
|||
* http://www.editorial.albujayra.com/distribucion/index.php?language=es Based in Spain |
|||
== English == |
== English == |
||
Line 66: | Line 146: | ||
== Links == |
== Links == |
||
* http://mt-archive.info/ Papers on machine translation |
|||
* http://meta.wikimedia.org/wiki/Interwiki_map |
|||
To edit the text of Recent change |
To edit the text of Recent change |
||
Line 222: | Line 305: | ||
*http://www.wordmirror.com/index.php |
*http://www.wordmirror.com/index.php |
||
*dict.leo.org |
*dict.leo.org |
||
== Pasting text == |
|||
*http://rafb.net/paste/ |
|||
== Spell checker == |
== Spell checker == |
Latest revision as of 20:47, 12 December 2013
I am Youssef Oualmakran. I have a four-year degree in translation French-English and French-Spanish. In 2006-2007 I taught Dutch. In 2007 I taught also English. I am also a power industrial engineer.
2012-2013. Currently, I am studing the use of case based reasoning (CBR) for power restoration in smart grids. I aslo studying Arabic and Russian, as attending German classes in Girona.
I would like to learn more about:
- languages (see below)
- programming: C++, Python
- artificial intelligence and machine learning
- linguistics
I am a mentor for the Google Code In 2012 (see user:youssefsan/GCI 2012). On this Wiki, I am a sysop (administrator) since December 2012. You can contact me on my talk page, on IRC (when I am connected), or by email.
I used to work on the Dutch-French pair. You can email me at <youssefsan@gmail.com>.
More about me on http://fr.wikipedia.org/wiki/Utilisateur:Youssefsan
Contents
- 1 Languages
- 2 Links
- 3 New todo
- 4 Less urgent todo
- 5 Language books
- 6 English
- 7 Spanish-French
- 8 Test
- 9 TODO
- 10 Notes
- 11 Links
- 12 Links
- 13 Test
- 14 Linguistics
- 15 Promotion
- 16 other tools
- 17 Todo
- 18 Apertium Bugzilla
- 19 Dutch
- 20 Languages stats
- 21 Irish
- 22 NL
- 23 Linguistics
- 24 Mapundgun (Chile)
- 25 Mexico
- 26 Po
- 27 Esperanto
- 28 Occitan
- 29 Neo normand
- 30 XML
- 31 XQuery
- 32 Spanish
- 33 German
- 34 Spell checker
- 35 Cree
- 36 Php/java
- 37 Irc tips
- 38 Ant
- 39 Translation projects
- 40 Misc
- 41 Speech synthesis
- 42 Firefox plugin
- 43 Language tools
- 44 MT general
- 45 Translation memory
- 46 Euroweek
- 47 Language skills problems
- 48 Catalan
- 49 Spanish
- 50 German
- 51 MT errors
- 52 Language learning
- 53 Multilingual wiki
- 54 Free interfaces on Ubuntu
Languages[edit]
- Good knowledge (= able to communicate/understand quite well) in French, English, Catalan, Spanish.
- Advanced intermidiate in Dutch.
- Lower intermediate German.
- Basic (= limited or very limited capabilities): Arabic, Russian, Amazigh
- Studied somehow (probably very broken to say the least) Japanese, Chinese, Latin
Interests in:
- Improving Arabic and Russian
- Progressing in Japanese, Chinese
- Starting Greek
- Knowing more romance languages: Italian, Portuguese, Romanian
- (At least) scandinavian languages: Swedish, Norvegian, Danish
- Amerindian languages: Nahuatl, Maya, ...
- Esperanto
- Non-Indoeuropean languages: Euskera, Finnish, Hungarian ....
Links[edit]
New todo[edit]
- create paradigm
- add examples and expand lemma
- Improve Monodix basics
- File:Monodix.png fix it. Paradigms are missing and other possible errors.
- https://en.wikipedia.org/wiki/Wikipedia:Tutorial/Formatting
- Update Greek
- Read Task ideas for Google Code-in
- user:Youssefsan/todo: more stuff to do
- user:youssefsan/GCI 2012: mentorship
- review Missing chemical elements, http://www.google-melange.com/gci/task/view/google/gci2012/7968236
- connect regularly to IRC #Apertium but not too much
- update Spanish (missing pairs)
- update Dutch (available pairs, open source tools)
- Look at Apertium work for Arabic and Russian
- Ar-mt: still some non Arabic letters there, Hebrew?
- Arabic and final vocals?
- Romance languages add covered languages (List of language pairs) and uncovered (Wikipedia:Romance language)
- Have a look at Language and pair maintainer
- Tamazight
- Introduction à la réalisation d'un dictionnaire Amazigh-Amazigh à base Kabyle www.ummto.dz/IMG/pdf/these_corrigee_Hamek_Brahim.pdf Thesis
- http://arabeclassique.forumactif.com/f140-berbere-berber forum
- research institute on Tamazight (they search collaboration) http://www.itrane-university.net/
- http://www.itrane-university.net/ (click on e-library)
Less urgent todo[edit]
Wiki:
- Merge monodix, Morphological dictionary and Monodix basics
Errors:
- fr-ca. "Traduction en catalan" --> "Traducció en català" when it should have been "al" instead of "en català"
Update
- Contributing to an existing pair (remove emails--> mailing list and update)
- Romance languages add missing languages and pairs (Wikipedia:Romance languages)
Have a look at:
- https://svn.code.sf.net/p/apertium/svn/languages/
- Apertium for Dummies
- Setup for working on morphological dictionaries
- http://en.wikipedia.org/wiki/Apertium
- Using linguistic resources
- Verb movement
- http://wiki.apertium.eu/index.php/Session_0:_Overview
- Курсы машинного перевода для языков России/Session 0
- Contributing
Google Summer of Code:
- Languages with no coverage
- Germanic languages: Frisian, Limburgish, Luxembourgish...
- Amerindian: Maya, Nahuatl,...
Language books[edit]
English:
French:
Other
English[edit]
- aluminium also spelled aluminum
- polysemic English words: No = negation ; = symbol of Nobelium
Spanish-French[edit]
Missing words in Spanish
- copiar (= «copier»)
Disambig
- servidor (= «serviteur» or «serveur»)
Test[edit]
Bonjour. Je suis Youssef Oualmakran, diplômé en traduction. J'étudie actuellement à Bruxelles en Belgique. Je vais vous écrire un petit test pour Apertium, un logiciel de traduction automatique. Ce logiciel est sous licence libre. Si vous avez des questions vous pouvez me contacter par courrier électronique. Bonne journée.
TODO[edit]
- Update Apertium article on fr:, make a new article in nl:
- See if there are other outdated articles, or missing articles
- improve the article machine translation on fr:
- Update the article on Framasoft
- Write an article in French Linux magazines
- Write an article on the wiki of Ubuntu https://wiki.ubuntu.com/
- High priority reverse enegenieer http://ilk.uvt.nl/tadpole/ Morphological analyzer GPL for Dutch
Notes[edit]
For xml linguistics data, Apertium uses DTD + xmllint from libxml2-utils
Links[edit]
- http://mt-archive.info/ Papers on machine translation
- http://meta.wikimedia.org/wiki/Interwiki_map
To edit the text of Recent change
See also
- http://xixona.dlsi.ua.es/~ebenimeli/apertium/ Alternative home page
Links[edit]
- http://tspier.shtooka.net/Apertium/nl-en/apertium-en-nl.en-nl.dix
- http://ilazki.thinkgeek.co.uk/apertium/index.php
- http://www-igm.univ-mlv.fr/~unitex/ Unitex
- http://en.wikipedia.org/wiki/Evaluation_of_machine_translation
- http://apertium.sourceforge.net/extending.html
- file:///home/youssef/apertium/apertium-documentation-en/documentation.html
- www.sf.net/projects/matxin
Test[edit]
Linguistics[edit]
- http://www.thewldc.org/ WLDC
- http://portal.unesco.org/ci/en/ev.php-URL_ID=24553&URL_DO=DO_TOPIC&URL_SECTION=201.html WLDC
- http://dico.isc.cnrs.fr/ dico du cnrs fr, en + synonyms
- http://forum.wordreference.com/ languages forums
Promotion[edit]
- http://wiki.framasoft.info/PropositionNotice/Apertium
- http://www.framasoft.net/
- http://freshmeat.net/
- http://gallery.yahoo.com/dev/e515a802e4ba1af14705442e3ecd8bd3 contact wacky for yahoo messenger plugin
- http://fr.wikipedia.org/wiki/Wikip%C3%A9dia:Promotion/Francophonie/M%C3%A9dias
other tools[edit]
- http://www.cs.chalmers.se/~markus/extract/
- $ cat <list> | lt-proc nl-fr.automorf.bin | grep '\/\*'
- http://www.otal.umd.edu/uupractice/clir/ CLIR (Cross Language Information Retrieva)
- http://transifex.org/ l10n platform originally for Fedora
Todo[edit]
- http://ilazki.thinkgeek.co.uk/~spectre/apertium-en_GB.po
- http://developer.pidgin.im/query asking new features (making a ticket and call it enhancement)
- youssef@youssef-desktop:~/apertium/webspace$
Apertium Bugzilla[edit]
Dutch[edit]
- http://gtb.inl.nl/?owner=WNT
- http://oase.uci.kun.nl/~ans/ ANS
- http://www.paultenhove.nl/ventura/engine.php?Cmd=see&P_site=352&P_self=34&PMax=list&PSkip= Free bidix on agriculture, etc (tspier2)
- http://ilk.uvt.nl/tadpole/ Morphological analyzer GPL for Dutch
- http://taalunieversum.org/nieuws/2110/ontwikkeling_corpus_geschreven_nederlands_kan_doorgaan Stevin
Languages stats[edit]
- Polish, Japanese and Italian are languages in Wikipedia top10 still missing for Apertium
Irish[edit]
NL[edit]
- http://www.linupedia.nl/wiki/index.php/Hoofdpagina
- http://doc.nl.linux.org/
- http://uitmuntend.de/ nl-de
- http://ote.2meta.com/
Linguistics[edit]
- http://ccat.sas.upenn.edu/jod/texts/donatus.4.html Donatus, ~1600 part of speech
Mapundgun (Chile)[edit]
- Mainly spoken, not written.
- Contreversy about translation from Microsoft
- http://www.liberaciondigital.org/wiki/Equipos_LD_en_Regiones#Temuco
Mexico[edit]
Po[edit]
Esperanto[edit]
- http://e-d-e.org/?lang=fr 'Europe Démocratie Espéranto' Poltical party
- http://www.esperanto-gb.org/eab/contact.htm#feedback UK
- http://www.esperanto.ie/english/aboutEAI.htm Ireland
Occitan[edit]
Neo normand[edit]
XML[edit]
- http://www.w3schools.com/schema/schema_intro.asp XSD tutorial
- http://lists.xml.org/archives/xml-dev/200106/msg00589.html xml 1.0 vs xml 1.1
- http://java.sun.com/webservices/jaxp/dist/1.1/docs/tutorial/TOC.html#intro java and xml
- http://java.sun.com/xml/
- http://en.wikipedia.org/wiki/JAXP
- http://www.cafeconleche.org/slides/sd2000east/dom/01.html
- http://www.rgagnon.com/javadetails/java-0408.html
- http://snippets.dzone.com/posts/show/4009
- http://www.roseindia.net/xml/dom/ ++
- http://www.ibm.com/developerworks/xml/library/x-jaxp/
- http://www.xml.com/pub/a/2005/07/06/jaxp.html
- http://wiki.java.net/bin/view/Javapedia/JAXP
- https://jaxp.dev.java.net/
- http://developers.sun.com/sw/building/codesamples/dom/doc/DOMSample.java
- http://developers.sun.com/sw/building/codesamples/dom/index.html
- http://java.sun.com/developer/technicalArticles/xml/JavaTechandXML_part3/index.html
- http://www.xquery.com/tutorials/xqj_tutorial/ XQJ
XQuery[edit]
- http://en.wikipedia.org/wiki/XQuery#Implementations
- http://saxon.sourceforge.net/ (commercial&free)
- http://www.gnu.org/software/qexo/
Spanish[edit]
German[edit]
- http://wiki.mcworld.org/index.php/Basic_German Wiki
- http://www.sagen.at/doku/lexika/mundart1811/mundart1811.html Dictionary
- http://www.wordmirror.com/index.php
- dict.leo.org
Spell checker[edit]
- http://wiki.services.openoffice.org/wiki/Dictionary#French_.28France.2C_....29
- http://www.gutenberg.eu.org/distributions/6-francais-gutenberg.html
- http://www.abisource.com/projects/enchant/
Cree[edit]
- http://adornthetruth.com/nikamo/ Cree language, java software, GPL
Php/java[edit]
Irc tips[edit]
- /msg nickserv ghost youssef password
Ant[edit]
- http://ant.apache.org/
- http://wiki.apache.org/ant/FrontPage
- http://en.wikibooks.org/wiki/Programming:Apache_Ant
- http://ideoplex.com/focus/java#ant
Translation projects[edit]
Misc[edit]
Speech synthesis[edit]
Firefox plugin[edit]
- http://developer.mozilla.org/en/docs/Building_an_Extension
- https://addons.mozilla.org/en-US/firefox/browse/type:1/cat:37/sort:popular
Language tools[edit]
- http://www.languagetool.org/ grammar spell checker. Rules defined in XML and Java
MT general[edit]
- mt-archive.info
Translation memory[edit]
Euroweek[edit]
- http://www.free-articles-zone.com/author/14090
- http://www.expertarticles.com/article/Reference-Education/Language/Tools-to-Use-for-a-Professional-Translation.html
- http://thetundradrums.com/news/story/1627 Alaskan languages
Language skills problems[edit]
Catalan[edit]
Spanish[edit]
- http://culturitalia.uibk.ac.at/hispanoteca/Foro-preguntas/ARCHIVO-Foro/Buenos días-buenas tardes-noches.htm
German[edit]
MT errors[edit]
- http://adweek.blogs.com/photos/uncategorized/2008/07/10/translateservererror.jpg
- http://china.kylereed.com/Sichuan/FunnySign.jpg
Language learning[edit]
- http://shtooka.net/project/fr/ Audio sound for fr
- http://linuxfr.org/2008/08/20/24396.html Openspell: learning African langauges
Multilingual wiki[edit]
Free interfaces on Ubuntu[edit]
- Freespeak
- Youtranslate