Difference between revisions of "User:AMR-KELEG/GSoC19 Proposal"

From Apertium
Jump to navigation Jump to search
Line 58: Line 58:
 
* and a detailed work plan (including, if possible, a schedule with milestones and deliverables).
 
* and a detailed work plan (including, if possible, a schedule with milestones and deliverables).
   
=== Work plan ===
+
== Work Plan ==
   
  +
{| class="wikitable" border="1"
* Week 1:
 
  +
|-
* Week 2:
 
  +
| Community Bonding
* Week 3:
 
  +
| Communicate with the maintainers and get to know Apertium better.
* Week 4:
 
  +
Solve some issues on Github.
 
  +
|-
* '''Deliverable #1'''
 
 
| Week 1
 
  +
(27 May - 3 June)
* Week 5:
 
  +
| Implement a baseline model for weigthing automata.
* Week 6:
 
  +
|-
* Week 7:
 
* Week 8:
+
| Week 2
  +
(4 June - 10 June)
 
  +
| Develop the first supervised model (Unigram counts).
* '''Deliverable #2'''
 
  +
Write a shell script for generating weights using a tagged corpus.
 
  +
|-
* Week 9:
 
* Week 10:
+
| Week 3
  +
(11 June - 17 June)
* Week 11:
 
  +
| Read, Understand and plan for implementing the publication for the first unsupervised model.
* Week 12:
 
  +
|-
 
 
| Week 4
* '''Project completed'''
 
  +
(18 June - 24 June)
 
  +
| Finalise the first unsupervised model and compare it to the supervised one.
Include time needed to think, to program, to document and to disseminate.
 
  +
|-
 
 
| '''Evaluation 1'''
If you are intending to disseminate to a conference, which conference are you intending to submit to. Make sure
 
  +
'''Deliverables: Two shell scripts for generating weights using both supervised and unsupervised techniques.'''
to factor in time taken to run any experiments/evaluations and write them up in your work plan.
 
  +
|-
 
 
| Week 5
List your skills and give evidence of your qualifications. Tell us what is your current field of study,
 
  +
(29 June - 5 July)
major, etc. Convince us that you can do the work.
 
  +
| Read, Understand and plan for implementing the publication for the second unsupervised model.
 
  +
|-
List any non-Summer-of-Code plans you have for the Summer, especially employment, if you are applying for
 
 
| Week 6
internships, and class-taking. Be specific about schedules and time commitments. we would like to be sure you have
 
  +
(6 July - 12 July)
at least 30 free hours a week to develop for our project.
 
  +
| Implement the second unsupervised model.
 
  +
|-
[[Category:GSoC 2019 student proposals|AMR-KELEG]]
 
 
| Week 7
  +
(13 July - 22 July)
  +
| Read, Understand and plan for implementing the publication for the second unsupervised model.
  +
|-
 
| Week 8
  +
(23 July - 12 July)
  +
| Implement the second unsupervised model.
  +
|-
 
| '''Evaluation 2'''
  +
'''Deliverables: A shell script for using the second unsupervised model and a plan for implementing the third one.'''
  +
|-
 
| Week 9
  +
(27 July - 2 August)
  +
| Implement the third unsupervised model.
  +
|-
 
| Week 10
  +
(3 August - 9 August)
  +
| Solve issues related to the developed models.
  +
|-
 
| Week 11-12
  +
(10 August - 26 August)
  +
| Write the required documentation and merge the code into Apertium's repositories.
  +
|-
  +
| '''Final evaluation'''
  +
|
  +
|}

Revision as of 19:41, 28 March 2019

Personal Information

  • Name: Amr Keleg
  • E-mail address: amr.keleg@eng.asu.edu.eg / amr_mohamed@live.com
  • IRC: AMR-KELEG
  • Location: Cairo, Egypt
  • Timezone: UTC+02
  • Current job: A MSc student and a teacher assistant at Computer and systems department, Faculty of Engineering, Ain Shams university, Cairo, Egypt.

Skills and qualifications

  • Past GSoC participant.
  • Competitive programming.
  • Worked for one year on developing sentiment analysis model for Arabic language.
  • Made several contributions to open source projects
  • Participated in online and on-site competitive programming contest.
  • Completed Udacity's data analysis nanodegree.
  • Good command of git and the GitHub process of contribution.
  • Experienced in using C++ and python.
  • Using Ubuntu for more than 3 years.
  • Basic knowledge of shell scripting.

Coding challenge

Code repository: https://github.com/AMR-KELEG/apertium-unsupervised-weighting-of-automata

Project Information

Why is it that you are interested in Apertium?

Which of the published tasks are you interested in? What do you plan to do?

Include a proposal, including

   * a title,
   * reasons why Google and Apertium should sponsor it,
   * a description of how and who it will benefit in society,
   * and a detailed work plan (including, if possible, a schedule with milestones and deliverables).

Work Plan

Community Bonding Communicate with the maintainers and get to know Apertium better.

Solve some issues on Github.

Week 1

(27 May - 3 June)

Implement a baseline model for weigthing automata.
Week 2

(4 June - 10 June)

Develop the first supervised model (Unigram counts).

Write a shell script for generating weights using a tagged corpus.

Week 3

(11 June - 17 June)

Read, Understand and plan for implementing the publication for the first unsupervised model.
Week 4

(18 June - 24 June)

Finalise the first unsupervised model and compare it to the supervised one.
Evaluation 1

Deliverables: Two shell scripts for generating weights using both supervised and unsupervised techniques.

Week 5

(29 June - 5 July)

Read, Understand and plan for implementing the publication for the second unsupervised model.
Week 6

(6 July - 12 July)

Implement the second unsupervised model.
Week 7

(13 July - 22 July)

Read, Understand and plan for implementing the publication for the second unsupervised model.
Week 8

(23 July - 12 July)

Implement the second unsupervised model.
Evaluation 2

Deliverables: A shell script for using the second unsupervised model and a plan for implementing the third one.

Week 9

(27 July - 2 August)

Implement the third unsupervised model.
Week 10

(3 August - 9 August)

Solve issues related to the developed models.
Week 11-12

(10 August - 26 August)

Write the required documentation and merge the code into Apertium's repositories.
Final evaluation