Difference between revisions of "User:AMR-KELEG/GSoC19 Proposal"

From Apertium
Jump to navigation Jump to search
Line 58: Line 58:
* and a detailed work plan (including, if possible, a schedule with milestones and deliverables).
* and a detailed work plan (including, if possible, a schedule with milestones and deliverables).


=== Work plan ===
== Work Plan ==


{| class="wikitable" border="1"
* Week 1:
|-
* Week 2:
| Community Bonding
* Week 3:
| Communicate with the maintainers and get to know Apertium better.
* Week 4:
Solve some issues on Github.

|-
* '''Deliverable #1'''
| Week 1

(27 May - 3 June)
* Week 5:
| Implement a baseline model for weigthing automata.
* Week 6:
|-
* Week 7:
* Week 8:
| Week 2
(4 June - 10 June)

| Develop the first supervised model (Unigram counts).
* '''Deliverable #2'''
Write a shell script for generating weights using a tagged corpus.

|-
* Week 9:
* Week 10:
| Week 3
(11 June - 17 June)
* Week 11:
| Read, Understand and plan for implementing the publication for the first unsupervised model.
* Week 12:
|-

| Week 4
* '''Project completed'''
(18 June - 24 June)

| Finalise the first unsupervised model and compare it to the supervised one.
Include time needed to think, to program, to document and to disseminate.
|-

| '''Evaluation 1'''
If you are intending to disseminate to a conference, which conference are you intending to submit to. Make sure
'''Deliverables: Two shell scripts for generating weights using both supervised and unsupervised techniques.'''
to factor in time taken to run any experiments/evaluations and write them up in your work plan.
|-

| Week 5
List your skills and give evidence of your qualifications. Tell us what is your current field of study,
(29 June - 5 July)
major, etc. Convince us that you can do the work.
| Read, Understand and plan for implementing the publication for the second unsupervised model.

|-
List any non-Summer-of-Code plans you have for the Summer, especially employment, if you are applying for
| Week 6
internships, and class-taking. Be specific about schedules and time commitments. we would like to be sure you have
(6 July - 12 July)
at least 30 free hours a week to develop for our project.
| Implement the second unsupervised model.

|-
[[Category:GSoC 2019 student proposals|AMR-KELEG]]
| Week 7
(13 July - 22 July)
| Read, Understand and plan for implementing the publication for the second unsupervised model.
|-
| Week 8
(23 July - 12 July)
| Implement the second unsupervised model.
|-
| '''Evaluation 2'''
'''Deliverables: A shell script for using the second unsupervised model and a plan for implementing the third one.'''
|-
| Week 9
(27 July - 2 August)
| Implement the third unsupervised model.
|-
| Week 10
(3 August - 9 August)
| Solve issues related to the developed models.
|-
| Week 11-12
(10 August - 26 August)
| Write the required documentation and merge the code into Apertium's repositories.
|-
| '''Final evaluation'''
|
|}

Revision as of 19:41, 28 March 2019

Personal Information

  • Name: Amr Keleg
  • E-mail address: amr.keleg@eng.asu.edu.eg / amr_mohamed@live.com
  • IRC: AMR-KELEG
  • Location: Cairo, Egypt
  • Timezone: UTC+02
  • Current job: A MSc student and a teacher assistant at Computer and systems department, Faculty of Engineering, Ain Shams university, Cairo, Egypt.

Skills and qualifications

  • Past GSoC participant.
  • Competitive programming.
  • Worked for one year on developing sentiment analysis model for Arabic language.
  • Made several contributions to open source projects
  • Participated in online and on-site competitive programming contest.
  • Completed Udacity's data analysis nanodegree.
  • Good command of git and the GitHub process of contribution.
  • Experienced in using C++ and python.
  • Using Ubuntu for more than 3 years.
  • Basic knowledge of shell scripting.

Coding challenge

Code repository: https://github.com/AMR-KELEG/apertium-unsupervised-weighting-of-automata

Project Information

Why is it that you are interested in Apertium?

Which of the published tasks are you interested in? What do you plan to do?

Include a proposal, including

   * a title,
   * reasons why Google and Apertium should sponsor it,
   * a description of how and who it will benefit in society,
   * and a detailed work plan (including, if possible, a schedule with milestones and deliverables).

Work Plan

Community Bonding Communicate with the maintainers and get to know Apertium better.

Solve some issues on Github.

Week 1

(27 May - 3 June)

Implement a baseline model for weigthing automata.
Week 2

(4 June - 10 June)

Develop the first supervised model (Unigram counts).

Write a shell script for generating weights using a tagged corpus.

Week 3

(11 June - 17 June)

Read, Understand and plan for implementing the publication for the first unsupervised model.
Week 4

(18 June - 24 June)

Finalise the first unsupervised model and compare it to the supervised one.
Evaluation 1

Deliverables: Two shell scripts for generating weights using both supervised and unsupervised techniques.

Week 5

(29 June - 5 July)

Read, Understand and plan for implementing the publication for the second unsupervised model.
Week 6

(6 July - 12 July)

Implement the second unsupervised model.
Week 7

(13 July - 22 July)

Read, Understand and plan for implementing the publication for the second unsupervised model.
Week 8

(23 July - 12 July)

Implement the second unsupervised model.
Evaluation 2

Deliverables: A shell script for using the second unsupervised model and a plan for implementing the third one.

Week 9

(27 July - 2 August)

Implement the third unsupervised model.
Week 10

(3 August - 9 August)

Solve issues related to the developed models.
Week 11-12

(10 August - 26 August)

Write the required documentation and merge the code into Apertium's repositories.
Final evaluation