User:Dj
I am Dhiraj Lohiya, currently pursuing B.E. (Hons.) Computer Science from B.I.T.S. Pilani, India.
I got interested in the field of Computational Linguistics a year back when I started a project on "Self-improving Phonetic Matching Algorithm" and read a lot of research papers in the field of phonetics, edit distance, n-grams etc. I found the topics really interesting and it was fun working over these topics even for long hours, be it checking out the theory by pronouncing the words/letters/vowels/consonants yourself or be it trying to figure out how phonetic match could contribute to retrieval of better search results or be it how the rules could be dynamically evolved always considering the users right and modifying/appending rules to the algorithm based on certain threshold.
Here, I would also love to add a new language pair to Apertium amongst Hindi, Marathi and English and thus take a dive in the machine translation part as well which is definitely going to be a great learning experience dealing with the nitty-gritty of machine translation.
I'll be working with Kevin Scannell on the project involving statistical diacritic restoration for GSoC 2011. You can read more about my project here.