Ideas for Google Summer of Code/UD Annotatrix

From Apertium
Jump to navigation Jump to search

UD Annotatrix is a tool for annotating Universal Dependencies. The objective of this project is to extend it in useful ways:

Tasks

  • General: Relates to the whole codebase
  • Interface: Relates to the front end graph drawing and buttons/tabs etc. (HTML, CSS, JS)
  • Backend: The JS that does format parsing and validation (JS)
  • Server: The code that runs the server (Python)
Task Days (approx.) General Interface Backend Server Relevant issues
Comprehensive testing framework 5 x
Interface for disambiguation (in CG mode) 1 x
Integrate d3's graphing library and TreeX 5 x
Support for ambiguous sentences 5 x x
Full keyboard support 2 x
Support for enhanced dependencies 5 x x
Integration with GitHub accounts 2 x
Integration with Git 5 x x x
Collaborative editting support 5 x x x
Search functionality 5 x x x
Full logging support 2 x x
Automatic punctuation support 1 x
Active learning 5 x x
Automatic validation 4 x x x(?)
Morphology filling from API 4 x x(?) 122
Decent support for labels (like GitHub) 2 x x
Stuffs
  1. Comprehensive testing framework
    • Find out how the code works, and add tests for functionalities (interface and backend).
  2. Integrate d3's graphing library and TreeX
    • Better graph/edge layouts
  3. Full keyboard support
  4. Collaborative editting support

Coding challenge

  • Download UD Annotatrix
  • Fix one issue in the server version
  • Fix one issue in the client
  • Annotate 20 trees
  • Proof of concept for graphing with d3 (hard-coded values)