Ideas for Google Summer of Code/UD Annotatrix

From Apertium
Jump to navigation Jump to search

UD Annotatrix is a tool for annotating Universal Dependencies. The objective of this project is to extend it in useful ways:

Tasks

  • General: Relates to the whole codebase
  • Interface: Relates to the front end graph drawing and buttons/tabs etc. (HTML, CSS, JS)
  • Backend: The JS that does format parsing and validation (JS)
  • Server: The code that runs the server (Python)
Task Days (approx.) General Interface Backend Server Relevant issues Status
Comprehensive testing framework 5 x 84 done
Interface for disambiguation (in CG mode) 1 x 10
Integrate d3's graphing library and TreeX 5 x 262 won't fix
Support for ambiguous sentences 5 x x 72 90 bug fixes needed
Full keyboard support 2 x
Support for enhanced dependencies 5 x x 178 179
Integration with GitHub accounts 2 x 97, 280 (PR) needs merging
Integration with Git 5 x x x
Collaborative editting support 5 x x x 241
Search functionality 5 x x x
Full logging support 2 x x 189
Automatic punctuation support 1 x 265 partially done
Active learning 5 x x
Automatic validation 4 x x x(?) 99
Morphology filling from API 4 x x(?) 122
Decent support for labels (like GitHub) 2 x x done
Stuffs
  1. Comprehensive testing framework
    • Find out how the code works, and add tests for functionalities (interface and backend).
  2. Integrate d3's graphing library and TreeX
    • Better graph/edge layouts
  3. Full keyboard support
  4. Collaborative editting support

Coding challenge

  • Download UD Annotatrix
  • Fix one issue in the server version
  • Fix one issue in the client
  • Annotate 20 trees
  • Proof of concept for graphing with d3 (hard-coded values)