Difference between revisions of "Ideas for Google Summer of Code/FieldWorks data extraction"

Latest revision as of 16:34, 1 March 2023

FieldWorks stores a lot of data of the sort that we want for building monodix.

Things we might be able to get:

Write a script that reads a FieldWorks file and outputs the headword and part of speech of each lexicon entry.

Downloading FieldWorks and making up your own data to test this is fine (you'll probably end up doing a lot of it over the course of the project).

@@ Line 22: / Line 22: @@
 ** Links to morphological stuff
 * http://downloads.sil.org/FieldWorks/WW-ConceptualIntro/ConceptualIntroduction.htm
+** Documentation
-** Long list of data we might be able to get
+** There might be data in the parent directory: http://downloads.sil.org/FieldWorks/
 * https://github.com/sillsdev/FieldWorks
 ** FieldWorks internals (might need this to figure out formats, but hopefully not)