Difference between revisions of "Talk:German to English"
Jump to navigation
Jump to search
m ('viele' is the plural of 'ein'?????) |
|||
Line 41: | Line 41: | ||
::<PoS><gender><number><case> - for lack of a better phrase, that's the order of inherency, plus it's easier to work with. Much easier. -- [[User:Jimregan|Jimregan]] 15:39, 19 October 2011 (UTC) |
::<PoS><gender><number><case> - for lack of a better phrase, that's the order of inherency, plus it's easier to work with. Much easier. -- [[User:Jimregan|Jimregan]] 15:39, 19 October 2011 (UTC) |
||
::Also, listing 'viele' as the plural of 'ein' is dubious, and will more than likely cause problems. Treat them as separate words -- [[User:Jimregan|Jimregan]] 15:53, 19 October 2011 (UTC) |
::Also, listing 'viele' as the plural of 'ein' is dubious, and will more than likely cause problems. Treat them as separate words -- [[User:Jimregan|Jimregan]] 15:53, 19 October 2011 (UTC) |
||
::: I started a stub at [[Tag_order]] on this, but it's not very complete. --[[User:Unhammer|unhammer]] 07:05, 20 October 2011 (UTC) |
Revision as of 07:05, 20 October 2011
What's the best approach to start adding entries to the German monodix?
- A good way would be to stat writing a script to download Wiktionary entries for German nouns and converting them into speling format, e.g.
Bett; Bett; sg.nom; n.nt Bett; Bettes; sg.gen; n.nt Bett; Betts; sg.gen; n.nt Bett; Bett; sg.dat; n.nt Bett; Bett; sg.acc; n.nt Bett; Betten; pl.nom; n.nt Bett; Betten; pl.gen; n.nt Bett; Betten; pl.dat; n.nt Bett; Betten; pl.acc; n.nt Haus; Haus; sg.nom; n.nt Haus; Hauses; sg.gen; n.nt Haus; Haus; sg.gen; n.nt Haus; Haus; sg.dat; n.nt Haus; Haus; sg.acc; n.nt Haus; Häuser; pl.nom; n.nt Haus; Häuser; pl.gen; n.nt Haus; Häusern; pl.dat; n.nt Haus; Häuser; pl.acc; n.nt
- There are around 15,000 entries in the category German nouns, so that should be a good start. - Francis Tyers 07:13, 18 October 2011 (UTC)
- Another thing you can do is make lists of closed category words that don't inflect (E.g. prepositions, conjunctions) and also of abbreviations. - Francis Tyers 07:15, 18 October 2011 (UTC)
Francis, what should be the expected order of the symbols in the morphological analysis? Let's say we are analyzing "Apfel", is it <POS><gender><case><number> or <POS><gender><number><case>? I guess it should also output all the possible cases, e.g.:
Apfel<n><m><nom><sg> Apfel<n><m><acc><sg> Apfel<n><m><dat><sg>
- <PoS><gender><number><case> - for lack of a better phrase, that's the order of inherency, plus it's easier to work with. Much easier. -- Jimregan 15:39, 19 October 2011 (UTC)
- Also, listing 'viele' as the plural of 'ein' is dubious, and will more than likely cause problems. Treat them as separate words -- Jimregan 15:53, 19 October 2011 (UTC)