Difference between revisions of "Welsh to English/Regression tests"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
| Line 209: | Line 209: | ||
| ;Demonstratives | ;Demonstratives | ||
| ⚫ | |||
| * (cy) ''y llyfr  | * (cy) ''y llyfr yma.'' → this book. | ||
| * (cy) ''y llyfr  | * (cy) ''y llyfr 'ma.'' → this book. | ||
| * (cy) ''y llyfr  | * (cy) ''y llyfr yna.'' → that book. | ||
| ⚫ | |||
| * (cy) ''y llyfr hwn.'' → this book. | * (cy) ''y llyfr hwn.'' → this book. | ||
| * (cy) ''y gath hon.'' → this cat. | * (cy) ''y gath hon.'' → this cat. | ||
Revision as of 10:52, 28 July 2008
To run these tests, run the script regression-tests.sh from the apertium-cy-en language pair in SVN.
- Treatment of 'is' in present tense.
- (en) The boy is in the garden. → mae y bachgen yn yr ardd. (note: yr → 'r is an open bug)
- (cy) mae'r bachgen yn yr ardd. → the boy is in the garden.
- Yep, I know the pattern, the problem is that the post-generator insists on having a ~ before anything that it deals with -- This would mean that we have to have '~' before every vowel, which would be quite difficult. There is another possibility though, if we can't fix that and it would be to just use a plain transliterator to replace:
 
 
- "aeiouwy + space + ~yr + space" with "aeiouwy'r + space"
 
 
 
- Can you think of anything this might catch by accident? or is it a fairly safe search/replace? - Francis Tyers
 
 
- I would go with this in the meantime - I think it's pretty safe.  Note that your rule can act on both 'y' and 'yr'.  The system is:
- consonant + space + y + space + consonant
- consonant + space + yr + space + vowel
- vowel + 'r + space + consonant-or-vowel
 
 
- I would go with this in the meantime - I think it's pretty safe.  Note that your rule can act on both 'y' and 'yr'.  The system is:
 
 
- Prepositional noun phrase should not be a subject
- (cy) cerddodd fo i'r dref → he walked to the town
- (cy) cerddodd i'r dref → [he/she] walked to the town
- (cy) allan i'r cyfarfod → out to the meeting
- (cy) roedd o ar dy lyfr → it was on your book
- (cy) roedd ar dy lyfr → [it] was on your book
- No subject shift with imperative
- (cy) gwasgwch y botwm! → squeeze the button!
- (en) squeeze the button! → gwasgu y botwm! (note: infinitive for imperative is an open bug)
- "yn" as stative
- (cy) yn falch → proud
- (cy) yn hapus → happy
- (cy) tyfodd fo yn fawr → he grew big
- Subject shift for pronouns
- (cy) roedden nhw'n hapus → they were happy
- Number agreement of verb
- (cy) roedd y bechgyn yn hapus → the boys were happy
- (cy) roedd y cwningod yn hapus → the rabbits were happy
- (cy) gwelodd y dyn y llyfr → the man saw the book
- "yn" as "-ing"
- (cy) yn mynd → going
- (cy) yn gweld → seeing
- Conjugation of 'mynd'
- (cy) aeth fo → he went
- Comparative adjectives
- (cy) tyfodd y twnnel yn fwy → the tunnel grew bigger
- (cy) tyfodd y twnnel yn llai → the tunnel grew smaller
- (cy) tyfodd y twnnel yn hirach → the tunnel grew longer
- (cy) tyfodd y twnnel yn uwch → the tunnel grew higher
- Insertion of indefinite determiner
- (cy) daeth Taid â lamp → Grandfather came with a lamp
- (cy) dychwelodd y rheolwr gyda gŵr tew → the manager returned with a fat man
- Verbal nouns / Infinitives
- (cy) roedd y dyn yn gwerthu pethau rhad → the man was selling cheap things
- (cy) roedd fo yn palu → he was digging
- (cy) ceisiodd y dyn agor y bocs → the man tried to open the box
- Synthetic to non-synthetic adjectives
- (en) the nice house → y tŷ neis
- (en) the nicer house → y tŷ neisiach
- (en) the nicest house → y tŷ neisiaf
- (en) the dirty cat → y gath budr
- (en) the dirtier cat → y gath fwy budr
- (en) the dirtiest cat → y gath fwyaf budr
- Yes, the synthetic endings would be nice to sort out, but I don't think there would be a mutation here, so that's not a bug. - Donnek
 
- The synthetic endings there are valid then? - Francis Tyers
 
 
- Yes, in this case you would tend to say "y tŷ neisiaf". - Donnek
 
 
- Hmm. Something odd about these. "y gath fudr" - fem adj has SM. Because the adj is short, you would also tend to use "butrach" and "butraf" in the second two. "peryglus" (dangerous) might be a better adj to use.- Donnek
 
- There are a few adjectives which need to be marked as synthetic in the monodix. Is there likely to be an easy way of doing this? - Francis Tyers
 
- As a rough rule for the time being, I would say you could mark any adj of 2 syllables or less as synthetic, but of course that is not scientific. - Donnek
 
 
- Subordinate ("reported speech") clauses with "bod" + noun
- (cy) clywodd y dyn bod y trên am gyrraedd yn hwyr. → the man heard that the train will arrive late.
- (cy) yr honiadau bod yr AS wedi methu datgan £103,000 o roddion. → the allegations that the MP has failed declare £103,000 of gifts.
- (cy) clywodd y dyn bod y trên ar gyrraedd yn hwyr. → the man heard that the train was about to arrive late.
- Infinitive after "yn lle"
- (cy) yn lle mynd dros y ffordd. → instead of going over the road.
- (en) instead of going over the road → yn lle mynd dros y f/ffordd (note: initial mutation error here needs to be fixed)
- "sydd" / "sy"
- (cy) mae'r dyn yn adeiladu gwesty sy'n darparu llawer o ystafelloedd. → the man is building hotel that provides a lot of rooms.
- (cy) mae'r blaid yn gwneud rhywbeth sy'n cyfrannu at ennill yr etholiad. → the party is doing something that contributes towards winning the election.
- (cy) mae'r dafad yn pori yn y maes sy'n cynnig bwyd da. → the sheep is grazing in the field that offers good food.
- (cy) mae'r defaid yn pori yn y cae sy'n cynnig bwyd da. → the sheep are grazing in the field that offers good food.
- Subordinate ("reported speech") clauses with "bod" + pronoun
- (cy) mae hi'n dweud eu bod nhw wedi mynd → she is saying that they have gone
- (cy) mae hi'n dweud eu bod nhw am mynd → she is saying that they will go
- (cy) mae hi'n dweud eu bod nhw ar mynd → she is saying that they are about to go
- (cy) mae'n amlwg ei fod o'n dweud y gwir → is obvious that he is telling the truth
- (cy) clywodd y dyn ei fod o'n cyrraedd yn hwyr → the man heard that he is arriving late (note: he → it is open bug)
- (cy) ei fod o wedi dod → that he has come
- (cy) ei fod o am adael → that he will leave
- (cy) ei bod hi'n gadael am ddyn arall → that she is leaving for another man
- "cael" - get / have
- (cy) mi aeth o i'r banc i gael pres allan → he went to the bank to get money out
- (cy) mae o'n cael mynd ar ôl y cwbl → he's getting to go after all
- (cy) mi gafodd yr athro radd dda o Brifysgol Bangor → the teacher got a good degree from the University of Bangor
- (cy) mae gan yr athro radd dda o Brifysgol Bangor → the teacher has a good degree from the University of Bangor
- (cy) mi gafodd y dyn ei daro gan gar → the man was struck by a car
- (cy) ei bod hi'n cael perthynas â dyn llawer hŷn → that she is having a relationship with a much older man
- Genitive noun-phrase
- (cy) merch y meddyg → The daughter of the doctor
- "i" + infin
- (cy) aeth y bachgen wedi i'r bws ddod → The boy went after the bus came
- Conjunctive genitive with proper names
- (cy) Flwyddyn neu ddwy ar ôl priodas Elin. → Year or two after the marriage of Elin.
- Elided pr.poss after a vowel
- (cy) â'i gar → With his car
- (cy) â'i char → With her car
- Conjunctive genitive with co-ordination
- (cy) Ond mae fy nhad, mam a thad fy ngŵr wedi talu trethi ac erioed wedi defnyddio'r gwasanaeth iechyd. → But my father, mother and the father of my man is after pay taxes and never after use the service health.
- Fragility of "bod + noun" (1.3.15)
- (cy) Un bore yn yr asembli dywedodd y prifathro bod ymwelwyr tramor yn dod i'r ysgol. → One morning in the assembly the headmaster said that foreign visitors are coming to the school.
- Periphrastic tenses with "bod"
- (cy) mae'r bachgen wedi mynd. → the boy has gone.
- (cy) mae'r bachgen am fynd. → the boy is going to go.
- (cy) roedd y bachgen am fynd. → the boy was going to go.
- (cy) roedd y bachgen wedi mynd. → the boy had gone.
- infin + object pronoun
- (cy) yn ei adael am ddyn arall → leaving him for another man.
- (cy) aeth dy fam i'w ateb a chlywaist ti llais Paul yn dweud nad oeddet ti ddim yn yr ysgol → Your mother went to answer it and you heard the voice of Paul saying nad you were not in the school
- Preverbal particles - negative
- (cy) nid aeth i'r dref → [he/she] did not go to the town
- Preverbal particles - affirmative
- (cy) yr ydych yn mynd → you are going
- (cy) yr ydych chi yn mynd → you are going
- (cy) yr ydych chi'n mynd → you are going
- (cy) rydych chi yn mynd → you are going
- (cy) rydych chi'n mynd → you are going
- Preverbal particles - interrogative
- (cy) a gyrhaeddodd y llythyr? → did the letter arrive?
- (cy) gyrhaeddodd y llythyr? → did the letter arrive?
- (cy) ewch i ofyn a fyddai hi'n dod → go and ask if she will be coming
- (cy) oni phrynodd yr athro'r papur? → didn't the teacher buy the paper?
- (cy) onid aeth y bws heibio'r ysgol? → didn't the bus go past the school?
- (cy) oni welsoch chi? → didn't you see?
- Demonstratives
- (cy) y llyfr yma. → this book.
- (cy) y llyfr 'ma. → this book.
- (cy) y llyfr yna. → that book.
- (cy) y llyfr 'na. → that book.
- (cy) y llyfr hwn. → this book.
- (cy) y gath hon. → this cat.
- (cy) y llyfrau hyn. → these books.
- (cy) y cathod hyn. → these cats.
- (cy) y llyfr hwnnw. → that book.
- (cy) y gath honno. → that book.
- (cy) y llyfrau hynny. → those books.
- (cy) y cathod hynny. → those cats.
- Generic coolness
- (cy) Roedd y Comisiwn yn ymchwilio i'r honiadau bod yr AS wedi methu datgan £103,000 o roddion. → the Commission Was investigating the allegations that the MP has failed to declare £103,000 of gifts.
- (cy) Dywedodd yr heddlu fod y troseddau honedig wedi digwydd rhwng 2003 a 2007 yn Sir Benfro a Sir Gaerfyrddin. → the police Said that the alleged crimes have happened between 2003 and 2007 in Pembrokeshire and Carmarthenshire.
- (cy) Mae'r heddlu hefyd yn ymchwilio i honiadau ei bod hi'n cael perthynas â dyn llawer hŷn. → The police are also investigating its allegations that she is getting relation with a much older man.
- (cy) Ond byddwn ni'n parhau i weithio gyda'n staff, gwirfoddolwyr, cystadleuwyr, cwsmeriaid, partneriaid, a chyrff eraill i wella ein gwasanaeth Cymraeg. → But we will be continuing to work within a staff, volunteers, competitors, customers, partners, and other bodies to improve our Welsh service.

