Te ivez difenn ar brezhoneg gant Broudix, « An Drouizig » !

Degemer

Brezhoneg 
Saozneg 
Galleg 

Ar foromoù

Daveoù

Skrivit deomp ! 


Useful links:

The electronic dictionary

Which spelling system is used by "An Drouizig Difazier" ?

The dictionary contains exclusively words that are spelled in accordance to the unified spelling system, which is also called sometimes KLTG Breton or again in Breton, Brezhoneg peurunvan for "completely unified" Breton.

This norm is characterized by a general use of the " zh ". For instance, we spell Breizh, unified form of " Breiz " (KLT) and " Breih " (Gw). Two other examples, we spell the word evit when others would spell " ewid " or " evid ", we spell enderv when others would spell " enderw " or " endero " …

Some frequently encountered dialectal forms like àr, teus, meump, …  could be present in the dictionary.


The unified Breton alphabet.

The letters of the unified Breton alphabet are the following :

a b ch c'h d e f g h i j k l m n o p r s t u v w y z.

Please note that there is no c (except within the ch and c'h polygraphs), nor q, or x. For obvious reasons c, x and q will not be considered as unknown letters and will not act as separators within words. But it can be asserted that words including one of this letter will not be present in the dictionary.


Accented letters.

The accented letters of the unified Breton alphabet are the following :

â à é ê ñ ô ù ü û

Examples of words including accented letters :

lâr ; kêrioù ; àr ; brasañ ; é ; kornôg ; skuizh-ôg ; û ; emroüs ; goût

Obviously, like for any other good spell-checker, a word will be flagged if it does not respect its correct accentuation.
 

Handling of the hyphen.

There are five cases :

· The common suffixes :

  • ar re-mañ, an traoù-se, mat-tre, …
  • ar gwenn-ha-du-se, du-hont, …
  • bras-meurbet, fall-spontus, ...

 In this first case, the first word, or more precisely the first group of words without the suffix, is tested. If it does not belong to the dictionary then it is flagged.

· The ez- and ent- prefixes :

  • Ez-laouen, ez-vicherel, ez-kalonek, …
  • Ent-yaouank, ent-reoliek, …

· Composed words :

  • botoù-koad, krogen-Sant-Jakez, doare-ober, ...
  • melc'hwed-krogennek, bag-dre-lien, kanod-saveteiñ, ...

 For these second and third cases, the whole word is tested, including its hyphen. The whole word is flagged if not recognized as correct.

· Idiomatic constructions :

  • lise-mañ-lise, den-mañ-den, ar c'hig-mañ-kig, ...
  • departamant-ha-departamant, pazenn-ha-pazenn, ...
  • hewelusoc'h-hewelusañ, bihan-bihan, hiraetoc'h-hiraetañ ...

This fourth case is correctly handled.

· " Linked " proper nouns :

  • Ur gejadenn Bush-Blair, Ur match Williams-Williams,
  • An emgav Frañs-Aostralia, Pont-'n-Abad-Sant-Pêr-Kiberen,
  • Ar baeroniezh Sant-Brieg-Aberystwyth,

This fifth case can be problematic.
 

Handling of the apostrophe.

The apostrophe character is special in Breton, because it belongs to the alphabet. This particularity forces the spell-checking engine to do a special treatment which is not required for other languages such as English, French or Spanish.

There are 3 cases :

· The c'h :

  • marc'h, alarc'h, melc'hwedenn, floc’h,...
  • c’hwec’h, kreñvoc’h, ...

If a word including the c'h trigraph is not present in the dictionary, the whole word will be flagged, like for every other common word.

· The elision :

  • n'on ket a-du, m'ho peus c'hoant, ma’z, …

In the case of elision, the apostrophe is always linked to the first word. " n'on " is composed of the first elided word " n' " and of the whole word " on ". The words " n' " and " on " must be present independently in the dictionary. Every missing word is flagged.

· The contraction :

  • ane'e for anezhe, de'i for dezhi, …
  • 'peus for az peus,  …

In this latter case, the behaviour of the spell-checker is more unpredictable and relies actually on the likelihood of finding such forms in the written literature (by the way, such forms are unadvised in written Breton). We will find 'peus et ane'i in the dictionary, but other more obscured or rare forms could not be included.

[Image]An important note. Some software can put at the place of the standard apostrophe ', whose ASCII code is 0x06, other kinds of apostrophes, lesser used, like for example the characters 0x60, 0x91, 0x92 and 0xB4. This is always the case when using Microsoft Word in auto-correction mode. It is also the case with Microsoft PowerPoint. An Drouizig Difazier knows these characters and treats them as a familiar apostrophe.
 

Handling of capital letters.

Two cases can be discussed here. On the first hand, the case of accented capitals, and on the second hand the case of polygraphs, that is to say, ch and c’h :

  • Bro-C'Hall or/and Bro-C'hall ? 
  • CHom or/and Chom ?
  • HAG-EN or/and HAG-EÑ ?
  • Û or/and U ?

An Drouizig Difazier allows the words Û, HAG-EÑ, Bro-C'hall and Chom.

Mutation of proper nouns.

Mutation of proper nouns can be written, in this case several choices exist. It could perhaps not be as well. As for example, we can find historically in the Breton literature (ex: " Buhez ar Sent ", etc.) the spelling " An Itron Varia ". We can find also the mutation spelled in the following way : " An Itron vMaria ".

It has been chosen to perform the mutation on the first letter of any proper noun, like for any common noun, so the dictionary will include all the proper nouns with all their mutated forms,

  •  Karine ® da Garine,  Gwenael ® da Wenael, etc. 
  •  Gwened ® Bro-Wened, Kemper ® Bro-Gemper, etc.


How many words in the dictionary ?

There are approximately 350000 words in the dictionary, a basis of 20000 words plus all their very (very) numerous mutated forms.

· verbs. (17,5%)

  • + conjugated forms.
  • + mutated and negative forms (N'hallan, …).

3500

· common nouns. (65%)

  • + plural and dual forms.
  • + feminine forms (-ez).
  • + single forms (for collectives).
  • + diminutive forms (-ig, -igoù, -oùigoù, …).
  • + mutated forms.

8000 (m.) + 4000 (f.) + 300 (pl.) + 80 (d.)
12380

· adjectives. (13%)

  • + diminutive forms (-ik).
  • + comparative forms (-oc'h).
  • + superlative forms (-añ).
  • + exclamatory forms (-at).
  • + mutated forms.

2600

· prepositions. (4,5% including the rest)

  • + conjugated forms. (warnon, din, evidout, …).
  • + mutated forms.

· interjections.
· adverbs.

  • + mutated forms. (Da belec’h, …)

· pronouns.

  • + elided forms.
  • + mutated forms. (Da betra, …)

· proper nouns (Breton surnames, towns from Brittany,…).

  • + mutated forms.

· conjunctions
· exclamations
· ordinals
· cardinals
· articles
· contracted forms

900

An overall of about 20000 words,
What makes 350000 different forms.