L10n:Hyphentation Data

From MozillaWiki
Jump to: navigation, search

We need to collect some information on hyphenation dictionaries. Dictionaries is a bit confusing, this is about the data that hyphen uses, not the data for hunspell. It'd be good to know:

  • Which language is the hyphenation data for?
  • What's their copyright status?
  • How big are they?

Gerv says: to be included in the source tree, and therefore in builds shipped by Mozilla, a dictionary needs to have a licence compatible with all three of the "MPL 1.1", "LGPL 2.1 or later" and "GPL 2.0 or later". Examples of compatible licensing schemes include:

  • Mozilla tri-licence
  • BSD or MIT-style licences
  • Public domain
Language License Size Size compressed Download link
English (US) TeX licence 106 kB 40 kB (zip) Hyphen 2.7 package
Swedish (SE) GPL 2.0/LGPL 2.1 65 kB  ?? Swedish hyphenation
Bulgarian (BG) GPL 2.0/LGPL 2.1/MPL 1.1 36 kB 13 kB (zip, v4.3) BG lang support project
Irish (GA) GPL 2.0 41 kB 20 kB (zip) OOo hyphenation dic
Czech (cs) GPL (derived from TeX hyphenation tables) 20 kB 11 kB (zip) OOo Hyphen dic