L10n:Hyphentation Data

Revision as of 14:37, 17 May 2011 by Pawell (talk | contribs)

We need to collect some information on hyphenation dictionaries. Dictionaries is a bit confusing, this is about the data that hyphen uses, not the data for hunspell. It'd be good to know:

  • Which language is the hyphenation data for?
  • What's their copyright status?
  • How big are they?

Gerv says: to be included in the source tree, and therefore in builds shipped by Mozilla, a dictionary needs to have a licence compatible with all three of the "MPL 1.1", "LGPL 2.1 or later" and "GPL 2.0 or later". Examples of compatible licensing schemes include:

  • Mozilla tri-licence
  • BSD or MIT-style licences
  • Public domain
Language License Size Size compressed Download link
English (US) LGPL (derived from TeX hyphenation tables) 79 272 38 807 (zip) OOo Hyphen dic : 2002-07-27
Swedish (SE) GPL 2.0/LGPL 2.1 65k ?? ??
Bulgarian (BG) GPL 2.0/LGPL 2.1/MPL 1.1 36 327 13 896 (zip, v4.3) BG lang support project
Czech (cs) GPL (derived from TeX hyphenation tables) 20 kB 11 Kb (zip) OOo Hyphen dic