Skip to content

Commit

Permalink
Add READMEs for dictionaries
Browse files Browse the repository at this point in the history
Fix #52.
  • Loading branch information
liZe committed Aug 19, 2023
1 parent 957198b commit 5d80cf8
Show file tree
Hide file tree
Showing 31 changed files with 2,991 additions and 0 deletions.
12 changes: 12 additions & 0 deletions pyphen/dictionaries/README_hyph_NO.txt
@@ -0,0 +1,12 @@
Myspell hyphenation
-------------------

Language: Norwegian Nynorsk (nn NO)
Langauge: Norwegian Bokm�l (nb NO)
Origin: Generated from the spell-norwegian source v2.0.7
License: GNU General Public license
Author: The spell-norwegian project, <URL:https://alioth.debian.org/projects/spell-norwegian/>

HYPH nn NO nn_NO
HYPH nb NO nb_NO

9 changes: 9 additions & 0 deletions pyphen/dictionaries/README_hyph_be_BY.txt
@@ -0,0 +1,9 @@
############################################################

Belarusian Hyphenation Dictionary
Created by: Aleś Bułojčyk <alex73mail@gmail.com>

Hyphenation rules according to official orthography 2008
License: CC BY-SA 4.0 or LGPLv3

############################################################
16 changes: 16 additions & 0 deletions pyphen/dictionaries/README_hyph_ca.txt
@@ -0,0 +1,16 @@
_______________________________________________________________________________

DICCIONARI DE PARTICIÓ DE MOTS
versió 1.4

Copyright (C) 2013-2018 Jaume Ortolà <jaumeortola@gmail.com> --- Riurau Editors

Llicència (a la vostra elecció):
LGPL v. 3.0 o superior -- http://www.gnu.org/licenses/lgpl-3.0.html
GPL v.3.0 o superior -- http://www.gnu.org/licenses/gpl-3.0.html

Aquests patrons funcionen amb el LibreOffice i OpenOffice.org 3.2+

Més informació:
https://www.softcatala.org/programes/diccionari-catala-de-particio-de-mots/
_______________________________________________________________________________
20 changes: 20 additions & 0 deletions pyphen/dictionaries/README_hyph_cs_CZ.txt
@@ -0,0 +1,20 @@
Hyphenation dictionary
----------------------

Language: Czech (Czech Republic) (cs CZ).
Origin: Based on the TeX hyphenation tables
License: GPL license, 2003
Author: Pavel@Janik.cz (Pavel Janík)

HYPH cs CZ hyph_cs

These patterns were converted from TeX hyphenation patterns by the package
lingucomponent-tools
(http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/oo-cs/lingucomponent-tools/).

The license of original files is GNU GPL (they are both parts of csTeX). My
work on them was to only run the scripts from lingucomponent-tools package
(dual LGPL/SISSL license so it can be integrated).
--
Pavel Janík
2003
10 changes: 10 additions & 0 deletions pyphen/dictionaries/README_hyph_da_DK.txt
@@ -0,0 +1,10 @@
Language: Danish (da DK).
Origin: Based on the TeX hyphenation tables
Created by Frank Jensen (fj@iesd.auc.dk), ca. 1988.
Modified by Preben Randhol (September 12, 1994) to increase portability between different systems
License: GNU LGPL license.
Author: conversion author is Marco Huggenberger<marco@by-night.ch>

This dictionary is based on syllable matching patterns and therefore should be usable under other variations of Danish

HYPH da DK hyph_da_DK
42 changes: 42 additions & 0 deletions pyphen/dictionaries/README_hyph_de.txt
@@ -0,0 +1,42 @@
Hyphenation dictionary "hyph_de_DE.dic"
---------------------------------------

Language: German (de DE)
according to the reform of 2006-08-01 (i.e. reformed or new spelling)

Version: 2017-01-12
New: using the COMPOUND feature for improved hyphenation
New: list with over 69,000 words and compounds by Karl Zeiler

Origin: Based on the TeX hyphenation tables "dehyphn.tex", revision level 31.
http://www.ctan.org/tex-archive/language/hyphenation/dehyphn.tex
The TeX hyphenation tables are released under the LaTeX Project
Public License (LPPL)

License: OpenOffice.org Adaptions of this package are licensed under the
GNU Lesser General Public License (LGPL 2 or later) and are under
Copyright by

Author: conversion author: Marco Huggenberger <marco@by-night.ch>
revised conversion and extensions: Daniel Naber <naber@danielnaber.de>
improvements: Karl Zeiler <karl.zeiler@t-online.de>

Note: This dictionary is based on syllable matching patterns
and thus should be suitable under other variations of German:
HYPH de AT hyph_de_AT
HYPH de CH hyph_de_CH


Trennmuster (hyph_de_DE.dic)
----------------------------

Die Trennmuster (hyph_de_DE.dic) basieren auf den TeX Trennmustern
"dehyphn.tex", revision level 31.
Lizenz der Trennmuster: LPPL. Die Anpassung der Trennmuster an
den in OpenOffice.org benutzten "ALTLinux LibHnj Hyphenator" wurde
mit dem Script substrings.pl durchgef�hrt, das unter
https://www.openoffice.org/lingucomponent/hyphenator.html als Teil
der Datei altlinux_Hyph.zip heruntergeladen werden kann.
Die Original-Trennmuster k�nnen hier heruntergeladen werden:
https://www.ctan.org/tex-archive/language/hyphenation/dehyphn.tex

11 changes: 11 additions & 0 deletions pyphen/dictionaries/README_hyph_el_GR.txt
@@ -0,0 +1,11 @@
Hellenic hyphenation dictionary for OpenOffice.org 1.1.0
--------------------------------------------------------

Language: Greek a.k.a. Hellenic (el GR).
Version: 1.1b

License: LGPL
Author: InterZone <info@interzone.gr>

This dictionary should be usable only for monotonic Greek (not polytonics, neither archaic). There may be some problems with words starting with accented vowels, feedback is welcome. Words in quotes do not hyphenate, but it seems like a problem with OpenOffice and not the hyphenation dictionary.

198 changes: 198 additions & 0 deletions pyphen/dictionaries/README_hyph_en_GB.txt
@@ -0,0 +1,198 @@
hyph_en_GB.dic - British English hyphenation patterns for OpenOffice.org

version 2011-10-07

- remove unnecessary parts for Hyphen 2.8.2

version 2010-03-16

Changes

- forbid hyphenation at 1-character distances from dashes (eg. ad=d-on)
and at the dashes (fix for OpenOffice.org 3.2)
- UTF-8 encoding and corrected hyphenation for words with Unicode f ligatures
(conversion scripts: see Hyphen 2.6)

version 2009-01-23

Changes

- add missing \hyphenation list (how-ever, through-out etc.)
- set correct LEFTHYPHENMIN = 2, RIGHTHYPHENMIN = 3
- handle apostrophes (forbid *can='t, *abaser='s, *o'c=lock etc.)
- set COMPOUNDLEFTHYPHENMIN, COMPOUNDRIGHTHYPHENMIN values

License

BSD-style. Unlimited copying, redistribution and modification of this file
is permitted with this copyright and license information.

British English hyphenation patterns, based on "ukhyphen.tex" Version 1.0a
Created by Dominik Wujastyk and Graham Toal using Frank Liang's PATGEN 1.0,
source: http://ctan.org

See original ukhyphen.tex license in this file, too.

Conversion and modifications by László Németh (nemeth at OOo).

Conversion:

./substrings.pl hyph_en_GB.dic.source /tmp/hyph_en_GB.dic.patterns >/dev/null
cat hyph_en_GB.dic.header /tmp/hyph_en_GB.dic.patterns >hyph_en_GB.dic

hyph_en_GB.dic.header:

ISO8859-1
LEFTHYPHENMIN 2
RIGHTHYPHENMIN 3
COMPOUNDLEFTHYPHENMIN 2
COMPOUNDRIGHTHYPHENMIN 3
1'.
1's.
1't.
NEXTLEVEL

OpenOffice.org ukhyphen patch (hyph_en_GB.dic.source):

--- ukhyphen.tex 2008-12-17 15:37:04.000000000 +0100
+++ hyph_en_GB.dic.source 2008-12-18 10:07:02.000000000 +0100
@@ -52,7 +52,6 @@
%
% These patterns require a value of about 14000 for TeX's pattern memory size.
%
-\patterns{ % just type <return> if you're not using INITEX
.ab4i
.ab3ol
.ace4
@@ -8580,13 +8579,64 @@
z3zie
zzo3
z5zot
-}
-\hyphenation{ % Do NOT make any alterations to this list! --- DW
-uni-ver-sity
-uni-ver-sit-ies
-how-ever
-ma-nu-script
-ma-nu-scripts
-re-ci-pro-city
-through-out
-some-thing}
+.uni5ver5sity.
+.uni5ver5sit5ies.
+.how5ever.
+.ma5nu5script.
+.ma5nu5scripts.
+.re5ci5pro5city.
+.through5out.
+.some5thing.
+4'4
+4a'
+4b'
+4c'
+4d'
+4e'
+4f'
+4g'
+4h'
+4i'
+4j'
+4k'
+4l'
+4m'
+4n'
+4o'
+4p'
+4q'
+4r'
+4s'
+4t'
+4u'
+4v'
+4w'
+4x'
+4y'
+4z'
+'a4
+'b4
+'c4
+'d4
+'e4
+'f4
+'g4
+'h4
+'i4
+'j4
+'k4
+'l4
+'m4
+'n4
+'o4
+'p4
+'q4
+'r4
+'s4
+'t4
+'u4
+'v4
+'w4
+'x4
+'y4
+'z4

Original License

% File: ukhyphen.tex
% TeX hyphenation patterns for UK English

% Unlimited copying and redistribution of this file
% is permitted so long as the file is not modified
% in any way.
%
% Modifications may be made for private purposes (though
% this is discouraged, as it could result in documents
% hyphenating differently on different systems) but if
% such modifications are re-distributed, the modified
% file must not be capable of being confused with the
% original. In particular, this means
%
%(a) the filename (the portion before the extension, if any)
% must not match any of :
%
% UKHYPH UK-HYPH
% UKHYPHEN UK-HYPHEN
% UKHYPHENS UK-HYPHENS
% UKHYPHENATION UK-HYPHENATION
% UKHYPHENISATION UK-HYPHENISATION
% UKHYPHENIZATION UK-HYPHENIZATION
%
% regardless of case, and
%
%(b) the file must contain conditions identical to these,
% except that the modifier/distributor may, if he or she
% wishes, augment the list of proscribed filenames.

% $Log: ukhyph.tex $
% Revision 2.0 1996/09/10 15:04:04 ucgadkw
% o added list of hyphenation exceptions at the end of this file.
%
%
% Version 1.0a. Released 18th October 2005/PT.
%
% Created by Dominik Wujastyk and Graham Toal using Frank Liang's PATGEN 1.0.
% Like the US patterns, these UK patterns correctly hyphenate about 90% of
% the words in the input list, and produce no hyphens not in the list
% (see TeXbook pp. 451--2).
%
% These patterns are based on a file of 114925 British-hyphenated words
% generously made available to Dominik Wujastyk by Oxford University Press.
% This list of words is copyright to the OUP and may not be redistributed.
% The hyphenation break points in the words in the abovementioned file is
% also copyright to the OUP.
%
% We are very grateful to Oxford University Press for allowing us to use
% their list of hyphenated words to produce the following TeX hyphenation
% patterns. This file of hyphenation patterns may be freely distributed.
%
% These patterns require a value of about 14000 for TeX's pattern memory size.
%
59 changes: 59 additions & 0 deletions pyphen/dictionaries/README_hyph_en_US.txt
@@ -0,0 +1,59 @@
hyph_en_US.dic - American English hyphenation patterns for OpenOffice.org

version 2011-10-07

- remove unnecessary parts for the new Hyphen 2.8.2

version 2010-03-16

Changes

- forbid hyphenation at 1-character distances from dashes (eg. ad=d-on)
and at the dashes (fix for OpenOffice.org 3.2)
- set correct LEFTHYPHENMIN = 2, RIGHTHYPHENMIN = 3
- handle apostrophes (forbid *o'=clock etc.)
- set COMPOUNDLEFTHYPHENMIN, COMPOUNDRIGHTHYPHENMIN values
- UTF-8 encoding
- Unicode ligature support

License

BSD-style. Unlimited copying, redistribution and modification of this file
is permitted with this copyright and license information.

See original license in this file.

Conversion and modifications by László Németh (nemeth at OOo).

Based on the plain TeX hyphenation table
(http://tug.ctan.org/text-archive/macros/plain/base/hyphen.tex) and
the TugBoat hyphenation exceptions log in
http://www.ctan.org/tex-archive/info/digests/tugboat/tb0hyf.tex, processed
by the hyphenex.sh script (see in the same directory).

Originally developed and distributed with the Hyphen hyphenation library,
see http://hunspell.sourceforge.net/ for the source files and the conversion
scripts.

Licenses

hyphen.tex:
% The Plain TeX hyphenation tables [NOT TO BE CHANGED IN ANY WAY!]
% Unlimited copying and redistribution of this file are permitted as long
% as this file is not modified. Modifications are permitted, but only if
% the resulting file is not named hyphen.tex.

output of hyphenex.sh:
% Hyphenation exceptions for US English, based on hyphenation exception
% log articles in TUGboat.
%
% Copyright 2007 TeX Users Group.
% You may freely use, modify and/or distribute this file.
%
% This is an automatically generated file. Do not edit!
%
% Please contact the TUGboat editorial staff <tugboat@tug.org>
% for corrections and omissions.

hyph_en_US.txt:
See the previous licenses.

0 comments on commit 5d80cf8

Please sign in to comment.