forked from hunspell/hyphen
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Caolán McNamara
committed
Mar 4, 2010
0 parents
commit 21127cc
Showing
134 changed files
with
76,646 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
Libhnj was written by Raph Levien <raph at acm dot org>. | ||
|
||
Original Libhnj source with OOo's patches are managed by Rene Engelhard and | ||
Chris Halls at Debian: http://packages.debian.org/stable/libdevel/libhnj-dev | ||
and http://packages.debian.org/unstable/source/libhnj | ||
|
||
This subset of Libhnj was extended by | ||
Peter Novodvorsky <nidd at alt-linux dot org> (OOo integration), | ||
László Németh <nemeth at OOo> (non-standard and compound hyphenation | ||
with Unicode support), | ||
Nanning Buitenhuis <nanning at elvenkind dot com> (substrings.c) | ||
|
||
Write bug reports to László Németh or to the lingu-dev mailing list of | ||
OpenOffice.org (after subscription: http://lingucomponent.openoffice.org) | ||
|
||
--- | ||
Please contact Raph Levien for information about licensing for | ||
proprietary applications. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
GPL 2.0/LGPL 2.1/MPL 1.1 tri-license | ||
|
||
The contents of this software may be used under the terms of | ||
the GNU General Public License Version 2 or later (the "GPL"), or | ||
the GNU Lesser General Public License Version 2.1 or later (the "LGPL", | ||
see COPYING.LGPL) or the Mozilla Public License Version 1.1 or later | ||
(the "MPL", see COPYING.MPL). | ||
|
||
The Plain TeX hyphenation tables "hyphen.tex" by Donald E. Knuth | ||
has a non MPL/LGPL compatible license, but freely redistributable: | ||
"Unlimited copying and redistribution of this file are permitted as long | ||
as this file is not modified. Modifications are permitted, but only if | ||
the resulting file is not named hyphen.tex." | ||
|
||
Software distributed under these licenses is distributed on an "AS IS" basis, | ||
WITHOUT WARRANTY OF ANY KIND, either express or implied. See the licences | ||
for the specific language governing rights and limitations under the licenses. |
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,170 @@ | ||
2010-02-23 László Németh <nemeth at OOo>: | ||
* hyphen.c: fix lefthyphenmin calculation for UTF-8 encoded input | ||
|
||
* hyphen.c: add Unicode ligature support for hyphenmin calculation | ||
(see also LONG_LIGATURE macro in hyphen.c for conditional | ||
compiling) | ||
|
||
* csutil.c: static struct for encds[] (from OpenOffice.org patch), | ||
(unsigned char)s, wordmin + 5 limit (see hyphen-2.4.patch of OOo) | ||
|
||
* Makefile.am, ooopatch.awk: add OpenOffice.org patch to the en_US | ||
hyphenation dictionary to fix apostrophe handling, see lhmin.test | ||
|
||
* Makefile.am, lig.awk: add Unicode ligature support to en_US | ||
hyphenation dictionary. NOTE: hyphenation within ligatures is not | ||
supported yet because of an implementation problem of | ||
OpenOffice.org: see OOo issue 71608. | ||
|
||
* tests: new tests: lig.* for ligature hyphenation, lhmin.* for | ||
lefthyphenmin calculation for UTF-8 encoded text with diacritics. | ||
|
||
2008-05-01 László Németh <nemeth at OOo>: | ||
* hyphen.c, hyphen.h: compound word hyphenation support by recursive | ||
pattern matching based on two hyphenation pattern sets, see | ||
README.compound. Especially useful for languages with arbitrary | ||
number of compounds (Danish, Dutch, Finnish, German, Hungarian, | ||
Icelandic, Norwegian, Swedish etc.). | ||
- dictionary options for compound word hyphenation: | ||
COMPOUNDLEFTHYPHENMIN: minimal hyphenation distance from the | ||
left compound word boundary | ||
COMPOUNDRIGHTHYPHENMIN: minimal hyphenation distance from the right | ||
compound word boundary | ||
* README.compound: documentation | ||
* tests/compound.*: test data for compound word hyphenation and | ||
COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN. | ||
|
||
* tests/test.sh: | ||
- add Valgrind debugger support, usage: | ||
make check | ||
VALGRIND=memcheck make check | ||
- fix false return when an error occurred | ||
- fix make distcheck target | ||
|
||
* tests/*.pat, test.sh: using static pattern files processed by | ||
substrings.pl instead of run-time processed patterns. | ||
|
||
* hyphen.c: add default hyphenmin support to the dictionaries: | ||
LEFTHYPHENMIN: minimal hyphenation distance from the left | ||
end of the word | ||
RIGHTHYPHENMIN: minimal hyphenation distance from the right | ||
end of the word. | ||
Problems with the LEFTHYPHENMIN and RIGHTHYPHENMIN and a | ||
possible solution reported by Joan Montané in SF.net Bug ID 1777894. | ||
* tests/settings*.*: test data of LEFTHYPHENMIN and | ||
RIGHTHYPHENMIN. First test (settings.*) is based on the test data | ||
of Joan Montané (SF.net Bug ID 1777894). | ||
|
||
* example.c: changed options: | ||
- old -d (non-standard hyph.) mode is the default now | ||
- -dd (listing possible hyphenations) -> -d | ||
- -o : old (without non-standard hyphenation support) mode | ||
|
||
* Makefile.am: | ||
- remove unused csutil from the shared library (-20 kB and | ||
solve a csutil conflict with Hunspell reported by | ||
Rene Engerhald in SF.net Bug 1939988). | ||
|
||
* substrings.pl : add lefthyphenmin and righthyphemin options: | ||
substrings.pl infile outfile [encoding [lefthyphenmin [righthyphenmin]]] | ||
|
||
* hyph_en_US.dic, Makefile.am: set right default values for | ||
American English, based on the original TeX settings and | ||
American English orthography: | ||
LEFTHYPHENMIN 2 | ||
RIGHTHYPHENMIN 3 | ||
|
||
* README_hyph_en_US.dic: add README for en_US hyphenation patterns | ||
|
||
* tbhyphext.tex: TugBoat hyphenation exception log with | ||
thousand word fixes, source: | ||
http://www.ctan.org/tex-archive/info/digests/tugboat/tb0hyf.tex, | ||
processed by the hyphenex.sh script (see in the same folder). | ||
* tbhyphext.sh: conversion script for tbhyphext.pat. | ||
|
||
2008-02-19 László Németh <nemeth at OOo>: | ||
* hyphen.c: fix unconditional jump in the obsolete | ||
hnj_hyphen_hyphenate() (it was already fixed in the | ||
preferred hnj_hyphen_hyphenate2()). Possible fix for the problem | ||
reported by Rene Engelhard in SourceForge Bug ID 1896207. | ||
|
||
* Makefile.am: add missing $(srcdir)s for make dist | ||
|
||
* NEWS: add NEWS for autoreconf | ||
|
||
2007-11-22 László Németh <nemeth at OOo>: | ||
* hyphen.c: fix a bad condition that introduced in the previous version. | ||
Problem reported by Joan Montané under SourceForge Bug ID 1772381. | ||
|
||
* Makefile.am: rename the library to "hyphen". | ||
|
||
* hyphen.tex: use the last official version and its time stamp. | ||
Source: http://tug.ctan.org/text-archive/macros/plain/base/hyphen.tex | ||
|
||
* tests/*: add make check support | ||
|
||
* doc/tb87nemeth.pdf: TugBoat article about non-standard | ||
hyphenation and its implementation. | ||
|
||
2007-11-12 Caolan McNamara <cmc at OO.o>: | ||
* autoconf/automake/libtoolize it | ||
Which as a side effect makes it fit into the existing | ||
--with-system-altlinuxhyph configure support in OOo to use a system | ||
pre-installed library for OOo hyphenation. | ||
|
||
* make a shared library libhnj.so from it | ||
|
||
* install the hyphen.h header | ||
|
||
* hyphen.patch: document by a make target how to go from the | ||
original hyphen.tex file to the interim hyphen.us to the final | ||
hyph_en_US.dic that OOo uses. (For example, converting | ||
\hyphenate section of hyphen.tex.) | ||
|
||
2007-05-14 László Németh <nemeth at OOo>: | ||
* README: add information about substring.pl conversion and HyFo | ||
Java hyphenation module. | ||
|
||
* README.hyphen: add the following references about hyphenation: | ||
Franklin M. Liang: Word Hy-phen-a-tion by Com-put-er. | ||
Stanford University, 1983. http://www.tug.org/docs/liang. | ||
|
||
László Németh: Automatic non-standard hyphenation in OpenOffice.org, | ||
TUGboat (27), 2006. No. 2., http://hunspell.sourceforge.net/tb87nemeth.pdf | ||
|
||
* README. nonstandard: add information about narrow subpatterns, | ||
and a problem reported by Peter B. West. | ||
|
||
2006-11-27 László Németh <nemeth at OOo>: | ||
* substrings.pl: restore previous version to fix rare non-standard hyphenation | ||
problems reported by Peter B. West, HyFo (Java XSLT formatter) developer | ||
|
||
2006-08-03 László Németh <nemeth at OOo>: | ||
* hyphen.c: fix bad Unicode non-standard hyphenation (reset deleted | ||
break in UTF-8 length conversion code in hnj_hyphen_load()) | ||
* tests/unicode*, Makefile: test for this fix | ||
|
||
* hyphen.c: fix bad hyphen duplication in hyphword output in hnj_hyphenate2() | ||
|
||
* example.c: fix empty input fault in single_hyphenations() | ||
(unsigned return value of strlen() output couldn't be negative in the condition) | ||
|
||
* substrings.pl: shorter version with Nanning Buitenhuis's substrings.pl fix. | ||
|
||
2006-07-28 Nanning Buitenhuis <nanning at elvenkind dot com>: | ||
* substrings.c: faster C version of substrings.pl | ||
- It also fixed a minor bug in combine(): if a sub-pattern is found twice | ||
(or more) in the main pattern, then all occurences were changed instead | ||
of (the correct) last occurence. Only example in hyphen.us is 'tanta3' | ||
|
||
2006-01-27 László Németh <nemeth at OOo>: | ||
* *.{c,h}: add non-standard hyphenation and Unicode support | ||
* README.discretionary: documentation | ||
- add tests/ (see make check) | ||
|
||
2005-10-13 Daniel Naber <dnaber at OOo>: | ||
* example.c: fixed the call to hnj_hyphen_hyphenate() in example.c so that | ||
patterns ending in a dot should now work (Daniel Naber) | ||
|
||
Libhnj was written by Raph Levien <[email protected]> | ||
Adapted to OpenOffice.org by Peter Novodvorsky <[email protected]> |
Oops, something went wrong.