Skip to content

Commit

Permalink
Initia import
Browse files Browse the repository at this point in the history
  • Loading branch information
Caolán McNamara committed Mar 4, 2010
0 parents commit 21127cc
Show file tree
Hide file tree
Showing 134 changed files with 76,646 additions and 0 deletions.
18 changes: 18 additions & 0 deletions AUTHORS
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
Libhnj was written by Raph Levien <raph at acm dot org>.

Original Libhnj source with OOo's patches are managed by Rene Engelhard and
Chris Halls at Debian: http://packages.debian.org/stable/libdevel/libhnj-dev
and http://packages.debian.org/unstable/source/libhnj

This subset of Libhnj was extended by
Peter Novodvorsky <nidd at alt-linux dot org> (OOo integration),
László Németh <nemeth at OOo> (non-standard and compound hyphenation
with Unicode support),
Nanning Buitenhuis <nanning at elvenkind dot com> (substrings.c)

Write bug reports to László Németh or to the lingu-dev mailing list of
OpenOffice.org (after subscription: http://lingucomponent.openoffice.org)

---
Please contact Raph Levien for information about licensing for
proprietary applications.
17 changes: 17 additions & 0 deletions COPYING
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
GPL 2.0/LGPL 2.1/MPL 1.1 tri-license

The contents of this software may be used under the terms of
the GNU General Public License Version 2 or later (the "GPL"), or
the GNU Lesser General Public License Version 2.1 or later (the "LGPL",
see COPYING.LGPL) or the Mozilla Public License Version 1.1 or later
(the "MPL", see COPYING.MPL).

The Plain TeX hyphenation tables "hyphen.tex" by Donald E. Knuth
has a non MPL/LGPL compatible license, but freely redistributable:
"Unlimited copying and redistribution of this file are permitted as long
as this file is not modified. Modifications are permitted, but only if
the resulting file is not named hyphen.tex."

Software distributed under these licenses is distributed on an "AS IS" basis,
WITHOUT WARRANTY OF ANY KIND, either express or implied. See the licences
for the specific language governing rights and limitations under the licenses.
515 changes: 515 additions & 0 deletions COPYING.LGPL

Large diffs are not rendered by default.

470 changes: 470 additions & 0 deletions COPYING.MPL

Large diffs are not rendered by default.

170 changes: 170 additions & 0 deletions ChangeLog
Original file line number Diff line number Diff line change
@@ -0,0 +1,170 @@
2010-02-23 László Németh <nemeth at OOo>:
* hyphen.c: fix lefthyphenmin calculation for UTF-8 encoded input

* hyphen.c: add Unicode ligature support for hyphenmin calculation
(see also LONG_LIGATURE macro in hyphen.c for conditional
compiling)

* csutil.c: static struct for encds[] (from OpenOffice.org patch),
(unsigned char)s, wordmin + 5 limit (see hyphen-2.4.patch of OOo)

* Makefile.am, ooopatch.awk: add OpenOffice.org patch to the en_US
hyphenation dictionary to fix apostrophe handling, see lhmin.test

* Makefile.am, lig.awk: add Unicode ligature support to en_US
hyphenation dictionary. NOTE: hyphenation within ligatures is not
supported yet because of an implementation problem of
OpenOffice.org: see OOo issue 71608.

* tests: new tests: lig.* for ligature hyphenation, lhmin.* for
lefthyphenmin calculation for UTF-8 encoded text with diacritics.

2008-05-01 László Németh <nemeth at OOo>:
* hyphen.c, hyphen.h: compound word hyphenation support by recursive
pattern matching based on two hyphenation pattern sets, see
README.compound. Especially useful for languages with arbitrary
number of compounds (Danish, Dutch, Finnish, German, Hungarian,
Icelandic, Norwegian, Swedish etc.).
- dictionary options for compound word hyphenation:
COMPOUNDLEFTHYPHENMIN: minimal hyphenation distance from the
left compound word boundary
COMPOUNDRIGHTHYPHENMIN: minimal hyphenation distance from the right
compound word boundary
* README.compound: documentation
* tests/compound.*: test data for compound word hyphenation and
COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN.

* tests/test.sh:
- add Valgrind debugger support, usage:
make check
VALGRIND=memcheck make check
- fix false return when an error occurred
- fix make distcheck target

* tests/*.pat, test.sh: using static pattern files processed by
substrings.pl instead of run-time processed patterns.

* hyphen.c: add default hyphenmin support to the dictionaries:
LEFTHYPHENMIN: minimal hyphenation distance from the left
end of the word
RIGHTHYPHENMIN: minimal hyphenation distance from the right
end of the word.
Problems with the LEFTHYPHENMIN and RIGHTHYPHENMIN and a
possible solution reported by Joan Montané in SF.net Bug ID 1777894.
* tests/settings*.*: test data of LEFTHYPHENMIN and
RIGHTHYPHENMIN. First test (settings.*) is based on the test data
of Joan Montané (SF.net Bug ID 1777894).

* example.c: changed options:
- old -d (non-standard hyph.) mode is the default now
- -dd (listing possible hyphenations) -> -d
- -o : old (without non-standard hyphenation support) mode

* Makefile.am:
- remove unused csutil from the shared library (-20 kB and
solve a csutil conflict with Hunspell reported by
Rene Engerhald in SF.net Bug 1939988).

* substrings.pl : add lefthyphenmin and righthyphemin options:
substrings.pl infile outfile [encoding [lefthyphenmin [righthyphenmin]]]

* hyph_en_US.dic, Makefile.am: set right default values for
American English, based on the original TeX settings and
American English orthography:
LEFTHYPHENMIN 2
RIGHTHYPHENMIN 3

* README_hyph_en_US.dic: add README for en_US hyphenation patterns

* tbhyphext.tex: TugBoat hyphenation exception log with
thousand word fixes, source:
http://www.ctan.org/tex-archive/info/digests/tugboat/tb0hyf.tex,
processed by the hyphenex.sh script (see in the same folder).
* tbhyphext.sh: conversion script for tbhyphext.pat.

2008-02-19 László Németh <nemeth at OOo>:
* hyphen.c: fix unconditional jump in the obsolete
hnj_hyphen_hyphenate() (it was already fixed in the
preferred hnj_hyphen_hyphenate2()). Possible fix for the problem
reported by Rene Engelhard in SourceForge Bug ID 1896207.

* Makefile.am: add missing $(srcdir)s for make dist

* NEWS: add NEWS for autoreconf

2007-11-22 László Németh <nemeth at OOo>:
* hyphen.c: fix a bad condition that introduced in the previous version.
Problem reported by Joan Montané under SourceForge Bug ID 1772381.

* Makefile.am: rename the library to "hyphen".

* hyphen.tex: use the last official version and its time stamp.
Source: http://tug.ctan.org/text-archive/macros/plain/base/hyphen.tex

* tests/*: add make check support

* doc/tb87nemeth.pdf: TugBoat article about non-standard
hyphenation and its implementation.

2007-11-12 Caolan McNamara <cmc at OO.o>:
* autoconf/automake/libtoolize it
Which as a side effect makes it fit into the existing
--with-system-altlinuxhyph configure support in OOo to use a system
pre-installed library for OOo hyphenation.

* make a shared library libhnj.so from it

* install the hyphen.h header

* hyphen.patch: document by a make target how to go from the
original hyphen.tex file to the interim hyphen.us to the final
hyph_en_US.dic that OOo uses. (For example, converting
\hyphenate section of hyphen.tex.)

2007-05-14 László Németh <nemeth at OOo>:
* README: add information about substring.pl conversion and HyFo
Java hyphenation module.

* README.hyphen: add the following references about hyphenation:
Franklin M. Liang: Word Hy-phen-a-tion by Com-put-er.
Stanford University, 1983. http://www.tug.org/docs/liang.

László Németh: Automatic non-standard hyphenation in OpenOffice.org,
TUGboat (27), 2006. No. 2., http://hunspell.sourceforge.net/tb87nemeth.pdf

* README. nonstandard: add information about narrow subpatterns,
and a problem reported by Peter B. West.

2006-11-27 László Németh <nemeth at OOo>:
* substrings.pl: restore previous version to fix rare non-standard hyphenation
problems reported by Peter B. West, HyFo (Java XSLT formatter) developer

2006-08-03 László Németh <nemeth at OOo>:
* hyphen.c: fix bad Unicode non-standard hyphenation (reset deleted
break in UTF-8 length conversion code in hnj_hyphen_load())
* tests/unicode*, Makefile: test for this fix

* hyphen.c: fix bad hyphen duplication in hyphword output in hnj_hyphenate2()

* example.c: fix empty input fault in single_hyphenations()
(unsigned return value of strlen() output couldn't be negative in the condition)

* substrings.pl: shorter version with Nanning Buitenhuis's substrings.pl fix.

2006-07-28 Nanning Buitenhuis <nanning at elvenkind dot com>:
* substrings.c: faster C version of substrings.pl
- It also fixed a minor bug in combine(): if a sub-pattern is found twice
(or more) in the main pattern, then all occurences were changed instead
of (the correct) last occurence. Only example in hyphen.us is 'tanta3'

2006-01-27 László Németh <nemeth at OOo>:
* *.{c,h}: add non-standard hyphenation and Unicode support
* README.discretionary: documentation
- add tests/ (see make check)

2005-10-13 Daniel Naber <dnaber at OOo>:
* example.c: fixed the call to hnj_hyphen_hyphenate() in example.c so that
patterns ending in a dot should now work (Daniel Naber)

Libhnj was written by Raph Levien <[email protected]>
Adapted to OpenOffice.org by Peter Novodvorsky <[email protected]>
Loading

0 comments on commit 21127cc

Please sign in to comment.