2011-03-19 10:44  joostvb

	* [r8969] NEWS: release 0.4.3

2011-03-19 10:44  joostvb

	* [r8968] NEWS: record changes

2011-03-17 15:48  sloot

	* [r8936] src/ucto.cxx: fixed --passthru problem

2011-03-17 15:48  sloot

	* [r8935] tests/testpassthru.ok: test now works

2011-03-17 09:31  sloot

	* [r8896] configure.ac: bumped version after release

2011-03-16 13:41  sloot

	* [r8877] include/ucto/tokenize.h, src/tokenize.cxx,
	  src/unicode.cxx: passes -pedantic

2011-03-16 12:59  sloot

	* [r8871] configure.ac: some cleanup
	  bumped version

2011-03-14 11:12  sloot

	* [r8843] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  added (hidden) --passthru option

2011-03-14 11:12  sloot

	* [r8842] tests/passthru.txt, tests/testall, tests/testpassthru,
	  tests/testpassthru.ok: added a passthru test.
	  has t0 be tested :)

2011-03-14 10:07  sloot

	* [r8839] src/tokenize.cxx: initialize your variables, please.

2011-03-14 10:01  sloot

	* [r8838] include/ucto/tokenize.h, src/tokenize.cxx: make compiler
	  more happy

2011-03-14 09:16  sloot

	* [r8837] docs/ucto.1: added description

2011-03-14 09:02  sloot

	* [r8836] docs/ucto.1: smal update

2011-03-12 15:48  joostvb

	* [r8811] ucto.spec: new upstream

2011-03-11 16:53  mvgompel

	* [r8796] src/tokenize.cxx: fixed passthruline (skip=t) bug

2011-03-08 16:01  sloot

	* [r8751] src/tokenize.cxx: fixed regexp and error messag

2011-03-08 15:54  sloot

	* [r8750] config/tokconfig-nl, src/tokenize.cxx: added the
	  possiblity to ste the order of RULES in the config file

2011-03-08 15:52  sloot

	* [r8749] tests/testnormalisation.ok: strange a space has left the
	  building

2011-03-08 13:47  mvgompel

	* [r8748] tests/bug0063.nl.tok.V: fix

2011-03-08 13:47  mvgompel

	* [r8747] tests/bug0063.nl.tok.V: override

2011-03-08 13:42  mvgompel

	* [r8746] config/tokconfig-nl, src/tokenize.cxx,
	  tests/bug0052.nl.tok.V, tests/normalisation.nl.tok.V,
	  tests/test.nl.tok.V: fix bug0063

2011-03-08 12:59  sloot

	* [r8745] tests/bug0063.nl.tok.V, tests/bug0063.nl.txt: added a
	  test for bug63
	  Not sure about the 'correct' solution

2011-03-08 10:02  sloot

	* [r8742] docs/ucto.1: updated man page

2011-03-07 16:05  mvgompel

	* [r8726] src/tokenize.cxx: FoLiA XSL has to be local unfortunately

2011-03-07 14:42  mvgompel

	* [r8721] configure.ac, src/tokenize.cxx: fix

2011-03-07 14:38  mvgompel

	* [r8720] src/tokenize.cxx: fix

2011-03-04 13:26  mvgompel

	* [r8687] logo.svg: added logo

2011-03-04 11:44  mvgompel

	* [r8683] src/tokenize.cxx: fix

2011-03-03 15:23  mvgompel

	* [r8678] src/tokenize.cxx: oops

2011-03-03 15:22  mvgompel

	* [r8677] src/tokenize.cxx: no body in FoLiA anymore

2011-03-01 13:59  sloot

	* [r8637] configure.ac: bumped version. seem a good moment for 0.4

2011-03-01 13:43  sloot

	* [r8636] tests/testoption1.ok, tests/testusage.ok: these tests
	  give a different outcome now.

2011-02-28 15:36  mvgompel

	* [r8620] src/ucto.cxx: oops: the sequel

2011-02-28 15:35  mvgompel

	* [r8619] src/ucto.cxx: oops

2011-02-28 15:32  mvgompel

	* [r8618] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  sentence per line input and output: two modes

2011-02-28 15:11  mvgompel

	* [r8617] src/tokenize.cxx, tests/bug0048.nl.tok.V,
	  tests/bug0054.nl.tok.V: Fixed bug 54

2011-02-28 13:46  mvgompel

	* [r8615] src/tokenize.cxx, tests/abbreviations.nl.tok.V,
	  tests/nu.nl.tok.V, tests/test.nl.tok.V: fixes

2011-02-28 13:46  mvgompel

	* [r8614] src/tokenize.cxx: FoLiA improvement

2011-02-27 15:34  joostvb

	* [r8593] ucto.spec:

2011-02-27 15:06  joostvb

	* [r8592] ucto.spec: first shot at ucto rpm

2011-02-27 10:35  joostvb

	* [r8576] fink/ucto.info: split, in order to be able to build frog

2011-02-27 08:45  joostvb

	* [r8565] fink/ucto.info: shlibs stuff. ship it

2011-02-26 15:31  joostvb

	* [r8563] fink, fink/ucto.info: first shot at building ucto for
	  fink/macosx

2011-02-14 13:35  sloot

	* [r8318] src/tokenize.cxx: added experimental code to use the -n
	  option ( output one sentence per line) also to process the input
	  one sentence per line

2011-02-14 13:18  sloot

	* [r8317] tests/bug0054.nl.tok.V, tests/bug0054.nl.txt: testcase
	  for bug0054

2011-02-12 18:09  joostvb

	* [r8299] NEWS, configure.ac: 0.3.6 is released. start 0.3.7

2011-02-12 18:04  joostvb

	* [r8298] NEWS: ship it

2011-02-12 18:00  joostvb

	* [r8297] NEWS: record more changes

2011-02-12 17:42  joostvb

	* [r8296] NEWS: record some changes

2011-02-10 16:59  sloot

	* [r8291] tests/testencoding2: missing file added

2011-02-10 16:33  sloot

	* [r8289] tests/encoding2.nl, tests/testall,
	  tests/testencoding2.ok: added a test for the -e option

2011-02-10 16:18  sloot

	* [r8288] src/tokenize.cxx: ok, accept any encoding string, now.
	  Throw up when not known.

2011-02-10 16:03  sloot

	* [r8286] tests/testoption1.ok, tests/testusage.ok: adapted tests
	  to the reality

2011-02-10 15:43  sloot

	* [r8283] configure.ac, include/ucto/tokenize.h, src/tokenize.cxx,
	  src/ucto.cxx, tests/testnormalisation: added possibility to set
	  the inputEncoding
	  breaks ucto user interface!

2011-02-10 11:51  mvgompel

	* [r8280] configure.ac: version bump

2011-02-10 11:51  mvgompel

	* [r8279] include/ucto/tokenize.h, src/tokenize.cxx,
	  tests/bug0051.nl.tok.V, tests/bug0052.nl.tok.V, tests/test.py:
	  Fixed quote problems bug0052, bug0051

2011-02-09 12:27  mvgompel

	* [r8274] src/tokenize.cxx: debug fix

2011-02-09 12:04  mvgompel

	* [r8273] tests/testfiles1.ok, tests/testfiles2.ok,
	  tests/testoption1.ok, tests/testoption2.ok: fixed tests

2011-02-09 11:55  mvgompel

	* [r8272] src/tokenize.cxx: added more debug

2011-02-09 11:43  mvgompel

	* [r8271] src/ucto.cxx: updated error messages

2011-02-09 11:08  sloot

	* [r8270] src/tokenize.cxx: better error message

2011-02-08 17:14  sloot

	* [r8264] include/ucto/tokenize.h, src/tokenize.cxx: more useful
	  naming

2011-02-08 17:07  sloot

	* [r8263] include/ucto/tokenize.h: ok, this is what is meant.

2011-02-08 16:43  sloot

	* [r8259] tests/bug0051.nl.tok.V: make the test fail again. this is
	  dead wrong

2011-02-08 16:27  sloot

	* [r8258] tests/bug0052.nl.tok.V, tests/bug0052.nl.txt: added a
	  demo for bug52

2011-02-08 10:27  sloot

	* [r8251] include/ucto/tokenize.h, src/tokenize.cxx: fixed bug51.
	  Not realy happy

2011-02-08 10:24  sloot

	* [r8250] tests/bug0051.nl.tok.V: this is what we want

2011-02-08 09:39  sloot

	* [r8249] tests/bug0051.nl.tok.V, tests/bug0051.nl.txt: added a
	  demo of bug51

2011-02-07 16:17  sloot

	* [r8243] tests/units.nl.tok.V, tests/units.nl.txt: New test. OK
	  file is NOT OK :)

2011-02-07 16:09  sloot

	* [r8242] tests/testall: cleanup

2011-02-07 14:13  sloot

	* [r8237] src/tokenize.cxx: bit of cleanup

2011-02-03 15:31  sloot

	* [r8216] src/ucto.cxx: fix memory leak

2011-02-03 15:24  sloot

	* [r8215] tests/test.py, tests/testall: - testall now tests all.
	  - hacked around in python without real knowledge to support
	  Valgrind testing

2011-02-03 13:00  sloot

	* [r8212] include/ucto/tokenize.h, src/tokenize.cxx: removed
	  strange stray '\'

2011-02-03 12:55  sloot

	* [r8211] include/ucto/tokenize.h, src/tokenize.cxx: last "elegant"
	  solution was dead wrong (but passed all tests)
	  this one is much better (and passes all tests)
	  Maybe the tests are wrong?

2011-02-03 11:00  sloot

	* [r8206] src/tokenize.cxx: more elegant ;)

2011-02-03 10:57  sloot

	* [r8205] tests/testoption2.ok: .

2011-02-03 10:00  sloot

	* [r8204] tests/testall, tests/testoption2, tests/testoption2.ok:
	  added a test for invalid combination of options

2011-02-03 09:59  sloot

	* [r8203] src/ucto.cxx: make sure -L and -c don't intermix

2011-02-02 15:42  sloot

	* [r8196] ., docs: props set

2011-02-02 14:14  sloot

	* [r8186] include/ucto/tokenize.h, include/ucto/unicode.h,
	  src/tokenize.cxx, src/ucto.cxx, src/unicode.cxx: ID ==> Id

2011-02-02 14:12  sloot

	* [r8184] include/ucto/unicode.h: tag

2011-02-02 14:11  sloot

	* [r8183] include/ucto/tokenize.h, include/ucto/unicode.h,
	  src/tokenize.cxx, src/ucto.cxx: tags

2011-02-02 14:08  sloot

	* [r8182] include/ucto/tokenize.h, include/ucto/unicode.h,
	  src/Makefile.am, src/tokenize.cxx, src/ucto.cxx, src/unicode.cxx:
	  more copyright

2011-02-02 14:05  sloot

	* [r8181] include/ucto/tokenize.h, src/unicode.cxx: added copyright
	  notice

2011-02-02 11:53  sloot

	* [r8171] include/ucto/unicode.h, src/unicode.cxx: we want these
	  fuction in the std namespace, i think

2011-02-02 11:35  sloot

	* [r8170] include/ucto/tokenize.h, include/ucto/unicode.h,
	  src/tokenize.cxx, src/ucto.cxx, src/unicode.cxx: more clean
	  implementation. hide normalization in a Class

2011-02-02 11:34  sloot

	* [r8169] tests/testoption1.ok, tests/testusage.ok: changed
	  spelling

2011-02-02 10:17  sloot

	* [r8168] include/ucto/Makefile.am, include/ucto/tokenize.h,
	  include/ucto/unicode.h, src/Makefile.am, src/tokenize.cxx,
	  src/unicode.cxx: put some usefull unicode stuff in separate
	  module

2011-02-01 15:41  sloot

	* [r8155] tests/normalisation.nl.tok.V: this one was missing

2011-02-01 15:34  mvgompel

	* [r8154] tests/test.py: nog een poging

2011-02-01 15:30  mvgompel

	* [r8153] tests/test.py: fix

2011-02-01 15:24  mvgompel

	* [r8152] tests/test.py: back geport

2011-02-01 15:12  sloot

	* [r8151] src/tokenize.cxx: oesp

2011-02-01 14:26  sloot

	* [r8148] tests/bug0001.nl.tok.V, tests/bug0001.nl.txt,
	  tests/bug0048.nl.tok.V, tests/bug0048.nl.txt: renamed. use same
	  number as in Bugzilla:
	  https://ilk.uvt.nl/bugzilla3//show_bug.cgi?id=48

2011-02-01 14:15  sloot

	* [r8146] tests/testall, tests/testconf1.ok, tests/testconf2.ok,
	  tests/testfiles1.ok, tests/testlanguage.ok, tests/testusage.ok:
	  Ok, it is a bit harsh, but strip all path info, to make the tests
	  worl "everywhere"

2011-02-01 11:42  sloot

	* [r8136] tests/testall, tests/testoption1.ok, tests/testusage.ok:
	  testresults have changed

2011-02-01 11:11  sloot

	* [r8135] tests/testall, tests/testencoding, tests/testencoding.ok,
	  tests/testnormalisation, tests/testnormalisation.ok: better
	  naming

2011-02-01 11:09  sloot

	* [r8134] tests/normalisation.nl.txt, tests/testencoding,
	  tests/testencoding.ok: added a test for several Normalisation
	  types

2011-02-01 10:45  sloot

	* [r8133] src/tokenize.cxx: now actualy use -e values for
	  normalisation

2011-02-01 10:25  sloot

	* [r8132] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  -f option now works

2011-02-01 10:25  sloot

	* [r8131] tests/testall: better test

2011-01-31 16:52  sloot

	* [r8126] config/tokconfig-nl: cleaned up non-dutch eosmarker
	  attempt to add 0x0a (\n) as eosmarker failed

2011-01-31 15:43  sloot

	* [r8123] tests/munten.nl.tok.V, tests/munten.nl.txt: new test
	  added (.V is empty, to gat a nice diff)

2011-01-31 15:12  sloot

	* [r8122] tests/ligaturen.nl.tok.V, tests/ligaturen.nl.txt: test
	  more

2011-01-31 15:04  sloot

	* [r8121] tests/encoding.nl.tok.V: this is the desired result (NFC)

2011-01-31 15:01  sloot

	* [r8120] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  more to-the-point naming

2011-01-31 14:47  sloot

	* [r8119] config/tokconfig-nl, include/ucto/tokenize.h,
	  src/tokenize.cxx, src/ucto.cxx: added support for ligature
	  filtering and Unicode normalizing.
	  a bit rough still

2011-01-31 13:40  sloot

	* [r8117] ucto.pc.in: now uses ucto-icu.pc

2011-01-31 10:46  sloot

	* [r8091] tests/bug0001.nl.tok.V, tests/bug0001.nl.txt,
	  tests/bug01.nl.tok.V, tests/bug01.nl.txt: to please Maarten even
	  more

2011-01-31 10:45  sloot

	* [r8090] tests/bug01.nl.tok.V, tests/bug01.nl.txt: zo beter dan
	  Maarten?

2011-01-31 10:32  sloot

	* [r8086] tests/bug01.nl.tok.V, tests/bug01.nl.txt: our first
	  officila bug!

2011-01-27 16:28  joostvb

	* [r8060] NEWS: cosmetics

2011-01-27 16:18  sloot

	* [r8059] tests/test.nl.tok, tests/test.nl.tok.V,
	  tests/test.nl.txt, tests/test.py, tests/testall, tests/testconf1,
	  tests/testconf1.ok, tests/testconf2, tests/testconf2.ok,
	  tests/testfiles1, tests/testfiles1.ok, tests/testfiles2,
	  tests/testfiles2.ok, tests/testlanguage, tests/testlanguage.ok,
	  tests/testoption1, tests/testoption1.ok, tests/testusage,
	  tests/testusage.ok: props set

2011-01-27 16:14  joostvb

	* [r8058] NEWS, configure.ac: release 0.3.4, record changes

2011-01-27 16:13  sloot

	* [r8057] tests/testoption1: props

2011-01-27 16:08  sloot

	* [r8056] tests: propset

2011-01-27 16:08  sloot

	* [r8055] tests: propset

2011-01-27 16:06  sloot

	* [r8054] tests/stuk1.cfg, tests/stuk2.cfg, tests/testall,
	  tests/testoption1, tests/testoption1.ok: more tests
	  forgot some files

2011-01-27 16:06  joostvb

	* [r8053] Makefile.am, configure.ac, icu.pc.in, ucto-icu.pc.in:
	  rename icu.pc to ucto-icu.pc: be sure we wont suffer from
	  filename clashes in the future

2011-01-27 16:01  sloot

	* [r8052] tests/testall, tests/testfiles1, tests/testfiles1.ok,
	  tests/testfiles2, tests/testfiles2.ok: more tests added

2011-01-27 16:00  mvgompel

	* [r8051] include/ucto/tokenize.h, src/tokenize.cxx: Updates in
	  FoLiA support

2011-01-27 15:50  sloot

	* [r8050] tests/testall: better?

2011-01-27 15:46  sloot

	* [r8049] tests/testall, tests/testconf1, tests/testconf1.ok,
	  tests/testconf2, tests/testconf2.ok: more tests

2011-01-27 15:38  sloot

	* [r8047] tests/testall, tests/testlanguage, tests/testlanguage.ok,
	  tests/testusage, tests/testusage.ok: added some scripts to test
	  illegal parameters etc.

2011-01-27 13:51  sloot

	* [r8043] tests/ligaturen.nl.tok.V, tests/ligaturen.nl.txt: een
	  interessante test. De .V is NU wel ok, maar ik vind dat ie fout
	  moet worden ;)

2011-01-27 13:44  sloot

	* [r8042] tests/encoding.nl.tok.V, tests/encoding.nl.txt: added a
	  test with combining vs precombined characters

2011-01-27 12:23  joostvb

	* [r8040] NEWS, configure.ac: release 0.3.3

2011-01-27 12:18  joostvb

	* [r8039] NEWS: record changes and releases

2011-01-27 11:57  joostvb

	* [r8038] config/Makefile.am: tags

2011-01-27 11:57  joostvb

	* [r8037] config/Makefile.am: honor --sysconfdir in ./configure
	  call

2011-01-27 11:56  joostvb

	* [r8036] bootstrap, configure.ac: tags

2011-01-27 11:45  sloot

	* [r8034] include/ucto/tokenize.h: added missing output operator
	  declaration

2011-01-27 11:13  sloot

	* [r8030] configure.ac: bumped version after "release" of 0.3.1

2011-01-26 17:17  sloot

	* [r8019] Makefile.am, configure.ac: install makefile too

2011-01-26 17:08  sloot

	* [r8018] Makefile.am, configure.ac, docs/Makefile.am,
	  include/ucto/tokenize.h: fixed bug, bumped version, added
	  Makefile and such

2011-01-18 15:34  sloot

	* [r7866] src/tokenize.cxx: some formatting, comment and such

2011-01-18 15:09  sloot

	* [r7865] src/tokenize.cxx: leaner ande meaner

2011-01-18 15:04  sloot

	* [r7864] include/ucto/tokenize.h, src/tokenize.cxx: refactoring
	  detectSentenceBounds()
	  tricky!

2011-01-18 14:35  sloot

	* [r7863] src/tokenize.cxx: much simpler. No limit to length of
	  "opening quote" string.

2011-01-18 12:46  sloot

	* [r7854] src/tokenize.cxx: added << operator for TokenRole (and
	  use is)

2011-01-18 12:37  sloot

	* [r7853] include/ucto/tokenize.h, src/tokenize.cxx: get rid of
	  unused function

2011-01-18 12:34  sloot

	* [r7852] include/ucto/tokenize.h: made clear() private.

2011-01-18 12:21  sloot

	* [r7851] include/ucto/tokenize.h, src/tokenize.cxx: make stream
	  parameters a bit more C++ like

2011-01-18 10:57  sloot

	* [r7849] include/ucto/tokenize.h, src/tokenize.cxx: cleaned up
	  TokenRole stuff

2011-01-18 10:32  sloot

	* [r7848] src/tokenize.cxx: refactoring...

2011-01-18 10:05  sloot

	* [r7847] src/tokenize.cxx: some refactoring going on

2011-01-13 16:46  sloot

	* [r7831] src/tokenize.cxx: comment added

2011-01-13 16:45  sloot

	* [r7830] config/tokconfig-de, config/tokconfig-en,
	  config/tokconfig-fr, config/tokconfig-it, config/tokconfig-sv:
	  all configs include the same URL and E-MAIL rules now

2011-01-13 16:34  sloot

	* [r7829] configure.ac: minor version bump

2011-01-13 16:33  sloot

	* [r7828] config/tokconfig-nl, src/tokenize.cxx: ignore sysconfdir
	  when filename contains a '/' ( so for ../rel/ati/ve and
	  /abs/olute/path )

2011-01-13 16:27  sloot

	* [r7827] src/tokenize.cxx: barf louder when no config found

2011-01-13 16:22  sloot

	* [r7826] config/e-mail.rule: nu wel

2011-01-13 16:20  sloot

	* [r7825] config/tokconfig-nl: uninclude

2011-01-13 15:53  sloot

	* [r7824] config/Makefile.am, config/tokconfig-nl: e-mail rule
	  created

2011-01-13 15:48  sloot

	* [r7823] config/Makefile.am, config/tokconfig-nl, config/url.rule,
	  include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx: added
	  simpel include mechanism. Not very robust.
	  Dont forget to add .rule files to config/Makefile.am !

2011-01-13 14:57  mvgompel

	* [r7822] config/tokconfig-nl: fix

2011-01-13 14:41  mvgompel

	* [r7821] src/tokenize.cxx, tests/quotetest_multiline.nl.tok.V,
	  tests/quotetest_multisentence.nl.tok.V,
	  tests/quotetest_onesentence.nl.tok.V, tests/test.nl.tok.V: fixes
	  and test update

2011-01-13 14:32  mvgompel

	* [r7820] include/ucto/tokenize.h, src/tokenize.cxx: added
	  detectQuoteBounds

2011-01-13 14:24  sloot

	* [r7819] src/tokenize.cxx: adde a few more sensible default

2011-01-13 14:23  sloot

	* [r7818] docs/todo: added a file to store useful tips for the
	  upcoming manual

2011-01-13 14:10  mvgompel

	* [r7817] tests/nu.nl.tok.V: added test text from NU.nl

2011-01-13 14:07  mvgompel

	* [r7816] tests/nu.nl.txt, tests/test2.nl.txt: renamed

2011-01-13 14:03  mvgompel

	* [r7815] tests/abbreviations.nl.tok.V, tests/abbreviations.nl.txt,
	  tests/datetime.nl.tok.V, tests/datetime.nl.txt,
	  tests/genitive.nl.tok.V, tests/genitive.nl.txt,
	  tests/multisentenceoneline.nl.tok.V,
	  tests/multisentenceoneline.nl.txt,
	  tests/onesentencemultiline.nl.tok.V,
	  tests/onesentencemultiline.nl.txt, tests/parentheses.nl.tok.V,
	  tests/parentheses.nl.txt, tests/quotetest_nested.nl.tok.V,
	  tests/test2.nl.txt, tests/urls.nl.tok.V, tests/urls.nl.txt: Added
	  lots of tests

2011-01-13 13:52  sloot

	* [r7814] src/tokenize.cxx: overthrown

2011-01-13 13:41  sloot

	* [r7813] src/tokenize.cxx: less exit, more throw

2011-01-13 13:22  sloot

	* [r7811] include/ucto/tokenize.h: added a setter voor the Error
	  logfile
	  otherwise logging/debugging is impossible when ucto is included
	  in a daemon
	  (which as we all now, closes stderr)
	  This was the sole reason for having theErrLog anyway

2011-01-13 12:04  mvgompel

	* [r7810] tests/test.py: fix

2011-01-13 12:02  mvgompel

	* [r7809] tests/test.py: test wrapper done

2011-01-13 12:02  mvgompel

	* [r7808] tests/quotetest_multiline.nl.tok.V,
	  tests/quotetest_multiline.nl.txt,
	  tests/quotetest_multisentence.nl.tok.V,
	  tests/quotetest_multisentence.nl.txt,
	  tests/quotetest_nested.nl.txt,
	  tests/quotetest_nosentence.nl.tok.V,
	  tests/quotetest_nosentence.nl.txt,
	  tests/quotetest_onesentence.nl.tok.V,
	  tests/quotetest_onesentence.nl.txt, tests/test.py,
	  tests/testall.py: added quote tests, updated test script

2011-01-13 12:02  mvgompel

	* [r7807] tests/testall.py: fix

2011-01-13 12:01  sloot

	* [r7806] src/tokenize.cxx, src/ucto.cxx: use our own exceptions

2011-01-13 11:44  mvgompel

	* [r7805] tests/testall.py: fixes

2011-01-13 11:44  mvgompel

	* [r7804] tests/testall.py: added test wrapper

2011-01-13 11:38  sloot

	* [r7803] src/tokenize.cxx: beterst

2011-01-13 11:29  sloot

	* [r7802] src/tokenize.cxx: beterder

2011-01-13 11:26  sloot

	* [r7801] src/tokenize.cxx, src/ucto.cxx: throw more sensible
	  exceptions

2011-01-12 15:52  sloot

	* [r7798] ., m4, src: propset

2011-01-12 15:43  sloot

	* [r7797] config/tokconfig-nl, src/tokenize.cxx: quotation
	  separator is now whitespace (spaces and/or tabs)

2011-01-12 14:49  sloot

	* [r7796] tests/qtest.spec.tok, tests/qtest.spec.tok.V: added
	  changed testresults

2011-01-12 14:48  sloot

	* [r7795] config/tokconfig-nl, src/tokenize.cxx, tests/qtest.spec:
	  better handle the 'multiple opening, single closing' quotation
	  problem
	  added test

2011-01-12 14:24  sloot

	* [r7794] tests/qtest.spec.tok, tests/qtest.spec.tok.V: the test
	  was changed. results are not OK, I think. (nested quotes)

2011-01-12 14:22  sloot

	* [r7793] config/tokconfig-nl, src/tokenize.cxx: first attempt to
	  read QUOTES from config file
	  syntax undecided yet
	  what about reasonable defaults?

2011-01-12 13:37  sloot

	* [r7792] src/tokenize.cxx: started enhancin quote handling

2011-01-12 13:29  mvgompel

	* [r7791] src/tokenize.cxx: small adaptation to FoLiA format

2011-01-11 17:32  sloot

	* [r7788] tests/qtest.spec, tests/test.nl.tok, tests/test.nl.tok.V:
	  test.nl now seems to work
	  added something naughty to qtest.spec

2011-01-11 17:18  mvgompel

	* [r7787] src/tokenize.cxx: trailing space fix

2011-01-11 16:45  sloot

	* [r7786] src/tokenize.cxx: first step in more clean quote handling

2011-01-11 15:59  sloot

	* [r7785] tests/qtest.spec, tests/qtest.spec.tok,
	  tests/qtest.spec.tok.V: added a simple test for 'special' quotes.

2011-01-11 15:27  sloot

	* [r7784] tests/test.nl.tok, tests/test.nl.tok.V: somewaht better
	  results, but still strange!

2011-01-11 15:01  mvgompel

	* [r7783] src/tokenize.cxx: fix

2011-01-11 14:38  mvgompel

	* [r7781] src/tokenize.cxx: oops, forgot )

2011-01-11 14:32  mvgompel

	* [r7780] src/tokenize.cxx: omgekeerd is beter denk ik

2011-01-11 14:32  mvgompel

	* [r7779] src/tokenize.cxx: fixes

2011-01-11 14:29  sloot

	* [r7778] tests/qtest.nl, tests/qtest.nl.tok, tests/qtest.nl.tok.V:
	  added test for quotation

2011-01-11 14:24  mvgompel

	* [r7777] configure.ac, include/ucto/tokenize.h, src/tokenize.cxx:
	  Important bugfixes!

2011-01-11 13:26  mvgompel

	* [r7776] src/tokenize.cxx: extra debug level

2011-01-11 13:02  sloot

	* [r7775] test.nl.txt, tests, tests/test.nl.tok,
	  tests/test.nl.tok.V, tests/test.nl.txt: created a tests
	  directory.
	  added tokenized ans Verbose Tokenized output for comparision

2011-01-11 11:59  sloot

	* [r7774] include/ucto/tokenize.h, src/tokenize.cxx: fixed all
	  warnings.
	  Not very smart yet.

2011-01-11 09:16  sloot

	* [r7773] docs/ucto.1: updatet Man page

2011-01-10 18:03  mvgompel

	* [r7771] include/ucto/tokenize.h, src/tokenize.cxx: removed some
	  obsolete things

2011-01-10 18:01  mvgompel

	* [r7770] src/tokenize.cxx: fixes, quote output in FoLiA works now

2011-01-10 17:45  mvgompel

	* [r7769] src/tokenize.cxx: fixes

2011-01-10 17:33  mvgompel

	* [r7768] include/ucto/tokenize.h, src/tokenize.cxx: bugfixes

2011-01-10 17:15  mvgompel

	* [r7767] include/ucto/tokenize.h, src/tokenize.cxx: improved debug
	  mode + bugfixes

2011-01-10 15:44  mvgompel

	* [r7765] include/ucto/tokenize.h, src/tokenize.cxx: Added a new
	  passthru mode for pre-tokenised text

2011-01-10 14:58  mvgompel

	* [r7762] src/tokenize.cxx: oops 2, removed debug line

2011-01-10 14:57  mvgompel

	* [r7761] src/tokenize.cxx: oops, fixed segfault

2011-01-10 14:29  mvgompel

	* [r7760] configure.ac, include/ucto/tokenize.h, src/tokenize.cxx,
	  src/ucto.cxx: major refactoring. Improved buffering, less
	  unnecessary storing of token/sentence vectors in memory. Improved
	  quote support.

2011-01-10 11:42  mvgompel

	* [r7759] src/tokenize.cxx: set BOS in sentences in quotes

2011-01-10 11:36  mvgompel

	* [r7758] src/tokenize.cxx: fixed EOS detection before punctuation

2011-01-10 11:12  mvgompel

	* [r7756] include/ucto/tokenize.h, src/tokenize.cxx: working on
	  proper output with quote support

2011-01-07 17:49  mvgompel

	* [r7754] src/ucto.cxx: updated some options

2011-01-07 15:54  mvgompel

	* [r7753] configure.ac, include/ucto/tokenize.h: version bump

2011-01-07 15:49  mvgompel

	* [r7752] include/ucto/tokenize.h, src/tokenize.cxx: Ucto now
	  remembers if a token was spaced or not in the original. Enabling
	  ucto to recontruct the original text exactly.

2011-01-07 14:04  mvgompel

	* [r7749] include/ucto/tokenize.h, src/tokenize.cxx: Added quote
	  detection support

2011-01-06 18:25  mvgompel

	* [r7746] src/tokenize.cxx: update?

2011-01-06 18:04  mvgompel

	* [r7745] src/tokenize.cxx: more debug update

2011-01-06 18:02  mvgompel

	* [r7744] src/tokenize.cxx: debug updates

2011-01-06 17:52  mvgompel

	* [r7743] src/ucto.cxx: updated help

2011-01-06 17:49  mvgompel

	* [r7742] src/tokenize.cxx: more sensible debug messages

2011-01-06 17:35  mvgompel

	* [r7741] configure.ac: I think that all changes merit a version
	  bump now

2011-01-06 17:35  mvgompel

	* [r7740] src/tokenize.cxx, test.nl.txt: added my testset

2011-01-06 16:57  mvgompel

	* [r7739] include/ucto/tokenize.h, src/tokenize.cxx: important
	  fixes in paragraph/sentence handling

2011-01-06 15:57  mvgompel

	* [r7738] src/tokenize.cxx: some fixes in sentence/paragraph
	  detection (not enough yet)

2011-01-06 15:31  sloot

	* [r7737] src/ucto.cxx: now we can debug init() too

2011-01-06 15:28  sloot

	* [r7736] src/tokenize.cxx, src/ucto.cxx: fixing

2011-01-06 15:04  sloot

	* [r7735] src/ucto.cxx: refuse to run when inputfile is bad

2011-01-06 14:58  mvgompel

	* [r7734] src/tokenize.cxx: consistency updates

2011-01-06 14:51  mvgompel

	* [r7733] include/ucto/tokenize.h, src/tokenize.cxx: set docid to
	  string, assuming utf-8

2011-01-06 14:46  mvgompel

	* [r7732] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  paragraph fix in XML output

2011-01-06 14:29  mvgompel

	* [r7731] include/ucto/tokenize.h, src/tokenize.cxx: fixed
	  beginofsentence bug

2011-01-06 13:55  mvgompel

	* [r7730] src/ucto.cxx: merge fix

2011-01-06 13:50  mvgompel

	* [r7729] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  Added preliminary FoLiA XML output support in ucto

2011-01-06 13:50  mvgompel

	* [r7728] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  fix

2011-01-06 13:12  sloot

	* [r7727] docs, docs/ucto.1: added a simple 'man' page

2011-01-06 12:53  sloot

	* [r7726] src/ucto.cxx: terrible and destructive bug!

2011-01-06 12:52  sloot

	* [r7725] TODO: to do

2011-01-06 11:42  sloot

	* [r7724] src/ucto.cxx: added al -p switch to disable paragraph
	  detection
	  how do I test that?

2011-01-06 11:14  sloot

	* [r7723] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  -d option now works

2011-01-05 15:40  mvgompel

	* [r7716] src/ucto.cxx: version update

2011-01-05 15:30  mvgompel

	* [r7714] configure.ac, src/ucto.cxx: Updated version

2011-01-05 15:07  pberck

	* [r7713] config/tokconfig-sv: Small fix.

2011-01-05 15:05  mvgompel

	* [r7712] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  Big API overhaul

2011-01-05 13:43  sloot

	* [r7707] Makefile.am: fix missing ChangeLog message while
	  bootstrapping

2011-01-05 13:36  pberck

	* [r7706] config/Makefile.am, config/tokconfig-se,
	  config/tokconfig-sv: Renamed Swedish tokconfig to tokconfig-sv
	  according to ISO-639-1.

2011-01-05 13:31  pberck

	* [r7705] config/Makefile.am, config/tokconfig-se: First attempt at
	  Swedish tokconfig added.

2010-12-27 12:22  mvgompel

	* [r7615] README: typo

2010-12-27 12:22  mvgompel

	* [r7614] README: updated

2010-12-27 12:22  mvgompel

	* [r7613] README: upaded README

2010-12-25 13:05  joostvb

	* [r7605] config/Makefile.am: use sysconfdir variable, not
	  hardcoded etc/

2010-12-25 12:59  joostvb

	* [r7604] Makefile.am: do not install changelog, packagers take
	  care of that

2010-12-25 12:23  joostvb

	* [r7600] NEWS, configure.ac: release 0.0.1 before releasing 1.0

2010-12-25 12:18  joostvb

	* [r7599] ChangeLog: ChangeLog is autogenerated, see bootstrap

2010-12-16 10:23  sloot

	* [r7443] configure.ac: '==" is illegal, but works on GNU system

2010-12-16 09:18  sloot

	* [r7440] configure.ac, m4/ax_icu_check.m4: adapting pberck's
	  MacOSX adaptations

2010-12-16 09:18  sloot

	* [r7439] COPYING: hmm

2010-12-16 06:50  pberck

	* [r7435] COPYING, m4/ax_icu_check.m4: Fix for compiling on OS X.

2010-12-15 21:19  antalb

	* [r7434] src/Makefile.am: changed Tokenizer to ucto

2010-12-15 17:10  sloot

	* [r7433] config/tokconfig-de, config/tokconfig-en,
	  config/tokconfig-fr, config/tokconfig-it, config/tokconfig-nl:
	  now the EOSMARKERS are realy fixed, i hope.

2010-12-15 16:00  sloot

	* [r7430] include/ucto/tokenize.h, src/tokenize.cxx: hmm, less
	  const, otherwise FrogNG chokes

2010-12-15 15:54  sloot

	* [r7429] src/tokenize.cxx: oesp

2010-12-15 15:53  sloot

	* [r7428] src/tokenize.cxx: used a more clean UCI function to
	  extract unicode numbers to real characters

2010-12-15 15:49  sloot

	* [r7427] config/tokconfig-de, config/tokconfig-fr,
	  config/tokconfig-it: fixed other languages too
	  al lot of duplication, can't we do better?

2010-12-15 15:44  sloot

	* [r7425] config/tokconfig-nl: fixed dutch too

2010-12-15 15:40  sloot

	* [r7422] config/tokconfig-en: two characters were wrong
	  some unicode entries were non-standard

2010-12-15 14:58  mvgompel

	* [r7418] src/Makefile.am, src/Tokenizer.cxx, src/ucto.cxx: renamed
	  Tokenizer --> ucto

2010-12-15 14:19  sloot

	* [r7413] include/ucto/tokenize.h, src/tokenize.cxx: more const
	  clean

2010-12-15 14:02  sloot

	* [r7412] include/ucto/tokenize.h: more C++ like

2010-12-15 13:49  sloot

	* [r7411] include/ucto/tokenize.h, src/tokenize.cxx: some
	  reshuffling. make local things more local

2010-12-15 11:57  sloot

	* [r7408] m4/ax_icu_check.m4: hmpfff, wrong messages

2010-12-15 11:42  sloot

	* [r7406] Makefile.am, configure.ac, icu.pc.in, ucto.pc.in:
	  automagicly create an icu.pc file.
	  ICU doesn't provide one (it will do so starting with 4.6)

2010-12-14 16:59  sloot

	* [r7398] m4/ax_icu_check.m4: added more output fields

2010-12-14 13:58  sloot

	* [r7394] m4/ax_icu_check.m4: another attempt to fix it for MacOSX
	  too

2010-12-13 21:28  mvgompel

	* [r7386] src/Tokenizer.cxx: Added default configuration if no -c
	  is specified, also added -L shortcut

2010-12-13 21:00  mvgompel

	* [r7385] include/ucto/tokenize.h, src/Tokenizer.cxx,
	  src/tokenize.cxx: Added features + fixes

2010-12-13 18:20  mvgompel

	* [r7380] src/Tokenizer.cxx: preparing for extra functions

2010-12-13 17:14  sloot

	* [r7379] src/tokenize.cxx: reverted change. Mac only supports
	  archaic ICU versions

2010-12-13 17:00  sloot

	* [r7378] src/tokenize.cxx: attempt. but does it compile on a mac?

2010-12-13 16:49  sloot

	* [r7377] include/ucto/tokenize.h: unused fuction removed

2010-12-13 16:40  sloot

	* [r7376] src/tokenize.cxx: more standard ICU solution

2010-12-13 15:53  sloot

	* [r7367] include/ucto/tokenize.h, src/Tokenizer.cxx,
	  src/tokenize.cxx: merged with some new stuff from FrogNG

2010-12-13 15:07  sloot

	* [r7364] ., config, include, include/ucto, m4, src: properties set

2010-12-13 14:59  sloot

	* [r7363] Makefile.am, configure.ac, ucto.pc.in: build a pkgconfig
	  file

2010-12-13 13:20  sloot

	* [r7351] COPYING: added

2010-12-13 13:16  sloot

	* [r7350] src/Makefile.am: build an use libucto.so

2010-12-13 12:02  sloot

	* [r7346] include/ucto/Makefile.am, include/ucto/tokenize.h,
	  include/ucto/unicode_utils.h, src/Makefile.am, src/tokenize.cxx,
	  src/unicode_utils.cxx: did some cherry picking in unicode_utils.*
	  and included them in tokenize.*

2010-12-09 16:45  sloot

	* [r7260] config/tokconfig-de, config/tokconfig-en,
	  config/tokconfig-fr, config/tokconfig-it, config/tokconfig-nl:
	  oeps i did it again

2010-12-09 16:44  sloot

	* [r7258] ChangeLog, TODO: oeps³

2010-12-09 16:42  sloot

	* [r7257] config/Makefile.am: oeps²

2010-12-09 16:40  sloot

	* [r7256] bootstrap: oesp

2010-12-09 16:37  sloot

	* [r7255] src/Tokenizer.cxx: let's be verbose

2010-12-09 16:09  sloot

	* [r7254] m4/Makefile.am, src/Makefile.am: needed for build system

2010-12-09 16:07  sloot

	* [r7253] include/ucto/tokenize.h, include/ucto/unicode_utils.h,
	  src/Tokenizer.cxx, src/tokenize.cxx, src/unicode_utils.cxx: it
	  works!
	  inclusive a small demo

2010-12-09 14:21  sloot

	* [r7250] AUTHORS, NEWS, README, m4/ax_icu_check.m4: more essential
	  files

2010-12-09 14:16  sloot

	* [r7249] Makefile.am, configure.ac, include/Makefile.am,
	  include/ucto/Makefile.am, include/ucto/tokenize.h,
	  include/ucto/unicode_utils.h, m4, src/tokenize.cxx,
	  src/unicode_utils.cxx: add some files

2010-12-09 14:14  sloot

	* [r7248] config, include, include/ucto, src: created basic dir
	  structure

2010-12-09 11:59  sloot

	* [r7242] .: new dir

