1This ChangeLog file is no longer maintained - see the git repo history for 2more recent changes: https://xapian.org/bleeding 3 4Fri Sep 25 07:30:55 GMT 2015 Olly Betts <olly@survex.com> 5 6 * NEWS: Update. 7 8Fri Sep 25 04:51:20 GMT 2015 Olly Betts <olly@survex.com> 9 10 * myhtmlparse.cc: Remove unused header. 11 12Fri Sep 25 04:49:49 GMT 2015 Olly Betts <olly@survex.com> 13 14 * loadfile.cc,md5wrap.cc: Stop trying to use O_STREAMING - the patch 15 to implement it was never merged into the Linux kernel, and I can't 16 find any evidence that other platforms implement it. The constant 17 value O_STREAMING used now seems to be used for the part of O_SYNC 18 which isn't covered by O_DSYNC, which seems likely to hurt 19 performance if anything. 20 21Fri Sep 25 04:47:06 GMT 2015 Olly Betts <olly@survex.com> 22 23 * omindex.cc: Use SAMPLE_SIZE in help text rather than literal 512. 24 Add '--title-size' option. 25 26Fri Sep 25 04:40:53 GMT 2015 Olly Betts <olly@survex.com> 27 28 * docs/omegascript.rst,query.cc: Fix documentation of $last to say 29 it's the MSet index *one beyond* the end of the current page. 30 Reported by Andrew Chilton. 31 32Fri Sep 25 04:39:54 GMT 2015 Olly Betts <olly@survex.com> 33 34 * docs/overview.rst: SVG extraction is built-in too. 35 36Fri Sep 25 04:32:04 GMT 2015 Olly Betts <olly@survex.com> 37 38 * docs/cgiparams.rst: Document behaviour if xDB is not set. 39 40Fri Sep 25 04:31:15 GMT 2015 Olly Betts <olly@survex.com> 41 42 * docs/cgiparams.rst,query.cc: If xFILTERS is not set, don't force the 43 first page as that's unhelpful if someone fails to set it in their 44 template. 45 46Fri Sep 25 01:22:11 GMT 2015 Olly Betts <olly@survex.com> 47 48 * configure.ac: Don't provide our own implementation of sleep() under 49 __WIN32__ if there's already one - mingw provides one, and in some 50 situations it seems to clash with ours. Reported to xapian-discuss 51 by John Alveris. 52 53Wed May 20 09:57:45 GMT 2015 Olly Betts <olly@survex.com> 54 55 * NEWS: Bump release date for 1.2.21. 56 57Fri May 15 05:32:35 GMT 2015 Olly Betts <olly@survex.com> 58 59 * NEWS: Another update for 1.2.21. 60 61Fri May 15 05:16:05 GMT 2015 Olly Betts <olly@survex.com> 62 63 * Makefile.am,configure.ac: Use -no-install or -no-fast-install when 64 linking test programs which never get installed, which means libtool 65 can often avoid creating a shell script wrapper. 66 67Thu May 14 14:52:14 GMT 2015 Olly Betts <olly@survex.com> 68 69 * NEWS,configure.ac: Update for 1.2.21. 70 71Thu May 14 10:17:32 GMT 2015 Olly Betts <olly@survex.com> 72 73 * NEWS: Update. 74 75Thu May 14 05:16:08 GMT 2015 Olly Betts <olly@survex.com> 76 77 * omindex.cc: Message tweak. 78 79Thu May 14 05:15:18 GMT 2015 Olly Betts <olly@survex.com> 80 81 * outlookmsg2html.in: Fix handling of message/rfc822 subparts. 82 83Thu May 14 04:16:34 GMT 2015 Olly Betts <olly@survex.com> 84 85 * cdb_hash.cc,md5.cc: Remove 'register' as it's deprecated, and 86 likely to just be ignored by any modern compiler anyway. 87 88Wed May 06 10:08:33 GMT 2015 Olly Betts <olly@survex.com> 89 90 * NEWS: Update. 91 92Wed May 06 07:26:28 GMT 2015 Olly Betts <olly@survex.com> 93 94 * omega.cc: Drop compilation date and time from output - they prevent 95 reproducible builds and the version number is sufficient 96 information. 97 98Wed May 06 07:24:27 GMT 2015 Olly Betts <olly@survex.com> 99 100 * commonhelp.cc,omega.cc,omindex.cc,scriptindex.cc: Add spaces between 101 literal strings and macros which expand to literal strings for C++11 102 compatibility. 103 104Wed May 06 04:56:39 GMT 2015 Olly Betts <olly@survex.com> 105 106 * Makefile.am: Add common/stringutils.cc to urlenctest_SOURCES, needed 107 now urldecode.h uses C_isxdigit(). 108 109Wed May 06 04:53:56 GMT 2015 Olly Betts <olly@survex.com> 110 111 * configfile.cc,htmlparse.cc,myhtmlparse.cc,omega.cc,omindex.cc, 112 query.cc,scriptindex.cc,urldecode.h: Consistently use C_isupper(), 113 C_toupper(), etc as these versions aren't affected by the locale 114 setting, and also allow signed char values (so we don't need to 115 cast the argument to unsigned char). 116 117Tue May 05 14:17:16 GMT 2015 Olly Betts <olly@survex.com> 118 119 * templates/query: When listing matching terms, don't make the commas 120 italic. 121 122Tue May 05 12:43:59 GMT 2015 Olly Betts <olly@survex.com> 123 124 * docs/overview.rst: Document 'E' prefixed boolean terms for filtering 125 by extension (see #668, reported by bramvdh). 126 127Tue May 05 12:42:59 GMT 2015 Olly Betts <olly@survex.com> 128 129 * docs/overview.rst: Whitespace cleanup. 130 131Tue May 05 12:41:52 GMT 2015 Olly Betts <olly@survex.com> 132 133 * templates/xml: Add XML declaration. 134 135Tue May 05 12:40:44 GMT 2015 Olly Betts <olly@survex.com> 136 137 * templates/query: Eliminate blank line before <html>. 138 139Tue May 05 12:39:01 GMT 2015 Olly Betts <olly@survex.com> 140 141 * templates/godmode: Return charset utf-8 in the content-type. 142 143Tue May 05 12:29:46 GMT 2015 Olly Betts <olly@survex.com> 144 145 * docs/encodings.rst: $prettyurl undoes %-encoding of UTF-8 in 1.2.21 146 and later too. 147 148Tue May 05 12:27:14 GMT 2015 Olly Betts <olly@survex.com> 149 150 * docs/encodings.rst: Note that one should ensure that Omega gets sent 151 form submissions encoded in UTF-8. 152 153Tue May 05 12:23:09 GMT 2015 Olly Betts <olly@survex.com> 154 155 * docs/encodings.rst: Discuss encodings of filenames (see #550). 156 157Tue May 05 12:20:00 GMT 2015 Olly Betts <olly@survex.com> 158 159 * urldecode.h,urlenctest.cc: $prettyurl now decodes valid UTF-8 160 sequences. Fixes #550 and #644, reported by catkin and terencz. 161 162Tue May 05 12:14:06 GMT 2015 Olly Betts <olly@survex.com> 163 164 * docs/: Add a document about character encoding, as suggested by 165 James Aylett in #550. 166 167Tue May 05 12:08:39 GMT 2015 Olly Betts <olly@survex.com> 168 169 * urldecode.h,urlenctest.cc: Improve decoding done by $prettyurl - we 170 now leave the query and fragment parts of the URL alone and don't 171 decode an escaped "/" (omindex doesn't create URLs with any of 172 these, so we only risk breaking other URLs which have them), and we 173 decode some additional ASCII characters in the path part: 174 []@!$&'()*+.;= (addresses #550 in part) 175 176Wed Mar 04 00:29:44 GMT 2015 Olly Betts <olly@survex.com> 177 178 * NEWS,configure.ac: Update for 1.2.20. 179 180Wed Mar 04 00:03:35 GMT 2015 Olly Betts <olly@survex.com> 181 182 * Backport change from trunk: 183 * configure.ac: Avoid doing link tests with libmagic in configure as 184 they fail on mingw due to not automatically picking up libraries 185 which libmagic itself depends on. 186 187Sat Feb 14 10:46:47 GMT 2015 Olly Betts <olly@survex.com> 188 189 * NEWS: Update. 190 191Tue Jan 27 04:37:12 GMT 2015 Olly Betts <olly@survex.com> 192 193 * docs/overview.rst,omindex.cc: Interpret a command of "false" in 194 "--filter" as meaning to ignore files with that MIME type. 195 196Sat Jan 24 12:14:33 GMT 2015 Olly Betts <olly@survex.com> 197 198 * NEWS: Update. 199 200Thu Jan 22 01:44:01 GMT 2015 Olly Betts <olly@survex.com> 201 202 * omindex.cc: Ignore extensions .msi and .msp, which are Microsoft 203 installer files, but which libmagic sometimes incorrectly identifies 204 as application/msword. 205 206Fri Dec 19 03:21:13 GMT 2014 Olly Betts <olly@survex.com> 207 208 * docs/cgiparams.rst: Improve wording of docs for SORT parameter. 209 210Tue Jan 06 21:15:14 GMT 2015 Olly Betts <olly@survex.com> 211 212 * configure.ac: Use pkg-config in preference to determine flags needed 213 to compile and link with PCRE, as this will just work when 214 cross-compiling (at least under MXE). 215 216Sun Dec 21 21:54:48 GMT 2014 Olly Betts <olly@survex.com> 217 218 * query.cc: Handle [=0 as [=1. 219 220Tue Dec 16 03:53:25 GMT 2014 Olly Betts <olly@survex.com> 221 222 * configure.ac: Define MINGW_HAS_SECURE_API under mingw to get 223 _putenv_s() declared in stdlib.h. 224 225Wed Dec 03 04:17:18 GMT 2014 Olly Betts <olly@survex.com> 226 227 * templates/xml: Update handling of DATE1, DATE2 and DAYSMINUS which 228 were renamed in 0.6.x and the compatibility aliases removed in 229 1.0.0. 230 231Wed Dec 03 04:15:51 GMT 2014 Olly Betts <olly@survex.com> 232 233 * docs/omegascript.rst: Update documentation references to DATE1, 234 DATE2, and DAYSMINUS which were renamed in 0.6.x and the 235 compatibility aliases removed in 1.0.0. 236 237Wed Dec 03 02:29:37 GMT 2014 Olly Betts <olly@survex.com> 238 239 * omindex.cc: Remove unused '#include <fstream>'. 240 241Wed Dec 03 02:18:51 GMT 2014 Olly Betts <olly@survex.com> 242 243 * diritor.h: Fix get_mtime() to return time_t not off_t. In practice, 244 this probably wouldn't have caused issues until at least 2038. 245 246Fri Nov 28 11:38:56 GMT 2014 Olly Betts <olly@survex.com> 247 248 * configure.ac: Move AC_CANONICAL_HOST before first use of $host_os. 249 In practice this wasn't a problem, as LT_INIT implicitly calls 250 AC_CANONICAL_HOST before this point anyway. 251 252Wed Nov 26 03:55:13 GMT 2014 Olly Betts <olly@survex.com> 253 254 * configure.ac: Enable automake option 'subdir-objects' to avoid 255 warning from newer automake. 256 257Tue Oct 21 00:33:58 GMT 2014 Olly Betts <olly@survex.com> 258 259 * NEWS,configure.ac: Update for 1.2.19. 260 261Mon Oct 20 10:52:38 GMT 2014 Olly Betts <olly@survex.com> 262 263 * NEWS: Update. 264 265Mon Oct 20 10:44:27 GMT 2014 Olly Betts <olly@survex.com> 266 267 * Backport change from trunk: 268 * docs/overview.rst: Note that pdftotext is part of poppler as well as 269 xpdf. (Noted by Paul Wise) 270 271Sun Jun 22 04:32:15 GMT 2014 Olly Betts <olly@survex.com> 272 273 * NEWS,configure.ac: Update for 1.2.18. 274 275Sat Jun 21 00:50:56 GMT 2014 Olly Betts <olly@survex.com> 276 277 * Backport change from trunk: 278 * Makefile.am: Don't compile in unixperm.cc - it isn't currently used, 279 and it fails to build with mingw. (fixes #635) 280 281Thu Jun 19 13:56:02 GMT 2014 Olly Betts <olly@survex.com> 282 283 * NEWS: Update. 284 285Thu Jun 19 13:11:43 GMT 2014 Olly Betts <olly@survex.com> 286 287 * omindex.cc,urlencode.cc,urlencode.h: To improve compatibility with 288 existing databases in cases which are OK in practice, only URL 289 encode the starting URL if it contains a character <= 31 or at least 290 one of '#', '%', ':' or '?'. 291 292Thu Jun 19 08:44:22 GMT 2014 Olly Betts <olly@survex.com> 293 294 * Backport change from trunk: 295 * omindex.cc,urlencode.cc,urlencode.h: URL encode starting URL 296 properly. 297 298Thu Jun 19 08:27:13 GMT 2014 Olly Betts <olly@survex.com> 299 300 * Backport change from trunk: 301 * docs/omegascript.rst: Put ``...`` around Xapian C++ class names. 302 303Thu Jun 19 08:10:17 GMT 2014 Olly Betts <olly@survex.com> 304 305 * Backport change from trunk: 306 * diritor.cc: Fix "applications/msword" to "application/msword" in the 307 fallback code for CDF files. 308 309Thu Jun 19 07:55:11 GMT 2014 Olly Betts <olly@survex.com> 310 311 * Backport change from trunk: 312 * diritor.cc: In fallback for CDF files, compare the extension 313 *without* leading dot. 314 315Thu Jun 19 07:02:12 GMT 2014 Olly Betts <olly@survex.com> 316 317 * Backport change from trunk: 318 * diritor.cc: libmagic can return a second string starting "Composite 319 Document File V2 Document" for the mime-type, so just look for that 320 prefix. And newer libmagic returns "application/CDFV2-corrupt" in 321 these cases, so handle that too. 322 323Fri May 23 13:43:33 GMT 2014 Olly Betts <olly@survex.com> 324 325 * Backport change from trunk: 326 * date.cc,scriptindex.cc: Pass std::string by const reference. 327 328Fri May 23 13:42:02 GMT 2014 Olly Betts <olly@survex.com> 329 330 * Backport change from trunk: 331 * query.cc: Removed unused inline function. 332 333Fri May 23 13:38:33 GMT 2014 Olly Betts <olly@survex.com> 334 335 * Backport change from trunk: 336 * omindex.cc: Report the size limit in the message when we skip a file 337 which exceeds it. 338 339Wed Apr 30 05:44:54 GMT 2014 Olly Betts <olly@survex.com> 340 341 * NEWS: Update. 342 343Mon Apr 14 10:28:51 GMT 2014 Olly Betts <olly@survex.com> 344 345 * Backport change from trunk: 346 * diritor.cc: Work around libmagic returning a MIME content-type of 347 "Composite Document File V2 Document, No summary info". 348 349Mon Apr 14 09:38:51 GMT 2014 Olly Betts <olly@survex.com> 350 351 * Backport change from trunk: 352 * expand.cc: Fix warning when built with GCC 4.7.2 using -Os. 353 354Fri Mar 14 12:12:59 GMT 2014 Olly Betts <olly@survex.com> 355 356 * Backport change from trunk: 357 * Makefile.am,docs/omegascript.rst,expand.cc,expand.h,omega.cc, 358 query.cc: Add support for setting the query expansion scheme to use. 359 360Tue Jan 28 23:50:11 GMT 2014 Olly Betts <olly@survex.com> 361 362 * NEWS,configure.ac: Update for 1.2.17. 363 364Sun Jan 05 22:11:39 GMT 2014 Olly Betts <olly@survex.com> 365 366 * NEWS: Update. 367 368Thu Dec 26 01:21:42 GMT 2013 Olly Betts <olly@survex.com> 369 370 * Backport change from trunk: 371 * Makefile.am,portability/mkdtemp.cc,portability/mkdtemp.h,omindex.cc: 372 Add header with prototype of mkdtemp() to avoid "no previous 373 declaration" warning on platforms which don't have mkdtemp() as 374 standard. 375 376Fri Dec 20 08:00:25 GMT 2013 Olly Betts <olly@survex.com> 377 378 * NEWS: Update. 379 380Fri Dec 20 07:04:32 GMT 2013 Olly Betts <olly@survex.com> 381 382 * Backport change from trunk: 383 * docs/overview.rst: Add Abiword as an example use of --filter, based 384 on patch from Frank J Bruzzaniti (fixes#383). Update unoconv 385 example to talk about LibreOffice instead of OpenOffice. 386 387Tue Dec 03 23:13:48 GMT 2013 Olly Betts <olly@survex.com> 388 389 * NEWS: Final update for 1.2.16. 390 391Tue Dec 03 07:33:40 GMT 2013 Olly Betts <olly@survex.com> 392 393 * Backport change from trunk: 394 * configure.ac: Define __MSVCRT_VERSION__ to 0x0601 on mingw so we get 395 __ftime64() defined in the headers. 396 397Sat Nov 30 20:36:36 GMT 2013 Olly Betts <olly@survex.com> 398 399 * Makefile.am: Need to ship common/safewinsock2.h now. 400 401Fri Nov 29 04:34:18 GMT 2013 Olly Betts <olly@survex.com> 402 403 * Backport change from trunk: 404 * configure.ac: Sync GCC checks with xapian-core. 405 406Fri Nov 29 01:09:01 GMT 2013 Olly Betts <olly@survex.com> 407 408 * configure.ac: Update for 1.2.16. 409 410Thu Oct 31 12:33:31 GMT 2013 Olly Betts <olly@survex.com> 411 412 * NEWS: Update. 413 414Sun Oct 13 23:25:11 GMT 2013 Olly Betts <olly@survex.com> 415 416 * Backport change from trunk: 417 * omindex.cc: Group-readable files which are owner-readable but not 418 world-readable should still get a "readable by owner" term added. 419 Reported by Emmanuel Garette. 420 421Tue Oct 08 01:06:45 GMT 2013 Olly Betts <olly@survex.com> 422 423 * NEWS: Update from ChangeLog. 424 425Sun Oct 06 08:17:57 GMT 2013 Olly Betts <olly@survex.com> 426 427 * Backport change from trunk: 428 * configure.ac: Sync compiler warning flag machinery against 429 xapian-core. The changes are special handling for clang, passing 430 -fshow-column where supported, and handling for new warning flags 431 in GCC 4.6 and 4.7. 432 433Sun Oct 06 08:13:56 GMT 2013 Olly Betts <olly@survex.com> 434 435 * Backport changes from trunk: 436 * xlsxparse.cc: Handle pre-defined numfmtid codes for dates. 437 * omindex.cc,xlsxparse.cc,xlsxparse.h: Fix detection of cells with a 438 date format to work with xlsx files other than my first example. 439 * omindex.cc,xlsxparse.cc,xlsxparse.h: Decode dates for xlsx files. 440 441Sun Oct 06 08:01:59 GMT 2013 Olly Betts <olly@survex.com> 442 443 * Backport change from trunk: 444 * omindex.cc,docs/overview.rst: Ignore 'adm', 'cur', and 'ico' by 445 default. 446 447Sat Oct 05 06:46:32 GMT 2013 Olly Betts <olly@survex.com> 448 449 * Backport change from trunk: 450 * weight.cc: Use "" not <> to include local header weight.h. 451 452Sat Oct 05 06:44:23 GMT 2013 Olly Betts <olly@survex.com> 453 454 * Backport change from trunk: 455 * configure.ac,xapian-omega.spec.in: Compress source tarballs with xz 456 instead of gzip. 457 458Sat Oct 05 06:40:57 GMT 2013 Olly Betts <olly@survex.com> 459 460 * Backport change from trunk: 461 * diritor.h: Mark DirectoryIterator ctor as 'explicit'. 462 463Sat Oct 05 06:37:08 GMT 2013 Olly Betts <olly@survex.com> 464 465 * Backport change from trunk: 466 * datematchdecider.h: Fix filename in comment at top of file. 467 468Sat Oct 05 06:35:19 GMT 2013 Olly Betts <olly@survex.com> 469 470 * Backport change from trunk: 471 * omindex.cc: Fix off-by-one when finding documents to delete which 472 would sometimes cause omindex to fail to delete documents from the 473 database when they weren't refound during an index update. 474 475Sat Oct 05 06:28:56 GMT 2013 Olly Betts <olly@survex.com> 476 477 * Backport change from trunk: 478 * Makefile.am,README,docs/Makefile.am: SVN -> git. 479 480Tue Apr 16 23:41:20 GMT 2013 Olly Betts <olly@survex.com> 481 482 * NEWS,configure.ac: Update for 1.2.15. 483 484Tue Apr 16 10:43:52 GMT 2013 Olly Betts <olly@survex.com> 485 486 * NEWS: Update from ChangeLog. 487 488Wed Mar 27 09:39:12 GMT 2013 Olly Betts <olly@survex.com> 489 490 * Backport change from trunk: 491 * Makefile.am: Don't link utf8convert.cc code into omega CGI. 492 493Thu Mar 14 22:43:50 GMT 2013 Olly Betts <olly@survex.com> 494 495 * NEWS,configure.ac: Update for 1.2.14. 496 497Thu Mar 14 12:34:29 GMT 2013 Olly Betts <olly@survex.com> 498 499 * Backport changes from trunk: 500 * docs/overview.rst: Update to add com to the list of ignored 501 extensions. 502 * omindex.cc: Ignore .com files by default. 503 504Thu Mar 07 09:41:25 GMT 2013 Olly Betts <olly@survex.com> 505 506 * NEWS: Update from ChangeLog. 507 508Sat Feb 16 08:20:36 GMT 2013 Olly Betts <olly@survex.com> 509 510 * Backport change from trunk: 511 * xlsxparse.cc: Correct "max" -> "min" when reserving space for shared 512 strings. This only means we now reserve a more appropriate amount 513 of space to start with. 514 515Wed Jan 09 11:15:03 GMT 2013 Olly Betts <olly@survex.com> 516 517 * NEWS,configure.ac: Update for 1.2.13. 518 519Wed Jan 09 10:10:18 GMT 2013 Olly Betts <olly@survex.com> 520 521 * NEWS: Format new entries. 522 523Tue Jan 08 11:09:38 GMT 2013 Olly Betts <olly@survex.com> 524 525 * Backport change from trunk: 526 * commonhelp.cc: Tweak wording about default to match other options 527 better. 528 529Tue Jan 08 10:58:58 GMT 2013 Olly Betts <olly@survex.com> 530 531 * Backport change from trunk: 532 * omindex.cc: Note default size limit on files to index is unlimited. 533 Update --help to reflect that --sample-size now accepts the same 534 formats as --max-size). 535 536Tue Jan 08 10:41:27 GMT 2013 Olly Betts <olly@survex.com> 537 538 * Backport change from trunk: 539 * omindex.cc: When generating a sample for a CSV file, limit the 540 reserved size to the CSV file size as sample_size could be set 541 really high by the user. 542 543Tue Jan 08 10:31:13 GMT 2013 Olly Betts <olly@survex.com> 544 545 * Backport change from trunk: 546 * diritor.cc: MAGIC_MIME_TYPE was added in 4.22, so note that in the 547 comment about its conditional use. 548 549Tue Jan 08 10:28:07 GMT 2013 Olly Betts <olly@survex.com> 550 551 * Backport change from trunk: 552 * Makefile.am: In automake, INCLUDES is now deprecated in favour of 553 AM_CPPFLAGS so update to use the latter. 554 555Tue Jan 08 10:14:47 GMT 2013 Olly Betts <olly@survex.com> 556 557 * Backport change from trunk: 558 * md5wrap.cc: If we get a read error while calculating the md5 checksum 559 of a file, fail rather than returning the checksum of the file up to 560 that point. 561 562Tue Jan 08 10:13:36 GMT 2013 Olly Betts <olly@survex.com> 563 564 * Backport change from trunk: 565 * omindex.cc: Calculate the md5 from the loaded file contents when 566 indexing SVG and Atom files. Use a const ref to avoid a string 567 copy of the file contents for HTML and uncompressed ABI word. 568 569Wed Nov 21 05:02:18 GMT 2012 Olly Betts <olly@survex.com> 570 571 * NEWS: Update from ChangeLog. 572 573Wed Nov 21 05:00:51 GMT 2012 Olly Betts <olly@survex.com> 574 575 * Backport change from trunk: 576 * omindex.cc: Fix omindex not to segfault when -F option without a ':' 577 is passed. 578 579Sat Nov 10 08:08:20 GMT 2012 Olly Betts <olly@survex.com> 580 581 * NEWS: Update from ChangeLog. 582 583Sat Oct 13 12:24:59 GMT 2012 Olly Betts <olly@survex.com> 584 585 * Backport change from trunk: 586 * Makefile.am,omindex.cc: Replace shell_protect() with 587 append_filename_argument() from common/append_filename_arg.h. 588 Extracting text using external filters now works for filenames 589 containing a newline character. 590 * common/: Update svn:externals to latest version of xapian-core 591 on 1.2 branch to get common/append_filename_arg.h. 592 593Wed Jul 18 10:51:39 GMT 2012 Olly Betts <olly@survex.com> 594 595 * Backport change from trunk: 596 * urldecode.h: Fix to decode escaped character at the end of the 597 string. 598 * urlenctest.cc: Add regression testcase. 599 600Wed Jun 27 12:15:37 GMT 2012 Olly Betts <olly@survex.com> 601 602 * NEWS,configure.ac: Update for 1.2.12. 603 604Tue Jun 26 12:06:15 GMT 2012 Olly Betts <olly@survex.com> 605 606 * NEWS,configure.ac: Update for 1.2.11. 607 608Tue Jun 26 12:04:16 GMT 2012 Olly Betts <olly@survex.com> 609 610 * Backport change from trunk: 611 * configure.ac: Set link_all_deplibs_CXX=no on solaris, like we 612 already do for xapian-core. 613 614Tue Jun 26 08:48:58 GMT 2012 Olly Betts <olly@survex.com> 615 616 * NEWS: Update from ChangeLog. 617 618Fri Jun 22 12:16:30 GMT 2012 Olly Betts <olly@survex.com> 619 620 * Backport changes from trunk: 621 * xlsxparse.cc: Check for "uniquecount" parameter, not "unqiueCount" as 622 we normalise parameter names to lower case. 623 * omindex.cc: unzip extracts files in the order they are in the 624 archive, not the order they are on the command line, so call unzip 625 twice when the order of extraction matters. 626 * Makefile.am,omindex.cc,opendocparse.cc,opendocparse.h,xmlparse.cc: 627 Improve handling of headers and footers on OpenDocument documents. 628 629Fri Jun 22 12:12:56 GMT 2012 Olly Betts <olly@survex.com> 630 631 * Backport change from trunk: 632 * omindex.cc: Tweak previous change. 633 634Fri Jun 22 12:10:34 GMT 2012 Olly Betts <olly@survex.com> 635 636 * Backport changes from trunk: 637 * omindex.cc,xlsxparse.cc,xlsxparse.h: Rework .xlsx parsing to 638 substitute the shared strings into the positions they are used 639 in, so that the sample actually matches what appears in the 640 spreadsheet. 641 * xlsxparse.cc,xlsxparse.h: Subclass XlsxParser directly from 642 HtmlParser. 643 * Makefile.am,omindex.cc,xlsxparse.cc,xlsxparse.h: Index calculated 644 numbers from .xlsx files. 645 646Fri Jun 22 12:03:06 GMT 2012 Olly Betts <olly@survex.com> 647 648 * Backport change from trunk: 649 * omindex.cc: pdftotext outputs a formfeed between each page, which 650 messes up our "empty body" check, so trim any trailing formfeeds 651 before the check. 652 * omindex.cc: Fix the "trim trailing formfeeds" code not to remove one 653 character too many. 654 * omindex.cc: Properly fix the "trim trailing formfeeds" code. 655 656Sat Jun 09 11:50:16 GMT 2012 Olly Betts <olly@survex.com> 657 658 * Backport change from trunk: 659 * Cherry pick changes from Mihai Bivol's GSoC snippets branch: 660 * omindex.cc: Add option for the document sample size. 661 * omindex.cc: Add short option for sample-size 662 * omindex.cc: Make sample-size consistent with max-size 663 664Fri Jun 08 12:48:47 GMT 2012 Olly Betts <olly@survex.com> 665 666 * NEWS: Update from ChangeLog. 667 668Mon May 21 07:28:26 GMT 2012 Olly Betts <olly@survex.com> 669 670 * Backport change from trunk: 671 * htmlparsetest.cc,myhtmlparse.cc,myhtmlparse.h: Change parsing of 672 multiple <body> tags and text outside of <body> to match the 673 behaviour if modern web browsers. (ticket#599) 674 675Mon May 21 06:34:43 GMT 2012 Olly Betts <olly@survex.com> 676 677 * Backport change from trunk: 678 * configure.ac: Set link_all_deplibs_CXX=no on freebsd and openbsd, 679 like we already do for xapian-core. 680 681Wed May 09 10:43:41 GMT 2012 Olly Betts <olly@survex.com> 682 683 * NEWS,configure.ac: Update for 1.2.10. 684 685Wed May 09 08:19:21 GMT 2012 Olly Betts <olly@survex.com> 686 687 * NEWS: Update from ChangeLog. 688 689Wed May 09 01:20:25 GMT 2012 Olly Betts <olly@survex.com> 690 691 * Backport change from trunk: 692 * runfilter.cc: Add cast to rlim_t, required for C++11 compatibility 693 according to new error from GCC 4.7 (reported by Gaurav Arora). 694 695Tue May 08 05:23:11 GMT 2012 Olly Betts <olly@survex.com> 696 697 * Backport changes from trunk: 698 * atomparse.cc: For type="html", use the charset of the XML rather 699 than utf-8. 700 * Makefile.am,atomparse.cc,atomparse.h,overview.rst,omindex.cc: Add 701 support for atom feed files, patch from Mihai Bivol in ticket#595. 702 * Makefile.am,atomparsetest.cc: Add tests for AtomParser. 703 704Tue May 08 04:58:57 GMT 2012 Olly Betts <olly@survex.com> 705 706 * Backport change from trunk: 707 * htmlparse.cc,htmlparsetest.cc: Add support for CDATA to HTML parser. 708 709Tue May 08 04:48:36 GMT 2012 Olly Betts <olly@survex.com> 710 711 * Backport change from trunk: 712 * omindex.cc: Add --max-size option, based on patch from ndaley in 713 ticket#587. 714 715Wed Mar 21 11:21:08 GMT 2012 Olly Betts <olly@survex.com> 716 717 * NEWS: Update from ChangeLog. 718 719Tue Mar 13 10:16:19 GMT 2012 Olly Betts <olly@survex.com> 720 721 * Backport change from trunk: 722 * omindex.cc: If the document with the highest existing docid was 723 updated, we'd previously report it as "added", but now we correctly 724 report it as "updated". 725 726Tue Mar 13 10:12:10 GMT 2012 Olly Betts <olly@survex.com> 727 728 * Backport change from trunk: 729 * omindex.cc: Catch and report std::exception. 730 731Thu Mar 08 02:08:53 GMT 2012 Olly Betts <olly@survex.com> 732 733 * NEWS,configure.ac: Update for 1.2.9. 734 735Tue Mar 06 11:13:03 GMT 2012 Olly Betts <olly@survex.com> 736 737 * Backport changes from trunk: 738 * docs/overview.rst,omindex.cc: More extensions to ignore by default: 739 fon pyd ttf 740 * docs/overview.rst: Wrap over-long line. 741 * docs/overview.rst,omindex.cc: Add more extensions to the default 742 ignore list: bin dat db jar lnk pyc pyo sqlite sqlite3 sqlite-journal 743 tmp 744 745Tue Mar 06 11:10:59 GMT 2012 Olly Betts <olly@survex.com> 746 747 * Backport change from trunk: 748 * docs/overview.rst,htmlparse.cc,htmlparsetest.cc: Add support for 749 ignoring sections bracketed by <!--UdmComment--> and 750 <!--/UdmComment--> like we already do for <!--htdig_noindex-->. 751 Patch from Raphael Geissert. 752 753Wed Feb 29 10:58:43 GMT 2012 Olly Betts <olly@survex.com> 754 755 * NEWS: Update from ChangeLog. 756 757Mon Jan 23 12:48:00 GMT 2012 Olly Betts <olly@survex.com> 758 759 * Backport change from trunk: 760 * docs/overview.rst: Document that libmagic is used to determine 761 the MIME type if the extension isn't known. Partly addresses 762 ticket#569. 763 764Mon Jan 23 12:44:36 GMT 2012 Olly Betts <olly@survex.com> 765 766 * Backport change from trunk: 767 * docs/overview.rst: We now limit time as well as CPU and memory for 768 external filters. 769 770Tue Dec 13 12:42:10 GMT 2011 Olly Betts <olly@survex.com> 771 772 * NEWS: Final update for 1.2.8. 773 774Tue Dec 13 12:38:04 GMT 2011 Olly Betts <olly@survex.com> 775 776 * Backport change from trunk: 777 * docs/omegascript.rst,query.cc,templates/emptydocs,templates/godmode, 778 templates/query,urldecode.h,urlenctest.cc: Add new $prettyurl{} 779 command which undoes RFC3986 URL escaping which doesn't affect 780 semantics in practice. Partly addresses ticket#550. 781 782Tue Dec 13 11:19:56 GMT 2011 Olly Betts <olly@survex.com> 783 784 * NEWS,configure.ac: Update for 1.2.8. 785 786Thu Dec 08 08:27:28 GMT 2011 Olly Betts <olly@survex.com> 787 788 * Backport change from trunk: 789 * omindex.cc: Improve --help output (and man page which is generated 790 from it). Closes bug#572. 791 792Thu Dec 08 04:53:40 GMT 2011 Olly Betts <olly@survex.com> 793 794 * Backport change from trunk: 795 * Makefile.am: Ship new header urldecode.h. 796 797Thu Dec 08 04:46:18 GMT 2011 Olly Betts <olly@survex.com> 798 799 * Backport change from trunk: 800 * Makefile.am,cgiparam.cc,urldecode.h,urlenctest.cc: Add new 801 implementation of URL decoding - the old one didn't handle 802 various corner cases well, and had two cut and pasted variants 803 for handling a input from a C string (GET) or from stdin (POST). 804 Also add a new unit test program to test URL encoding and decoding. 805 Fixes bug#578. 806 807Tue Dec 06 13:30:58 GMT 2011 Olly Betts <olly@survex.com> 808 809 * Backport change from trunk: 810 * scriptindex.cc: If no rules are found in the index script, report an 811 error and give up - this is inevitably the result of a mistake, and 812 adding empty documents to the database isn't helpful. 813 814Tue Dec 06 13:28:32 GMT 2011 Olly Betts <olly@survex.com> 815 816 * Backport change from trunk: 817 * docs/omegascript.rst: Add note to discourage use of percentage 818 scores. 819 * templates/query: Don't show the percentage score in the default 820 template. 821 822Tue Dec 06 13:26:30 GMT 2011 Olly Betts <olly@survex.com> 823 824 * Backport change from trunk: 825 * configure.ac,runfilter.cc: If we don't get any data from a filter 826 for 5 minutes, give up - it has probably ended up blocked 827 indefinitely. 828 829Tue Dec 06 12:51:24 GMT 2011 Olly Betts <olly@survex.com> 830 831 * Backport changes from trunk: 832 * templates/query: HTML escape topterms. 833 * templates/godmode: HTML escape the contents of document values. 834 835Tue Dec 06 12:48:24 GMT 2011 Olly Betts <olly@survex.com> 836 837 * Backport change from trunk: 838 * scriptindex.cc: MyHtmlParser::parse_html() no longer throws bool to 839 stop parsing early, so we no longer need to catch it. 840 841Tue Dec 06 12:45:48 GMT 2011 Olly Betts <olly@survex.com> 842 843 * Backport change from trunk: 844 * configure.ac: Sync changes from xapian-core: Don't pass -Wshadow for 845 GCC < 4.1; don't pass -Wstrict-null-sentinel for GCC 4.0.x; only 846 enable symbol visibility on platforms where it is supported; remove 847 now superfluous check for GCC >= 3. Also, add FIXME for enabling 848 -Woverloaded-virtual. 849 850Tue Dec 06 11:57:58 GMT 2011 Olly Betts <olly@survex.com> 851 852 * Backport change from trunk: 853 * scriptindex.cc: Add link to 854 http://xapian.org/docs/omega/scriptindex.html to --help output (and 855 so also to the man page which is generated from this). 856 857Tue Dec 06 11:30:42 GMT 2011 Olly Betts <olly@survex.com> 858 859 * Backport change from trunk: 860 * omega.cc: If P had trailing spaces, we would remove all but one - 861 fixed to remove all of them! 862 863Thu Sep 01 12:31:59 GMT 2011 Olly Betts <olly@survex.com> 864 865 * Backport change from trunk: 866 * xapian-omega.spec.in: Package outlookmsg2html helper. 867 868Wed Aug 10 06:13:41 GMT 2011 Olly Betts <olly@survex.com> 869 870 * NEWS: Finalise 1.2.7. 871 872Tue Aug 09 09:00:12 GMT 2011 Olly Betts <olly@survex.com> 873 874 * NEWS,configure.ac: Initial update for 1.2.7. 875 876Thu Jun 23 01:01:25 GMT 2011 Olly Betts <olly@survex.com> 877 878 * Backport change from trunk: 879 * query.cc: Improve $version output from "Xapian - xapian-omega 1.2.6" 880 to "xapian-omega 1.2.6". 881 * docs/omegascript.rst: Update example to match (and use less ancient 882 version!) 883 884Thu Jun 23 00:38:18 GMT 2011 Olly Betts <olly@survex.com> 885 886 * Backport change from trunk: 887 * dbi2omega: Remove uninteresting reference to 0.9.4. 888 889Thu Jun 23 00:29:28 GMT 2011 Olly Betts <olly@survex.com> 890 891 * Backport change from trunk: 892 * hashterm.cc: Avoid unnecessary temporary string object. 893 894Thu Jun 23 00:26:13 GMT 2011 Olly Betts <olly@survex.com> 895 896 * Backport change from trunk: 897 * hashterm.cc: Fix comment typo. 898 899Wed Jun 22 16:59:05 GMT 2011 Olly Betts <olly@survex.com> 900 901 * Backport change from trunk: 902 * xapian-omega.spec.in: We're ABI compatible within a release series 903 so make dependency on xapian-core-libs >= rather than =. 904 905Wed Jun 22 16:55:10 GMT 2011 Olly Betts <olly@survex.com> 906 907 * Backport change from trunk: 908 * scriptindex.cc: Avoid unnecessary temporary string object. 909 910Mon Jun 13 08:00:16 GMT 2011 Olly Betts <olly@survex.com> 911 912 * Backport change from trunk: 913 * docs/termprefixes.rst: Expand to document mapping a user prefix to 914 multiple term prefixes. 915 916Mon Jun 13 03:23:47 GMT 2011 Olly Betts <olly@survex.com> 917 918 * docs/overview.rst: Improve documentation of htdig_noindex. 919 920Sun Jun 12 11:52:29 GMT 2011 Olly Betts <olly@survex.com> 921 922 * NEWS: Final update for 1.2.6. 923 924Fri Jun 10 12:02:32 GMT 2011 Olly Betts <olly@survex.com> 925 926 * NEWS,configure.ac: Update in preparation for 1.2.6. 927 928Fri Jun 10 03:28:33 GMT 2011 Olly Betts <olly@survex.com> 929 930 * templates/inc/anyallexactradio: Remove unused duplicate of 931 anyallradio. 932 933Fri Jun 10 03:21:25 GMT 2011 Olly Betts <olly@survex.com> 934 935 * configure.ac,omindex-config.cc,omindex-config.html: Strip out partly 936 written and long untouched omindex-config utility. 937 938Thu Jun 09 14:20:46 GMT 2011 Olly Betts <olly@survex.com> 939 940 * weight.cc: Fix a compiler warning (I failed to note the compiler 941 unfortunately). 942 943Sun May 29 13:00:26 GMT 2011 Olly Betts <olly@survex.com> 944 945 * templates/query: Make search query input type=search. 946 947Sun May 29 12:24:43 GMT 2011 Olly Betts <olly@survex.com> 948 949 * templates/query: Autofocus the search query input (using HTML 950 autofocus attribute with Javascript fallback for older browsers). 951 (ticket#544) 952 953Wed May 25 14:33:18 GMT 2011 Olly Betts <olly@survex.com> 954 955 * docs/omegascript.rst: Correct the documentation of the colours used by 956 $highlight{}. 957 958Fri May 13 05:50:35 GMT 2011 Olly Betts <olly@survex.com> 959 960 * docs/overview.rst: Add using unoconv as more complex example of 961 using --filter (ticket#324). 962 963Wed Apr 20 07:00:56 GMT 2011 Olly Betts <olly@survex.com> 964 965 * NEWS: Fix typo; clarify wording. 966 967Mon Apr 04 13:58:06 GMT 2011 Olly Betts <olly@survex.com> 968 969 * NEWS: Update release date. 970 971Mon Apr 04 13:53:34 GMT 2011 Olly Betts <olly@survex.com> 972 973 * templates/xml: Fix syntax error from recent edit. 974 975Sun Apr 03 10:54:04 GMT 2011 Olly Betts <olly@survex.com> 976 977 * NEWS,configure.ac: Update for 1.2.5. 978 979Sat Apr 02 14:15:32 GMT 2011 Olly Betts <olly@survex.com> 980 981 * templates/query: Use $add{$field{modtime}} to ensure it is numeric. 982 983Sat Apr 02 14:14:06 GMT 2011 Olly Betts <olly@survex.com> 984 985 * templates/godmode: More missing escaping. 986 987Sat Apr 02 14:07:45 GMT 2011 Olly Betts <olly@survex.com> 988 989 * templates/xml: Remove double escaping. 990 991Sat Apr 02 13:58:44 GMT 2011 Olly Betts <olly@survex.com> 992 993 * templates/query: More escaping fixes. 994 995Sat Apr 02 13:55:03 GMT 2011 Olly Betts <olly@survex.com> 996 997 * templates/emptydocs,templates/opensearch,templates/xml: More missing 998 escaping. 999 1000Sat Apr 02 12:34:42 GMT 2011 Olly Betts <olly@survex.com> 1001 1002 * templates/query: Add missing escaping. 1003 1004Sat Apr 02 11:48:43 GMT 2011 Olly Betts <olly@survex.com> 1005 1006 * templates/godmode: Add missing escaping. 1007 1008Sat Apr 02 10:34:58 GMT 2011 Olly Betts <olly@survex.com> 1009 1010 * templates/xml: Remove support for undocumented HILITECLASS CGI 1011 variable. There's no evidence I can find using Google code search 1012 or web search that this has been used anywhere, and it's problematic 1013 to escape properly. 1014 1015Sat Mar 26 14:51:36 GMT 2011 Olly Betts <olly@survex.com> 1016 1017 * INSTALL: Copy new Multi-Arch section from xapian-core/INSTALL. 1018 Replace VPATH section with better equivalent from 1019 xapian-core/INSTALL. 1020 1021Wed Mar 23 15:21:41 GMT 2011 Olly Betts <olly@survex.com> 1022 1023 * htmlparse.cc,htmlparse.h,htmlparsetest.cc,metaxmlparse.cc, 1024 metaxmlparse.h,myhtmlparse.cc,myhtmlparse.h,omindex.cc,svgparse.cc, 1025 svgparse.h,xmlparse.cc,xmlparse.h,xpsxmlparse.cc,xpsxmlparse.h: 1026 Instead of throwing a bool to abandon parsing, change methods to 1027 return bool to signify if they want to continue parsing or not. 1028 This is a bit faster (~0.23% for indexing a lot of HTML files). 1029 1030Mon Mar 21 05:48:08 GMT 2011 Olly Betts <olly@survex.com> 1031 1032 * myhtmlparse.cc,myhtmlparse.h,omindex.cc: Add --ignore-exclusions 1033 option, which will index HTML files despite meta robots tags, etc - 1034 omindex is often used in environments where such exclusions aren't 1035 relevant. 1036 1037Fri Mar 18 10:24:58 GMT 2011 Olly Betts <olly@survex.com> 1038 1039 * omindex.cc: Just report the mimetype as unknown instead of saying 1040 "unknown Office 2007 MIME subtype". 1041 1042Fri Mar 18 05:53:21 GMT 2011 Olly Betts <olly@survex.com> 1043 1044 * diritor.h: Avoid using S_IRUSR, etc under __WIN32__. 1045 1046Fri Mar 18 03:00:16 GMT 2011 Olly Betts <olly@survex.com> 1047 1048 * docs/overview.rst,omindex.cc: Ignore *.css and *.js by default too. 1049 1050Thu Mar 17 23:34:07 GMT 2011 Olly Betts <olly@survex.com> 1051 1052 * omindex.cc: For skip messages which are only to be shown in verbose 1053 mode, call skip with new SKIP_VERBOSE_ONLY flag. Pass new 1054 SKIP_SHOW_FILENAME flag for skip messages shown before we say what 1055 file we are indexing so we know to show the filename even in verbose 1056 mode. 1057 1058Thu Mar 17 03:47:54 GMT 2011 Olly Betts <olly@survex.com> 1059 1060 * omindex.cc: Restore handling of exceptions from 1061 DirectoryIterator::get_type(), and handle exceptions from 1062 DirectoryIterator::next() which ended up at the top level 1063 before (though they probably never happen, at least on Linux). 1064 1065Wed Mar 16 06:19:01 GMT 2011 Olly Betts <olly@survex.com> 1066 1067 * omindex.cc: Push all the code associated with indexing a file into 1068 index_file(). 1069 1070Wed Mar 16 02:55:53 GMT 2011 Olly Betts <olly@survex.com> 1071 1072 * omindex.cc: Push try block around index_file() call into the 1073 function. 1074 1075Wed Mar 16 02:51:52 GMT 2011 Olly Betts <olly@survex.com> 1076 1077 * omindex.cc: Factor out handling for skipping files, and improve 1078 these messages by consistently reporting the filename. 1079 1080Tue Mar 15 12:47:12 GMT 2011 Olly Betts <olly@survex.com> 1081 1082 * docs/Makefile.am,docs/index.rst: Add index page which links to all 1083 the other documentation pages. 1084 1085Tue Mar 15 12:20:30 GMT 2011 Olly Betts <olly@survex.com> 1086 1087 * omindex.cc: Add --empty-docs option to allow documents we extract 1088 no body text from to be indexed (existing behaviour), skipped, or 1089 reported and then indexed. 1090 1091Fri Mar 04 14:13:47 GMT 2011 Olly Betts <olly@survex.com> 1092 1093 * docs/omegascript.rst: Minor improvements. 1094 1095Wed Mar 02 11:17:42 GMT 2011 Olly Betts <olly@survex.com> 1096 1097 * NEWS: Update. 1098 1099Wed Mar 02 06:14:41 GMT 2011 Olly Betts <olly@survex.com> 1100 1101 * docs/termprefixes.rst: New standard prefix E for filename extension. 1102 * omindex.cc: Index file extension as E-prefixed term. 1103 1104Mon Feb 28 13:45:32 GMT 2011 Olly Betts <olly@survex.com> 1105 1106 * omindex.cc: Tell xls2csv not to quote fields and to put spaces 1107 not commas between them. Fixes indexing of numeric fields, and 1108 means we don't need to use our CSV parser to get a sample. 1109 1110Mon Feb 28 12:10:53 GMT 2011 Olly Betts <olly@survex.com> 1111 1112 * xmlparse.cc: Add whitespace between chunks of text extracted from 1113 Microsoft Office 2007 formats. 1114 1115Wed Feb 23 12:34:28 GMT 2011 Olly Betts <olly@survex.com> 1116 1117 * templates/xml: Try $field{caption} (which is what omindex sets) 1118 before $field{title} when getting a value for the hit tag's title 1119 attribute - this is consistent with how the query template gets the 1120 title. Add new type attribute which gives $field{type}. 1121 1122Thu Feb 17 05:19:28 GMT 2011 Olly Betts <olly@survex.com> 1123 1124 * templates/xml: Add DBSize attribute to <result> element. 1125 1126Wed Feb 16 03:19:57 GMT 2011 Olly Betts <olly@survex.com> 1127 1128 * Makefile.am,omindex.cc,query.cc,urlencode.cc,urlencode.h: Update 1129 URL encoding to follow RFC3986. 1130 1131Tue Feb 15 03:20:40 GMT 2011 Olly Betts <olly@survex.com> 1132 1133 * omindex.cc: Encode reserved characters in URLs - now links to 1134 files with names containing '#' and '?' will work. 1135 1136Sun Jan 23 13:27:48 GMT 2011 Olly Betts <olly@survex.com> 1137 1138 * docs/overview.rst,omindex.cc: Later Microsoft Works version produce 1139 .xlr spreadsheet files, which are apparently XL files with a 1140 different extension, so handle them as XL files. 1141 1142Thu Jan 20 11:07:46 GMT 2011 Olly Betts <olly@survex.com> 1143 1144 * docs/omegascript.rst,omega.cc,query.cc,templates/query: Allow 1145 QueryParser flags to be set from OmegaScript (ticket#418). 1146 1147Sat Jan 15 11:14:32 GMT 2011 Olly Betts <olly@survex.com> 1148 1149 * NEWS: Update from ChangeLog, 1.0.22 and 1.0.23. 1150 1151Wed Jan 12 02:21:59 GMT 2011 Olly Betts <olly@survex.com> 1152 1153 * query.cc: Fix double Content-Type header in some error reporting 1154 situations (regression introduced in 1.2.4). 1155 1156Mon Jan 10 10:00:00 GMT 2011 Olly Betts <olly@survex.com> 1157 1158 * omindex.cc,pkglibbindir.cc,pkglibbindir.h: Fix typo in function name 1159 (get_pkglibdindir() -> get_pkglibbindir()). 1160 1161Mon Jan 10 09:50:38 GMT 2011 Olly Betts <olly@survex.com> 1162 1163 * diritor.cc,diritor.h: Don't define or try to set euid member of 1164 DirectoryIterator on platforms where we aren't going to use it. 1165 1166Mon Jan 10 09:15:24 GMT 2011 Olly Betts <olly@survex.com> 1167 1168 * diritor.h: Stub out get_owner() and get_group() for __WIN32__. 1169 1170Fri Dec 24 10:35:29 GMT 2010 Olly Betts <olly@survex.com> 1171 1172 * NEWS: Update from ChangeLog. 1173 1174Thu Dec 23 01:53:06 GMT 2010 Olly Betts <olly@survex.com> 1175 1176 * diritor.cc: Fix to work with older libmagic which doesn't have 1177 MAGIC_MIME_TYPE (e.g. on Ubuntu hardy). 1178 1179Sun Dec 19 12:39:23 GMT 2010 Olly Betts <olly@survex.com> 1180 1181 * NEWS,configure.ac: 1.2.4. 1182 1183Sun Dec 19 12:37:58 GMT 2010 Olly Betts <olly@survex.com> 1184 1185 * query.cc: Disable permission filtering based on $REMOTE_USER as that 1186 will break some existing installations if users upgrade, which we 1187 don't want. Probably this should be specifiable from OmegaScript 1188 but it's not worth delaying 1.2.4 while we sort this out. 1189 1190Sun Dec 19 02:46:17 GMT 2010 Olly Betts <olly@survex.com> 1191 1192 * docs/overview.rst,omindex.cc: Change the new name for 1193 "--preserve-unupdated" from "--preserve-removed" to "--no-delete". 1194 1195Sun Dec 19 02:32:29 GMT 2010 Olly Betts <olly@survex.com> 1196 1197 * query.cc: Fix comment typo. 1198 1199Fri Dec 17 12:45:47 GMT 2010 Olly Betts <olly@survex.com> 1200 1201 * commonhelp.cc,commonhelp.h,omindex.cc,scriptindex.cc: Swap the 1202 meanings of -v and -V in omindex for consistency with scriptindex 1203 and typical short options for --verbose and --version in other 1204 packages. For backward compatibility, "omindex -v" is handled 1205 specially and still reports the version. 1206 1207Fri Dec 17 08:31:29 GMT 2010 Olly Betts <olly@survex.com> 1208 1209 * utf8convert.cc: Fix built in converter to handle space in charset 1210 names, which fixes failing utf8converttest when iconv isn't 1211 available. 1212 1213Fri Dec 17 05:36:36 GMT 2010 Olly Betts <olly@survex.com> 1214 1215 * utf8convert.cc: Rework the fixing up of charset names which iconv() 1216 doesn't understand a little. 1217 1218Thu Dec 16 06:35:46 GMT 2010 Olly Betts <olly@survex.com> 1219 1220 * loadfile.cc: If fstat() fails, preserve the errno value rather than 1221 letting close() clobber it. 1222 1223Thu Dec 16 06:31:30 GMT 2010 Olly Betts <olly@survex.com> 1224 1225 * loadfile.cc: Fix file descriptor leak if load_file() is called on 1226 something which isn't a file (found by cppcheck run on the Debian 1227 archive). This case probably couldn't occur in omindex, but could if 1228 you used the LOADFILE action in scriptindex. 1229 1230Thu Dec 09 10:58:48 GMT 2010 Olly Betts <olly@survex.com> 1231 1232 * docs/omegascript.rst: Replace $simplecommand with $query - a concrete 1233 example is more useful. Improve mark-up. 1234 * docs/termprefixes.rst: Remove mention of pre-0.9.7 use of W prefix. 1235 1236Thu Nov 18 12:25:50 GMT 2010 Olly Betts <olly@survex.com> 1237 1238 * omega.cc: Fix reversed condition in recent exception reporting fix. 1239 1240Wed Nov 17 03:46:24 GMT 2010 Olly Betts <olly@survex.com> 1241 1242 * diritor.cc: Add missing magic_cookie argument to calls to 1243 magic_error(). 1244 1245Sat Nov 13 12:17:51 GMT 2010 Olly Betts <olly@survex.com> 1246 1247 * omindex.cc: Build up document data with += for efficiency. 1248 1249Sat Nov 13 12:08:09 GMT 2010 Olly Betts <olly@survex.com> 1250 1251 * omindex.cc: Index author with A prefix. 1252 1253Sat Nov 13 12:00:50 GMT 2010 Olly Betts <olly@survex.com> 1254 1255 * omindex.cc: A file extension can't contain a '/'. 1256 1257Sat Nov 13 11:50:31 GMT 2010 Olly Betts <olly@survex.com> 1258 1259 * omindex.cc: Index the leafname of the file (without any extension) as 1260 if it contained additional keywords. 1261 1262Sat Nov 13 11:32:09 GMT 2010 Olly Betts <olly@survex.com> 1263 1264 * omindex.cc: If a filter command isn't installed, flag this in the 1265 commands map so we don't try running this command again for any 1266 file with the same mimetype (previously we'd rerun it for a different 1267 extension which gave the same mimetype). 1268 1269Fri Nov 12 09:11:35 GMT 2010 Olly Betts <olly@survex.com> 1270 1271 * Makefile.am,configure.ac: Add -no-undefined to AM_LDFLAGS on 1272 platforms which need it to dynamically link such as cygwin (need 1273 to do this taken from ticket#282). 1274 1275Fri Nov 12 03:35:56 GMT 2010 Olly Betts <olly@survex.com> 1276 1277 * omindex.cc: Report MIME type if it's unknown to us. Remove debug 1278 output line. Update comments. 1279 1280Fri Nov 12 03:32:27 GMT 2010 Olly Betts <olly@survex.com> 1281 1282 * diritor.cc: Report errors from libmagic. 1283 1284Fri Nov 12 02:58:20 GMT 2010 Olly Betts <olly@survex.com> 1285 1286 * diritor.cc,diritor.h: Fix to compile when libmagic is detected. 1287 1288Fri Nov 12 01:40:24 GMT 2010 Olly Betts <olly@survex.com> 1289 1290 * diritor.cc: Add missing class qualifier to method definition. 1291 1292Fri Nov 12 01:25:11 GMT 2010 Olly Betts <olly@survex.com> 1293 1294 * INSTALL: Mention libmagic in install instructions. 1295 1296Fri Nov 12 01:16:21 GMT 2010 Olly Betts <olly@survex.com> 1297 1298 * Makefile.am,configure.ac,diritor.cc,diritor.h,omindex.cc: Optionally 1299 use libmagic to detect MIME types for files for which we have no 1300 extension mapping, which allows us to handle files with a misleading 1301 extension, and files with no extension. (ticket#114) 1302 1303Thu Nov 11 23:23:07 GMT 2010 Olly Betts <olly@survex.com> 1304 1305 * omindex.cc: Refactor slightly to handle the unknown extension case 1306 up front, so we lose an indentation level for the known extension 1307 case. 1308 1309Thu Nov 11 12:25:03 GMT 2010 Olly Betts <olly@survex.com> 1310 1311 * omindex.cc: Add new --filter option to allow the user to specify 1312 new filters without patching omindex.cc. 1313 * docs/overview.rst: Document --filter. 1314 1315Thu Nov 11 02:51:55 GMT 2010 Olly Betts <olly@survex.com> 1316 1317 * omindex.cc: Factor out handling for external filter programs which 1318 simply return UTF-8 text on stdout. 1319 1320Mon Nov 08 10:58:46 GMT 2010 Olly Betts <olly@survex.com> 1321 1322 * omindex.cc,svgparse.cc,svgparse.h: Extract author for SVG files. 1323 1324Mon Nov 08 10:40:09 GMT 2010 Olly Betts <olly@survex.com> 1325 1326 * omindex.cc: Extract metadata from Microsoft Office 2007 file formats. 1327 1328Mon Nov 08 10:21:13 GMT 2010 Olly Betts <olly@survex.com> 1329 1330 * myhtmlparse.cc,myhtmlparse.h,omindex.cc: Extract author from HTML 1331 documents. 1332 1333Mon Nov 08 09:46:03 GMT 2010 Olly Betts <olly@survex.com> 1334 1335 * omindex.cc: Escape wildcard patterns being passed to unzip - in the 1336 unlikely event that one of these matched files in or under the 1337 current directory, we might fail to extract all the files we wanted 1338 to. 1339 1340Mon Nov 08 05:03:41 GMT 2010 Olly Betts <olly@survex.com> 1341 1342 * metaxmlparse.cc,metaxmlparse.h,omindex.cc: Extract author from 1343 OpenDocument documents. 1344 1345Mon Nov 08 03:18:26 GMT 2010 Olly Betts <olly@survex.com> 1346 1347 * omindex.cc: Extract author from PDF metadata. 1348 1349Mon Nov 08 03:15:17 GMT 2010 Olly Betts <olly@survex.com> 1350 1351 * metaxmlparse.h: Initialise field member variable. 1352 1353Mon Nov 08 00:28:07 GMT 2010 Olly Betts <olly@survex.com> 1354 1355 * omindex.cc: Index text in headers and footers for .odt and .docx 1356 files. 1357 1358Thu Nov 04 11:55:58 GMT 2010 Olly Betts <olly@survex.com> 1359 1360 * omega.cc,omega.h,query.cc: If we catch an error early on, make sure 1361 that if it's appropriate, we write out a "Content-Type:" HTTP header 1362 and end the headers. 1363 1364Thu Nov 04 11:39:10 GMT 2010 Olly Betts <olly@survex.com> 1365 1366 * utf8converttest.cc: Add back in testcases for charset names with 1367 hyphens in. 1368 1369Thu Nov 04 09:01:43 GMT 2010 Olly Betts <olly@survex.com> 1370 1371 * utils.cc: Fix misuse of BUFSIZE which should be sizeof(buf) (issue 1372 reported by compilation with CPPFLAGS=-D_GLIBCXX_DEBUG). 1373 1374Thu Nov 04 09:01:08 GMT 2010 Richard Boulton <richard@tartarus.org> 1375 1376 * utf8convert.cc,utf8converttest.cc: If iconv can't handle a 1377 charset, check if it's of the form (UTF|UCS)[_ ]?.* and if so, 1378 convert to the official hypenated form. Should fix failure of 1379 utf8converttest on OSX, where it fails due to iconv not 1380 supporting "UTF16". 1381 1382Tue Nov 02 09:48:19 GMT 2010 Olly Betts <olly@survex.com> 1383 1384 * diritor.cc,diritor.h,loadfile.cc,loadfile.h,md5wrap.cc,md5wrap.h, 1385 omindex.cc,scriptindex.cc: Use O_NOATIME if available and either the 1386 file is owned by the current euid, or the current euid is 0 (i.e. 1387 we're running as root). Fixes ticket#222. 1388 1389Fri Oct 29 14:26:25 GMT 2010 Olly Betts <olly@survex.com> 1390 1391 * omindex.cc: Use the CSV parser to generate a nicer sample for files 1392 of type application/vnd.ms-excel. 1393 1394Fri Oct 29 09:26:52 GMT 2010 Olly Betts <olly@survex.com> 1395 1396 * Makefile.am: Put $(PCRE_LIBS) in libtransform_la_LIBADD rather than 1397 omega_LDADD (more correct, but probably doesn't actually make any 1398 difference). 1399 1400Thu Oct 28 14:46:11 GMT 2010 Olly Betts <olly@survex.com> 1401 1402 * omindex.cc: Disable more output unless --verbose is specified. Don't 1403 flush the "Indexing" partial message until we get to the potentially 1404 time consuming actions. 1405 1406Thu Oct 28 13:54:44 GMT 2010 Olly Betts <olly@survex.com> 1407 1408 * docs/overview.rst: Improve mark-up, and tweak wording in a few 1409 places. 1410 1411Thu Oct 28 13:46:36 GMT 2010 Olly Betts <olly@survex.com> 1412 1413 * docs/overview.rst: Update docs for --duplicates and 1414 --preserve-removed. 1415 1416Thu Oct 28 13:27:01 GMT 2010 Olly Betts <olly@survex.com> 1417 1418 * omindex.cc: Deprecated "--preserve-nonduplicates" in favour of new 1419 long option "--preserve-removed" which does the same thing, but has 1420 a (hopefully) clearer name. Rename the variable it controls from 1421 preserve_unupdated to delete_removed_documents (with the opposite 1422 sense). 1423 1424Thu Oct 28 12:08:59 GMT 2010 Olly Betts <olly@survex.com> 1425 1426 * configfile.cc: Only append '/' to directory values if they don't 1427 already have a trailing '/'. 1428 1429Thu Oct 28 11:49:54 GMT 2010 Olly Betts <olly@survex.com> 1430 1431 * runfilter.cc: Make the memory limit for filter processes the size 1432 of physical memory, not 7/8 of this value, which is a little less 1433 arbitrary (ticket#424). 1434 1435Thu Oct 28 11:47:38 GMT 2010 Olly Betts <olly@survex.com> 1436 1437 * omindex.cc: Under --duplicate=ignore, fix so that old documents which 1438 aren't seen get deleted, which wasn't implemented before (to suppress 1439 this deletion, pass -p as well). 1440 1441Thu Oct 28 10:38:21 GMT 2010 Olly Betts <olly@survex.com> 1442 1443 * omindex.cc: Track how many documents in the index we haven't seen 1444 in this index run - if this is 0, we don't need to check for docs 1445 to delete at all; otherwise we can at least use it to know when we 1446 have found them all. Use a PostingIterator over all documents to 1447 avoid having to catch exceptions from delete_document() for gaps 1448 in the used docids. 1449 1450Thu Oct 28 04:52:36 GMT 2010 Olly Betts <olly@survex.com> 1451 1452 * omindex.cc: Add quotes around directory name in "Entering directory" 1453 message. Add directory name to "skipping directory" error message. 1454 1455Thu Oct 28 04:50:37 GMT 2010 Olly Betts <olly@survex.com> 1456 1457 * omindex.cc: Document --verbose in --help. Actually recognise -V. 1458 1459Thu Oct 28 04:01:31 GMT 2010 Olly Betts <olly@survex.com> 1460 1461 * omindex.cc: Move the directory iteration loop out of the try/catch 1462 block for starting the iteration, which means it's indented by a 1463 whole level less. 1464 1465Thu Oct 28 03:47:30 GMT 2010 Olly Betts <olly@survex.com> 1466 1467 * omindex.cc: Add --verbose option, and disable the less interesting 1468 output unless it is specified. 1469 1470Thu Oct 28 03:34:44 GMT 2010 Olly Betts <olly@survex.com> 1471 1472 * omindex.cc: Eliminate the message "Caught unknown exception in 1473 index_directory, rethrowing" as it isn't actually informative. 1474 1475Thu Oct 28 01:43:44 GMT 2010 Olly Betts <olly@survex.com> 1476 1477 * omindex.cc: Variable dbpath doesn't need to be global. 1478 1479Thu Oct 28 01:28:10 GMT 2010 Olly Betts <olly@survex.com> 1480 1481 * omindex.cc: The Host and Path terms are the same for every document 1482 in a single invocation of omindex, so calculate them just once up 1483 front. 1484 1485Thu Oct 28 01:13:36 GMT 2010 Olly Betts <olly@survex.com> 1486 1487 * omindex.cc: Eliminate the leading slash on filenames in output, so 1488 they are now relative filenames on the system. This also simplifies 1489 path building internally. 1490 1491Wed Oct 27 09:51:51 GMT 2010 Olly Betts <olly@survex.com> 1492 1493 * omindex.cc: Use rpm's --qf option to produce output which is simpler 1494 to parse. 1495 1496Wed Oct 27 09:32:22 GMT 2010 Olly Betts <olly@survex.com> 1497 1498 * docs/overview.rst,omindex.cc: Add support for indexing RPM packages 1499 (ticket#493). 1500 1501Wed Oct 27 06:07:59 GMT 2010 Olly Betts <olly@survex.com> 1502 1503 * docs/overview.rst,omindex.cc: Add support for indexing Debian package 1504 files (ticket #493). 1505 1506Wed Oct 27 05:37:02 GMT 2010 Olly Betts <olly@survex.com> 1507 1508 * docs/overview.rst,omindex.cc: Quietly ignore files with mimetype set 1509 to "ignore". The initial list of extensions set to ignore is: 1510 .a .dll .dylib .exe .lib .o .obj .so 1511 1512Wed Oct 27 02:25:01 GMT 2010 Olly Betts <olly@survex.com> 1513 1514 * omindex.cc: Report get_description() for Xapian exceptions, which 1515 is provides additional information above get_msg(). 1516 1517Wed Oct 27 01:56:08 GMT 2010 Olly Betts <olly@survex.com> 1518 1519 * omindex.cc,query.cc,values.h: Add file size as a value, and set up a 1520 NumberValueRangeProcessor so size: works in the query (has to be in 1521 bytes currently). 1522 1523Wed Oct 27 01:31:25 GMT 2010 Olly Betts <olly@survex.com> 1524 1525 * scriptindex.cc: Report get_description() for Xapian exceptions, which 1526 is provides additional information above get_msg(). 1527 1528Tue Oct 26 12:00:58 GMT 2010 Olly Betts <olly@survex.com> 1529 1530 * docs/overview.rst: Document the new emptydocs template. 1531 1532Tue Oct 26 11:51:31 GMT 2010 Olly Betts <olly@survex.com> 1533 1534 * docs/omegascript.rst,query.cc: Add new $emptydocs command which 1535 returns a list of documents with doclength zero. 1536 * query.cc: Extend $field to take an optional DOCID argument, rather 1537 than always using the context from $hitlist. 1538 * templates/emptydocs: New template which lists documents with 1539 doclength zero. 1540 1541Thu Oct 21 12:05:23 GMT 2010 Olly Betts <olly@survex.com> 1542 1543 * configure.ac,unixperm.cc: Fix to build on platforms where 1544 getgrouplist() exists but takes int* not gid_t* (e.g. Mac OS X). 1545 1546Wed Oct 20 10:30:13 GMT 2010 Olly Betts <olly@survex.com> 1547 1548 * omindex.cc,scriptindex.cc: Add boolean terms with add_boolean_term() 1549 so they get wdf of 0 and don't contribute to document length. 1550 1551Sat Oct 16 06:13:23 GMT 2010 Olly Betts <olly@survex.com> 1552 1553 * configure.ac: Probe for any options needed to enable large file 1554 support. Handling files >= 2GB isn't especially useful, but more 1555 importantly this is needed to allow omindex to index files on filing 1556 systems with 64 bit inodes on some platforms (e.g. 32-bit Linux). 1557 1558Mon Oct 11 11:11:07 GMT 2010 Olly Betts <olly@survex.com> 1559 1560 * Makefile.am: Drop special case to remove man pages on "make clean" 1561 in maintainer-mode. 1562 1563Wed Sep 29 04:14:21 GMT 2010 Olly Betts <olly@survex.com> 1564 1565 * Makefile.am,configure.ac,query.cc,unixperm.cc,unixperm.h: Pull out 1566 permission checks into a separate file and check Unix user and group 1567 permissions based on environmental variable REMOTE_USER, if set. 1568 1569Tue Sep 28 08:06:00 GMT 2010 Olly Betts <olly@survex.com> 1570 1571 * Makefile.am: Ship common/realtime.h. 1572 1573Tue Sep 28 06:32:10 GMT 2010 Olly Betts <olly@survex.com> 1574 1575 * query.cc: Apply permission filters if USER and/or GROUP are set. 1576 1577Tue Sep 28 06:14:50 GMT 2010 Olly Betts <olly@survex.com> 1578 1579 * ./: Update svn:externals to latest common from xapian-core. 1580 * query.cc: Use RealTime::now() to time running the query. Include 1581 more enquire set-up in the time. 1582 1583Tue Sep 28 05:26:07 GMT 2010 Olly Betts <olly@survex.com> 1584 1585 * omindex.cc: Index file owner and read permissions, to allow finding 1586 documents with a particular owner, and so searches can be restricted 1587 to documents a user is able to read. 1588 * docs/termprefixes.rst: Document term prefixes used by the above. 1589 1590Tue Sep 28 05:20:01 GMT 2010 Olly Betts <olly@survex.com> 1591 1592 * diritor.h: Rename get_other_read() to is_other_readable() for 1593 consistency. 1594 1595Tue Sep 28 04:16:55 GMT 2010 Olly Betts <olly@survex.com> 1596 1597 * diritor.cc,diritor.h: Rearrange so that the setting of statbuf_valid 1598 gets inlined so the compiler should be able to optimise out 1599 subsequent calls to call_stat(). 1600 1601Tue Sep 28 04:10:28 GMT 2010 Olly Betts <olly@survex.com> 1602 1603 * diritor.h: Add methods to read the owner and group, and to check 1604 who can read the file. 1605 1606Tue Sep 28 01:39:15 GMT 2010 Olly Betts <olly@survex.com> 1607 1608 * NEWS: Fix typo. 1609 1610Tue Sep 28 01:33:44 GMT 2010 Olly Betts <olly@survex.com> 1611 1612 * NEWS: Fix whitespace oddities. 1613 1614Tue Sep 28 01:31:46 GMT 2010 Olly Betts <olly@survex.com> 1615 1616 * NEWS: Update from ChangeLog. 1617 1618Tue Sep 28 01:27:41 GMT 2010 Olly Betts <olly@survex.com> 1619 1620 * omindex.cc: Improve --help for --mime-type option. 1621 1622Mon Sep 20 06:50:45 GMT 2010 Olly Betts <olly@survex.com> 1623 1624 * omindex.cc,svgparse.cc,svgparse.h: Extract any document title and 1625 keywords from SVG files. 1626 1627Mon Sep 20 06:49:44 GMT 2010 Olly Betts <olly@survex.com> 1628 1629 * htmlparse.cc: Call closing_tag() for XML empty tag syntax (like 1630 "<tag foo=bar />"). 1631 1632Mon Sep 20 05:30:54 GMT 2010 Olly Betts <olly@survex.com> 1633 1634 * Makefile.am,docs/overview.rst,omindex.cc,svgparse.cc,svgparse.h: Add 1635 support for indexing SVG files. 1636 1637Tue Sep 07 04:39:59 GMT 2010 Olly Betts <olly@survex.com> 1638 1639 * outlookmsg2html.in: If the required perl modules aren't available, 1640 exit with status 127 which omindex interprets as "filter not 1641 installed" and won't try further .msg files. 1642 1643Tue Sep 07 02:24:36 GMT 2010 Olly Betts <olly@survex.com> 1644 1645 * Makefile.am,configure.ac,docs/overview.rst,omindex.cc, 1646 outlookmsg2html.in,pkglibbindir.cc,pkglibbindir.h: Add support for 1647 indexing .msg files from Microsoft Outlook. (ticket#334) 1648 1649Tue Aug 31 06:32:15 GMT 2010 Olly Betts <olly@survex.com> 1650 1651 * omindex.cc: Fix handling of quoting in CSV files to match what's 1652 most common. 1653 1654Tue Aug 31 05:41:13 GMT 2010 Olly Betts <olly@survex.com> 1655 1656 * docs/overview.rst,omindex.cc: The V in CSV is Values not Variable. 1657 1658Mon Aug 30 14:56:36 GMT 2010 Olly Betts <olly@survex.com> 1659 1660 * docs/overview.rst,omindex.cc: Add support for indexing .csv files. 1661 1662Sat Aug 28 11:46:22 GMT 2010 Olly Betts <olly@survex.com> 1663 1664 * cdb_find.cc,cdb_init.cc,cgiparam.cc,date.cc,md5.cc,query.cc,utils.cc, 1665 values.h: Fix to compile with Sun C++. 1666 1667Sat Aug 28 11:36:25 GMT 2010 Olly Betts <olly@survex.com> 1668 1669 * omega.cc: An ESet can't contain empty terms, so there's no need to 1670 check for them. 1671 1672Tue Aug 24 05:58:28 GMT 2010 Olly Betts <olly@survex.com> 1673 1674 * NEWS,configure.ac: Update for 1.2.3. 1675 1676Mon Aug 23 15:08:11 GMT 2010 Olly Betts <olly@survex.com> 1677 1678 * xapian-omega.spec.in: Don't run autoreconf - it's no longer required. 1679 1680Tue Aug 03 14:11:35 GMT 2010 Olly Betts <olly@survex.com> 1681 1682 * docs/termprefixes.rst: Update "flint and quartz" to "flint and chert" 1683 as quartz is no longer supported. Give exact term length limit for 1684 flint and chert. 1685 1686Sun Jun 27 05:00:39 GMT 2010 Olly Betts <olly@survex.com> 1687 1688 * NEWS,configure.ac: Update for 1.2.2. 1689 1690Sat Jun 26 15:59:59 GMT 2010 Olly Betts <olly@survex.com> 1691 1692 * NEWS.SKELETON: Add blank line to the end. 1693 1694Sat Jun 26 15:59:05 GMT 2010 Olly Betts <olly@survex.com> 1695 1696 * NEWS.SKELETON: Add template NEWS entry. 1697 1698Tue Jun 22 13:55:11 GMT 2010 Olly Betts <olly@survex.com> 1699 1700 * NEWS: Sync with 1.0.21. 1701 * NEWS,configure.ac: Update for 1.2.1. 1702 1703Sun Jun 13 11:55:40 GMT 2010 Olly Betts <olly@survex.com> 1704 1705 * freemem.cc: Merge in __WIN32__ implementation from perftest in 1706 xapian-core. 1707 1708Fri May 14 01:39:43 GMT 2010 Olly Betts <olly@survex.com> 1709 1710 * freemem.cc: Use "safeunistd.h" instead of <unistd.h>. 1711 1712Wed Apr 28 13:38:33 GMT 2010 Olly Betts <olly@survex.com> 1713 1714 * NEWS: Sync with 1.0.20. 1715 1716Wed Apr 28 06:44:56 GMT 2010 Olly Betts <olly@survex.com> 1717 1718 * configure.ac: Tell libtool not to link in deplibs on platforms where 1719 we know they aren't needed. 1720 * configure.ac: On Linux, extract the library search path from ldconfig 1721 which gives us the default entries reliably. 1722 * NEWS,configure.ac: 1.2.0. 1723 1724Thu Apr 15 04:32:06 GMT 2010 Olly Betts <olly@survex.com> 1725 1726 * NEWS,configure.ac: Update for 1.1.5. 1727 1728Mon Feb 15 14:00:26 GMT 2010 Olly Betts <olly@survex.com> 1729 1730 * configure.ac: Update for 1.1.4. 1731 1732Mon Feb 15 13:51:44 GMT 2010 Olly Betts <olly@survex.com> 1733 1734 * NEWS: Add missing notes for 1.1.2 and 1.1.1 including changes from 1735 1.0.14 and 1.0.13 respectively. 1736 1737Mon Feb 15 13:28:12 GMT 2010 Olly Betts <olly@survex.com> 1738 1739 * NEWS: Update from ChangeLog and 1.0.18. 1740 1741Mon Feb 08 00:48:44 GMT 2010 Olly Betts <olly@survex.com> 1742 1743 * Makefile.am: Need to ship common/omassert.h. 1744 1745Sun Feb 07 23:03:45 GMT 2010 Olly Betts <olly@survex.com> 1746 1747 * Makefile.am: Need to ship common/str.h. 1748 1749Sun Feb 07 21:40:03 GMT 2010 Olly Betts <olly@survex.com> 1750 1751 * Makefile.am,omega.cc,omindex.cc,query.cc,utils.cc,utils.h: Use the 1752 optimised str() routine instead of int_to_string() and 1753 long_to_string(). 1754 1755Fri Feb 05 23:29:12 GMT 2010 Olly Betts <olly@survex.com> 1756 1757 * omindex.cc: Increase the wdf boost for the document title from 2 to 1758 5, since 2 isn't really enough. 1759 1760Thu Feb 04 03:20:02 GMT 2010 Olly Betts <olly@survex.com> 1761 1762 * Makefile.am,configure.ac,runfilter.cc: Use safesyswait.h. 1763 * runfilter.cc: Reformat header to @file doxygen comment. Put 1764 '#include "runfilter.h"' right after <config.h>. 1765 1766Wed Dec 10 00:15:10 GMT 2009 Olly Betts <olly@survex.com> 1767 1768 * NEWS: Update from ChangeLog. 1769 1770Wed Dec 09 00:26:19 GMT 2009 Olly Betts <olly@survex.com> 1771 1772 * myhtmlparse.cc: Add missing "using namespace std;". 1773 1774Wed Dec 09 00:20:38 GMT 2009 Olly Betts <olly@survex.com> 1775 1776 * htmlparse.cc: Make the default charset "utf-8" not "UTF-8" as we 1777 lower case explicitly specified character sets to compare to see 1778 if we need to reparse, so this avoids a reparse when UTF-8 is 1779 explicitly specified as well as the default. 1780 1781Tue Dec 08 23:56:46 GMT 2009 Olly Betts <olly@survex.com> 1782 1783 * scriptindex.cc: Don't bomb out if indexing is disallowed or we hit 1784 </body> for a document which had an overridden character set. 1785 Fixes ticket#410. 1786 1787Wed Nov 18 10:48:47 GMT 2009 Olly Betts <olly@survex.com> 1788 1789 * NEWS,configure.ac: Update for 1.1.3. 1790 1791Wed Nov 18 02:37:34 GMT 2009 Olly Betts <olly@survex.com> 1792 1793 * NEWS: Update from 1.0.17 and ChangeLog. 1794 1795Mon Nov 16 09:08:12 GMT 2009 Olly Betts <olly@survex.com> 1796 1797 * utf8converttest.cc: Charset "8859_1" isn't understood by Solaris 1798 libiconv, and isn't likely to be specified on a page, so just 1799 test it for our built-in convertor and GNU libc. 1800 1801Wed Nov 11 04:52:25 GMT 2009 Olly Betts <olly@survex.com> 1802 1803 * configure.ac: Also check for socketpair with -lxnet if it isn't found 1804 without, which enables resource limits on Solaris, and possibly some 1805 other platforms. Fixes ticket#412. 1806 1807Wed Nov 04 01:51:41 GMT 2009 Olly Betts <olly@survex.com> 1808 1809 * freemem.cc: On Linux, _SC_AVPHYS_PAGES excludes pages used by the OS 1810 VM cache, so will often return a really low value, so instead use 1811 _SC_PHYS_PAGES. Reported by Rune Kock in Debian bug#548987. Also 1812 explains ticket#358. 1813 1814Wed Nov 04 00:54:38 GMT 2009 Olly Betts <olly@survex.com> 1815 1816 * common/: Sync with latest version from xapian-core to pick up getopt 1817 fix for Mac OS X 10.6. 1818 1819Mon Nov 02 09:32:22 GMT 2009 Olly Betts <olly@survex.com> 1820 1821 * omindex.cc: Use delete[] (not delete) for array allocated by new[]. 1822 1823Mon Nov 02 07:08:13 GMT 2009 Olly Betts <olly@survex.com> 1824 1825 * runfilter.cc: Fix likely crash if read() is interrupted by a signal. 1826 Identified by Coverity's Scan. 1827 1828Mon Nov 02 06:47:01 GMT 2009 Olly Betts <olly@survex.com> 1829 1830 * scriptindex.cc: Extend exception handling to the whole of main. 1831 Xapian::Stem("english") can't actually throw, but that's not obvious 1832 to static analysis tools, and it is more robust to wrap the whole of 1833 main, and reduces indentation. 1834 1835Mon Nov 02 06:32:41 GMT 2009 Olly Betts <olly@survex.com> 1836 1837 * omindex.cc,scriptindex.cc: Tighten up the type of the error we catch 1838 to detect an unknown stemming language. 1839 1840Thu Sep 17 12:13:10 GMT 2009 Olly Betts <olly@survex.com> 1841 1842 * NEWS: Update from ChangeLog. 1843 1844Thu Sep 10 13:33:06 GMT 2009 Olly Betts <olly@survex.com> 1845 1846 * configure.ac: Default to looking for xapian-config-1.1. 1847 1848Thu Sep 10 06:46:55 GMT 2009 Olly Betts <olly@survex.com> 1849 1850 * NEWS: Sync changes from 1.0.15 and 1.0.16. 1851 1852Wed Sep 09 13:32:25 GMT 2009 Olly Betts <olly@survex.com> 1853 1854 * omega.cc,query.cc,query.h: Fix cross-site scripting vulnerability in 1855 reporting of exceptions (CVE-2009-2947). 1856 1857Fri Aug 28 15:30:07 GMT 2009 Richard Boulton <richard@lemurconsulting.com> 1858 1859 * configure.ac: Check for PERL if in maintainer mode, not just when 1860 building documentation, because making the omegascript vim syntax 1861 mode requires it. 1862 1863Wed Aug 26 14:17:06 GMT 2009 Olly Betts <olly@survex.com> 1864 1865 * templates/query: www.xapian.org -> xapian.org. 1866 1867Tue Aug 25 11:15:38 GMT 2009 Olly Betts <olly@survex.com> 1868 1869 * gen-omegascript-vim: Fix swapped arguments to perl mkdir function. 1870 1871Tue Aug 25 10:39:29 GMT 2009 Olly Betts <olly@survex.com> 1872 1873 * gen-omegascript-vim: Add GPL licence boilerplate. 1874 1875Tue Aug 25 10:29:07 GMT 2009 Olly Betts <olly@survex.com> 1876 1877 * gen-omegascript-vim: Need to create "extra" for a VPATH build. 1878 1879Tue Aug 25 08:39:00 GMT 2009 Olly Betts <olly@survex.com> 1880 1881 * Makefile.am: Fix for VPATH build. 1882 1883Tue Aug 25 06:38:08 GMT 2009 Olly Betts <olly@survex.com> 1884 1885 * Makefile.am,extra/omegascript.vim,extra/omegascript.vim.in, 1886 gen-omegascript-vim: The list of OmegaScript commands in the vim 1887 mode was rather out of date, and a few commands were misclassified. 1888 Fix both problems and avoid future recurrences by automatically 1889 generating those lists from the command list in query.cc. 1890 1891Sat Aug 15 11:31:56 GMT 2009 Olly Betts <olly@survex.com> 1892 1893 * NEWS: Update from ChangeLog. 1894 1895Wed Aug 05 03:50:54 GMT 2009 Olly Betts <olly@survex.com> 1896 1897 * omindex.cc: Implement correct handling of paths when calling 1898 external filter programs on Microsoft Windows. 1899 1900Thu Jul 23 12:07:24 GMT 2009 Olly Betts <olly@survex.com> 1901 1902 * omindex.cc: Remove pointless fallback code. 1903 1904Thu Jul 23 12:06:37 GMT 2009 Olly Betts <olly@survex.com> 1905 1906 * templates/inc/toptermsjs: Use double-quotes rather than single quotes 1907 for parameter values on the <script> tag. 1908 1909Thu Jul 23 11:29:43 GMT 2009 Olly Betts <olly@survex.com> 1910 1911 * docs/omegascript.rst: Document that $date uses UTC. (ticket#314) 1912 1913Thu Jul 23 11:26:15 GMT 2009 Olly Betts <olly@survex.com> 1914 1915 * templates/query: If JavaScript is available, convert the 1916 $field{modtime} to a string on the client-side so that the timezone 1917 is correct. If JavaScript isn't available, fall back to the existing 1918 behaviour of using UTC. (ticket#314) 1919 1920Thu Jul 23 04:12:02 GMT 2009 Olly Betts <olly@survex.com> 1921 1922 * NEWS,configure.ac: Update for 1.1.2. 1923 1924Wed Jul 22 04:33:29 GMT 2009 Olly Betts <olly@survex.com> 1925 1926 * NEWS: Update from ChangeLog and sync with 1.0.13 and 1.0.14. 1927 1928Tue Jul 07 15:05:09 GMT 2009 Olly Betts <olly@survex.com> 1929 1930 * omindex.cc: Consistently use endl not "\n" at the end of messages so 1931 that output is flushed. 1932 1933Tue Jul 07 07:29:21 GMT 2009 Olly Betts <olly@survex.com> 1934 1935 * cdb_init.cc,cdb_int.h,cgiparam.cc,configfile.cc,date.cc, 1936 datematchdecider.cc,datematchdecider.h,freemem.cc,htmlparse.cc, 1937 htmlparsetest.cc,md5.cc,md5test.cc,myhtmlparse.cc,omega.cc, 1938 omindex.cc,query.cc,runfilter.cc,scriptindex.cc,strcasecmp.h, 1939 utf8converttest.cc,utils.cc: Update to use C++ forms for ISO C 1940 standard headers (ticket#330). 1941 1942Mon Jul 06 01:54:35 GMT 2009 Olly Betts <olly@survex.com> 1943 1944 * loadfile.cc: Avoid infinite loop if the file has been truncated 1945 since we read the length, or on Cygwin with the automatic end of 1946 line translation turned on. 1947 1948Sun Jul 05 13:00:57 GMT 2009 Olly Betts <olly@survex.com> 1949 1950 * htmlparse.cc,htmlparse.h: Make HtmlParser::get_parameter() const 1951 (ticket#139). 1952 1953Sun Jul 05 12:59:45 GMT 2009 Olly Betts <olly@survex.com> 1954 1955 * cdb_init.cc: Prefer static_cast<> to C-style cast. 1956 1957Sat Jun 20 03:31:22 GMT 2009 Olly Betts <olly@survex.com> 1958 1959 * docs/overview.rst: www.xapian.org -> xapian.org 1960 1961Thu Jun 11 09:45:45 GMT 2009 Olly Betts <olly@survex.com> 1962 1963 * omindex.cc: Extract pptx notesSlides and comments, if present. If 1964 they aren't, unzip returns exit code 11, which we must ignore 1965 (ticket#290). 1966 1967Thu Jun 11 07:38:57 GMT 2009 Olly Betts <olly@survex.com> 1968 1969 * docs/overview.rst,omindex.cc: Handle the "macroenabled" versions of 1970 MS Office 2007 files too (ticket#290). 1971 1972Wed Jun 10 01:13:14 GMT 2009 Olly Betts <olly@survex.com> 1973 1974 * configure.ac: Update for 1.1.1. 1975 1976Tue Jun 09 14:35:40 GMT 2009 Olly Betts <olly@survex.com> 1977 1978 * NEWS: Update for 1.1.1. 1979 1980Mon May 25 13:38:46 GMT 2009 Olly Betts <olly@survex.com> 1981 1982 * query.cc: If SERVER_PROTOCOL in the environment is set to INCLUDED, 1983 then our output is being included in another page (e.g. using SSI) 1984 so suppress the output of any HTTP headers. 1985 1986Mon May 25 13:02:22 GMT 2009 Olly Betts <olly@survex.com> 1987 1988 * templates/query: Remove extra "}" introduced when adding spelling 1989 support. 1990 1991Mon May 25 12:57:45 GMT 2009 Olly Betts <olly@survex.com> 1992 1993 * cgiparam.cc,commonhelp.cc: Include the corresponding header. 1994 1995Mon May 25 12:56:55 GMT 2009 Olly Betts <olly@survex.com> 1996 1997 * cgiparam.h: Add explicit inclusions of <map> and <string> and qualify 1998 multimap and string with std::. 1999 2000Sat May 23 12:21:33 GMT 2009 Olly Betts <olly@survex.com> 2001 2002 * configure.ac: Sync warning flags used with GCC with xapian-core 2003 apart from -Woverloaded-virtual which fires for 2004 MyHtmlParser::parse_html(). That probably should be tidied up at 2005 some point, but not right now. 2006 2007Wed May 20 11:24:46 GMT 2009 Olly Betts <olly@survex.com> 2008 2009 * omindex.cc: The MD5 checksum of a text file with a BOM was being 2010 incorrectly calculated from the contents converted to UTF-8 2011 since 1.0.7. Noticed by Srijon Biswas. 2012 2013Tue May 05 12:13:17 GMT 2009 Olly Betts <olly@survex.com> 2014 2015 * omindex.cc: We can now use numeric_limits<> since we no longer 2016 support GCC 2.95, so use it and fix a warning on platforms with 2017 32 bit long. 2018 2019Thu Apr 30 14:09:50 GMT 2009 Olly Betts <olly@survex.com> 2020 2021 * Makefile.am,docs/omegascript.rst,query.cc,weight.cc,weight.h: Add 2022 $opt{weighting} to allow the weighting scheme and parameters to be 2023 specified (ticket#298). 2024 2025Tue Apr 28 07:38:54 GMT 2009 Olly Betts <olly@survex.com> 2026 2027 * omindex.cc: Check the last modification time of files before 2028 reindexing (ticket#342). 2029 2030Tue Apr 28 05:17:04 GMT 2009 Olly Betts <olly@survex.com> 2031 2032 * omindex.cc: Drop the copyright info from the output of --version as 2033 it's perennially out of date and we don't report it for any other 2034 Xapian programs. 2035 2036Tue Apr 28 05:03:29 GMT 2009 Olly Betts <olly@survex.com> 2037 2038 * omindex.cc: If the filter for a filetype isn't installed, don't erase 2039 the entry from the mime_map, but instead set it to the empty string 2040 and then use this to report why we subsequently skip files with the 2041 same extension, rather than slightly misleadingly reporting "Unknown 2042 extension". 2043 2044Mon Apr 27 16:34:29 GMT 2009 Olly Betts <olly@survex.com> 2045 2046 * templates/query: Offer any spelling correction QueryParser gives. 2047 2048Mon Apr 27 13:36:19 GMT 2009 Olly Betts <olly@survex.com> 2049 2050 * omindex.cc: Add "--spelling" option to index spelling correction 2051 data. 2052 2053Sun Apr 26 16:28:36 GMT 2009 Olly Betts <olly@survex.com> 2054 2055 * omindex.cc: Make -s work as a short-form for --stemmer (as 2056 documented by "omindex --help" and "man omindex"). 2057 2058Sun Apr 26 15:33:32 GMT 2009 Olly Betts <olly@survex.com> 2059 2060 * docs/omegascript.rst,query.cc: Add $suggestion and $opt{spelling} to 2061 provide access to spelling correction (ticket#296). 2062 2063Sun Apr 26 15:08:40 GMT 2009 Olly Betts <olly@survex.com> 2064 2065 * docs/scriptindex.rst,scriptindex.cc: Add new "spell" action for 2066 scriptindex (ticket#296). 2067 2068Thu Apr 23 07:40:41 GMT 2009 Olly Betts <olly@survex.com> 2069 2070 * docs/scriptindex.rst,scriptindex.cc: Add new "valuenumeric" action 2071 to index a value using Xapian::sortable_serialise() to allow numeric 2072 sorting (ticket#260). 2073 2074Thu Apr 23 07:09:18 GMT 2009 Olly Betts <olly@survex.com> 2075 2076 * Makefile.am,configure.ac,docs/Makefile.am: Fix things up so that in 2077 a bootstrapped SVN tree, automatic regeneration of 2078 autotools-generated files uses the in-tree versions of the autotools. 2079 2080Wed Apr 22 13:52:28 GMT 2009 Olly Betts <olly@survex.com> 2081 2082 * NEWS: Update for 1.1.0. 2083 2084Mon Apr 20 14:20:51 GMT 2009 Olly Betts <olly@survex.com> 2085 2086 * NEWS: Sync changes from 1.0.12. 2087 2088Mon Apr 20 14:15:41 GMT 2009 Olly Betts <olly@survex.com> 2089 2090 * NEWS: Update from ChangeLog and clean up for release. 2091 2092Thu Apr 16 10:02:44 GMT 2009 Olly Betts <olly@survex.com> 2093 2094 * transform.cc: Fix off-by-one error - the return value of pcre_exec() 2095 is one more than the number of groupings. 2096 2097Thu Apr 16 09:23:29 GMT 2009 Olly Betts <olly@survex.com> 2098 2099 * Makefile.am: Need to ship new file transform.h. 2100 2101Thu Apr 16 08:20:01 GMT 2009 Olly Betts <olly@survex.com> 2102 2103 * Makefile.am,docs/omegascript.rst,query.cc,transform.cc,transform.h: 2104 Factor out the implementation of $transform into a separate source 2105 file and compile only that file with $(PCRE_CFLAGS) to avoid 2106 problems reported by James Aylett with Mac OS X on #xapian-devel. 2107 Fix expansion of \1 to \9 to work correctly and document these 2108 and \\. Fix handling of unescaped \ at the end of the pattern, and 2109 leave unrecognised \<x> sequences unchanged. 2110 2111Thu Apr 16 04:38:20 GMT 2009 Olly Betts <olly@survex.com> 2112 2113 * configure.ac: Remove duplicate "AC_SUBST(AM_CXXFLAGS)". 2114 2115Thu Apr 16 04:29:28 GMT 2009 Olly Betts <olly@survex.com> 2116 2117 * configure.ac: Avoid implicitly casting a string literal to char* in 2118 the test for iconv by adding the same explicit cast we use in the 2119 code in utf8convert.cc. Currently the implicit cast is "only" a 2120 warning under GCC, but the user could pass -Werror explicitly in 2121 CXXFLAGS, and this could be promoted to an error in future GCC 2122 versions, and may already be so for some other compilers. 2123 2124Thu Apr 16 03:56:16 GMT 2009 Olly Betts <olly@survex.com> 2125 2126 * configure.ac: Back out previous fix - -Werror has nothing to do with 2127 the issue James reported. 2128 2129Tue Apr 14 15:34:36 GMT 2009 Richard Boulton <richard@lemurconsulting.com> 2130 2131 * configure.ac: Test for compiler flags before checking for 2132 libraries, and use the compiler flags found when checking for 2133 things. In particular, this should fix the test for the type 2134 used by iconv() on MacOS (where it was previously returning "char 2135 *", and the test was giving a warning about converting this to 2136 "const char *", but not failing). Requires a change to the iconv 2137 test to avoid it failing on linux with GCC due to an unrelated 2138 warning in the test code. 2139 2140Sat Apr 04 15:15:18 GMT 2009 Olly Betts <olly@survex.com> 2141 2142 * NEWS: Update from ChangeLog. 2143 2144Wed Mar 25 12:35:42 GMT 2009 Olly Betts <olly@survex.com> 2145 2146 * Makefile.am,configure.ac: Actually use all those warning flags we 2147 carefully determine! 2148 2149Wed Mar 25 12:03:37 GMT 2009 Olly Betts <olly@survex.com> 2150 2151 * Makefile.am,configure.ac: Only put XAPIAN_CXXFLAGS in CXXFLAGS for 2152 the duration of configure (we need it as it may include options to 2153 put the compiler into ISO C++ mode). Set AM_CXXFLAGS to 2154 XAPIAN_CXXFLAGS in Makefile.am. This means that the user can safely 2155 override CXXFLAGS at make-time: "make CXXFLAGS=-Os" 2156 2157Wed Mar 25 10:56:29 GMT 2009 Olly Betts <olly@survex.com> 2158 2159 * query.cc: Cope with write() not writing all the data or being 2160 interrupted by a signal when writing log entries. 2161 2162Wed Mar 25 10:48:14 GMT 2009 Olly Betts <olly@survex.com> 2163 2164 * configure.ac: Move AC_PROG_CXX and AC_LANG_CPLUSPLUS earlier so that 2165 CXXFLAGS is set before we add XAPIAN_CXXFLAGS to it. With libtool 2166 1.5.x this wasn't an issue, as AC_PROG_CXX was implicitly run early 2167 on. With libtool 2.2.x it is as AC_PROG_CXX doesn't touch CXXFLAGS 2168 if it is already set, so we don't get "-O2 -g" set for GCC. 2169 2170Wed Mar 18 06:13:16 GMT 2009 Olly Betts <olly@survex.com> 2171 2172 * scriptindex.cc: Mark "index=nopos" error for removal in 1.3.0 2173 not 1.2.0. Tweak code that produces it to use more literal strings. 2174 2175Wed Mar 18 06:12:06 GMT 2009 Olly Betts <olly@survex.com> 2176 2177 * docs/scriptindex.rst: The deprecated "index=nopos" is now removed 2178 and gives an error explaining what to use instead, so remove the 2179 documentation saying it is deprecated and what to do. 2180 2181Mon Mar 16 14:07:58 GMT 2009 Olly Betts <olly@survex.com> 2182 2183 * NEWS: Sync with 1.0.11. 2184 2185Sat Feb 28 08:31:15 GMT 2009 Olly Betts <olly@survex.com> 2186 2187 * omindex.cc,scriptindex.cc: Use commit() rather than flush(). 2188 2189Sat Feb 28 08:28:26 GMT 2009 Olly Betts <olly@survex.com> 2190 2191 * scriptindex.cc: Don't call reopen() on a WritableDatabase - it 2192 doesn't do anything! 2193 2194Thu Feb 26 06:38:05 GMT 2009 Olly Betts <olly@survex.com> 2195 2196 * NEWS: Update from ChangeLog. 2197 2198Thu Feb 26 06:18:05 GMT 2009 Olly Betts <olly@survex.com> 2199 2200 * omindex.cc: Mark "-l" as requiring an argument so that it actually 2201 works - previously it would always result in a segmentation fault. 2202 2203Thu Feb 26 00:17:56 GMT 2009 Olly Betts <olly@survex.com> 2204 2205 * docs/cgiparams.rst: Note the technique of using a stub database file 2206 to allow a default of searching over multiple databases. 2207 2208Wed Feb 25 12:39:08 GMT 2009 Olly Betts <olly@survex.com> 2209 2210 * configure.ac: Update g++ version check to match recent change to 2211 xapian-core. Also turn on _FORTIFY_SOURCE and make the rare() 2212 and usual() branch prediction hint macros available. 2213 2214Mon Feb 23 06:05:25 GMT 2009 Olly Betts <olly@survex.com> 2215 2216 * Makefile.am,docs/overview.rst,omindex.cc,xpsxmlparse.cc, 2217 xpsxmlparse.h: Add support for XPS files (bug#290). 2218 2219Fri Feb 20 03:25:14 GMT 2009 Olly Betts <olly@survex.com> 2220 2221 * query.cc: Wrap a long comment. 2222 2223Thu Feb 19 10:34:36 GMT 2009 Olly Betts <olly@survex.com> 2224 2225 * omega.cc,query.cc: Prefer str.resize(0) to str = "". 2226 2227Thu Feb 19 06:23:34 GMT 2009 Olly Betts <olly@survex.com> 2228 2229 * docs/overview.rst,omindex.cc: Add support for MS Office 2007 2230 formats (bug#290). 2231 2232Thu Feb 19 04:46:26 GMT 2009 Olly Betts <olly@survex.com> 2233 2234 * metaxmlparse.cc,metaxmlparse.h,xmlparse.cc,xmlparse.h: XmlParser and 2235 MetaXmlParser were overriding opening_tag with the wrong signature so 2236 their implementations weren't ever being used. 2237 2238Fri Jan 09 04:19:32 GMT 2009 Olly Betts <olly@survex.com> 2239 2240 * runfilter.cc: Fix to compile when RLIMIT_AS isn't available (as on 2241 NetBSD and OpenBSD). In this situation, instead use RLIMIT_VMEM or 2242 RLIMIT_DATA if either is available. 2243 2244Wed Dec 10 01:06:03 GMT 2008 Olly Betts <olly@survex.com> 2245 2246 * query.cc: Fix poor grammar in comment. 2247 2248Sat Nov 01 01:49:07 GMT 2008 Olly Betts <olly@survex.com> 2249 2250 * NEWS: Sync with 1.0.9. 2251 2252Fri Oct 31 18:34:49 GMT 2008 Olly Betts <olly@survex.com> 2253 2254 * configure.ac: Sync warning flag handling changes from xapian-core. 2255 2256Thu Oct 23 17:08:22 GMT 2008 Olly Betts <olly@survex.com> 2257 2258 * docs/overview.rst: Document HTML parsing a bit, including robots 2259 meta and htdig_noindex. 2260 2261Sat Oct 18 08:00:24 GMT 2008 Olly Betts <olly@survex.com> 2262 2263 * omega.cc: Catch std::exception and report what its what() method 2264 returns. 2265 2266Thu Oct 09 10:16:05 GMT 2008 Olly Betts <olly@survex.com> 2267 2268 * configure.ac: Update autoconf requirement to 2.63, libtool to 2.2.6. 2269 2270Wed Oct 01 04:48:37 GMT 2008 Olly Betts <olly@survex.com> 2271 2272 * scriptindex.cc: Separate Action constructor cases to avoid 2273 pointlessly calling atoi() on an empty string. 2274 2275Wed Oct 01 03:15:29 GMT 2008 Olly Betts <olly@survex.com> 2276 2277 * omega.cc,omega.h: Remove undocumented and non-functional support for 2278 numeric sorting via: SORT=#<slot> 2279 2280Thu Sep 04 04:26:22 GMT 2008 Olly Betts <olly@survex.com> 2281 2282 * configure.ac: Set version to 1.1.0. 2283 2284Thu Sep 04 04:21:12 GMT 2008 Olly Betts <olly@survex.com> 2285 2286 * NEWS: Sync with 1.0.8 and update from ChangeLog. 2287 2288Wed Sep 03 12:26:58 GMT 2008 Olly Betts <olly@survex.com> 2289 2290 * htmlparse.cc,htmlparse.h,htmlparsetest.cc,myhtmlparse.cc, 2291 myhtmlparse.h,omindex.cc,scriptindex.cc,xmlparse.h: If the character 2292 encoding is specified using <meta http-equiv=...> in an HTML 2293 document then reparse the document if it isn't the encoding we're 2294 already using so that any preceding <title> is converted correctly 2295 (bug#292). 2296 2297 Convert text from meta tag parameters to UTF-8 (bug#293). 2298 2299 Handle <meta charset="..."> (new in HTML 5). 2300 2301 Fix bug in parameter parsing which was probably just a small 2302 performance penalty in real world cases, but could perhaps result in 2303 parsing bogus extra parameters in carefully contrived situations. 2304 2305Tue Aug 05 09:24:33 GMT 2008 Olly Betts <olly@survex.com> 2306 2307 * docs/: Fix a few typos and improve wording in a few places. 2308 2309Tue Aug 05 09:19:56 GMT 2008 Olly Betts <olly@survex.com> 2310 2311 * omindex.cc: Tweak to use string::assign() instead of assigning the 2312 result of string::substr(). 2313 2314Tue Jul 29 23:48:31 GMT 2008 Olly Betts <olly@survex.com> 2315 2316 * runfilter.cc: Add missing <signal.h>, noted on FreeBSD by Henrik 2317 Brix Andersen. 2318 2319Mon Jul 21 12:27:48 GMT 2008 Olly Betts <olly@survex.com> 2320 2321 * commonhelp.cc: Use PACKAGE_BUGREPORT instead of hardcoding the bug 2322 report URL. Remove reference to "bugzilla" as we now use trac 2323 instead. 2324 2325Mon Jul 21 11:58:25 GMT 2008 Olly Betts <olly@survex.com> 2326 2327 * configure.ac: Put the bug report URL as the third parameter to 2328 AC_INIT. Add proper m4 quoting in a few places (nowhere that 2329 should actually change behaviour). Add hard autotools version 2330 requirements to match xapian-core, and remove the version 2331 justification since HACKING now covers that. Drop docdir workaround 2332 for autoconf < 2.60. 2333 2334Wed Jul 09 10:44:37 GMT 2008 Olly Betts <olly@survex.com> 2335 2336 * configure.ac: The workaround to avoid probe code for F77, GCJ, and 2337 RC being added to configure is no longer required now that we're 2338 using libtool 2.2 so remove it. 2339 2340Wed Jul 09 10:13:18 GMT 2008 Olly Betts <olly@survex.com> 2341 2342 * Makefile.am,configure.ac: Use AC_CONFIG_MACRO_DIR and 2343 ACLOCAL_AMFLAGS as libtoolize 2.2.4 recommends. 2344 2345Fri Jul 04 08:29:47 GMT 2008 Olly Betts <olly@survex.com> 2346 2347 * NEWS: Synchronise with 1.0 branch. 2348 2349Fri Jul 04 08:15:03 GMT 2008 Olly Betts <olly@survex.com> 2350 2351 * utf8convert.cc,utf8converttest.cc: UTF-16 with no BOM is meant to be 2352 assumed to be big-endian. GNU libiconv doesn't handle some examples 2353 as expected, so disable them when using iconv() for now. 2354 2355Fri Jul 04 06:39:20 GMT 2008 Olly Betts <olly@survex.com> 2356 2357 * omindex.cc: Handle UCS-2 and UTF-16 text files with a byte-order 2358 mark (BOM). Ignore any UTF-8 "byte-order" mark. 2359 * utf8convert.cc: Handle UCS-2/UTF-16 and explicit BE and LE forms in 2360 the non-iconv code. 2361 * Makefile.am,utf8converttest.cc: Add unit tests of convert_to_utf8(). 2362 2363Fri Jun 27 04:43:18 GMT 2008 Olly Betts <olly@survex.com> 2364 2365 * query.cc: Overhaul the $highlight colour combinations since some 2366 were rather unreadable. Reported by Joey Hess in Debian bug 2367 #484456. 2368 2369Sun Jun 01 15:12:02 GMT 2008 Olly Betts <olly@survex.com> 2370 2371 * configure.ac: Update version to 1.0.7 to match 1.0 branch. 2372 2373Sun May 25 14:56:41 GMT 2008 Olly Betts <olly@survex.com> 2374 2375 * NEWS: Synchronise with 1.0 branch, and update from ChangeLog. 2376 2377Sat May 17 11:42:26 GMT 2008 Olly Betts <olly@survex.com> 2378 2379 * docs/omegascript.rst,docs/scriptindex.rst: Tweak mark-up so 2380 generated HTML gets a non-empty title. 2381 2382Sat May 10 11:14:20 GMT 2008 Olly Betts <olly@survex.com> 2383 2384 * Makefile.am: omega_CPPFLAGS overrides AM_CPPFLAGS, so we need to 2385 explicitly include AM_CPPFLAGS in omega_CPPFLAGS to get 2386 CONFIGFILE_SYSTEM defined when building omega. 2387 2388Fri May 09 19:27:21 GMT 2008 Olly Betts <olly@survex.com> 2389 2390 * Makefile.am: Fix handling of any -I options needed for PCRE. 2391 2392Sun May 04 19:12:08 GMT 2008 Olly Betts <olly@survex.com> 2393 2394 * omindex.cc: Fix comment error regarding catdvi options. 2395 2396Sat May 03 14:02:02 GMT 2008 Olly Betts <olly@survex.com> 2397 2398 * xapian-omega.spec.in: Remove "www." from xapian.org and 2399 oligarchy.co.uk URLs. 2400 2401Sat May 03 13:55:35 GMT 2008 Olly Betts <olly@survex.com> 2402 2403 * cgiparam.cc,htdig2omega,mbox2omega,omindex-config.cc: Update FSF 2404 address. 2405 2406Sat May 03 13:54:25 GMT 2008 Olly Betts <olly@survex.com> 2407 2408 * gnu_getopt.h: Remove old copy of file which is no longer used - we 2409 now share a copy with xapian-core via common/. 2410 2411Sat May 03 10:42:27 GMT 2008 Olly Betts <olly@survex.com> 2412 2413 * configure.ac: Fix header checks to pre-include <sys/types.h> which 2414 Mac OS X needs for some other headers to work. 2415 2416Sat May 03 10:41:18 GMT 2008 Olly Betts <olly@survex.com> 2417 2418 * configure.ac: Improve code which prevents probing for f77, etc. 2419 2420Fri May 02 17:52:44 GMT 2008 Olly Betts <olly@survex.com> 2421 2422 * configure.ac: Fix to fail if --with-iconv is specified and libiconv 2423 isn't, and we aren't using fink on Mac OS X. 2424 2425Fri May 02 15:55:24 GMT 2008 Richard Boulton <richard@lemurconsulting.com> 2426 2427 * configure.ac: If iconv isn't found, set with_iconv to "no", to 2428 prevent USE_ICONV being set. Was previously only doing this if 2429 fink on OS X was found. 2430 2431Fri May 02 14:14:07 GMT 2008 Richard Boulton <richard@lemurconsulting.com> 2432 2433 * query.cc: Cast size to unsigned before division to avoid a 2434 warning about signed overflow. 2435 2436Fri May 02 14:08:39 GMT 2008 Richard Boulton <richard@lemurconsulting.com> 2437 2438 * configure.ac: Synchronise code for working out warning flags used 2439 for builds with that used for xapian-core. Copes with different 2440 formats of version number output by "gcc --version" which should 2441 help to improve output. 2442 2443Tue Apr 15 23:44:10 GMT 2008 Richard Boulton <richard@lemurconsulting.com> 2444 2445 * query.cc: Catch only the specific error which indicates a need to 2446 repeat a get_termfreq() call on the database instead of the mset. 2447 2448Sun Apr 13 11:19:49 GMT 2008 Richard Boulton <richard@lemurconsulting.com> 2449 2450 * freemem.h: Specify units of get_free_physical_memory(). 2451 2452Sun Apr 06 09:05:58 GMT 2008 Olly Betts <olly@survex.com> 2453 2454 * freemem.cc: Fix latent compilation error on FreeBSD, pointed out by 2455 Richard Boulton. 2456 2457Mon Mar 31 02:00:48 GMT 2008 Olly Betts <olly@survex.com> 2458 2459 * configure.ac: Update version to 1.0.6 to match latest release. 2460 2461Wed Mar 12 07:04:56 GMT 2008 Olly Betts <olly@survex.com> 2462 2463 * scriptindex.cc: Make deprecated "index=nopos" an error. 2464 2465Mon Mar 10 03:37:30 GMT 2008 Olly Betts <olly@survex.com> 2466 2467 * Makefile.am,diritor.cc,diritor.h,omindex.cc: Check for readdir() 2468 failing. 2469 2470Thu Mar 06 23:43:11 GMT 2008 Olly Betts <olly@survex.com> 2471 2472 * common/: Update to latest revisions. 2473 * Makefile.am,diritor.h: Use safedirent.h not dirent.h and build 2474 msvc_dirent.cc as part of omindex. 2475 2476Wed Mar 05 23:16:23 GMT 2008 Olly Betts <olly@survex.com> 2477 2478 * NEWS: Update to HEAD with un-backported changes kept separate. 2479 2480Wed Mar 05 19:05:12 GMT 2008 Olly Betts <olly@survex.com> 2481 2482 * NEWS: Update to 1.0 branch point. 2483 2484Sat Feb 02 22:46:40 GMT 2008 Olly Betts <olly@survex.com> 2485 2486 * query.cc: Add (C) notice for Thomas Viehmann. 2487 2488Sat Feb 02 22:46:14 GMT 2008 Olly Betts <olly@survex.com> 2489 2490 * omindex.cc: Back out random change committed by accident. 2491 2492Sat Feb 02 21:23:07 GMT 2008 Olly Betts <olly@survex.com> 2493 2494 * omindex.cc,query.cc: New OmegaScript commands $addfilter, $lower, 2495 $upper. 2496 * docs/omegascript.rst: Document. Improve formatting. 2497 2498Fri Feb 01 01:45:26 GMT 2008 Olly Betts <olly@survex.com> 2499 2500 * INSTALL: PCRE required. 2501 * docs/omegascript.rst: $transform{} now enabled. Fixes bug#231. 2502 2503Fri Feb 01 01:35:58 GMT 2008 Olly Betts <olly@survex.com> 2504 2505 * Makefile.am,configure.ac,query.cc: Add PCRE as a requirement and 2506 add $transform{} command (which has been in the code for ages but 2507 disabled). 2508 2509Sat Jan 19 02:01:02 GMT 2008 Olly Betts <olly@survex.com> 2510 2511 * omindex.cc: Add support for DjVu files. 2512 * docs/overview.rst: Document. 2513 2514Sat Jan 12 03:37:28 GMT 2008 Olly Betts <olly@survex.com> 2515 2516 * freemem.cc: Check "defined HAVE_SYSMP" rather than just "HAVE_SYSMP". 2517 This doesn't change behaviour, but fixes a compile warning on 2518 platforms other than Linux and IRIX. 2519 2520Fri Dec 21 02:13:49 GMT 2007 Olly Betts <olly@survex.com> 2521 2522 * NEWS: Bump release date. 2523 2524Thu Dec 20 21:40:34 GMT 2007 Olly Betts <olly@survex.com> 2525 2526 * NEWS: Another update for 1.0.5. 2527 2528Thu Dec 20 20:08:58 GMT 2007 Olly Betts <olly@survex.com> 2529 2530 * Makefile.am,scriptindex.cc: Fix scriptindex to insert a ':' between 2531 prefix and term using the same criteria which the QueryParser does. 2532 * scriptindex.cc,docs/scriptindex.rst: Action BOOLEAN now ignores an 2533 empty input rather than adding the prefix as a term. Action UNIQUE 2534 now issues an warning for empty input but otherwise ignores it. 2535 2536Thu Dec 20 17:44:57 GMT 2007 Olly Betts <olly@survex.com> 2537 2538 * common/: Update to r9894 to pick up stringutils.cc. 2539 2540Wed Dec 19 03:44:50 GMT 2007 Olly Betts <olly@survex.com> 2541 2542 * NEWS,configure.ac: Update for 1.0.5. 2543 2544Tue Dec 18 00:58:07 GMT 2007 Olly Betts <olly@survex.com> 2545 2546 * NEWS: Update. 2547 2548Thu Dec 13 01:38:43 GMT 2007 Olly Betts <olly@survex.com> 2549 2550 * omindex.cc: Avoid rereading uncompressed AbiWord documents in order 2551 to calculate their MD5 checksums. 2552 2553Thu Dec 13 01:34:53 GMT 2007 Olly Betts <olly@survex.com> 2554 2555 * omindex.cc: Improve comment wording. 2556 2557Thu Dec 13 00:59:35 GMT 2007 Olly Betts <olly@survex.com> 2558 2559 * docs/overview.rst: Document that omindex limits resources that 2560 filter programs can use. Also add a note welcoming suggestions 2561 for additional reliable filter programs. 2562 2563Wed Dec 12 23:49:27 GMT 2007 Olly Betts <olly@survex.com> 2564 2565 * Makefile.am,freemem.cc,freemem.h,runfilter.cc: Limit filter programs 2566 to 7/8 of free physical memory on platforms where we know how to 2567 determine this (currently at least Linux, FreeBSD, IRIX, HP-UX; 2568 probably Solaris and a few others too). Fixes bug#111. 2569 2570Wed Dec 12 18:20:34 GMT 2007 Olly Betts <olly@survex.com> 2571 2572 * docs/termprefixes.rst: Note the version where we stopped generating 2573 terms with a 'W' prefix (0.9.7). 2574 2575Wed Dec 12 18:17:28 GMT 2007 Olly Betts <olly@survex.com> 2576 2577 * docs/overview.rst: omindex hasn't generated "W"-prefix terms since 2578 0.9.7, so remove the documentation saying it does! 2579 2580Wed Dec 12 18:16:52 GMT 2007 Olly Betts <olly@survex.com> 2581 2582 * docs/overview.rst: Update to mention how upper case in extensions is 2583 handled. 2584 2585Wed Dec 12 17:49:12 GMT 2007 Olly Betts <olly@survex.com> 2586 2587 * omindex.cc: If an extension isn't found in the mime_map and contains 2588 uppercase ASCII characters, see if the lower cased extension is in 2589 the mime_map. 2590 2591Wed Dec 12 02:09:02 GMT 2007 Olly Betts <olly@survex.com> 2592 2593 * NEWS: Updated from ChangeLog in preparation for 1.0.5. 2594 2595Mon Dec 10 23:27:40 GMT 2007 Olly Betts <olly@survex.com> 2596 2597 * omindex.cc: '-f' is documented by --help as a short option for 2598 '--follow', but wasn't previously actually recognised. 2599 2600Tue Nov 20 13:08:19 GMT 2007 Olly Betts <olly@survex.com> 2601 2602 * htmlparse.cc: Add "using namespace std;" to ensure that 2603 std::strchr(), etc are imported into the global namespace. 2604 2605Tue Nov 20 01:01:13 GMT 2007 Richard Boulton <richard@lemurconsulting.com> 2606 2607 * commonhelp.cc,diritor.cc,htmlparse.cc,omega.cc,scriptindex.cc: 2608 Add #include of cstring, to fix errors from gcc-4.3 snapshot. 2609 Tidy include ordering in htmlparse.cc 2610 2611Tue Nov 06 12:17:10 GMT 2007 Olly Betts <olly@survex.com> 2612 2613 * docs/Makefile.am: No need to set SUFFIXES manually for suffixes used 2614 in implicit rules. 2615 2616Mon Nov 05 19:32:41 GMT 2007 Olly Betts <olly@survex.com> 2617 2618 * configure.ac: Probe for rst2html. 2619 2620Mon Nov 05 07:24:31 GMT 2007 Olly Betts <olly@survex.com> 2621 2622 * Makefile.am,README,configure.ac,docs/,query.cc: Replace .txt docs 2623 with Jenny's RST-ified versions. 2624 2625Tue Oct 30 04:54:58 GMT 2007 Olly Betts <olly@survex.com> 2626 2627 * NEWS,configure.ac: Update for 1.0.4. 2628 2629Sat Oct 27 05:32:06 BST 2007 Olly Betts <olly@survex.com> 2630 2631 * NEWS: Update. 2632 2633Sat Oct 27 05:30:28 BST 2007 Olly Betts <olly@survex.com> 2634 2635 * query.cc: On balance, it's more helpful to users to moan about a 2636 template which tries to set the same user prefix as both boolean 2637 and probabilistic, even if previous releases didn't. 2638 2639Thu Oct 25 20:38:15 BST 2007 Olly Betts <olly@survex.com> 2640 2641 * common/: Update to latest version. 2642 * query.cc: Remove STRINGIZE macro definition as this is now 2643 defined by stringutils.h. 2644 2645Fri Oct 19 16:17:47 BST 2007 Olly Betts <olly@survex.com> 2646 2647 * query.cc: Fix for reverted add_prefix() API. 2648 2649Sun Sep 30 22:12:46 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2650 2651 * query.cc: Use the new form of add_prefix() to avoid deprecation 2652 warnings at compile time. Carefully avoid calling 2653 add_prefix(f,p,PREFIX_FILTER) for a prefix which has already been 2654 set with add_prefix(f,p,PREFIX_INLINE), because this would cause 2655 an error (and we wish to avoid changing semantics of omegascript 2656 to avoid breaking existing scripts). 2657 * NEWS: Update 2658 2659Fri Sep 28 15:48:50 BST 2007 Olly Betts <olly@survex.com> 2660 2661 * NEWS: Final (?) update for 1.0.3. 2662 2663Fri Sep 28 15:46:11 BST 2007 Olly Betts <olly@survex.com> 2664 2665 * mbox2omega: Expand --help output. 2666 * docs/scriptindex.txt: Refer to mbox2omega as an example of how to 2667 use scriptindex. 2668 2669Fri Sep 28 03:18:25 BST 2007 Olly Betts <olly@survex.com> 2670 2671 * NEWS: Update. 2672 2673Fri Sep 28 03:15:11 BST 2007 Olly Betts <olly@survex.com> 2674 2675 * configure.ac: Update for 1.0.3. Use ustar format for tarball since 2676 we have to for xapian-core anyway. 2677 2678Fri Sep 28 02:42:28 BST 2007 Olly Betts <olly@survex.com> 2679 2680 * ./: Update common SVN rev in svn:externals so the files are in 2681 sync with xapian-core. 2682 2683Wed Sep 19 16:09:36 BST 2007 Olly Betts <olly@survex.com> 2684 2685 * NEWS: Update from ChangeLog entries since 1.0.2. 2686 2687Sat Sep 08 19:24:48 BST 2007 Olly Betts <olly@survex.com> 2688 2689 * configure.ac,runfilter.cc: Impose a 5 minute CPU time limit on 2690 filter programs to prevent problems if a filter program goes into 2691 an infinite loop on a malformed input. Partly addresses bug#111. 2692 2693Fri Sep 07 21:22:43 BST 2007 Olly Betts <olly@survex.com> 2694 2695 * omindex.cc: Fix comment typos. 2696 2697Fri Sep 07 20:56:50 BST 2007 Olly Betts <olly@survex.com> 2698 2699 * docs/overview.txt,omindex.cc: Add supporting for indexing TeX DVI 2700 files. 2701 2702Thu Sep 06 20:59:57 BST 2007 Olly Betts <olly@survex.com> 2703 2704 * query.cc: Fix bug in decimal fraction in $size for files >= 1M in 2705 size. 2706 2707Thu Sep 06 20:13:44 BST 2007 Olly Betts <olly@survex.com> 2708 2709 * templates/query: Set HTML charset to utf-8 since that's what 2710 databases now are by default. Tidy up some HTML gremlins. 2711 Restyle to use CSS to draw a "score bar" instead of using 2712 images. Rework the layout of each hit. Add popup hints on 2713 mouse-over for various items. 2714 2715Thu Sep 06 18:12:07 BST 2007 Olly Betts <olly@survex.com> 2716 2717 * scriptindex.cc: Fix line number tracking in dump files. 2718 2719Thu Sep 06 18:06:28 BST 2007 Olly Betts <olly@survex.com> 2720 2721 * docs/omegascript.txt,query.cc: Add $muldiv{A,B,C} which calculates 2722 int(A*B/C). 2723 2724Thu Sep 06 03:36:36 BST 2007 Olly Betts <olly@survex.com> 2725 2726 * runfilter.cc: Fix file description. 2727 2728Thu Sep 06 00:54:58 BST 2007 Olly Betts <olly@survex.com> 2729 2730 * Makefile.am,omindex.cc,runfilter.cc,runfilter.h: Factor out the 2731 stdout_to_string() function into its own source file. 2732 2733Thu Sep 06 00:45:14 BST 2007 Olly Betts <olly@survex.com> 2734 2735 * cgiparam.h,commonhelp.h,date.h,hashterm.h,htmlparse.h,loadfile.h, 2736 md5wrap.h,metaxmlparse.h,myhtmlparse.h,namedentities.h,omega.h, 2737 sample.h,utf8convert.h,utf8truncate.h,xmlparse.h: Add missing header 2738 guards and standardise existing header guards to use the form 2739 OMEGA_INCLUDED_FOO_H. 2740 2741Thu Sep 06 00:24:54 BST 2007 Olly Betts <olly@survex.com> 2742 2743 * myhtmlparse.cc: Add '#include <config.h>'. 2744 * omega.h: Don't '#include <config.h>'. 2745 2746Mon Sep 03 19:16:37 BST 2007 Olly Betts <olly@survex.com> 2747 2748 * docs/overview.txt,omindex.cc: Add support for indexing AbiWord 2749 documents. 2750 2751Thu Jul 05 00:37:35 BST 2007 Olly Betts <olly@survex.com> 2752 2753 * NEWS: Final (?) update for 1.0.2. 2754 2755Thu Jul 05 00:33:14 BST 2007 Olly Betts <olly@survex.com> 2756 2757 * omindex.cc: Report files we aren't indexing because their extensions 2758 aren't recognised. 2759 2760Wed Jul 04 21:22:02 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2761 2762 * NEWS: Update with release date for release 1.0.2 2763 2764Wed Jul 04 20:43:22 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2765 2766 * configure.ac: Bump version to 1.0.2. 2767 2768Wed Jul 04 17:34:15 BST 2007 Olly Betts <olly@survex.com> 2769 2770 * NEWS: Update. 2771 2772Wed Jul 04 17:31:38 BST 2007 Olly Betts <olly@survex.com> 2773 2774 * Makefile.am,omindex.cc,query.cc: Use stringutils.h from common. 2775 * ./: Update common SVN rev in svn:externals to get the latest 2776 stringutils.h. 2777 * cgiparam.cc: Use string::resize() rather than assigning from a 2778 substring of the string. 2779 2780Mon Jul 02 16:42:01 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2781 2782 * htmlparsetest.cc,md5test.cc: Add #include <stdlib.h>, to get a 2783 definition for exit(). Fixes compilation with gcc-snapshot. 2784 2785Thu Jun 28 18:05:18 BST 2007 Olly Betts <olly@survex.com> 2786 2787 * omindex.cc: If --url isn't passed, default to "/", but print a 2788 warning noting that this default has been used (at least for now). 2789 2790Thu Jun 28 18:04:53 BST 2007 Olly Betts <olly@survex.com> 2791 2792 * docs/scriptindex.txt: Fix typo. 2793 2794Wed Jun 27 15:44:30 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2795 2796 * NEWS: Remove the items which aren't really interesting to users. 2797 2798Wed Jun 27 14:26:26 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2799 2800 * common/: Update svn:externals property to use latest version. 2801 2802 * NEWS: Updated. 2803 2804Sat Jun 23 13:11:15 BST 2007 Olly Betts <olly@survex.com> 2805 2806 * diritor.h: Delete random extra blank line. 2807 2808Sat Jun 23 13:08:35 BST 2007 Olly Betts <olly@survex.com> 2809 2810 * omega.cc,query.cc: Use Xapian::BAD_VALUENO. 2811 2812Sat Jun 16 11:06:08 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2813 2814 * Makefile.am: Pass value of XAPIAN_CONFIG to distcheck, to ensure 2815 that it works with uninstalled copies of Xapian. 2816 2817Mon Jun 11 03:34:53 BST 2007 Olly Betts <olly@survex.com> 2818 2819 * NEWS: Minor wording improvement. 2820 2821Mon Jun 11 03:33:37 BST 2007 Olly Betts <olly@survex.com> 2822 2823 * NEWS: Probably the final update for 1.0.1. 2824 2825Sun Jun 10 22:00:23 BST 2007 Olly Betts <olly@survex.com> 2826 2827 * configure.ac: Drop automake requirement to 1.8.3 to allow RPM spec 2828 file to work on SLES 9. 2829 2830Sun Jun 10 21:49:45 BST 2007 Olly Betts <olly@survex.com> 2831 2832 * configure.ac: Bump version to 1.0.1. 2833 2834Sun Jun 10 02:16:54 BST 2007 Olly Betts <olly@survex.com> 2835 2836 * NEWS: Updated. 2837 2838Sat Jun 09 15:20:25 BST 2007 Olly Betts <olly@survex.com> 2839 2840 * Makefile.am,diritor.cc,diritor.h,omindex.cc: Under Linux (at least) 2841 struct dirent can tell us the type of a directory entry for some 2842 filing systems, so make use of this to avoid calling stat() (or 2843 lstat()) unnecessarily - when indexing /usr/share/doc on my Linux 2844 box, this saves about 14000 explicit calls to stat (leaving about 2845 7000). 2846 2847Thu Jun 07 01:40:43 BST 2007 Olly Betts <olly@survex.com> 2848 2849 * NEWS: Update. 2850 2851Wed Jun 06 15:45:33 BST 2007 Olly Betts <olly@survex.com> 2852 2853 * docs/scriptindex.txt: Document that you can delete a document by 2854 giving a new document which only contains the unique term. 2855 2856Mon Jun 04 16:40:18 BST 2007 Richard Boulton <richard@lemurconsulting.com> 2857 2858 * Makefile.am: Only add manpages to dist_man_MANS if we're not in 2859 maintainer mode with documentation generation turned off. 2860 2861Thu May 31 20:02:16 BST 2007 Olly Betts <olly@survex.com> 2862 2863 * NEWS: Update. 2864 2865Thu May 31 19:16:37 BST 2007 Olly Betts <olly@survex.com> 2866 2867 * configure.ac: Relax automake requirement to 1.9.2 to allow RPM 2868 building on RHEL 4. 2869 2870Wed May 30 14:42:40 BST 2007 Olly Betts <olly@survex.com> 2871 2872 * NEWS: Update for changes since 1.0.0. Removed unused subheading 2873 in 1.0.0 changes. 2874 2875Wed May 30 10:24:57 BST 2007 Olly Betts <olly@survex.com> 2876 2877 * query.cc: Fix handling of query parsing errors (broken by changes in 2878 1.0.0). 2879 2880Tue May 29 01:19:21 BST 2007 Olly Betts <olly@survex.com> 2881 2882 * docs/overview.txt: We no longer use pstotext for PostScript, but 2883 instead use ps2pdf followed by pdftotext, so update the docs to 2884 reflect this. 2885 2886Fri May 18 03:36:28 BST 2007 Olly Betts <olly@survex.com> 2887 2888 * htmlparsetest.cc,myhtmlparse.cc: Fix bug in HTML parser - if the 2889 text between tags consisted entirely of whitespace it would just be 2890 ignored which could run words together. Add regression test, plus 2891 another test for other whitespace handling. 2892 2893Thu May 17 22:27:47 BST 2007 Olly Betts <olly@survex.com> 2894 2895 * NEWS: Final update before release. 2896 2897Thu May 17 20:48:25 BST 2007 Olly Betts <olly@survex.com> 2898 2899 * NEWS: Update. 2900 2901Thu May 17 20:46:43 BST 2007 Olly Betts <olly@survex.com> 2902 2903 * docs/termprefixes.txt: Update to include 'Z' prefix and mention 2904 that 'R' and 'W' aren't used by Xapian now. 2905 2906Thu May 17 19:11:04 BST 2007 Olly Betts <olly@survex.com> 2907 2908 * configure.ac: Bump version to 1.0.0. 2909 2910Thu May 17 18:11:19 BST 2007 Olly Betts <olly@survex.com> 2911 2912 * common/: Update to latest xapian-core revision to pull in 2 argument 2913 mkdir() wrapper for Mingw. 2914 2915Thu May 17 03:29:44 BST 2007 Olly Betts <olly@survex.com> 2916 2917 * Makefile.am,configure.ac: Add support for --disable-documentation 2918 like xapian-core now has. 2919 * configure.ac: Only enable -Werror on --enable-maintainer-mode for 2920 GCC 4 or newer, in line with change in xapian-core. 2921 2922Thu May 17 03:22:10 BST 2007 Olly Betts <olly@survex.com> 2923 2924 * NEWS: Update for 1.0.0. 2925 2926Wed May 16 03:09:44 BST 2007 Olly Betts <olly@survex.com> 2927 2928 * TODO: Update. 2929 2930Tue May 15 18:50:47 BST 2007 Olly Betts <olly@survex.com> 2931 2932 * configure.ac: Add AC_TYPE_PID_T. 2933 2934Tue May 15 04:22:40 BST 2007 Olly Betts <olly@survex.com> 2935 2936 * omindex.cc: Remove FIXME comment which has already been addressed. 2937 2938Mon May 14 04:38:49 BST 2007 Olly Betts <olly@survex.com> 2939 2940 * docs/omegascript.txt: Update docs for $prettyterm{TERM}. 2941 2942Mon May 14 04:31:01 BST 2007 Olly Betts <olly@survex.com> 2943 2944 * omega.cc,omega.h,query.cc,query.h: Rejig how $topterms and other 2945 cases handle terms to fit with the new term generation scheme. 2946 Add 'you' and 'your' as stopwords. 2947 2948Thu May 10 04:48:43 BST 2007 Olly Betts <olly@survex.com> 2949 2950 * ./: Update svn:externals to pull in r8538 of xapian-core's common 2951 subdirectory. 2952 * Makefile.am: Add common/safe.cc to scriptindex_SOURCES. 2953 2954Thu May 10 01:09:14 BST 2007 Olly Betts <olly@survex.com> 2955 2956 * templates/,Makefile.am: The 'query' template no longer uses 2957 $topterms by default - to get them, use the new 'topterms' template. 2958 Also the template fragments which aren't intended for direct use 2959 have been move to templates/inc/. 2960 * docs/overview.txt: Document what each of the OmegaScript templates 2961 does. 2962 * docs/quickstart.txt: Assorted minor improvements. 2963 * xapian-omega.spec.in: Update to install templates/inc too. 2964 2965Wed May 09 23:43:57 BST 2007 Olly Betts <olly@survex.com> 2966 2967 * docs/omegascript.txt,query.cc: Instead of appending a dot to 2968 indicate a stemmed term, wrap the term in double quotes. 2969 2970Sun May 06 21:41:21 BST 2007 Olly Betts <olly@survex.com> 2971 2972 * omindex.cc,scriptindex.cc: Removed commented out code for generating 2973 "W" prefix terms for date searching. We've never made use of them 2974 in Omega, and we'll be moving to using DateMatchDecider by default 2975 eventually anyway. 2976 2977Sun May 06 16:00:47 BST 2007 Olly Betts <olly@survex.com> 2978 2979 * configure.ac: Set version to mythical 0.9.99. 2980 2981Sun May 06 15:52:08 BST 2007 Olly Betts <olly@survex.com> 2982 2983 * Makefile.am,configure.ac,omega.spec.in,xapian-omega.spec.in: 2984 Update RPM spec file to reflect tarball name change from omega 2985 to xapian.omega (patch from Fabrice Colin). Also rename omega.spec 2986 to xapian-omega.spec (rpmbuild looks for any .spec file, but it's 2987 more consistent to keep the names in step). 2988 2989Fri May 04 19:52:44 BST 2007 Olly Betts <olly@survex.com> 2990 2991 * omindex.cc,scriptindex.cc: Use new TermGenerator convenience methods 2992 which take std::string instead of Utf8Iterator. 2993 2994Fri May 04 13:32:11 BST 2007 Olly Betts <olly@survex.com> 2995 2996 * Makefile.am,configure.ac,makemanpage.in: Use makemanpage to generate 2997 manpages. 2998 2999Fri May 04 13:30:36 BST 2007 Olly Betts <olly@survex.com> 3000 3001 * commonhelp.cc: Add missing full stop in description of --stemmer. 3002 3003Fri May 04 04:10:23 BST 2007 Olly Betts <olly@survex.com> 3004 3005 * query.cc: Explicitly include stdlib.h since we use atoi(). 3006 3007Thu May 03 15:16:31 BST 2007 Olly Betts <olly@survex.com> 3008 3009 * Makefile.am,indextext.cc,indextext.h,omindex.cc,scriptindex.cc: 3010 Update to use new TermGenerator class. 3011 3012Thu May 03 04:03:35 BST 2007 Olly Betts <olly@survex.com> 3013 3014 * ./: Update svn:externals to pull rev8430 of xapian-core's common 3015 subdirectory. 3016 * scriptindex.cc: Remove sleep() wrapper. 3017 3018Wed May 02 03:26:38 BST 2007 Olly Betts <olly@survex.com> 3019 3020 * docs/omegascript.txt,query.cc: Removed $freqs as it has been 3021 deprecated for ages. 3022 3023Wed May 02 03:19:18 BST 2007 Olly Betts <olly@survex.com> 3024 3025 * docs/scriptindex.txt: Explicitly note that index=nopos is deprecated 3026 (scriptindex already emits a warning). 3027 3028Wed May 02 03:17:03 BST 2007 Olly Betts <olly@survex.com> 3029 3030 * docs/cgiparams.txt: FMT isn't limited to just `a-z' - the 3031 actual restriction is that it may not contain `..'. 3032 3033Wed May 02 03:02:53 BST 2007 Olly Betts <olly@survex.com> 3034 3035 * scriptindex.cc: Remove -q and -u options - they no longer do 3036 anything and are only accepted for compatibility with really old 3037 versions (0.6.1 and earlier and 0.7.5 and earlier respectively). 3038 3039Wed Apr 25 21:47:48 BST 2007 Olly Betts <olly@survex.com> 3040 3041 * Makefile.am: omega doesn't need indextext.cc. 3042 3043Wed Apr 25 21:46:25 BST 2007 Olly Betts <olly@survex.com> 3044 3045 * query.cc: Remove unused `#include "indextext.h"'. 3046 3047Wed Apr 25 02:37:15 BST 2007 Olly Betts <olly@survex.com> 3048 3049 * Makefile.am,configure.ac: Add support like xapian-core has for 3050 `configure --enable-quiet', `make QUIET=' and `make QUIET=y'. 3051 3052Mon Apr 23 15:42:24 BST 2007 Olly Betts <olly@survex.com> 3053 3054 * date.cc,datematchdecider.cc,utils.cc: Fix compilation with GCC 4.3 3055 snapshot. 3056 3057Mon Apr 23 15:38:00 BST 2007 Olly Betts <olly@survex.com> 3058 3059 * portability/mkdtemp.cc: config.h should always be included first and 3060 with angle brackets. Use safeerrno.h not errno.h. No special 3061 headers are required here for __CYGWIN__, and safesysstat.h provides 3062 a two argument wrapper for mkdir, so we don't need any 3063 __WIN32__-specific magic either. 3064 3065Mon Apr 23 12:14:01 BST 2007 Richard Boulton <richard@lemurconsulting.com> 3066 3067 * portability/mkdtemp.cc: Patch from Charlie Hull to fix windows 3068 compilation. 3069 * scriptindex.cc: #include <time.h> in scriptindex.cc for 3070 localtime(). 3071 3072Sat Apr 21 23:31:02 BST 2007 Olly Betts <olly@survex.com> 3073 3074 * strcasecmp.h: New header containing magic to provide strcasecmp() 3075 and strncasecmp(). 3076 * query.cc,utf8convert.cc: Use strcasecmp.h. 3077 * Makefile.am,cdb_init.cc,cdb_int.h,configfile.cc,getopt.cc, 3078 loadfile.cc,md5wrap.cc,omega.cc,omindex-config.cc,omindex.cc, 3079 query.cc,scriptindex.cc,utf8convert.cc: Add xapian-core's common/ 3080 subdirectory as an svn:external so we can (a) share copies of 3081 gnu_getopt.h and getopt.cc and (b) make use of the "safeunistd.h" 3082 and friends. 3083 3084Sat Apr 21 23:06:49 BST 2007 Olly Betts <olly@survex.com> 3085 3086 * metaxmlparse.cc,metaxmlparse.h: Fix summary comments at the top of 3087 these two files. 3088 3089Sat Apr 21 20:42:03 BST 2007 Olly Betts <olly@survex.com> 3090 3091 * omindex.cc: xapian.h no longer pulls in time.h, which exposes that 3092 we weren't explicitly including it here! 3093 3094Sat Apr 21 20:27:43 BST 2007 Olly Betts <olly@survex.com> 3095 3096 * configure.ac: We require automake 1.9.5 for xapian-core, so require 3097 it here too for consistency. Turn on automake -Wportability option. 3098 3099Sat Apr 21 20:24:17 BST 2007 Olly Betts <olly@survex.com> 3100 3101 * configure.ac: Probe for ssize_t and mode_t and define replacements 3102 if we don't find them. 3103 3104Fri Apr 20 14:38:57 BST 2007 Olly Betts <olly@survex.com> 3105 3106 * datematchdecider.h,omega.h,datematchdecider.cc: Update return 3107 types of MatchDecider and ExpandDecider subclasses. 3108 3109Wed Apr 18 23:44:36 BST 2007 Olly Betts <olly@survex.com> 3110 3111 * utf8convert.cc: Fix to compile when USE_ICONV isn't defined (to_utf8 3112 is now in the Xapian::Unicode namespace). 3113 3114Wed Apr 18 23:15:26 BST 2007 Olly Betts <olly@survex.com> 3115 3116 * docs/cgiparams.txt,query.cc: Remove "bias_weight" and 3117 "bias_halflife" CGI parameters since they rely on 3118 Enquire::set_bias() which has been removed. 3119 3120Tue Apr 17 21:45:40 BST 2007 Richard Boulton <richard@lemurconsulting.com> 3121 3122 * Makefile.am: Link htmlparsetest with Xapian library to get access 3123 to noascii_to_utf8. 3124 3125Tue Apr 17 02:22:42 BST 2007 Olly Betts <olly@survex.com> 3126 3127 * htmlparse.cc: nonascii_to_utf8 is now in the public API. 3128 3129Tue Apr 17 00:55:17 BST 2007 Olly Betts <olly@survex.com> 3130 3131 * Makefile.am,htmlparse.cc,indextext.cc,indextext.h,query.cc,sample.cc, 3132 scriptindex.cc,tclUniData.cc,tclUniData.h,utf8convert.cc,utf8itor.cc, 3133 utf8itor.h,utf8test.cc: Use the new Unicode API routines in the core 3134 Xapian library instead of local copies. 3135 3136Thu Apr 12 17:04:07 BST 2007 Olly Betts <olly@survex.com> 3137 3138 * Makefile.am: omega and scriptindex both need tclUniData.cc. 3139 3140Sat Mar 31 19:58:29 BST 2007 Olly Betts <olly@survex.com> 3141 3142 * query.cc: $filesize{0} is now "0 bytes", $filesize{1} is now "1 3143 byte", $filesize{SIZE} where SIZE is negative is now "". Fix 3144 "comparison of signed and unsigned" warning. Use "%c" to generate 3145 the fractional part. 3146 * docs/omegascript.txt: Document that $filesize{SIZE} is "" when SIZE 3147 is negative. 3148 3149Sat Mar 31 18:25:55 BST 2007 Olly Betts <olly@survex.com> 3150 3151 * query.cc: Ensure that the result of snprintf is zero terminated 3152 since MSVC's snprintf is broken (by design it seems). 3153 * query.cc,docs/omegascript.txt: $filesize enhanced to return a 3154 decimal point for K, M, and G (e.g. "2.1K" and "4.0M" rather than 3155 "2K" and "4M"). 3156 3157Fri Mar 30 19:57:00 BST 2007 Olly Betts <olly@survex.com> 3158 3159 * portability/mkdtemp.cc: Fixes for mingw. 3160 3161Fri Mar 30 02:22:59 BST 2007 Olly Betts <olly@survex.com> 3162 3163 * Makefile.am,scriptindex.cc,utf8truncate.cc,utf8truncate.h: The 3164 "truncate" action now knows not to chop off a multibyte utf-8 3165 character. 3166 3167Fri Mar 30 02:19:05 BST 2007 Olly Betts <olly@survex.com> 3168 3169 * Makefile.am,omindex.cc,sample.cc,sample.h: New sample generating 3170 function which normalises all runs of whitespace to a single space, 3171 and fixes invalid utf-8 in the sample. This means we can now index 3172 an iso-8859-1 text file and mostly get the same results as if it 3173 were utf-8! 3174 3175Thu Mar 29 23:12:20 BST 2007 Olly Betts <olly@survex.com> 3176 3177 * scriptindex.cc: Fix optimisation of "load truncate=N" to actually 3178 work! 3179 3180Thu Mar 29 18:54:11 BST 2007 Olly Betts <olly@survex.com> 3181 3182 * configure.ac: Probe for mkdtemp. 3183 * Makefile.am: Add portability/mkdtemp.cc to omindex_SOURCES if 3184 configure didn't detect it. 3185 * omindex.cc: Prototype mkdtemp if configure didn't detect it. 3186 3187Thu Mar 29 18:47:50 BST 2007 Olly Betts <olly@survex.com> 3188 3189 * portability/mkdtemp.cc: Fix to compile as C++. Replace isdigit() 3190 with a simple range test to avoid locale related quirks. 3191 3192Thu Mar 29 18:28:25 BST 2007 Olly Betts <olly@survex.com> 3193 3194 * portability/mkdtemp.cc: Add portable implementation of mkdtemp for 3195 use on platforms which don't supply it. 3196 3197Thu Mar 29 17:22:18 BST 2007 Olly Betts <olly@survex.com> 3198 3199 * omindex.cc: Index PostScript by converting to PDF with ps2pdf and 3200 then indexing that. This allows us to index PostScript files 3201 containing Unicode characters outside of iso-8859-1, and also 3202 means we now get metadata from PostScript files. 3203 3204Thu Mar 29 03:14:55 BST 2007 Olly Betts <olly@survex.com> 3205 3206 * omega.spec.in: Update to handle documentation being installed in 3207 $prefix/share/doc/xapian-omega. 3208 3209Tue Mar 27 21:42:19 BST 2007 Olly Betts <olly@survex.com> 3210 3211 * configure.ac: datarootdir is new in 2.60 too, so use datadir when 3212 setting docdir for 2.59. 3213 3214Mon Mar 26 15:47:53 BST 2007 Olly Betts <olly@survex.com> 3215 3216 * configure.ac: Add code to ensure that docdir is set for autoconf 3217 2.59 (starting from 2.60, it is defined as standard). 3218 * Makefile.am: Use docdir for installing docs. This means that the 3219 documentation now goes in $prefix/share/doc/xapian-omega rather 3220 than $prefix/share/doc/omega, which is better really. 3221 3222Sat Mar 24 17:21:32 GMT 2007 Olly Betts <olly@survex.com> 3223 3224 * query.cc: Prefer static char[] to static char * (gives better 3225 generated code). 3226 3227Sat Mar 24 17:19:18 GMT 2007 Olly Betts <olly@survex.com> 3228 3229 * omega.cc: Prefer static char[] to static char * (gives better 3230 generated code). 3231 3232Sat Mar 24 17:16:49 GMT 2007 Olly Betts <olly@survex.com> 3233 3234 * configfile.cc: Prefer static char[] to static char * (gives better 3235 generated code). 3236 3237Thu Mar 22 01:11:52 GMT 2007 Olly Betts <olly@survex.com> 3238 3239 * configure.ac: Eliminate libtool probe code for f77, gcj, and rc 3240 which speeds up configure and knocks 29% off its size. 3241 3242Tue Mar 06 01:56:00 GMT 2007 Olly Betts <olly@survex.com> 3243 3244 * configure.ac: Bump version number to 0.9.10 so that snapshots don't 3245 look older than releases. 3246 3247Sun Mar 04 14:42:18 GMT 2007 Olly Betts <olly@survex.com> 3248 3249 * TODO: Remove entries which have already been done! 3250 3251Sat Mar 03 02:24:42 GMT 2007 Olly Betts <olly@survex.com> 3252 3253 * utf8test.cc: Add single utf-8 sequence decoding tests. 3254 3255Fri Mar 02 00:18:09 GMT 2007 Olly Betts <olly@survex.com> 3256 3257 * configure.ac: Perform a link test for posix_fadvise to fix 3258 misdetection on HP-UX. 3259 3260Thu Mar 01 21:48:57 GMT 2007 Olly Betts <olly@survex.com> 3261 3262 * utf8itor.h: Add cast to suppress warning from aCC. 3263 3264Thu Mar 01 21:00:56 GMT 2007 Olly Betts <olly@survex.com> 3265 3266 * configure.ac: Check we can link with libiconv, not just compile. 3267 Some of the HP-UX hosts in the HP testdrive seem to have headers 3268 but no matching library. 3269 3270Thu Mar 01 18:02:37 GMT 2007 Olly Betts <olly@survex.com> 3271 3272 * myhtmlparse.cc: Remove unused function. Move "#include <string.h>" 3273 before any code. 3274 3275Thu Feb 22 15:45:25 GMT 2007 Olly Betts <olly@survex.com> 3276 3277 * configure.ac: xapian-config --cxxflags now includes -ptused for 3278 SGI's C++ compiler, so we don't need to probe for it here. 3279 3280Wed Feb 21 15:17:07 GMT 2007 Olly Betts <olly@survex.com> 3281 3282 * docs/termprefixes.txt: Expand section on boolean prefixes, showing 3283 how to generate them using scriptindex, and how to allow them to be 3284 selected in an HTML form. 3285 3286Mon Feb 19 12:51:24 GMT 2007 Olly Betts <olly@survex.com> 3287 3288 * configure.ac: Previous fix doesn't work. Just drop -O2 instead - 3289 users of SGI's CC can specify "./configure CXXFLAGS=-O2" is they 3290 want optimisation. 3291 3292Sun Feb 18 21:44:09 GMT 2007 Olly Betts <olly@survex.com> 3293 3294 * configure.ac: For SGI's CC, -g overrides -g3 if it comes afterwards, 3295 so we need to modify CXXFLAGS rather than just setting AM_CXXFLAGS. 3296 3297Sat Feb 17 19:25:04 GMT 2007 Olly Betts <olly@survex.com> 3298 3299 * docs/overview.txt,omindex.cc: Add support for indexing MS Works 3300 documents using wps2text (part of libwps). 3301 3302Sat Feb 17 19:06:03 GMT 2007 Olly Betts <olly@survex.com> 3303 3304 * omindex.cc: Don't index empty files. 3305 3306Fri Feb 16 21:14:35 GMT 2007 Olly Betts <olly@survex.com> 3307 3308 * NEWS: Add note that Omega < 0.8.0 NEWS entries are in the 3309 xapian-core NEWS file. 3310 3311Fri Feb 16 20:34:10 GMT 2007 Olly Betts <olly@survex.com> 3312 3313 * indextext.cc: Now I've fixed the bug in UTF-8 decoding, the check 3314 for zero length terms is no longer required. 3315 3316Fri Feb 16 19:34:48 GMT 2007 Olly Betts <olly@survex.com> 3317 3318 * tclUniData.h,utf8itor.h: The tcl unicode routines only have tables 3319 for characters in the BMP. For other characters, assume they're 3320 word characters, but can't be forced to lowercase. 3321 3322Fri Feb 16 19:19:11 GMT 2007 Olly Betts <olly@survex.com> 3323 3324 * utf8itor.cc: Fix bug in decoding of 4 byte utf-8 sequences 3325 - the returned value was 0x400000 too large! Fixes bug#106. 3326 3327Thu Feb 15 19:42:36 GMT 2007 Olly Betts <olly@survex.com> 3328 3329 * indextext.cc,query.cc: Keep embedded apostrophe's in terms rather 3330 than relying on generating a phrase search for them. 3331 3332Thu Feb 15 05:38:12 GMT 2007 Olly Betts <olly@survex.com> 3333 3334 * Makefile.am,datematchdecider.cc,datematchdecider.h, 3335 docs/cgiparams.txt,query.cc: Add an alternative implementation 3336 of date range filtering which uses a MatchDecider. This allows 3337 everything that the existing implementation does, plus you can 3338 support sorting on a choice of dates (e.g. first published or 3339 last updated), and filtering works to a resolution of a minute 3340 rather than a day. Since omindex now adds the last modified 3341 date as value 0, this will work with omindex. 3342 3343Thu Feb 15 04:38:32 GMT 2007 Olly Betts <olly@survex.com> 3344 3345 * configure.ac: SGI's CC needs -g3 instead of -g if we want to use 3346 any -O option. 3347 3348Sat Feb 10 20:53:14 GMT 2007 Olly Betts <olly@survex.com> 3349 3350 * md5.cc: Fix reversed preprocessor conditional so that we generate 3351 correct MD5 checksums on big endian platforms. 3352 3353Sat Feb 10 20:19:23 GMT 2007 Olly Betts <olly@survex.com> 3354 3355 * md5.cc: No need to byte swap when we've just zero filled! 3356 3357Sat Feb 10 18:54:33 GMT 2007 Olly Betts <olly@survex.com> 3358 3359 * indextext.cc,query.cc: Prefer Xapian::Stem::operator() to 3360 Xapian::Stem::stem_word(). 3361 3362Fri Feb 09 05:53:29 GMT 2007 Olly Betts <olly@survex.com> 3363 3364 * docs/omegascript.txt: Rewrite introductory paragraph. Note that 3365 whitespace is significant, and add explicit warning to $setmap. 3366 3367Mon Jan 1 01:56:56 GMT 2007 Richard Boulton <richard@lemurconsulting.com> 3368 3369 * indextext.cc: Fix parsing of text containing certain unicode 3370 characters. Such text could have resulted in zero length terms 3371 being added to documents. (The minimal example I found causing 3372 this problem was a document containing only the unicode character 3373 0x28a0f, which is a CJK Unified Ideograph). 3374 3375 Addresses bug #106, though may not be a complete fix - see the 3376 bug for details. 3377 3378Sun Dec 31 17:22:56 GMT 2006 Richard Boulton <richard@lemurconsulting.com> 3379 3380 * scriptindex.cc: Update short option list for scriptindex to match 3381 documented usage (-h, -V and -s were not working). 3382 3383Thu Dec 21 14:57:28 GMT 2006 Olly Betts <olly@survex.com> 3384 3385 * query.cc: Remove support for xB, xDATE1, xDATE2, xDAYSMINUS, 3386 and xDEFAULTOP which were deprecated in favour of xFILTER in 3387 0.7.5 (over 3 years ago). 3388 3389Thu Dec 21 14:52:38 GMT 2006 Olly Betts <olly@survex.com> 3390 3391 * docs/cgiparams.txt: Remove documentation of the removed deprecated 3392 aliases. 3393 3394Thu Dec 21 14:39:04 GMT 2006 Olly Betts <olly@survex.com> 3395 3396 * omega.cc,query.cc: Remove deprecated aliases for CGI parameters 3397 (deprecated in 0.6.3 or 0.6.5, more than 3.5 years ago): 3398 RAW_SEARCH (now RAWSEARCH), DATE1 (now START), DATE2 (now END), 3399 DAYSMINUS (now SPAN but with slightly different semantics), 3400 and MIN_HITS (now MINHITS). 3401 3402Thu Dec 21 01:04:00 GMT 2006 Olly Betts <olly@survex.com> 3403 3404 * utf8convert.cc: Fix headers included for iconv and not-iconv. 3405 3406Wed Dec 20 23:53:41 GMT 2006 Olly Betts <olly@survex.com> 3407 3408 * configure.ac,utf8convert.cc: If iconv isn't found by configure, fall 3409 back on simple conversion routines which handle iso-8859-1. 3410 Configuring --without-iconv forces these routines to be used. 3411 Configuring --with-iconv forces configure to fail if it can't find 3412 iconv. 3413 3414Tue Dec 19 20:35:04 GMT 2006 Olly Betts <olly@survex.com> 3415 3416 * utf8itor.h: Need <string.h> for strlen. 3417 3418Tue Dec 19 19:53:52 GMT 2006 Olly Betts <olly@survex.com> 3419 3420 * Makefile.am,configure.ac: Add "-liconv" if it's needed. If we're on 3421 OS X, also check for libiconv installed with fink. 3422 3423Fri Dec 15 05:43:40 GMT 2006 Olly Betts <olly@survex.com> 3424 3425 * values.h: Add include guard. 3426 3427Sun Dec 10 04:33:26 GMT 2006 Olly Betts <olly@survex.com> 3428 3429 * query.cc: Fix $substr{} with negative start to actually work. Fix 3430 $substr{} to never cause a C++ exception. 3431 * docs/omegascript.txt,query.cc: Enhance $substr{} to accept a 3432 negative length (meaning to count back from the end of the string). 3433 3434Sun Dec 10 03:05:09 GMT 2006 Olly Betts <olly@survex.com> 3435 3436 * commonhelp.cc: "--help" now says that the default stemming language 3437 is "english" 3438 3439Thu Nov 16 23:06:25 GMT 2006 Olly Betts <olly@survex.com> 3440 3441 * docs/omegascript.txt,query.cc,utils.cc,utils.h: Add $weight command 3442 to OmegaScript which returns the raw document weight - mostly useful 3443 for debugging purposes. 3444 3445Thu Nov 16 04:02:10 GMT 2006 Olly Betts <olly@survex.com> 3446 3447 * omega.spec.in: Remove "." from the end of the Summary. 3448 3449Thu Nov 16 03:03:25 GMT 2006 Olly Betts <olly@survex.com> 3450 3451 * configure.ac: As of xapian-core 0.8.0, XO_LIB_XAPIAN doesn't need to 3452 be called with arguments if you want a hard requirement on xapian, 3453 so remove the arguments. 3454 3455Thu Nov 16 02:07:31 GMT 2006 Olly Betts <olly@survex.com> 3456 3457 * configure.ac: Change the project name to "xapian-omega" since that's 3458 what the RPMs and Debian packages call it (there's a Rogue-like game 3459 called Omega). 3460 3461Thu Nov 16 02:01:55 GMT 2006 Olly Betts <olly@survex.com> 3462 3463 * omega.cc: Fix backwards setting of sort_after. Fix generation of 3464 sort setup flags for filters. 3465 3466Thu Nov 16 01:21:32 GMT 2006 Olly Betts <olly@survex.com> 3467 3468 * docs/cgiparams.txt,omega.cc,omega.h,query.cc: Implement new CGI 3469 parameters for finer control of sorting and ranking - SORTAFTER 3470 and DOCIDORDER. 3471 * omega.cc: Set up the filters variable so we know to revert to 3472 page 1 if the sorting options are changed. 3473 3474Tue Nov 14 15:27:09 GMT 2006 Olly Betts <olly@survex.com> 3475 3476 * md5test.cc: Need <stdio.h> for sprintf. 3477 3478Tue Nov 14 03:19:13 GMT 2006 Olly Betts <olly@survex.com> 3479 3480 * configure.ac: Note a couple of platforms which take the different 3481 iconv input types. 3482 3483Tue Nov 14 03:16:37 GMT 2006 Olly Betts <olly@survex.com> 3484 3485 * configure.ac,utf8convert.cc: The input pointer to iconv can be 3486 either "char **" or "const char **" so probe at configure time. 3487 3488Mon Nov 13 20:22:50 GMT 2006 Olly Betts <olly@survex.com> 3489 3490 * utf8convert.cc: Need <algorithm> for swap(). 3491 3492Mon Nov 13 02:27:51 GMT 2006 Olly Betts <olly@survex.com> 3493 3494 * Makefile.am,md5test.cc: Add tests for md5 code. 3495 3496Mon Nov 13 02:06:51 GMT 2006 Olly Betts <olly@survex.com> 3497 3498 * Merge in utf8 branch: 3499 3500 Fri Sep 15 06:03:50 BST 2006 Olly Betts <olly@survex.com> 3501 3502 * utf8convert.cc: Compilation fix for Sun C++. 3503 3504 Thu Sep 14 23:55:20 BST 2006 Olly Betts <olly@survex.com> 3505 3506 * Makefile.am,htmlparse.cc,htmlparse.h,indextext.cc, 3507 indextext.h,makesymboltabh.pl,myhtmlparse.cc,myhtmlparse.h, 3508 namedentities.h,omindex.cc,query.cc,scriptindex.cc, 3509 symboltab.h,tclUniData.cc,tclUniData.h,utf8convert.cc, 3510 utf8convert.h,utf8itor.cc,utf8itor.h, utf8test.cc: Convert 3511 to work in UTF-8. 3512 3513Thu Nov 09 00:20:19 GMT 2006 Olly Betts <olly@survex.com> 3514 3515 * NEWS,configure.ac: Update for 0.9.9. 3516 3517Wed Nov 08 22:45:10 GMT 2006 Olly Betts <olly@survex.com> 3518 3519 * omega.spec.in: Run "autoreconf --force" to avoid rpath on x86_64 3520 FC6. 3521 3522Sun Nov 05 17:08:48 GMT 2006 Olly Betts <olly@survex.com> 3523 3524 * scriptindex.cc: The "date" action was modifying the value it 3525 operated on, which it isn't meant to do - fixed. 3526 3527Sun Nov 05 02:25:48 GMT 2006 Olly Betts <olly@survex.com> 3528 3529 * query.cc: Report an error if $setmap is called with an even number 3530 of parameters. 3531 3532Thu Nov 02 16:08:27 GMT 2006 Olly Betts <olly@survex.com> 3533 3534 * NEWS,configure.ac: Update for 0.9.8. 3535 3536Thu Nov 02 15:43:31 GMT 2006 Olly Betts <olly@survex.com> 3537 3538 * configure.ac: Update comment about "-ptused". 3539 3540Wed Nov 01 16:23:13 GMT 2006 Olly Betts <olly@survex.com> 3541 3542 * cdb_init.cc: Fix warning in mingw build. 3543 3544Wed Nov 01 13:43:54 GMT 2006 Olly Betts <olly@survex.com> 3545 3546 * cdb_init.cc,query.cc: Fix warnings. 3547 3548Wed Nov 01 04:00:20 GMT 2006 Olly Betts <olly@survex.com> 3549 3550 * md5.cc,md5.h: Fix warnings about changing alignment requirements 3551 when casting pointers. 3552 3553Tue Oct 31 02:47:23 GMT 2006 Olly Betts <olly@survex.com> 3554 3555 * cdb_init.cc,configure.ac,getopt.cc,omega.cc,query.cc,scriptindex.cc: 3556 Enable more warnings for GCC (and fix them in the code). Enable 3557 appropriate warnings for Intel's C++ compiler. 3558 3559Tue Oct 31 00:02:19 GMT 2006 Olly Betts <olly@survex.com> 3560 3561 * htmlparsetest.cc,omindex.cc: Fix GCC warnings. 3562 3563Mon Oct 30 23:57:09 GMT 2006 Olly Betts <olly@survex.com> 3564 3565 * query.cc: $substr where the start is negative and longer than the 3566 string (e.g. $substr{abcd,-5,1}) should now work as intended. 3567 3568Mon Oct 30 21:02:18 GMT 2006 Olly Betts <olly@survex.com> 3569 3570 * scriptindex.cc: Fix GCC warnings uncovered by actually substituting 3571 AM_CXXFLAGS. 3572 3573Mon Oct 30 21:01:26 GMT 2006 Olly Betts <olly@survex.com> 3574 3575 * configure.ac: Actually substitute AM_CXXFLAGS in the Makefile. 3576 * configure.ac: Fix AM_CXXFLAGS for IRIX. 3577 3578Sat Oct 28 12:31:31 BST 2006 Olly Betts <olly@survex.com> 3579 3580 * myhtmlparse.cc: Add missing "#include <ctype.h>". 3581 3582Sat Oct 28 02:23:09 BST 2006 Olly Betts <olly@survex.com> 3583 3584 * htmlparse.cc,indextext.cc,indextext.h,myhtmlparse.cc,omega.cc, 3585 omega.h,omindex.cc,query.cc,scriptindex.cc: Ensure that we always 3586 pass an unsigned char value to isupper(), toupper(), etc as they 3587 are undefined on other values (glibc makes them work for signed 3588 char values too, but this is an extension). 3589 3590Fri Oct 27 00:36:34 BST 2006 Olly Betts <olly@survex.com> 3591 3592 * configure.ac,md5.h,values.h: HAVE_STDINT_H is already defined 3593 by autoconf based on trying the C compiler with AC_CHECK_HEADERS 3594 so define HAVE_WORKING_STDINT_H instead. 3595 3596Wed Oct 25 01:36:43 BST 2006 Olly Betts <olly@survex.com> 3597 3598 * configure.ac: Need a more sophisticated test for the stdint.h 3599 problem on IRIX. 3600 3601Tue Oct 24 02:12:13 BST 2006 Olly Betts <olly@survex.com> 3602 3603 * metaxmlparse.cc,omega.h: Fix warnings from SGI's C++ compiler. 3604 3605Tue Oct 24 02:11:11 BST 2006 Olly Betts <olly@survex.com> 3606 3607 * htmlparse.cc,query.cc,scriptindex.cc: Remove unused static 3608 functions. 3609 3610Tue Oct 24 01:51:05 BST 2006 Olly Betts <olly@survex.com> 3611 3612 * configure.ac: Pass magic options to SGI's C++ compiler to allow 3613 linking of templates to work. 3614 3615Tue Oct 24 00:46:06 BST 2006 Olly Betts <olly@survex.com> 3616 3617 * configure.ac: IRIX doesn't allow stdint.h to be included from C++ 3618 code, so we need a smarter configure test than AC_CHECK_HEADERS. 3619 3620Sun Oct 22 03:30:11 BST 2006 Olly Betts <olly@survex.com> 3621 3622 * configure.ac: Tell AC_CHECK_HEADERS to suppress its backward 3623 compatibility mode, so it only checks headers with the compiler. 3624 This speeds up configure a little, and is what we do elsewhere. 3625 3626Tue Oct 10 17:21:13 BST 2006 Olly Betts <olly@survex.com> 3627 3628 * NEWS: Update for actual 0.9.7 release. 3629 3630Mon Oct 09 18:26:14 BST 2006 Olly Betts <olly@survex.com> 3631 3632 * docs/termprefixes.txt: "$setmap{title,S}" should be 3633 "$setmap{prefix,title,S}". 3634 3635Sun Oct 08 21:43:16 BST 2006 Olly Betts <olly@survex.com> 3636 3637 * NEWS,configure.ac: Update for 0.9.7. 3638 3639Fri Sep 15 16:56:49 BST 2006 Olly Betts <olly@survex.com> 3640 3641 * cgiparam.cc: Compilation fix for Sun C++. 3642 3643Fri Sep 15 06:00:50 BST 2006 Olly Betts <olly@survex.com> 3644 3645 * configure.ac,query.cc: Compilation fix for Sun C++. 3646 3647Thu Sep 14 15:41:33 BST 2006 Olly Betts <olly@survex.com> 3648 3649 * htmlparse.cc: Include <stdlib.h> so atoi() is prototyped. 3650 3651Wed Sep 13 16:37:32 BST 2006 Olly Betts <olly@survex.com> 3652 3653 * configure.ac,md5.h,values.h: Use stdint.h if we have it. 3654 3655Tue Sep 12 11:57:16 BST 2006 Olly Betts <olly@survex.com> 3656 3657 * myhtmlparse.cc: Need "#include <string.h>" for strchr. 3658 3659Mon Sep 11 20:24:27 BST 2006 Olly Betts <olly@survex.com> 3660 3661 * values.h: Only want our own ntohl for MS Windows. 3662 3663Mon Sep 11 16:36:54 BST 2006 Olly Betts <olly@survex.com> 3664 3665 * omega.cc,query.cc: Now xapian-config will switch Sun's C++ compiler 3666 into ANSI C++ compliant mode, so clean out all our special cased 3667 bits of code. 3668 3669Mon Sep 11 14:23:44 BST 2006 Olly Betts <olly@survex.com> 3670 3671 * md5.h,values.h: Apply previous fix for DJGPP too. 3672 3673Sun Sep 10 19:04:17 BST 2006 Olly Betts <olly@survex.com> 3674 3675 * md5.h,values.h: Using htonl from winsock.h requires use to link 3676 with the winsock DLL, which is overkill so just add a simple 3677 implementation for htonl - we know MS Windows is little-endian. 3678 3679Sat Sep 09 21:48:22 BST 2006 Olly Betts <olly@survex.com> 3680 3681 * md5.h,values.h: Sigh, winsock.h uses u_long instead of uint32_t 3682 in the htonl prototype. 3683 3684Sat Sep 09 19:19:15 BST 2006 Olly Betts <olly@survex.com> 3685 3686 * omindex.cc: Fix typo in previous commit. 3687 3688Sat Sep 09 17:11:40 BST 2006 Olly Betts <olly@survex.com> 3689 3690 * configure.ac,omindex.cc: Mingw doesn't have sys/wait.h or 3691 WEXITSTATUS. 3692 3693Sat Sep 09 16:44:29 BST 2006 Olly Betts <olly@survex.com> 3694 3695 * md5.h,values.h: On MS Windows, we need to #include <winsock.h>. 3696 3697Fri Sep 08 08:01:15 BST 2006 Olly Betts <olly@survex.com> 3698 3699 * query.cc: Sun C++'s std::count() isn't very "std" -- it has the 3700 wrong prototype! 3701 3702Fri Sep 08 03:39:14 BST 2006 Olly Betts <olly@survex.com> 3703 3704 * md5.h,values.h: openbsd needs arpa/inet.h to be included before 3705 netinet/in.h. 3706 3707Wed Sep 06 21:31:33 BST 2006 Olly Betts <olly@survex.com> 3708 3709 * md5wrap.cc: #include <unistd.h> 3710 3711Wed Sep 06 18:03:23 BST 2006 Olly Betts <olly@survex.com> 3712 3713 * Makefile.am: Ship values.h. 3714 3715Wed Sep 06 03:52:27 BST 2006 Olly Betts <olly@survex.com> 3716 3717 * configfile.cc: Changed my mind - don't allow comments on the end of 3718 lines. 3719 * docs/overview.txt: Document that omega.conf can have comments and 3720 blank lines in. 3721 3722Wed Sep 06 03:46:16 BST 2006 Olly Betts <olly@survex.com> 3723 3724 * configfile.cc,omega.conf: Fix code which reads omega.conf to be line 3725 based as documented rather than the wacky whitespace based scheme 3726 that was actually implemented. Allow "#" comments and blank lines 3727 in omega.conf. 3728 3729Wed Sep 06 01:26:17 BST 2006 Olly Betts <olly@survex.com> 3730 3731 * omindex.cc: If popen() fails, treat it as a read error. 3732 3733Wed Sep 06 00:49:47 BST 2006 Olly Betts <olly@survex.com> 3734 3735 * omindex.cc: Fix escaping of filenames to cast characters to 3736 "unsigned char" so that isalnum() works correctly everywhere. 3737 Not a security hole as dangerous characters were still being 3738 escaped. 3739 3740Tue Sep 05 06:49:30 BST 2006 Olly Betts <olly@survex.com> 3741 3742 * Makefile.am: Run htmlparsetest on "make check". 3743 3744Tue Sep 05 06:46:18 BST 2006 Olly Betts <olly@survex.com> 3745 3746 * Makefile.am,htmlparse.cc,htmlparse.h,metaxmlparse.cc,metaxmlparse.h, 3747 myhtmlparse.h,omindex.cc,xmlparse.cc,xmlparse.h: Parse the XML from 3748 OpenDocument and OpenOffice using new subclasses of HtmlParser. 3749 Only extract meta.xml once. 3750 3751Tue Sep 05 06:45:02 BST 2006 Olly Betts <olly@survex.com> 3752 3753 * Makefile.am,htmlparsetest.cc: Add htmlparsetest which tests the 3754 MyHtmlParser class. 3755 3756Tue Sep 05 04:36:46 BST 2006 Olly Betts <olly@survex.com> 3757 3758 * omindex.cc: Note UTF-8 runes for pdfinfo and pdftotext. 3759 3760Tue Sep 05 04:29:21 BST 2006 Olly Betts <olly@survex.com> 3761 3762 * omindex.cc: Only run pdfinfo once and pull out the 3763 fields we want using string operations, instead of 3764 running it twice filtered through sed. 3765 3766Tue Sep 05 03:53:00 BST 2006 Olly Betts <olly@survex.com> 3767 3768 * htmlparse.cc,htmlparse.h: Don't get confused by "a<b" in 3769 Javascript in a <script> tag. Fixes bug#91. 3770 3771Sat Sep 02 04:29:12 BST 2006 Olly Betts <olly@survex.com> 3772 3773 * omindex.cc: Call pclose() not fclose() on a FILE* obtained from 3774 popen(). If a filter program isn't installed, then don't try it 3775 again for the same extension (not perfect but an improvement - 3776 previously we indexed an empty document!) 3777 3778Sat Sep 02 02:07:30 BST 2006 Olly Betts <olly@survex.com> 3779 3780 * Makefile.am,configure.ac,docs/omegascript.txt,md5.cc,md5.h, 3781 md5wrap.cc,md5wrap.h,omindex.cc,query.cc,values.h: Generate 3782 an MD5 checksum of each file indexed and store it in value #1 3783 to allow duplicates to be collapsed. Add $pack and $unpack 3784 OmegaScript commands to allow big endian binary values to 3785 be encoded and decoded. Add the file last modified time 3786 as value #0. 3787 3788Fri Sep 01 04:37:09 BST 2006 Olly Betts <olly@survex.com> 3789 3790 * omindex.cc: Tweak comment and whitespace. 3791 3792Fri Sep 01 04:19:39 BST 2006 Olly Betts <olly@survex.com> 3793 3794 * README: Update reference to "CVS" to say "SVN". 3795 3796Thu Aug 31 20:22:33 BST 2006 Olly Betts <olly@survex.com> 3797 3798 * loadfile.cc: #include <algorithm> for std::min(). 3799 3800Thu Aug 31 02:35:36 BST 2006 Olly Betts <olly@survex.com> 3801 3802 * loadfile.cc: More missing #include-s. 3803 3804Thu Aug 31 01:53:31 BST 2006 Olly Betts <olly@survex.com> 3805 3806 * loadfile.cc: Add #include <unistd.h>. 3807 3808Wed Aug 30 23:21:49 BST 2006 Olly Betts <olly@survex.com> 3809 3810 * Makefile.am: Include loadfile.h in the tarball. 3811 3812Mon Aug 28 18:09:28 BST 2006 Olly Betts <olly@survex.com> 3813 3814 * omindex.cc: Don't generate 'W' terms since omega doesn't use them. 3815 3816Mon Aug 28 03:06:46 BST 2006 Olly Betts <olly@survex.com> 3817 3818 * query.cc,templates/query: Use '\t' to separate terms in xP since 3819 filter terms might contain '.'. Fixes bug#87. 3820 3821Sun Aug 27 01:36:40 BST 2006 Olly Betts <olly@survex.com> 3822 3823 * indextext.cc: Don't generate terms with more than 3 trailing 3824 symbols ('-', '+', or '#'). 3825 3826Sun Aug 27 01:11:45 BST 2006 Olly Betts <olly@survex.com> 3827 3828 * omindex.cc: Added "size" field to document data; don't add "modtime" 3829 field if the timestamp is (time_t)-1. 3830 3831Sun Aug 27 00:36:12 BST 2006 Olly Betts <olly@survex.com> 3832 3833 * omindex.cc,templates/query,utils.cc,utils.h: Store the file's last 3834 modified time in the document data as "modtime" so it shows up in 3835 search results (and tweak the query template so the display of this 3836 information looks nicer). 3837 3838Fri Aug 25 22:55:23 BST 2006 Olly Betts <olly@survex.com> 3839 3840 * docs/overview.txt,omindex.cc: Run xls2csv on MS Excel files; run 3841 catppt on MS Powerpoint files; also index MS Word templates (.dot). 3842 3843Thu Aug 24 21:40:10 BST 2006 Olly Betts <olly@survex.com> 3844 3845 * htmlparse.cc: Support htdig's "ignore this bit" comments. 3846 3847Thu Aug 24 12:55:26 BST 2006 Olly Betts <olly@survex.com> 3848 3849 * query.cc: Fix $highlight{} to work with capitalised words (it used 3850 to work but regressed in 0.8.2). 3851 3852Thu Aug 24 12:38:50 BST 2006 Olly Betts <olly@survex.com> 3853 3854 * Makefile.am,omindex.cc,query.cc: Use the new routines in loadfile.cc 3855 to replace code to do the same thing in omindex and omega. 3856 3857Thu Aug 24 12:37:16 BST 2006 Olly Betts <olly@survex.com> 3858 3859 * scriptindex.cc: Fix handling of check whether a record has content 3860 in the case where the same field is processed more than once. 3861 3862Thu Aug 24 12:35:32 BST 2006 Olly Betts <olly@survex.com> 3863 3864 * Makefile.am,docs/scriptindex.txt,loadfile.cc,loadfile.h, 3865 scriptindex.cc: Add new "load" action to allow the contents of an 3866 external file to be loaded. 3867 3868Thu Aug 24 12:05:23 BST 2006 Olly Betts <olly@survex.com> 3869 3870 * configure.ac: Check for strftime. 3871 3872Sun Jul 09 01:40:09 BST 2006 Olly Betts <olly@survex.com> 3873 3874 * docs/omegascript.txt: Note that (by design) an omegascript template 3875 can't contain an infinite loop. 3876 3877Sun May 21 11:42:54 BST 2006 Olly Betts <olly@survex.com> 3878 3879 * Makefile.am: Make use of the dist_ prefix to avoid having to list 3880 files in EXTRA_DIST as well as in *_SCRIPTS, *_DATA, and man_MANS. 3881 * Makefile.am: Prefer $(sysconfdir) to @sysconfdir@ since the former 3882 can be overridden on the "make" command line. 3883 3884Sat May 20 06:16:27 BST 2006 Olly Betts <olly@survex.com> 3885 3886 * Makefile.am,configure.ac: Specify required automake version in 3887 the call to AM_INIT_AUTOMAKE in configure.ac. 3888 3889Thu May 18 14:12:13 BST 2006 Olly Betts <olly@survex.com> 3890 3891 * docs/overview.txt,docs/quickstart.txt: Use the default path to the 3892 database directories in examples. Tweak the formatting in a few 3893 places. Give a path to the omega CGI binary in the example showing 3894 how to run it from the command line. 3895 3896Wed May 17 15:28:01 BST 2006 Olly Betts <olly@survex.com> 3897 3898 * omega.spec.in: Fix so that the documentation gets packaged. 3899 3900Tue May 16 06:56:26 BST 2006 Olly Betts <olly@survex.com> 3901 3902 * configure.ac: Remove unused variable from snprintf testing code. 3903 3904Mon May 15 02:18:01 BST 2006 Olly Betts <olly@survex.com> 3905 3906 * NEWS,configure.ac: Updated for 0.9.6. 3907 3908Sat May 13 20:43:08 BST 2006 Olly Betts <olly@survex.com> 3909 3910 * configure.ac: Update snprintf detection to match xapian-core. 3911 3912Fri May 12 20:12:40 BST 2006 Olly Betts <olly@survex.com> 3913 3914 * docs/omegascript.txt: Clarified description of $now. 3915 3916Thu Apr 27 23:45:26 BST 2006 Olly Betts <olly@survex.com> 3917 3918 * docs/omegascript.txt,query.cc: Added new OmegaScript commands 3919 $filterterms and $substr. 3920 3921Thu Apr 27 18:37:50 BST 2006 Olly Betts <olly@survex.com> 3922 3923 * scriptindex.cc: Use const reference instead of just a reference. 3924 3925Sun Apr 23 18:32:20 BST 2006 Olly Betts <olly@survex.com> 3926 3927 * scriptindex.cc: Fix "index" and "indexnopos" without a prefix to 3928 set the weight correctly (bug introduced in 0.9.5). 3929 3930Wed Apr 19 13:37:15 BST 2006 Fabrice Colin 3931 3932 * omega.spec.in: Create and package /var/lib/omega/cdb and 3933 /var/log/omega. 3934 3935Tue Apr 11 19:29:34 BST 2006 Olly Betts <olly@survex.com> 3936 3937 * configure.ac,htmlparse.cc,query.cc,scriptindex.cc: Disable MSVC 3938 warning 4800 (on int to bool conversions) in config.h and then we 3939 can remove the "fixes" elsewhere. 3940 3941Mon Apr 10 16:26:08 BST 2006 Olly Betts <olly@survex.com> 3942 3943 * date.cc,hashterm.cc,htmlparse.cc,omega.cc,omindex.cc,query.cc, 3944 scriptindex.cc: Fix MSVC7 warnings. 3945 3946Sat Apr 08 20:04:33 BST 2006 Olly Betts <olly@survex.com> 3947 3948 * NEWS,configure.ac: Updated for 0.9.5. 3949 3950Fri Apr 07 16:45:36 BST 2006 Olly Betts <olly@survex.com> 3951 3952 * omindex.cc,query.cc: Tweak for MSVC compilation. 3953 3954Fri Apr 07 03:23:22 BST 2006 Olly Betts <olly@survex.com> 3955 3956 * omega.spec.in: Man pages may be gzipped. 3957 3958Thu Apr 06 14:28:08 BST 2006 Olly Betts <olly@survex.com> 3959 3960 * README: Add pointer to documentation. 3961 3962Thu Apr 06 03:32:21 BST 2006 Olly Betts <olly@survex.com> 3963 3964 * omega.spec.in: Include man pages in RPM. 3965 3966Thu Apr 06 03:06:56 BST 2006 Olly Betts <olly@survex.com> 3967 3968 * Makefile.am,commonhelp.cc,commonhelp.h,configure.ac,omindex.cc, 3969 scriptindex.cc: Add man pages for omindex and scriptindex. 3970 3971Thu Apr 06 02:56:09 BST 2006 Olly Betts <olly@survex.com> 3972 3973 * mbox2omega.script: Use new "hash" command. 3974 3975Wed Apr 05 19:29:14 BST 2006 Olly Betts <olly@survex.com> 3976 3977 * Makefile.am,docs/scriptindex.txt,hashterm.cc,hashterm.h, 3978 omindex.cc,scriptindex.cc: Add new "hash" command to allow hashed 3979 terms to be generated from long URLs like omindex does. 3980 * htdig2omega.script: Use new "hash" command. 3981 * scriptindex.cc: Fix "useless weight" warning to not incorrectly 3982 fire when "index" or "indexnopos" has no parameter. 3983 3984Wed Apr 05 15:03:28 BST 2006 Olly Betts <olly@survex.com> 3985 3986 * scriptindex.cc: Check if we successfully opened the index script 3987 and give an error if not. 3988 3989Fri Mar 10 05:21:13 GMT 2006 Olly Betts <olly@survex.com> 3990 3991 * dbi2omega: Check DBIDRIVER environmental variable to allow a driver 3992 other than mysql to be specified without modifying the script. 3993 3994Wed Mar 01 02:28:57 GMT 2006 Olly Betts <olly@survex.com> 3995 3996 * scriptindex.cc: Don't repeat the "note" part of warnings; Warn if 3997 "unique=<prefix>" is used without a corresponding "boolean=<prefix>"; 3998 Warn that "index=nopos" is deprecated and should be replaced by 3999 "indexnopos". 4000 4001Tue Feb 28 23:46:57 GMT 2006 Olly Betts <olly@survex.com> 4002 4003 * scriptindex.cc: Report a useless weight action, even if it's 4004 followed by another non-useless action (e.g. field); convert weight 4005 actions into a numeric parameter on index and indexnopos Action 4006 objects; add explanatory text "(note that actions are executed from 4007 left to right)" when reporting useless actions. 4008 4009Sun Feb 26 00:25:10 GMT 2006 Olly Betts <olly@survex.com> 4010 4011 * query.cc: Fix $opt[fieldnames] handling. Previously it would try 4012 to kick in if you didn't set fieldnames but set any alphabetically 4013 later option! 4014 4015Tue Feb 21 00:18:25 GMT 2006 Olly Betts <olly@survex.com> 4016 4017 * configure.ac,NEWS: Updated for 0.9.4. 4018 4019Sun Feb 19 23:20:49 GMT 2006 Olly Betts <olly@survex.com> 4020 4021 * COPYING: Updated FSF address. 4022 4023Thu Feb 16 00:10:22 GMT 2006 Olly Betts <olly@survex.com> 4024 4025 * NEWS,configure.ac: Updated for 0.9.3. 4026 4027Wed Feb 08 13:01:15 GMT 2006 Olly Betts <olly@survex.com> 4028 4029 * templates/query: Make the page title shorter so there's more chance 4030 it will fit on icon bars, etc. 4031 4032Wed Feb 08 10:08:24 GMT 2006 Olly Betts <olly@survex.com> 4033 4034 * docs/overview.txt: Add pointer to documentation of the supported 4035 query syntax. 4036 4037Mon Feb 06 15:19:17 GMT 2006 Olly Betts <olly@survex.com> 4038 4039 * docs/termprefixes.txt: Fix typo. 4040 4041Sat Jan 14 22:40:43 GMT 2006 Olly Betts <olly@survex.com> 4042 4043 * configure.ac: Copy over fixed snprintf checks from xapian-core. 4044 4045Fri Jan 13 03:21:15 GMT 2006 Olly Betts <olly@survex.com> 4046 4047 * configure.ac: The configure test for snprintf uses memcmp, so 4048 we need to "#include <string.h>" for it to work reliably. 4049 4050Mon Jan 09 04:23:54 GMT 2006 Olly Betts <olly@survex.com> 4051 4052 * date.cc,query.cc: Add "#include <stdarg.h>" where we use 4053 va_list, etc. 4054 4055Mon Jan 09 04:17:54 GMT 2006 Olly Betts <olly@survex.com> 4056 4057 * cdb_init.cc: Fix more compilation issues with cdb no-mmap code. 4058 4059Mon Jan 09 03:42:18 GMT 2006 Olly Betts <olly@survex.com> 4060 4061 * omega.cc,utils.cc,utils.h: Replace remaining use of split with 4062 a direct walk of the string. 4063 4064Mon Jan 09 03:19:49 GMT 2006 Olly Betts <olly@survex.com> 4065 4066 * query.cc: Don't split strings of docids in R parameters into a 4067 vector<string> - just walk the string directly. The code is 4068 as simple, and much more efficient if a lot of documents are 4069 marked relevant. 4070 4071Mon Jan 09 02:46:34 GMT 2006 Olly Betts <olly@survex.com> 4072 4073 * Makefile.am,date.cc,omindex.cc,query.cc,scriptindex.cc,utils.cc, 4074 utils.h: Use snprintf where available. 4075 4076Sun Jan 08 22:41:47 GMT 2006 Olly Betts <olly@survex.com> 4077 4078 * cdb_init.cc: Fixed malloc-based version to compile. 4079 4080Sun Jan 08 21:05:46 GMT 2006 Olly Betts <olly@survex.com> 4081 4082 * cdb_find.cc,cdb_hash.cc,cdb_unpack.cc: #include <config.h>. 4083 * configure.ac: Test for mmap. 4084 * cdb_init.cc: If mmap isn't found, and this isn't WIN32 fall back on 4085 the very crude approach of loading the whole file into a malloc-ed 4086 block. For a small cdb file, that'll give acceptable performance 4087 at least. 4088 4089Fri Jan 06 21:29:37 GMT 2006 Olly Betts <olly@survex.com> 4090 4091 * symboltab.h: Fix A after \xbf being interpereted as an overlong 4092 escape sequence. 4093 4094Fri Jan 06 21:26:57 GMT 2006 Olly Betts <olly@survex.com> 4095 4096 * query.cc: Fix printf type mismatch on 64 bit platforms. 4097 4098Fri Jan 06 21:00:34 GMT 2006 Olly Betts <olly@survex.com> 4099 4100 * docs/omegascript.txt,query.cc: Added $find{LIST,STRING}. 4101 4102Fri Jan 06 20:52:31 GMT 2006 Olly Betts <olly@survex.com> 4103 4104 * symboltab.h: Write top-bit set characters using \xXX notation to 4105 avoid warnings from Intel's C++ compiler. 4106 4107Fri Jan 06 18:15:42 GMT 2006 Olly Betts <olly@survex.com> 4108 4109 * query.cc: Removed unused variable. 4110 4111Fri Jan 06 18:14:33 GMT 2006 Olly Betts <olly@survex.com> 4112 4113 * query.cc: Cast time_t to unsigned long to avoid problems on 64bit 4114 platforms. 4115 4116Fri Jan 06 18:12:38 GMT 2006 Olly Betts <olly@survex.com> 4117 4118 * docs/omegascript.txt: Note in the $cgi description that it returns 4119 an arbitrary value if there's more than one, and pointing to 4120 $cgilist. 4121 4122Thu Jan 05 05:54:58 GMT 2006 Olly Betts <olly@survex.com> 4123 4124 * cdb_init.cc: Fix mingw compilation. 4125 4126Thu Jan 05 03:24:07 GMT 2006 Olly Betts <olly@survex.com> 4127 4128 * cdb_init.cc: Fix to hopefully compile on Solaris which has a broken 4129 sys/mman.h when used from C++. 4130 4131Wed Jan 04 20:44:44 GMT 2006 Olly Betts <olly@survex.com> 4132 4133 * query.cc: Fixed to compile with GCC 3.0. 4134 4135Wed Jan 04 04:33:15 GMT 2006 Olly Betts <olly@survex.com> 4136 4137 * Makefile.am,cdb.h,cdb_find.cc,cdb_hash.cc,cdb_init.cc,cdb_int.h, 4138 cdb_unpack.cc,configfile.cc,configfile.h,docs/omegascript.txt, 4139 omega.conf,query.cc: Add $lookup{CDBFILE,KEY} command to perform 4140 a lookup in a CDB file. 4141 4142Wed Jan 04 03:06:31 GMT 2006 Olly Betts <olly@survex.com> 4143 4144 * docs/omegascript.txt,docs/overview.txt,query.cc: Added new feature 4145 which allows you to avoid storing fieldnames in every document 4146 (which can save a lot of disk space for a large database). Instead 4147 you just store the field values, one per line, and add something 4148 like "$set{fieldnames,$split{caption sample url}}" to the 4149 OmegaScript template to specify the fieldnames to use. 4150 * docs/omegascript.txt,query.cc: Add new "$split{}" command which 4151 splits a string to give an OmegaScript list. 4152 * query.cc: Fix $url{} to escape "+" to "%2b". 4153 * query.cc: Speed up $highlight{} - only compare terms which are the 4154 same length. 4155 4156Tue Jan 03 22:38:01 GMT 2006 Olly Betts <olly@survex.com> 4157 4158 * configfile.cc: Rename file_readable() to file_exists() to better 4159 reflect what the function actually does! 4160 4161Tue Jan 03 17:43:40 GMT 2006 Olly Betts <olly@survex.com> 4162 4163 * templates/opensearch: Add missing escaping. 4164 4165Mon Dec 19 10:27:30 GMT 2005 Olly Betts <olly@survex.com> 4166 4167 * Makefile.am,commonhelp.cc,commonhelp.h,docs/overview.txt,omindex.cc, 4168 scriptindex.cc: Add "--stemmer" option to omindex and scriptindex 4169 to allow the stemming language to be set. 4170 * omindex.cc,scriptindex.cc: More consistent --help and --version 4171 output. Update FSF address. 4172 4173Mon Dec 19 06:03:31 GMT 2005 Olly Betts <olly@survex.com> 4174 4175 * query.cc: Explicitly use "unsigned char" when %-encoding in $url 4176 so that top bit set characters are correctly handled on platforms 4177 where char is signed by default. 4178 4179Sun Dec 11 09:30:44 GMT 2005 Olly Betts <olly@survex.com> 4180 4181 * templates/godmode: If a non-existent docid is specified, report the 4182 error and prompt the user to enter another docid. Fixes bug#60. 4183 4184Sun Dec 11 09:27:18 GMT 2005 Olly Betts <olly@survex.com> 4185 4186 * docs/cgiparams.txt,omega.cc,omega.h,query.cc: Add "SORTREVERSE" 4187 CGI parameter which allows the sort order to be reversed when 4188 sorting on a value. Remove "SORTBANDS" CGI parameter since it 4189 no longer does anything. 4190 4191Sun Dec 11 09:26:14 GMT 2005 Olly Betts <olly@survex.com> 4192 4193 * omindex.cc: Improve wording of comment. 4194 4195Sun Dec 11 09:22:58 GMT 2005 Olly Betts <olly@survex.com> 4196 4197 * docs/overview.txt,omindex.cc: Add support for OpenDocument format 4198 mimetypes and extensions out of the box. 4199 4200Sun Dec 11 09:16:57 GMT 2005 Olly Betts <olly@survex.com> 4201 4202 * docs/omegascript.txt,query.cc: If executing an OmegaScript command 4203 causes a Xapian exception to be thrown, catch it and copy the error 4204 message into error_msg (which is read by the $error command). 4205 4206Sun Dec 11 09:12:12 GMT 2005 Olly Betts <olly@survex.com> 4207 4208 * htmlparse.cc: Tweak a few comments; "while (1)" -> "while (true)". 4209 4210Sun Dec 11 09:09:40 GMT 2005 Olly Betts <olly@survex.com> 4211 4212 * docs/overview.txt: The U prefix (URL term) was grouped with the date 4213 searching prefixes, but it makes more sense to group it with the 4214 prefixes relating to parts of the URL (H for hostname, P for path, 4215 etc). 4216 4217Sun Oct 02 16:28:59 BST 2005 Olly Betts <olly@survex.com> 4218 4219 * scriptindex.cc: Use "int database_mode" (set to the value to pass to 4220 WritableDatabase's ctor) instead of "bool overwrite" to implement 4221 --overwrite. 4222 * scriptindex.cc: Remove code to handle "-q" as it no longer actually 4223 controls anything. Just ignore it for backwards compatibility. 4224 * scriptindex.cc: Tweak --help output to not wrap on a default 4225 install. 4226 4227Sat Sep 10 14:57:19 BST 2005 Olly Betts <olly@survex.com> 4228 4229 * docs/omegascript.txt: Improve descriptions of $collapsed, $value, 4230 $version. 4231 4232Fri Jul 29 10:05:21 BST 2005 James Aylett <james@tartarus.org> 4233 4234 * omindex.cc: add --preserve-nonduplicates / -p option to not 4235 delete any documents that aren't updated, in replace duplicates 4236 mode (so that multiple runs of omindex on different subsites 4237 don't stomp on each other). 4238 4239 * docs/overview.txt: update to match the above. 4240 4241Fri Jul 15 11:12:28 BST 2005 Olly Betts <olly@survex.com> 4242 4243 * configure.ac: Updated for 0.9.2. 4244 4245Fri Jul 15 02:18:40 BST 2005 Olly Betts <olly@survex.com> 4246 4247 * NEWS: Updated for 0.9.2. 4248 4249Sat Jul 02 14:56:35 BST 2005 Olly Betts <olly@survex.com> 4250 4251 * query.cc: Workaround further Sun C++ crapness. 4252 4253Wed Jun 29 03:19:22 BST 2005 Olly Betts <olly@survex.com> 4254 4255 * docs/omegascript.txt,query.cc: Changed $highlight so 4256 if OPEN and CLOSE aren't specified, they default to 4257 highlighting each word from the query with a different 4258 background colour like gmane does (previous default was to use 4259 '<strong>' and '</strong>'). 4260 * query.cc: Removed surplus whitespace. 4261 4262Fri Jun 24 02:51:38 BST 2005 Olly Betts <olly@survex.com> 4263 4264 * query.cc: Call QueryParser::set_database() as this is now used to 4265 decide what to do for terms like "C#". 4266 * docs/omegascript.txt,docs/termprefixes.txt,query.cc: Add the 4267 ability to set boolean prefixes for the QueryParser by setting 4268 a "boolprefix" map in the omegascript template. 4269 4270Fri Jun 24 02:40:10 BST 2005 Olly Betts <olly@survex.com> 4271 4272 * scriptindex.cc: Fix infinite loop if there's no newline at the end 4273 of a dumpfile. 4274 4275Thu Jun 23 16:42:41 BST 2005 Olly Betts <olly@survex.com> 4276 4277 * docs/termprefixes.txt: Explain who to use termprefixes with 4278 scriptindex and omega, since that's what most people will want to 4279 know. 4280 4281Thu Jun 23 16:41:15 BST 2005 Olly Betts <olly@survex.com> 4282 4283 * query.cc,docs/omegascript.txt: Added $length{} and $stoplist{} 4284 commands to OmegaScript. 4285 * docs/omegascript.txt: Use standard "S" prefix for title in example 4286 for $setmap, rather than "XT". 4287 4288Mon Jun 06 17:59:10 BST 2005 Olly Betts <olly@survex.com> 4289 4290 * NEWS: Another 0.9.1 update. 4291 4292Mon Jun 06 17:52:44 BST 2005 Olly Betts <olly@survex.com> 4293 4294 * NEWS: Updated for 0.9.1. 4295 4296Mon Jun 06 17:51:58 BST 2005 Olly Betts <olly@survex.com> 4297 4298 * configure.ac: Updated for 0.9.1. 4299 4300Mon May 23 23:36:48 BST 2005 Fabrice Colin <fabrice.colin@gmail.com> 4301 4302 * omega.spec.in: Updated for 0.9.0. 4303 4304Fri May 13 23:21:02 BST 2005 Olly Betts <olly@survex.com> 4305 4306 * NEWS: Updated for 0.9.0. 4307 4308Fri May 13 00:39:44 BST 2005 Olly Betts <olly@survex.com> 4309 4310 * configure.ac: Updated for 0.9.0. 4311 4312Fri May 13 00:35:21 BST 2005 Olly Betts <olly@survex.com> 4313 4314 * scriptindex.cc: Improved handling of extra blank lines in dump file; 4315 Strip multiple \r characters from end of line; Complain if a dump 4316 file doesn't appear to have been = escaped correctly; Flush 4317 database after each input file to ensure all changes from a file 4318 make it in. 4319 * docs/omegascript.txt: Whitespace tweak. 4320 4321Wed May 11 02:28:41 BST 2005 Olly Betts <olly@survex.com> 4322 4323 * NEWS: Started to update for 0.9.0. 4324 4325Sun May 08 02:16:07 BST 2005 Olly Betts <olly@survex.com> 4326 4327 * query.cc: Use Query::get_terms_begin() not 4328 QueryParser::termlist_begin(). 4329 4330Sun May 08 02:11:49 BST 2005 Olly Betts <olly@survex.com> 4331 4332 * Makefile.am: Use AM_CPPFLAGS not CPPFLAGS (CPPFLAGS is for the 4333 user). 4334 4335Wed May 4 11:32:18 BST 2005 Richard Boulton <richard@tartarus.org> 4336 4337 * configfile.cc: Configuration file is now looked for in various 4338 locations: the first location in which a file is found is used. 4339 Firstly, if the OMEGA_CONFIG_FILE environment variable is set, 4340 the location given in it is checked. Secondly, the file 4341 "omega.conf" in the same directory as the executable is checked. 4342 Finally, the file "${sysconfdir}/omega.conf" (eg, /etc/omega.conf 4343 on Linux) is checked. If none of these locations contain a file, 4344 default values are used. 4345 * docs/overview.txt: Update to describe new configuration file 4346 locations. 4347 * Makefile.am: Install omega.conf to ${sysconfdir} by default. 4348 Define CONFIGFILE_SYSTEM with an appropriate value to find the 4349 system configuration file. 4350 4351Wed May 4 11:20:26 BST 2005 Richard Boulton <richard@tartarus.org> 4352 4353 * query.cc: Use new set_stemming_strategy() API method, rather than 4354 old set_stemming_options() method. The old method didn't compile 4355 because it's being passed a stemming_strategy value, which there 4356 isn't a prototype for. 4357 4358Fri Apr 29 10:27:05 BST 2005 Olly Betts <olly@survex.com> 4359 4360 * scriptindex.cc: Improved comments. 4361 4362Fri Apr 15 03:12:02 BST 2005 Olly Betts <olly@survex.com> 4363 4364 * docs/termprefixes.txt: Updated QueryParser prefix documentation to 4365 remove references to CVS HEAD. 4366 * docs/termprefixes.txt: Capitalise "Month" to indicate why it has 4367 prefix "M" (in line with all the other entries in the list). 4368 4369Fri Apr 15 02:55:06 BST 2005 Olly Betts <olly@survex.com> 4370 4371 * indextext.cc: Generate terms like "c#". 4372 * query.cc: Highlight words like "C#". 4373 4374Fri Apr 15 02:53:22 BST 2005 Olly Betts <olly@survex.com> 4375 4376 * query.cc: Clearer code for adding boolean filters are added to the 4377 query. 4378 4379Wed Apr 06 02:47:14 BST 2005 Olly Betts <olly@survex.com> 4380 4381 * omindex.cc: Tweak the hashing of URLs so that it works the same 4382 way on all platforms (previously it would depend on sizeof(long)). 4383 This means an incompatibility with any existing database built on 4384 a platform where sizeof(long) > 4 where URLs were hashed (i.e. 4385 URLs were > 228 bytes if sizeof(long) == 8), but we really want 4386 databases to be portable between platforms. 4387 4388Wed Apr 06 02:44:58 BST 2005 Olly Betts <olly@survex.com> 4389 4390 * omindex.cc,docs/overview.txt: Removed useless "DUPE_duplicate" 4391 option. 4392 4393Wed Apr 06 00:48:08 BST 2005 Olly Betts <olly@survex.com> 4394 4395 * omindex.cc,docs/overview.txt: Added support for using pod2text for 4396 indexing Perl documentation. 4397 4398Wed Apr 06 00:25:47 BST 2005 Olly Betts <olly@survex.com> 4399 4400 * omindex.cc,docs/overview.txt: Replace -l/--no-recurse with 4401 -l/--depth-limit which takes an argument allowing recursion 4402 to be restriction to any depth, not just 0 or infinite! 4403 4404Tue Apr 05 23:45:39 BST 2005 Olly Betts <olly@survex.com> 4405 4406 * mbox2omega,mbox2omega.script,Makefile.am: Added mbox2omega which 4407 allows a mail folder to be indexed. Mostly it's an example as 4408 there's no mechanism included to show the full original message. 4409 4410Tue Apr 05 23:41:44 BST 2005 Olly Betts <olly@survex.com> 4411 4412 * scriptindex.cc: Tidy up STL header includes. 4413 4414Tue Apr 05 23:34:36 BST 2005 Olly Betts <olly@survex.com> 4415 4416 * docs/omegascript.txt: Clarify $field description slightly. 4417 4418Tue Apr 05 23:33:33 BST 2005 Olly Betts <olly@survex.com> 4419 4420 * indextext.h: Add typedefs to allow AccentNormalisingItor to be used 4421 as an STL iterator. 4422 4423Tue Apr 05 00:47:52 BST 2005 Olly Betts <olly@survex.com> 4424 4425 * docs/cgiparams.txt,docs/omegascript.txt: Fixed 3 references to 4426 OmXxxx classes. 4427 4428Tue Apr 05 00:41:45 BST 2005 Olly Betts <olly@survex.com> 4429 4430 * debian/.cvsignore,.cvsignore: Remove .cvsignore files, as they're 4431 not used by SVN. 4432 4433Mon Mar 21 16:43:07 GMT 2005 Richard Boulton <richard@tartarus.org> 4434 4435 * templates/opensearch: Add new template to implement basic 4436 opensearch feeds of search results. 4437 * Makefile.am: Include opensearch template in distribution. 4438 4439Thu Mar 03 02:20:26 GMT 2005 Olly Betts <olly@survex.com> 4440 4441 * templates/query2: Remove Sam's unfinished rewrite of the query 4442 template. It's not been worked on for nearly two years, and we 4443 don't ship it. 4444 4445Wed Mar 02 03:09:52 GMT 2005 Olly Betts <olly@survex.com> 4446 4447 * COPYING: Put in CVS. 4448 4449Tue Mar 01 02:09:35 GMT 2005 Olly Betts <olly@survex.com> 4450 4451 * omindex.cc,docs/overview.txt: Extend -M/--mime-type to allow an 4452 existing mapping to be removed by omitting the type. 4453 4454Thu Feb 24 17:42:35 GMT 2005 Olly Betts <olly@survex.com> 4455 4456 * Makefile.am: Actually ship docs/termprefixes.txt (and make it harder 4457 to fail to ship new docs in future). 4458 4459Thu Feb 24 02:10:09 GMT 2005 Olly Betts <olly@survex.com> 4460 4461 * Makefile.am,docs/termprefixes.txt: Added a single document covering 4462 all aspects of term prefixes. 4463 4464Wed Feb 23 14:59:46 GMT 2005 Olly Betts <olly@survex.com> 4465 4466 * docs/omegascript.txt: Moved $collapsed into correct place 4467 alphabetically! 4468 4469Wed Feb 16 03:46:51 GMT 2005 Olly Betts <olly@survex.com> 4470 4471 * docs/cgiparams.txt,docs/overview.txt: Improved description of how 4472 B filters are handled when building the query. 4473 4474Wed Feb 16 03:44:24 GMT 2005 Olly Betts <olly@survex.com> 4475 4476 * omindex.cc: Fixed so that we get lstat() prototype on Linux systems 4477 where we have posix_fadvise(). 4478 4479Mon Jan 17 03:35:35 GMT 2005 Olly Betts <olly@survex.com> 4480 4481 * query.cc: Corrected a comment. 4482 4483Mon Jan 17 03:32:25 GMT 2005 Olly Betts <olly@survex.com> 4484 4485 * query.cc: Updated to use the new QueryParser API. 4486 4487Wed Jan 05 03:15:43 GMT 2005 Olly Betts <olly@survex.com> 4488 4489 * docs/scriptindex.txt: Note that actions are applied in the specified 4490 order. 4491 4492Thu Dec 23 19:12:57 GMT 2004 Olly Betts <olly@survex.com> 4493 4494 * INSTALL: "xapian-examples" -> "omega". 4495 4496Thu Dec 23 19:10:04 GMT 2004 Olly Betts <olly@survex.com> 4497 4498 * configure.ac,NEWS: Version 0.8.5. 4499 4500Thu Dec 23 19:09:01 GMT 2004 Olly Betts <olly@survex.com> 4501 4502 * INSTALL,README: Added better installation instructions. 4503 4504Mon Dec 20 17:26:26 GMT 2004 Olly Betts <olly@survex.com> 4505 4506 * configure.ac,omindex.cc: Fixed "ignore symlinks" code to compile on 4507 systems without lstat (e.g. mingw). 4508 4509Mon Dec 20 12:18:18 GMT 2004 Olly Betts <olly@survex.com> 4510 4511 * omindex.cc: Fix the "ignore symlinks" code to actually compile on 4512 certain Linux boxes. 4513 4514Mon Dec 20 11:33:59 GMT 2004 Olly Betts <olly@survex.com> 4515 4516 * query.cc: If an exception is thrown, make sure that the HTTP headers 4517 get written so that we don't cause "500 Internal Server Error". 4518 This problem was introduced by the change to allow a user specified 4519 Content-Type in 0.8.0. Partly addresses bug#60. 4520 4521Fri Dec 17 22:50:01 GMT 2004 Olly Betts <olly@survex.com> 4522 4523 * omindex.cc: Only try to delete removed documents in DUPE_replace 4524 mode. 4525 4526Thu Dec 16 11:43:28 GMT 2004 Olly Betts <olly@survex.com> 4527 4528 * scriptindex.cc: Fixed "Unknown Exception" when trying to "unhtml" 4529 text which contains "</body>" (bug#61). This bug was introduced in 4530 0.8.4. 4531 4532Thu Dec 16 11:28:25 GMT 2004 Olly Betts <olly@survex.com> 4533 4534 * myhtmlparse.cc: <h1> - <h6> and </h1> - </h6> should leave a 4535 space into the dumped HTML. 4536 4537Wed Dec 15 15:53:55 GMT 2004 Richard Boulton <richard@tartarus.org> 4538 4539 * dbi2omega: Add a comment to the start of the file detailing what 4540 dbi2omega does. 4541 4542Wed Dec 15 15:08:41 GMT 2004 Richard Boulton <richard@tartarus.org> 4543 4544 * omindex.cc: Change behaviour of crawler such that it doesn't 4545 follow symbolic links any more. Add "--follow" command 4546 line option to turn following of symlinks back on. 4547 4548Wed Dec 08 16:31:46 GMT 2004 Olly Betts <olly@survex.com> 4549 4550 * NEWS: Final update for 0.8.4. 4551 4552Tue Dec 07 18:16:32 GMT 2004 Olly Betts <olly@survex.com> 4553 4554 * indextext.h: Fixed to compile with GCC 3.x. 4555 4556Tue Dec 07 18:15:39 GMT 2004 Olly Betts <olly@survex.com> 4557 4558 * omega.cc,omindex.cc,scriptindex.cc: Use the new 4559 Database/WritableDatabase constructors. 4560 4561Tue Nov 30 22:02:33 GMT 2004 Olly Betts <olly@survex.com> 4562 4563 * NEWS,configure.ac: Updated for 0.8.4 release. 4564 4565Wed Nov 24 04:50:52 GMT 2004 Olly Betts <olly@survex.com> 4566 4567 * templates/godmode: Finished off godmode template. 4568 4569Wed Nov 24 04:12:09 GMT 2004 Olly Betts <olly@survex.com> 4570 4571 * query.cc: If there's only a boolean query so we promote it to be 4572 the query, switch to boolean weights. 4573 4574Wed Nov 24 03:29:36 GMT 2004 Olly Betts <olly@survex.com> 4575 4576 * Makefile.am,myhtmlparse.cc,myhtmlparse.h,omindex.cc,scriptindex.cc: 4577 Factored out MyHtmlParser into a separate file so it can be used 4578 in scriptindex too to give scriptindex the same improved HTML 4579 parsing which omindex just got. 4580 4581Wed Nov 24 02:22:49 GMT 2004 Olly Betts <olly@survex.com> 4582 4583 * omindex.cc: Removed bogus extra line from code which was meant to 4584 truncate at a word boundary, but has never actually worked! 4585 4586Wed Nov 24 02:20:36 GMT 2004 Olly Betts <olly@survex.com> 4587 4588 * omindex.cc: Improved HTML to text conversion - the parser now knows 4589 that some tags should be regarded as word breaks and some shouldn't 4590 (previously all tags were treated as word breaks). 4591 4592Wed Nov 24 00:22:39 GMT 2004 Olly Betts <olly@survex.com> 4593 4594 * omindex.cc: Removed debug output; don't include \xa0 in the list of 4595 whitespace characters for now, as that's a bit character set 4596 specific... 4597 4598Wed Nov 24 00:04:42 GMT 2004 Olly Betts <olly@survex.com> 4599 4600 * omindex.cc: HTML extraction now turns strips leading and trailing 4601 whitespace and converts all other consecutive groups of whitespace 4602 to a single space. 4603 4604Tue Nov 23 20:29:14 GMT 2004 Olly Betts <olly@survex.com> 4605 4606 * Makefile.am: XAPIAN_FLAGS already links with xapianqueryparser 4607 so remove -lxapianqueryparser from omega_LDADD as it was causing 4608 problems on cygwin. 4609 4610Wed Nov 17 18:51:28 GMT 2004 Olly Betts <olly@survex.com> 4611 4612 * omindex.cc: Index RTF documents with unrtf, if available. 4613 * docs/overview.txt: Document this. 4614 4615Wed Nov 17 16:31:01 GMT 2004 Olly Betts <olly@survex.com> 4616 4617 * omindex.cc: If a filename to be passed to a filter program has a 4618 leading "-", protect it from possible interpretation as an option 4619 by prepending "./". 4620 4621Wed Nov 17 16:29:55 GMT 2004 Olly Betts <olly@survex.com> 4622 4623 * omindex.cc: Index Wordperfect documents with wpd2text, if available. 4624 * docs/overview.txt: Document this. 4625 4626Wed Nov 17 15:12:08 GMT 2004 Olly Betts <olly@survex.com> 4627 4628 * omindex.cc: Index MS Word documents with antiword, if available. 4629 * docs/overview.txt: Document this. 4630 4631Wed Nov 17 04:29:15 GMT 2004 Olly Betts <olly@survex.com> 4632 4633 * omindex.cc: Add simple code to index OpenOffice documents. 4634 * docs/overview.txt: Update documentation to mention this. 4635 4636Tue Nov 09 03:04:44 GMT 2004 Olly Betts <olly@survex.com> 4637 4638 * configure.ac,Makefile.am: We now get -AA or -std strict_ansi from 4639 xapian-config, so we don't need to probe for them ourselves. 4640 4641Sun Nov 07 16:36:42 GMT 2004 Olly Betts <olly@survex.com> 4642 4643 * utils.cc: Fixed to work with updated snprintf configure test, 4644 4645Sun Nov 07 04:55:26 GMT 2004 Olly Betts <olly@survex.com> 4646 4647 * configure.ac: rearrange so that libtool is active when we test if 4648 the c++ compiler can link a program so it can pull in libstdc++ 4649 through a .la file; updated snprintf test to the new one from 4650 xapian-core. 4651 4652Fri Nov 05 17:20:13 GMT 2004 Olly Betts <olly@survex.com> 4653 4654 * configure.ac: AM_CONFIG_HEADER -> AC_CONFIG_HEADERS; Run tests using 4655 the C++ compiler; select ANSI mode for aCC and cxx; Check GXX not 4656 GCC when choosing warning flags. 4657 4658Wed Nov 03 20:15:34 GMT 2004 Olly Betts <olly@survex.com> 4659 4660 * query.cc: Updated to use Query::empty() instead of 4661 Query::is_empty(). 4662 4663Wed Nov 03 20:12:37 GMT 2004 Olly Betts <olly@survex.com> 4664 4665 * Makefile.am,getopt.cc,getopt.h,getopt1.cc,gnu_getopt.h,omindex.cc, 4666 scriptindex.cc: Updated to reworked getopt from xapian-core. 4667 4668Wed Nov 03 04:11:03 GMT 2004 Olly Betts <olly@survex.com> 4669 4670 * getopt.cc: Defining _NO_PROTO is a really bad idea for C++ code! 4671 4672Tue Nov 02 18:54:12 GMT 2004 Olly Betts <olly@survex.com> 4673 4674 * getopt.cc: Protect getopt definition for possible getopt macro 4675 declared in getopt.h. 4676 4677Tue Nov 02 17:56:08 GMT 2004 Olly Betts <olly@survex.com> 4678 4679 * indextext.h: Fixed 2 warnings. 4680 4681Tue Nov 02 06:54:17 GMT 2004 Olly Betts <olly@survex.com> 4682 4683 * getopt.cc,getopt1.cc: Fixed function declarations to not use K&R C 4684 syntax. 4685 4686Tue Nov 02 05:40:06 GMT 2004 Olly Betts <olly@survex.com> 4687 4688 * Makefile.am,configure.ac,getopt.c,getopt1.c,getopt.cc,getopt1.cc: 4689 Compile everything as C++. 4690 4691Mon Sep 20 14:52:24 BST 2004 Olly Betts <olly@survex.com> 4692 4693 * NEWS,configure.ac: Version 0.8.3. 4694 4695Mon Sep 20 14:49:26 BST 2004 Olly Betts <olly@survex.com> 4696 4697 * Makefile.am,configure.ac: Require same versions of autoconf and 4698 automake that xapian-core does. 4699 4700Mon Sep 20 14:45:53 BST 2004 Olly Betts <olly@survex.com> 4701 4702 * omega.spec.in: Update from Fabrice Colin. The most notable change 4703 is that the RPM is now called xapian-omega because there's already 4704 an omega RPM (in Fedora Core at least) which is some game. 4705 4706Thu Sep 16 00:57:13 BST 2004 Olly Betts <olly@survex.com> 4707 4708 * cgiparam.cc,configfile.cc,configfile.h,htmlparse.cc,indextext.cc, 4709 omega.cc,omindex-config.cc: All C++ sources should #include 4710 <config.h> as the first header; no header files should #include 4711 <config.h>. 4712 4713Thu Sep 16 00:54:31 BST 2004 Olly Betts <olly@survex.com> 4714 4715 * scriptindex.cc: --version now actually reports the version. --help 4716 now exits with status 0 rather than status 1. 4717 4718Tue Sep 14 03:00:32 BST 2004 Olly Betts <olly@survex.com> 4719 4720 * omega.spec.in: Updated URL for sources; include htdig2omega and 4721 htdig2omega.script in the RPM. 4722 4723Tue Sep 14 02:56:52 BST 2004 Olly Betts <olly@survex.com> 4724 4725 * Makefile.am: Install htdig2omega.script in ${prefix}/share/omega/ 4726 rather than ${prefix}/share/. 4727 4728Mon Sep 13 03:22:55 BST 2004 Olly Betts <olly@survex.com> 4729 4730 * NEWS,configure.ac: Version 0.8.2. 4731 4732Thu Sep 09 15:11:45 BST 2004 Olly Betts <olly@survex.com> 4733 4734 * NEWS: Updated. 4735 4736Thu Sep 09 14:41:41 BST 2004 Olly Betts <olly@survex.com> 4737 4738 * query.cc: Use new checkatleast parameter to Enquire::get_mset to 4739 implement MINHITS. 4740 4741Thu Sep 02 01:45:46 BST 2004 Olly Betts <olly@survex.com> 4742 4743 * templates/query: Always report database not found - previously we 4744 only did so if there was a query. Also fixed missing </center> 4745 tag which happened in certain cases. 4746 4747Wed Aug 25 23:19:47 BST 2004 Olly Betts <olly@survex.com> 4748 4749 * omindex.cc: When running with "replace duplicates" mode (the 4750 default), detect documents removed since the last indexing 4751 run and delete them from the database (bug #34). 4752 4753Tue Aug 24 19:23:55 BST 2004 Olly Betts <olly@survex.com> 4754 4755 * omega.cc: Added FIXME comment noting that SORT and SORTBANDS should 4756 be tracked and the results reset to the first page if they change. 4757 4758Tue Aug 24 19:23:07 BST 2004 Olly Betts <olly@survex.com> 4759 4760 * Makefile.am: Install htdig2omega and htdig2omega.script. 4761 4762Mon Aug 23 22:29:53 BST 2004 Olly Betts <olly@survex.com> 4763 4764 * scriptindex.cc: Report index file name and line number when 4765 reporting errors in it. Added warning for redundant actions, 4766 such as "truncate" as the last action in a rule. 4767 4768Mon Aug 23 22:03:25 BST 2004 Olly Betts <olly@survex.com> 4769 4770 * omindex.cc: Use the new replace_document(term, doc) method. 4771 4772Sun Aug 22 13:11:23 BST 2004 Olly Betts <olly@survex.com> 4773 4774 * configure.in,configure.ac: Renamed configure.in to configure.ac. 4775 4776Sat Aug 21 12:41:43 BST 2004 Olly Betts <olly@survex.com> 4777 4778 * docs/omegascript.txt: Added note about that $add{$hit,1} gives 4779 the "hit number". 4780 4781Fri Aug 20 20:28:16 BST 2004 Olly Betts <olly@survex.com> 4782 4783 * Makefile.am: Link with -lxapianqueryparser, not -lomqueryparser. 4784 4785Thu Aug 19 19:13:34 BST 2004 Olly Betts <olly@survex.com> 4786 4787 * Makefile.am: And actually ship htdig2omega and htdig2omega.script! 4788 4789Thu Aug 19 19:02:40 BST 2004 Olly Betts <olly@survex.com> 4790 4791 * htdig2omega,htdig2omega.script: Added perl script and corresponding 4792 scriptindex index script which allow an ht://dig database to be 4793 imported into Xapian. This provides an easy way to provide a search 4794 of remote websites using omega (by spidering them with ht://dig). 4795 4796Sun Aug 15 01:48:58 BST 2004 Olly Betts <olly@survex.com> 4797 4798 * indextext.cc,indextext.h,omindex.cc,query.cc,scriptindex.cc, 4799 symboltab.h: Fixed $highlight to understand accented characters 4800 (bug#9). 4801 4802Wed Jun 30 14:58:12 BST 2004 Olly Betts <olly@survex.com> 4803 4804 * NEWS,configure.in: Version 0.8.1. 4805 4806Tue Jun 29 17:26:41 BST 2004 Richard Boulton <richard@tartarus.org> 4807 4808 * Makefile.am: Remove Debian files from distribution tarballs, 4809 since there will often be multiple patch releases for each 4810 release. Debian files will be available from an apt repository 4811 in future. 4812 4813Tue Jun 29 01:45:06 BST 2004 Olly Betts <olly@survex.com> 4814 4815 * omindex.cc: Renamed hash() to hash_string() to avoid colliding 4816 with something on IRIX; Removed explicit initialisation of 4817 mime_types - perhaps that's spooking the SGI CC prelinker. 4818 4819Sun Jun 27 23:47:35 BST 2004 Olly Betts <olly@survex.com> 4820 4821 * omega.cc: Change MORELIKE to pick up to 40 terms, rather than up to 4822 6 (feedback on the mailing list suggests this gives much better 4823 results). 4824 4825Fri Jun 11 02:22:38 BST 2004 Olly Betts <olly@survex.com> 4826 4827 * scriptindex.cc: Added catch for std::bad_alloc. 4828 4829Mon Apr 19 14:43:17 BST 2004 Olly Betts <olly@survex.com> 4830 4831 * NEWS: Final update for 0.8.0. 4832 4833Sun Apr 18 22:31:24 BST 2004 Olly Betts <olly@survex.com> 4834 4835 * omindex.cc: Only need _POSIX_C_SOURCE on Linux, and it seems to 4836 cause problems with Sun's C++ compiler. 4837 4838Sun Apr 18 17:50:35 BST 2004 Olly Betts <olly@survex.com> 4839 4840 * omindex.cc: _POSIX_C_SOURCE works better than _POSIX_SOURCE for 4841 making posix_fadvise prototype visible on Linux. 4842 4843Thu Apr 15 02:05:49 BST 2004 Olly Betts <olly@survex.com> 4844 4845 * omindex.cc: And another _POSIX_SOURCE attempt! 4846 4847Thu Apr 15 01:43:51 BST 2004 Olly Betts <olly@survex.com> 4848 4849 * omindex.cc: Another stab at _POSIX_SOURCE... 4850 4851Thu Apr 15 01:25:29 BST 2004 Olly Betts <olly@survex.com> 4852 4853 * omindex.cc: Added a missing underscore (_POSIX_SOURCE not 4854 POSIX_SOURCE!) 4855 4856Thu Apr 15 00:48:12 BST 2004 Olly Betts <olly@survex.com> 4857 4858 * omindex.cc: Defined POSIX_SOURCE to a suitable value to get 4859 posix_fadvise on some versions of redhat. 4860 4861Mon Apr 12 01:06:58 BST 2004 Olly Betts <olly@survex.com> 4862 4863 * NEWS,configure.in: Version 0.8.0. 4864 4865Mon Apr 12 00:03:57 BST 2004 Olly Betts <olly@survex.com> 4866 4867 * indextext.cc,query.cc: Don't create R terms for terms which start 4868 with a digit. 4869 4870Sun Apr 11 23:47:33 BST 2004 Olly Betts <olly@survex.com> 4871 4872 * omindex.cc: Fixed inconsistent indenting. 4873 4874Sun Apr 11 23:11:51 BST 2004 Olly Betts <olly@survex.com> 4875 4876 * omindex.cc: Call posix_fadvise with POSIX_FADV_DONTNEED just before 4877 closing an input file. Again should help improve indexing 4878 throughput. 4879 4880Fri Apr 02 16:09:03 BST 2004 Olly Betts <olly@survex.com> 4881 4882 * configure.in,omindex.cc: Use O_STREAMING and/or posix_fadvise() 4883 when reading files to be indexed (if available). This helps to 4884 keep the Xapian database in cache, and greatly improve indexing 4885 throughput. 4886 4887Tue Mar 30 00:06:15 BST 2004 Olly Betts <olly@survex.com> 4888 4889 * NEWS: We're now putting omega news here rather than in xapian-core 4890 so composed draft version for the forthcoming 0.8.0 release. 4891 4892Tue Mar 29 23:56:27 BST 2004 Olly Betts <olly@survex.com> 4893 4894 * templates/xml: Remove unused OmegaScript code: 4895 `$set{topterms,$or{$ne{$msize,0},$query}}'. 4896 4897Tue Mar 29 23:55:40 BST 2004 Olly Betts <olly@survex.com> 4898 4899 * Makefile.am: scriptindex needs to link to getopt.c and getopt1.c. 4900 4901Tue Mar 23 19:20:19 GMT 2004 Olly Betts <olly@survex.com> 4902 4903 * templates/xml: Correct spelling of `relavence' to `relevance'. 4904 NB: if you're parsing the XML output, you'll need to fix this 4905 spelling in your parser! 4906 4907Sun Mar 21 14:23:23 GMT 2004 Olly Betts <olly@survex.com> 4908 4909 * scriptindex.cc: Use getopt for option parsing. Change default to 4910 *not* overwriting the database (use --overwrite if you really want 4911 to do this); -u is now accepted but ignored. 4912 4913Fri Mar 12 02:11:28 GMT 2004 Olly Betts <olly@survex.com> 4914 4915 * templates/xml: "Content-Type: application/html" is more appropriate 4916 than text/xml. 4917 4918Fri Mar 12 02:09:33 GMT 2004 Olly Betts <olly@survex.com> 4919 4920 * omindex.cc: Added --overwrite option which forces an existing 4921 database to be deleted before indexing begins. 4922 4923Wed Mar 10 14:39:13 GMT 2004 Olly Betts <olly@survex.com> 4924 4925 * templates/xml: "Content-Type: text/xml". 4926 4927Wed Mar 10 00:08:40 GMT 2004 Olly Betts <olly@survex.com> 4928 4929 * docs/scriptindex.txt: Make more explicit that boolean produces a 4930 *single* boolean term. 4931 4932Tue Mar 09 19:08:19 GMT 2004 Olly Betts <olly@survex.com> 4933 4934 * indextext.cc,omindex.cc,scriptindex.cc: Updated to use add_term() 4935 instead of add_term_nopos(). 4936 4937Wed Mar 03 14:55:50 GMT 2004 Olly Betts <olly@survex.com> 4938 4939 * scriptindex.cc: Use true/false for assigning to booleans, not 1/0. 4940 4941Sat Feb 21 18:33:15 GMT 2004 Olly Betts <olly@survex.com> 4942 4943 * omega.cc,query.cc,docs/omegascript.txt: Added $httpheader 4944 Omegascript to allow arbitrary HTTP headers and alternative 4945 Content-Type headers to be specified. 4946 4947Sat Feb 14 00:32:06 GMT 2004 Olly Betts <olly@survex.com> 4948 4949 * query.cc: If the probabilistic query was bad, don't try to run the 4950 match. 4951 4952Sat Feb 14 00:11:52 GMT 2004 Olly Betts <olly@survex.com> 4953 4954 * docs/cgiparams.txt: Note that START and END should be in the format 4955 YYYYMMDD. 4956 4957Sat Feb 14 00:07:41 GMT 2004 Olly Betts <olly@survex.com> 4958 4959 * query.cc: Don't crash if there's a date filter but no probabilistic 4960 query. 4961 4962Wed Nov 26 22:44:49 GMT 2003 Olly Betts <olly@survex.com> 4963 4964 * indextext.cc: Raw terms with a multicharacter prefix are now indexed 4965 with a : inserted (e.g. as XFOO:Rterm). This matches what the query 4966 parser does. 4967 4968Wed Nov 26 16:25:16 GMT 2003 Olly Betts <olly@survex.com> 4969 4970 * configure.in: Version 0.7.5. 4971 4972Sun Nov 23 03:28:21 GMT 2003 Olly Betts <olly@survex.com> 4973 4974 * query.cc,docs/omegascript.txt: Added note that $setmap{prefix,...} 4975 needs be used before any commands which require the query to be 4976 parsed. 4977 4978Thu Nov 20 02:44:55 GMT 2003 Olly Betts <olly@survex.com> 4979 4980 * docs/omegascript.txt: Expanded documentation of $set and $setmap to 4981 include values which Omega itself makes use of. 4982 4983Thu Nov 20 02:43:03 GMT 2003 Olly Betts <olly@survex.com> 4984 4985 * omega.cc,query.cc: Set default value for $opt{stemmer} to "english" 4986 rather than taking "" to mean English. 4987 4988Tue Oct 21 21:29:18 BST 2003 Olly Betts <olly@survex.com> 4989 4990 * query.cc: Fixed $setmap{} to not add bogus entries. 4991 4992Tue Oct 21 21:20:31 BST 2003 Olly Betts <olly@survex.com> 4993 4994 * query.cc: Allow the QueryParser prefix map to be set up using 4995 $setmap{prefix,...} (e.g. $setmap{prefix,subject,XT,abstract,XA}). 4996 4997Tue Oct 21 21:13:59 BST 2003 Olly Betts <olly@survex.com> 4998 4999 * query.cc: Only parse probabilistic query once! 5000 5001Tue Oct 21 20:03:27 BST 2003 Olly Betts <olly@survex.com> 5002 5003 * omega.cc,omega.h,query.cc,query.h: Reworked so that the 5004 probabilistic query isn't parsed until we need some 5005 information from it. This means that we can now use options 5006 set by the omegascript template to control the behaviour of the 5007 query parser. 5008 5009Thu Oct 16 21:17:01 BST 2003 Olly Betts <olly@survex.com> 5010 5011 * omega.cc: Renamed `big_buf' to `query_string' and eliminated `more' 5012 flag and use of goto; tidied up order of reading CGI variables; use 5013 const refs to value strings in cgi_params map rather than copying 5014 the strings out. 5015 5016Sat Oct 11 20:43:04 BST 2003 Olly Betts <olly@survex.com> 5017 5018 * omega.cc,omega.h,query.cc: Make rset an object rather than a pointer 5019 to an object. 5020 5021Fri Oct 10 18:06:10 BST 2003 Olly Betts <olly@survex.com> 5022 5023 * query.cc: Removed the unfinished code for caching omegascript 5024 command expansions. Added code to cache $dbsize. The only other 5025 value correctly marked for caching is already being cached! 5026 5027Thu Oct 02 15:18:19 BST 2003 Olly Betts <olly@survex.com> 5028 5029 * configure.in: Version 0.7.4. 5030 5031Thu Oct 02 15:16:41 BST 2003 Olly Betts <olly@survex.com> 5032 5033 * query.cc: $date doesn't require the match to be run to work, but 5034 $topdoc does! 5035 5036Tue Sep 30 18:32:25 BST 2003 Olly Betts <olly@survex.com> 5037 5038 * query.cc: Cleaner version of T macro. 5039 5040Tue Sep 30 18:09:30 BST 2003 Olly Betts <olly@survex.com> 5041 5042 * query.cc: Hopefully the final piece in the Sun C++ puzzle. 5043 5044Tue Sep 30 00:59:50 BST 2003 Olly Betts <olly@survex.com> 5045 5046 * query.cc: Cleaned up a recent fix by using clean generic code which 5047 works on Sun's C++ too. 5048 5049Mon Sep 29 17:12:10 BST 2003 Olly Betts <olly@survex.com> 5050 5051 * cgiparam.cc: Portability fixes for Sun's C++ compiler. 5052 5053Mon Sep 29 13:26:22 BST 2003 Olly Betts <olly@survex.com> 5054 5055 * query.cc: Another Sun C++ fix. 5056 5057Mon Sep 29 11:49:30 BST 2003 Olly Betts <olly@survex.com> 5058 5059 * query.cc,omega.cc: More fixes for Sun's really rather rubbish 5060 C++ compiler. 5061 5062Mon Sep 29 01:39:56 BST 2003 Olly Betts <olly@survex.com> 5063 5064 * query.cc: Fixes for compiling with Sun's C++ compiler. 5065 5066Mon Sep 29 01:17:39 BST 2003 Olly Betts <olly@survex.com> 5067 5068 * omega.cc: Added workaround for compilation problem with Sun's C++. 5069 5070Fri Aug 08 01:39:51 BST 2003 Olly Betts <olly@survex.com> 5071 5072 * configure.in: Version 0.7.3. 5073 5074Sat Aug 02 01:52:38 BST 2003 Olly Betts <olly@survex.com> 5075 5076 * configure.in,omindex.cc,query.cc: Fixed to compile on mingw 5077 where ftime() returns void. 5078 5079Fri Aug 01 20:59:57 BST 2003 Olly Betts <olly@survex.com> 5080 5081 * scriptindex.cc: Added #define for sleep() on __WIN32__. 5082 5083Wed Jul 30 19:05:17 BST 2003 Olly Betts <olly@survex.com> 5084 5085 * getopt.h: Copied over latest getopt.h from xapian-core. 5086 5087Sun Jul 27 16:34:19 BST 2003 Olly Betts <olly@survex.com> 5088 5089 * Makefile.am,getopt.c,getopt.h,getopt1.c: Copied our version of GNU 5090 getopt here from xapian-core so we can build omindex on non-glibc 5091 platforms (modifications are for better C++ compatibility). 5092 5093Mon Jul 21 01:16:59 BST 2003 Olly Betts <olly@survex.com> 5094 5095 * configure.in: Use libtool; OM_PATH_XAPIAN -> XO_LIB_XAPIAN. 5096 5097Sat Jul 19 19:26:03 BST 2003 Olly Betts <olly@survex.com> 5098 5099 * omindex.cc: Added missing `#include <errno.h>'. 5100 5101Sat Jul 19 19:24:50 BST 2003 Olly Betts <olly@survex.com> 5102 5103 * indextext.cc: Fixed signed character issue. 5104 5105Thu Jul 17 00:51:42 BST 2003 Olly Betts <olly@survex.com> 5106 5107 * bootstrap: Removed bootstrap in favour of top-level bootstrap. 5108 5109Tue Jul 15 16:27:52 BST 2003 Olly Betts <olly@survex.com> 5110 5111 * omindex.cc: file_to_string() and stdout_to_string() now throw an 5112 exception on a read error, avoiding the " "-for-empty-file bodge. 5113 5114Tue Jul 15 15:18:32 BST 2003 James Aylett <james@tartarus.org> 5115 5116 * omindex.cc: fix file_to_string() to return the file on 5117 success, and not leak memory on empty files. Fix callers 5118 to give up on unreadable files, not vice versa. Fix 5119 logging messages to distinguish re-indexed/added. 5120 5121Fri Jul 11 15:09:55 BST 2003 Olly Betts <olly@survex.com> 5122 5123 * configure.in: Version 0.7.2. 5124 5125Fri Jul 11 12:08:57 BST 2003 Olly Betts <olly@survex.com> 5126 5127 * omega.cc: If the same database is listed more than once, only search 5128 the first occurrence. 5129 5130Fri Jul 11 11:57:24 BST 2003 Olly Betts <olly@survex.com> 5131 5132 * configure.in,utils.cc: Use snprintf. 5133 5134Tue Jul 08 17:56:39 BST 2003 Olly Betts <olly@survex.com> 5135 5136 * configure.in: Version 0.7.1. 5137 5138Tue Jul 08 17:34:01 BST 2003 Olly Betts <olly@survex.com> 5139 5140 * omindex.cc: Fixed compilation problem. 5141 5142Fri Jul 04 22:12:32 BST 2003 Olly Betts <olly@survex.com> 5143 5144 * bootstrap: add missing ';;' as case pattern delimiter 5145 5146Thu Jul 03 23:34:50 BST 2003 Olly Betts <olly@survex.com> 5147 5148 * configure.in: Version 0.7.0. 5149 5150Thu Jul 03 23:33:05 BST 2003 Olly Betts <olly@survex.com> 5151 5152 * omindex.cc: Abort parsing of document if it's excluded from 5153 indexing; ignore anything outside of the first <body>...</body>, 5154 if present. 5155 5156Tue Jun 24 00:45:28 BST 2003 Olly Betts <olly@survex.com> 5157 5158 * docs/overview.txt: Added note about hashing of long URL terms and 5159 reworked structure a little. 5160 5161Mon Jun 23 21:11:41 BST 2003 Olly Betts <olly@survex.com> 5162 5163 * bootstrap: Check for Bison 1.875 which doesn't work with Xapian. 5164 5165Mon Jun 23 16:52:47 BST 2003 Olly Betts <olly@survex.com> 5166 5167 * omega.cc,omindex.cc,scriptindex.cc: Xapian::PostListIterator -> 5168 Xapian::PostingIterator. 5169 5170Thu Jun 19 20:02:00 BST 2003 Olly Betts <olly@survex.com> 5171 5172 * symboltab.h: Convert hardspace to space. 5173 5174Wed Jun 18 16:32:34 BST 2003 Olly Betts <olly@survex.com> 5175 5176 * scriptindex.cc: Removed already disabled unique id hashing to docid 5177 code. Xapian doesn't support setting arbitrary docids - if it ever 5178 does we can retrieve this code from CVS. 5179 5180Wed Jun 18 16:28:33 BST 2003 Olly Betts <olly@survex.com> 5181 5182 * Makefile.am,indextext.cc,indextext.h,omindex.cc,scriptindex.cc: 5183 Normalise accents in probabilistic terms. 5184 5185Tue Jun 17 17:54:32 BST 2003 Olly Betts <olly@survex.com> 5186 5187 * omindex.cc: Read output from pstotext and pdftotext via pipes rather 5188 than temporary files to side-step the whole problem of secure 5189 temporary file creation; Use pdfinfo to get the title and keywords 5190 from when indexing a PDF; Safe filename escaping tweaked to not 5191 escape common safe punctuation. 5192 5193Tue Jun 17 17:50:00 BST 2003 Olly Betts <olly@survex.com> 5194 5195 * htmlparse.cc,htmlparse.h: Moved initialisation of named_ents out of 5196 header - it's not a sensible candidate for inlining. 5197 5198Wed Jun 11 02:32:25 BST 2003 Olly Betts <olly@survex.com> 5199 5200 * date.cc,date.h,omega.cc,omega.h,omindex.cc,query.cc,query.h, 5201 scriptindex.cc: Om -> Xapian::, etc. 5202 5203Fri Jun 6 01:04:12 BST 2003 Richard Boulton <richard@tartarus.org> 5204 5205 * omindex.cc: Implement an upper limit on the length of URL 5206 terms. Currently, this is set at 240 characters - it can 5207 probably be increased slightly, but I'm not sure exactly 5208 how long a term can safely be. If the URL term would be 5209 longer than this, its last few bytes are replaced by a 5210 hash of the tail of the URL. This means that (apart from 5211 hopefully very rare collisions) urlterms should still be 5212 unique ids for documents. 5213 5214Fri Jun 06 00:14:13 BST 2003 Richard Boulton <richard@tartarus.org> 5215 5216 * omindex.cc: Clean up processing of HTML documents: 5217 - Ignore the contents of <script> and <style> tags in HTML. 5218 - Strip initial whitespace in each tag in an HTML document. 5219 - Try not to split words in half when truncating title and 5220 summary. 5221 5222Tue Jun 03 11:15:28 BST 2003 Olly Betts <olly@survex.com> 5223 5224 * templates/query: Create log entry in query.log. 5225 5226Thu May 29 18:03:54 BST 2003 Olly Betts <olly@survex.com> 5227 5228 * query.cc: Fixed bug in DEFAULT_LOG_ENTRY's Omegascript. 5229 5230Thu May 29 00:22:28 BST 2003 Olly Betts <olly@survex.com> 5231 5232 * query.cc: Set STEM_LANGUAGE near the start of the file so it's easy 5233 for users to change until we get better configurability. 5234 5235Thu May 29 00:00:28 BST 2003 Olly Betts <olly@survex.com> 5236 5237 * Makefile.am,date.cc,date.h,query.cc: Split code to build a 5238 date range filter into a separate file. 5239 5240Wed May 28 23:38:02 BST 2003 Olly Betts <olly@survex.com> 5241 5242 * configfile.cc,configfile.h,omega.cc,omega.conf,query.cc,query.h, 5243 docs/omegascript.txt,docs/overview.txt,docs/quickstart.txt: 5244 Replaced half-hearted logging support with flexible 5245 OmegaScript-based approach with new $log command. Also added 5246 $now to allow the current date/time to be logged. 5247 5248Tue May 27 17:55:24 BST 2003 Olly Betts <olly@survex.com> 5249 5250 * query.cc: Added missing "#include <assert.h>". 5251 5252Mon May 26 22:41:26 BST 2003 Olly Betts <olly@survex.com> 5253 5254 * configure.in: Don't use libtool; Use AC_CONFIG_FILES - it's the new 5255 autoconf way! 5256 5257Mon May 26 12:12:22 BST 2003 Olly Betts <olly@survex.com> 5258 5259 * omega.spec.in: Removed %changelog - it hasn't been reliably updated 5260 and only really makes sense when the packaging is done by a third 5261 party anyway. 5262 5263Mon May 26 12:01:55 BST 2003 Olly Betts <olly@survex.com> 5264 5265 * query.cc: If the query is empty, don't bother running it through 5266 enquire. 5267 5268Wed Apr 30 01:18:47 BST 2003 Olly Betts <olly@survex.com> 5269 5270 * docs/cgiparams.txt,docs/omegascript.txt: Minor improvements. 5271 5272Wed Apr 30 01:14:46 BST 2003 Olly Betts <olly@survex.com> 5273 5274 * query.cc: Use correct types for docid and value_no in $value. 5275 5276Wed Apr 23 16:15:07 BST 2003 Sam Liddicott <sam@liddicott.com> 5277 5278 * templates/xml: add collapse info to xml template. 5279 5280Wed Apr 23 14:00:37 BST 2003 Olly Betts <olly@survex.com> 5281 5282 * omega.spec.in: Merged changes from Fabrice Colin. 5283 5284Thu Apr 10 03:14:51 BST 2003 Olly Betts <olly@survex.com> 5285 5286 * configure.in: Updated for 0.6.5 release. 5287 5288Wed Apr 09 13:56:14 BST 2003 Olly Betts <olly@survex.com> 5289 5290 * omega.cc,query.cc,omega.h,docs/cgiparams.txt: Renamed DATE1, DATE2, 5291 and DAYSMINUS to the more meaningful START, END, and SPAN (NB SPAN 5292 is days before END, or after START, or before today - whereas 5293 SPAN was before *DATE1* or before today). The old parameters names 5294 are supported (with the original semantics) for now. 5295 5296Wed Apr 09 13:44:28 BST 2003 Olly Betts <olly@survex.com> 5297 5298 * Makefile.am: Install docs in /usr/share/doc/omega to be FHS 5299 compliant. 5300 * omega.spec.in: Consistently use %{contentdir} instead of /var/lib; 5301 removed redundant second setting of %docdir. 5302 5303Wed Apr 09 01:21:57 BST 2003 Olly Betts <olly@survex.com> 5304 5305 * Makefile.am: Removed bogus extra "\". 5306 5307Mon Mar 31 19:42:24 BST 2003 Olly Betts <olly@survex.com> 5308 5309 * Makefile.am: Install documentation! 5310 * omega.spec.in: Merged in changes to RPM packaging from Fabrice Colin 5311 and reworked further. 5312 5313Fri Mar 28 17:47:45 GMT 2003 Olly Betts <olly@survex.com> 5314 5315 * templates/query,templates/query2: Removed bogus setting of defunct 5316 xB parameter; correctly propagate multiple B parameters. 5317 5318Fri Mar 28 17:45:41 GMT 2003 Olly Betts <olly@survex.com> 5319 5320 * omindex.cc: Report correct version number (was hard-wired to 1.0!) 5321 5322Tue Mar 25 14:46:10 GMT 2003 Olly Betts <olly@survex.com> 5323 5324 * query.cc: If xP and P are both empty, classify as SAME_QUERY not 5325 NEW_QUERY as there may be a boolean query too. 5326 * query.cc: Fixed off-by-one error in rounding down topdoc - it was 5327 possible to get to an empty page of hits if there were exactly a 5328 multiple of HITSPERPAGE matches and the matcher over-estimated the 5329 number of matches and Omega displayed page links. 5330 5331Mon Mar 24 09:40:04 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk> 5332 5333 * templates/query: Added propagation of B boolean filter 5334 * templates/query2: factored about a bit more, query2 is 5335 a more modular version of query which will ultimately 5336 lend itself to customisation a bit more to the uninitiated. 5337 5338Tue Mar 04 01:02:12 GMT 2003 Olly Betts <olly@survex.com> 5339 5340 * omega.cc: Fixed handling of multiple DB parameters to be as 5341 documented. 5342 5343Fri Feb 28 09:52:03 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk> 5344 5345 * Added $collapsed to omegascript to give the number of hits 5346 collapsed into the current hit, eg: 5347 5348 $if{$ne{$collapsed,0},$collapsed hidden results 5349 ($value{$cgi{COLLAPSE}})} 5350 5351 * templates/godmode: removed euro ferret icon reference 5352 * templates/godmode: added value dumping, for values from 0-255 5353 5354Thu Feb 27 11:58:13 GMT 2003 Olly Betts <olly@survex.com> 5355 5356 * Makefile.am,query.cc,docs/omegascript.txt,templates/query: 5357 Added $transform{} which does regexp manipulation (currently 5358 disabled); Added $uniq{} to eliminate duplicates from a sorted 5359 list; Fixed a query with repeated terms to be identified as 5360 SAME_QUERY not EXTENDED_QUERY; remove duplicates from terms 5361 listed in term frequencies. 5362 5363Wed Feb 26 17:50:26 GMT 2003 Olly Betts <olly@survex.com> 5364 5365 * scriptindex.cc: Allow '_' in fieldnames. Diagnose bad characters 5366 in fieldnames better. 5367 5368Wed Feb 26 15:13:02 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk> 5369 5370 * dbi2omega: Add DBUSER and DBPASSWD env var support so that password 5371 protected DB's can easily be used 5372 * add cgi parameter COLLAPSE to collapse on key values 5373 * Add $value{key[,docid]} support to omegascript 5374 5375Wed Feb 26 09:58:01 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk> 5376 5377 * bootstrap: Fix success message when building in non-src dir 5378 as configure is written to the src dir. 5379 5380Mon Jan 6 12:47:55 GMT 2003 James Aylett <james@tartarus.org> 5381 5382 * scriptindex.cc: build fix 5383 5384Tue Dec 24 20:12:23 GMT 2002 Olly Betts <olly@survex.com> 5385 5386 * configure.in: Version 0.6.4. 5387 5388Tue Dec 24 20:06:47 GMT 2002 Olly Betts <olly@survex.com> 5389 5390 * scriptindex.cc: Minor tweak. 5391 5392Tue Dec 24 19:58:57 GMT 2002 Olly Betts <olly@survex.com> 5393 5394 * omega.cc,docs/cgiparams.txt: Prefer MINHITS to MIN_HITS and 5395 RAWSEARCH to RAW_SEARCH since none of the other CGI parameter 5396 names have _ separating words. Also support old names for now. 5397 5398Mon Dec 23 03:23:33 GMT 2002 Olly Betts <olly@survex.com> 5399 5400 * query.cc,docs/omegascript.txt,templates/query: Added $unstem to map 5401 a stemmed term to the form(s) used in the query; $queryterms now 5402 only includes the first occurrence of each stemmed form; $prettyterm 5403 uses the unstem map. 5404 5405Sat Dec 21 17:47:33 GMT 2002 Olly Betts <olly@survex.com> 5406 5407 * scriptindex.cc,docs/scriptindex.txt: Replaced index=nopos with 5408 indexnopos action; index and indexnopos now take an optional 5409 prefix argument; index=nopos is handled specially for backwards 5410 compatibility. 5411 5412Sat Dec 21 17:18:02 GMT 2002 Olly Betts <olly@survex.com> 5413 5414 * scriptindex.cc,docs/scriptindex.txt: Added new scriptindex action 5415 date=FORMAT to generate terms for date range searching. 5416 5417Sat Dec 21 01:51:32 GMT 2002 Olly Betts <olly@survex.com> 5418 5419 * templates/query: Stop topterms sticking out of green box with 5420 gecko based browsers. 5421 5422Sat Dec 21 01:44:53 GMT 2002 Olly Betts <olly@survex.com> 5423 5424 * Makefile.am: Distribute docs/scriptindex.txt. 5425 * docs/omegascript.txt: It's $setrelevant not $set_relevant. 5426 5427Sat Dec 14 13:54:10 GMT 2002 Olly Betts <olly@survex.com> 5428 5429 * configure.in: Version 0.6.3; removed -Wno-long-long as we don't use 5430 long long here. 5431 * query.cc: Compilation fixes. 5432 * templates/query: Don't call $topterms twice! 5433 5434Sat Dec 14 01:10:48 GMT 2002 Olly Betts <olly@survex.com> 5435 5436 * query.cc: Updated in line with removal of OmSettings. 5437 5438Wed Dec 11 00:58:49 GMT 2002 Olly Betts <olly@survex.com> 5439 5440 * configure.in,query.cc,docs/omegascript.txt,templates/query: 5441 Added $time which reports how long the match took - when searching 5442 on a remote website, it's hard to gauge how much time is taken by 5443 the search, and how much by the web server and browser; renamed 5444 and_vec to or_vec which better describes its purpose. 5445 5446Mon Dec 09 17:11:26 GMT 2002 Olly Betts <olly@survex.com> 5447 5448 * query.cc,docs/omegascript.txt,templates/query: Added $dbsize 5449 to return the number of documents in the database being searched. 5450 Use this in the default query template on the "front page" shown 5451 when there's no search. 5452 5453Mon Dec 09 02:55:46 GMT 2002 Olly Betts <olly@survex.com> 5454 5455 * query.cc,docs/omegascript.txt,templates/query: Added $msizeexact 5456 which returns "true" if $msize if exact (or "" if it is estimated). 5457 This means that you'll see "... of about N matches" less often - 5458 notably it's gone when searching for a single term, which is a 5459 pretty common case. 5460 5461Sun Dec 08 08:42:47 GMT 2002 Olly Betts <olly@survex.com> 5462 5463 * scriptindex.cc: Replaced icky unportable code which set the filename 5464 to "/dev/fd/0" in order to read from stdin. 5465 5466Sun Dec 08 06:39:30 GMT 2002 Olly Betts <olly@survex.com> 5467 5468 * query.cc,docs/omegascript.txt: Fixed $hitlist to complain if more 5469 than one parameter is passed; $topterms now defaults to 16 terms 5470 rather than 20; $topterms now weeds out terms which stem to the 5471 same as those in the query, or those already in $topterms. 5472 5473Sun Dec 08 06:36:04 GMT 2002 Olly Betts <olly@survex.com> 5474 5475 * templates/query: Make background white - the very light grey just 5476 looks dirty; fixed exclusion of TopTerms Javascript when there 5477 are not TopTerms; sample now <small>; language and size now 5478 appear when the corresponding fields are present; fixed 5479 unmatched </small>; fixed missing list of terms matching 5480 each document. 5481 5482Sat Dec 07 21:20:31 GMT 2002 Olly Betts <olly@survex.com> 5483 5484 * configure.in: Version 0.6.2. 5485 5486Sat Dec 07 21:04:31 GMT 2002 Olly Betts <olly@survex.com> 5487 5488 * query.cc: Prefer "while (true)" to "while (1)". 5489 5490Fri Dec 06 04:41:05 GMT 2002 Olly Betts <olly@survex.com> 5491 5492 * omindex.cc: Index .php files by default; non-zero return code if 5493 an exception is caught. 5494 5495Fri Dec 06 04:30:17 GMT 2002 Olly Betts <olly@survex.com> 5496 5497 * htmlparse.cc: Ignore PHP tags and their contents; fixed tag 5498 scanning code to never read one character past the end of 5499 the document. 5500 5501Wed Dec 04 18:42:51 GMT 2002 Olly Betts <olly@survex.com> 5502 5503 * omega.cc,omega.h,omindex.cc,query.cc,scriptindex.cc: 5504 Updated in line with OmSettings related changes to the API. 5505 5506Wed Dec 04 17:13:43 GMT 2002 Olly Betts <olly@survex.com> 5507 5508 * query.cc: Fixed $dbname to return "default" for the default 5509 database, rather than "" - this fixes paging in searches of the 5510 default database. 5511 * templates/query: Removed xDEFAULTOP hidden field which is no longer 5512 used. 5513 5514Wed Dec 04 11:57:13 GMT 2002 Olly Betts <olly@survex.com> 5515 5516 * templates/query: Removed bogus unmatched '}'. 5517 5518Thu Nov 28 20:24:08 GMT 2002 Olly Betts <olly@survex.com> 5519 5520 * omega.cc,query.cc: Updated in line with OmEnquire::get_eset() no 5521 longer taking an OmSettings object. 5522 5523Wed Nov 27 19:02:12 GMT 2002 Olly Betts <olly@survex.com> 5524 5525 * dbi2omega: Return fields in table order; more efficient; 5526 report any error reading a row; if we get a NULL field, 5527 don't output it, and suppress perl warning about use of 5528 an undefined program. 5529 5530Wed Nov 27 05:22:04 GMT 2002 Olly Betts <olly@survex.com> 5531 5532 * configure.in: Set version to 0.6.0. 5533 5534Wed Nov 27 05:21:00 GMT 2002 Olly Betts <olly@survex.com> 5535 5536 * configure.in,htmlparse.h,omindex.cc,scriptindex.cc: 5537 Use "-Wall -W" rather than "-Wall -Wunused", and fixed the 5538 warnings this reveals. 5539 5540Wed Nov 27 04:20:13 GMT 2002 Olly Betts <olly@survex.com> 5541 5542 * Makefile.am,dbi2omega: Added perl script to dump any database 5543 which perl DBI can access into the dump format expected by 5544 scriptindex. 5545 5546Wed Oct 30 02:02:32 GMT 2002 Olly Betts <olly@survex.com> 5547 5548 * omega.spec.in: Use bootstrap instead of buildall; don't use "-j4" 5549 with make - most people don't all have quad processor boxes! 5550 5551Wed Oct 30 01:56:31 GMT 2002 Olly Betts <olly@survex.com> 5552 5553 * buildall: Removed in favour of bootstrap script. 5554 5555Tue Oct 29 02:01:58 GMT 2002 Olly Betts <olly@survex.com> 5556 5557 * omindex.cc,scriptindex.cc: Added MAX_PROB_TERM_LENGTH (set to 5558 64) to limit size of probabilistic terms. 5559 5560Sat Oct 12 17:09:55 BST 2002 Olly Betts <olly@survex.com> 5561 5562 * bootstrap: Copied bootstrap script from xapian-core. 5563 5564Sat Oct 12 17:05:37 BST 2002 Olly Betts <olly@survex.com> 5565 5566 * configure.in: Version 0.5.3. 5567 5568Wed Oct 09 16:55:56 BST 2002 Olly Betts <olly@survex.com> 5569 5570 * omega.cc,omega.h,query.cc,docs/{cgiparams.txt,omegascript.txt}, 5571 templates/query: revamped the "reset first page when filter changes" 5572 scheme - all filtery things are now serialised and put into the 5573 xFILTER CGI parameter, which copes with multiple B values. Support 5574 for the old way (xB, xDATE1, xDATE2, xDAYSMINUS, xDEFAULTOP) is 5575 included for now (but only copes with a single B value). Added (and 5576 documented) $filters Omegascript command to implement this. 5577 * query.cc: fixed handling of case when topdoc is non-zero, but 5578 no matches were found. This was causing topdoc to be set to -6! 5579 * query.cc: fixed handling of prefixes starting with an X. 5580 5581Wed Oct 09 15:35:54 BST 2002 Olly Betts <olly@survex.com> 5582 5583 * .cvsignore: Added scriptindex and omega-*.tar.gz; removed libtool. 5584 5585Sun Oct 06 18:56:40 BST 2002 Olly Betts <olly@survex.com> 5586 5587 * configure.in: Version 0.5.2. 5588 5589Thu Oct 03 16:42:06 BST 2002 Olly Betts <olly@survex.com> 5590 5591 * query.cc: Added CMD_hit to enumeration. 5592 5593Wed Oct 02 17:02:25 BST 2002 Olly Betts <olly@survex.com> 5594 5595 * configure.in: Version 0.5.1. 5596 * Makefile.am,configure.in: require automake 1.6.3 and autoconf 2.54 5597 since xapian-core does anyway, and it neatens configure.in slightly. 5598 5599Wed Oct 02 16:58:39 BST 2002 Olly Betts <olly@survex.com> 5600 5601 * query.cc,docs/omegascript.txt: Added $hit which gives the m-set 5602 number of the current hit. 5603 5604Sun Sep 22 15:47:33 BST 2002 Olly Betts <olly@survex.com> 5605 5606 * configfile.cc: Corrected use of string.data() to string.c_str(). 5607 5608Sun Sep 22 03:53:35 BST 2002 Olly Betts <olly@survex.com> 5609 5610 * templates/query: Updated xapian url to http://www.xapian.org/ 5611 5612Fri Sep 20 15:36:35 BST 2002 Olly Betts <olly@survex.com> 5613 5614 * configure.in: Version 0.5.0. 5615 5616Sun Sep 15 03:07:31 BST 2002 Richard Boulton <richard.boulton@omsee.com> 5617 5618 * buildall: Update to latest version, to fix bug with VPATH version 5619 checking for autoconf. 5620 5621Thu Sep 12 15:11:16 BST 2002 Olly Betts <olly@survex.com> 5622 5623 * htmlparse.cc: Add comment about string::replace() invalidating 5624 iterators. 5625 5626Thu Sep 12 13:38:05 BST 2002 Olly Betts <olly@survex.com> 5627 5628 * omegascript.vim,omegascript.txt,query.cc: cosmetic tweaks. 5629 5630Thu Sep 5 14:47:54 BST 2002 Richard Boulton <richard@tartarus.org> 5631 5632 * configure.in: Don't use libtool. I don't know why I ever thought 5633 it was needed. 5634 5635Thu Sep 5 14:11:51 BST 2002 Richard Boulton <richard@tartarus.org> 5636 5637 * query.cc: Change $and to return true iff all its arguments are 5638 not false, rather than if one or more of the arguments is false. 5639 * docs/omegascript.txt: Update documentation of $and{} 5640 5641Fri Aug 23 13:27:02 BST 2002 James Aylett <tartarus@users.sourceforge.net> 5642 5643 * docs/quickstart.txt: encourage people to call their first 5644 database 'default' since this will work straight off. 5645 5646Wed Aug 21 17:52:36 BST 2002 Richard Boulton <richard@tartarus.org> 5647 5648 * query.cc: Add $slice{} command, to slice a list at a set of 5649 positions (given by a second list). 5650 Also, bugfix: require $hitlist{} to take at least one parameter: 5651 it currently segfaults if given none. 5652 * docs/omegascript.txt: Document $slice{}. 5653 * extra/omegascript.vim: Update syntax highlighting. 5654 5655Wed Aug 21 18:03:43 BST 2002 James Aylett <tartarus@users.sourceforge.net> 5656 5657 * omindex.cc: tidy up output so it doesn't wrap so much 5658 5659Wed Aug 21 18:01:38 BST 2002 James Aylett <tartarus@users.sourceforge.net> 5660 5661 * htmlparse.cc: fixed bug in entity reference handling 5662 5663Wed Aug 21 13:21:12 BST 2002 James Aylett <tartarus@users.sourceforge.net> 5664 5665 * omindex.cc: Bugfix to metaterm generation when operating on an 5666 absolute URL that is also at the root of its web server. 5667 5668Wed Aug 21 10:48:06 BST 2002 Richard Boulton <richard@tartarus.org> 5669 5670 * scriptindex.cc: If a field has multiple instances, keep all of 5671 them (previously only kept the final occurrence). 5672 * docs/scriptindex.txt: Mention that multiple instances of fields 5673 are permitted. 5674 5675Tue Aug 20 18:02:45 BST 2002 James Aylett <tartarus@users.sourceforge.net> 5676 5677 * docs/quickstart.txt: correct for new(ish) omindex behaviour 5678 5679Sat Aug 17 13:38:57 BST 2002 Richard Boulton <richard@tartarus.org> 5680 5681 * extra/omegascript.vim: Quick attempt at a vim syntax highlighting 5682 file for omegascript. Recognises files only if they're in a 5683 directory called "templates": perhaps we should adopt a suffix to 5684 make recognition easier. 5685 Read the file for installation instructions. 5686 5687Thu Aug 15 11:21:20 BST 2002 Richard Boulton <richard@tartarus.org> 5688 5689 * scriptindex.cc: Allow updating of databases by a command line 5690 switch, and also turn off verbose output (can be turned back 5691 on with a switch). 5692 * docs/scriptindex.txt: Document the "unique" tag. 5693 5694Thu Aug 15 11:18:21 BST 2002 Richard Boulton <richard@tartarus.org> 5695 5696 * buildall: Copy buildall from xapian-core - the old one breaks 5697 for me (due to odd aclocal paths) but the new one is fine. 5698 We should make a common module to hold build stuff to be shared 5699 between modules, though. 5700 5701Mon Aug 12 01:34:42 BST 2002 Richard Boulton <richard@tartarus.org> 5702 5703 * scriptindex.cc: Bug fix - index without positional information 5704 if "nopos" is specified, rather than the other way around. 5705 Bug fix - don't completely eradicate newlines in multiline values, 5706 until they have a chance to be converted to spaces. 5707 Delete documents if no fields other than unique fields are 5708 specifed. 5709 Add some simple debugging, and write messages to a log file in 5710 the database directory. 5711 5712 * configure.in: Use libtool. 5713 5714Fri Aug 9 13:57:32 BST 2002 Richard Boulton <richard@tartarus.org> 5715 5716 * scriptindex.cc: Fix compile errors, by changing string 5717 constructors to take begin and end iterators, instead of a begin 5718 and a length. 5719 5720Fri Jul 05 19:33:55 BST 2002 Olly Betts <olly@survex.com> 5721 5722 * omega.spec.in: Fixed wrt /usr/lib/omega/bin/omega. 5723 5724Fri Jul 05 19:20:05 BST 2002 Olly Betts <olly@survex.com> 5725 5726 * Makefile.am, docs/quickstart.txt: Install omega as 5727 ${prefix}/lib/omega/bin/omega. 5728 5729Thu Jul 04 02:11:46 BST 2002 Olly Betts <olly@survex.com> 5730 5731 * scriptindex.cc, docs/scriptindex.txt: new indexer - indexing 5732 behaviour is controlled by a simple but powerful script. 5733 5734 * Makefile.am: tidied up. 5735 5736 * configfile.cc, docs/quickstart.txt: database and templates default to 5737 being in /var/lib/omega rather than /home/omega. 5738 5739 * docs/quickstart.txt: describe the new test mode (command line) rather 5740 than the old one (stdin). 5741 5742 * omega.cc, docs/cgiparams.txt: If xP isn't set, honour paging and 5743 R-set. So RAW_SEARCH now only disables snapping TOPDOC to a multiple 5744 of HITSPERPAGE. 5745 5746 * query.cc: "using namespace std;" 5747 5748Fri Jun 14 00:07:20 BST 2002 Olly Betts <olly@survex.com> 5749 5750 * $prettyterm{} no longer adds a trailing '.' if the term also exists 5751 with an R prefix and stems to itself. 5752 5753Fri Jun 14 00:02:16 BST 2002 Olly Betts <olly@survex.com> 5754 5755 * MORELIKE can now take a termname - this allows MORELIKE to be used 5756 with a unique id from an external database if it has been indexed 5757 as a boolean term. 5758 5759Thu Jun 13 00:01:11 BST 2002 Olly Betts <olly@survex.com> 5760 5761 * omega.conf: removed trailing slashes from directory names. 5762 5763 * query.cc: removed extra slash added to template_dir; improved 5764 reporting of errors opening template file. 5765 5766Wed Jun 12 23:51:11 BST 2002 Olly Betts <olly@survex.com> 5767 5768 * Added an alternative test mode - you can now pass parameters as 5769 command line arguments, which is more convenient for repeating 5770 the same test query, and for automated testing, e.g.: 5771 5772 omega 'P=information retrieval' DB=papers 5773 5774 If the first parameter starts with a "-" and doesn't contain an 5775 "=", omega now outputs the version string and stops (to gracefully 5776 handle "omega --version" and "omega --help". 5777 5778Wed Jun 12 23:39:20 BST 2002 Olly Betts <olly@survex.com> 5779 5780 * omindex.cc: removed OLD_PREFIXES code - shout if you were using it. 5781 5782Fri May 17 14:09:25 BST 2002 Olly Betts <olly@survex.com> 5783 5784 * Pass the database to the query parser (not used there at present, 5785 but will allow wildcarded searches, etc to be implemented). 5786 5787Thu May 16 17:57:34 BST 2002 Olly Betts <olly@survex.com> 5788 5789 * <algo.h> -> <algorithm>. 5790 5791Thu May 16 15:41:14 BST 2002 Sam Liddicott <sam@ananova.com> 5792 5793 * Removed extra package again! 5794 5795 * Moved images to /var/www/icons/omega till we think of something 5796 better. Should be the most harmless solution that still works 5797 without requireing too much brains on the part of the installer 5798 5799Thu May 16 14:53:54 BST 2002 Sam Liddicott <sam@ananova.com> 5800 5801 * Moved images to a separate optional package to stop touching 5802 user's web tree until we work out what to do. sysadmin can 5803 still install images if he wants and on a redhat box they will 5804 end up in the right place. This will no doubt get revisted later, 5805 that's fine by me. 5806 5807Thu May 16 13:31:27 BST 2002 Sam Liddicott <sam@ananova.com> 5808 5809 * Added loads more missing files like images and templates to the 5810 package 5811 5812 * Also fixed the templates to use the new images dir (if they used 5813 images, which they actually don't) 5814 5815Thu May 16 12:56:55 BST 2002 Sam Liddicott <sam@ananova.com> 5816 5817 * Fixes to spec file to add various missing files 5818 5819Wed May 15 12:59:37 BST 2002 Olly Betts <olly@survex.com> 5820 5821 * omindex now understand acronyms (N.A.T.O. E.T ...). 5822 5823 * $highlight{} now understands "&" (AT&T M&S ...) and acronyms. 5824 5825Tue May 14 13:08:41 BST 2002 Olly Betts <olly@survex.com> 5826 5827 * Index <word>&<word> as a single term (e.g. AT&T, M&S, A&P). 5828 5829Tue May 14 12:37:49 BST 2002 Olly Betts <olly@survex.com> 5830 5831 * omindex.cc: cleaned up a little. 5832 5833Tue May 14 11:24:42 BST 2002 Olly Betts <olly@survex.com> 5834 5835 * Fixed config.h inclusion; using std::*. 5836 5837Tue May 14 11:18:37 BST 2002 Olly Betts <olly@survex.com> 5838 5839 * Updated. 5840 5841Tue May 14 11:16:03 BST 2002 Olly Betts <olly@survex.com> 5842 5843 * Added SORT and SORTBANDS. 5844 5845Mon May 13 12:52:29 BST 2002 Olly Betts <olly@survex.com> 5846 5847 * Autoconf 2.50. 5848 5849 * Commented out omindex-config (since it's unfinished) and XML support 5850 (since only omindex-config uses it). 5851 5852Thu May 02 16:06:02 BST 2002 Olly Betts <olly@survex.com> 5853 5854 * Updated to reflect removal of OmData. 5855 5856Wed May 01 11:26:59 BST 2002 Olly Betts <olly@survex.com> 5857 5858 * Changed to use queryparser in libomqueryparser. 5859 5860Tue Apr 23 15:10:42 BST 2002 Olly Betts <olly@survex.com> 5861 5862 * Make buildall smart enough to generate aclocal.m4 properly and 5863 remove acinclude.m4. It now also extracts the package name from 5864 configure.in so we can use the same buildall everywhere; fixed 5865 problem with double use of AM_CXXFLAGS in Makefile.am. 5866 5867Tue Apr 23 14:27:29 BST 2002 Olly Betts <olly@survex.com> 5868 5869 * Updated for xapian-config and xapian.m4 changes. 5870 5871Thu Apr 18 14:37:05 BST 2002 Olly Betts <olly@survex.com> 5872 5873 * Updated buildall; minor tweaks to configure.in. 5874 5875Wed Apr 17 12:31:18 BST 2002 Olly Betts <olly@survex.com> 5876 5877 * Removed references to xapian-config uninst options. 5878 5879Fri Apr 12 15:48:33 BST 2002 Olly Betts <olly@survex.com> 5880 5881 * Remove parsequery.cc on "make maintainer-clean". 5882 5883Fri Apr 12 16:19:19 BST 2002 Olly Betts <olly@survex.com> 5884 5885 * Require automake 1.5. 5886 5887Fri Apr 12 12:47:04 BST 2002 Olly Betts <olly@survex.com> 5888 5889 * Tweaked what gets interpreted as a phrase. 5890 5891Fri Apr 12 12:44:00 BST 2002 Olly Betts <olly@survex.com> 5892 5893 * Fixed to use AM_CFLAGS and AM_CXXFLAGS. 5894 5895Mon Apr 01 23:34:09 BST 2002 Olly Betts <olly@survex.com> 5896 5897 * Fixed support for decimal numeric entities (e.g. "ö") 5898 5899 * Added support for all iso-8859-1 named entities (e.g. "ö") 5900 5901Mon Apr 01 15:07:31 BST 2002 Olly Betts <olly@survex.com> 5902 5903 * Applied patch from "orion orion" to fix problem in HTML parsing. 5904 5905Mon Mar 25 13:11:14 GMT 2002 Olly Betts <olly@survex.com> 5906 5907 * More tolerant treatment of random punctuation in query. 5908 5909Mon Feb 4 14:57:36 GMT 2002 Sam Liddicott <sam@ananova.com> 5910 5911 * Added support for repeated fields in document data. 5912 $field{fieldname} may now return multiple tab separated values if 5913 more than one instance of a field exists in the document data 5914 5915Tue Jan 15 16:29:39 GMT 2002 Sam Liddicott <sam@ananova.com> 5916 5917 * Fixed date_range_filter for the case where DATE1 and DATE2 don't 5918 share the same MONTH and YEAR and M## terms for intermediate months 5919 need calculating between the years. 5920 5921Thu Jan 10 15:39:43 GMT 2002 Sam Liddicott <sam@ananova.com> 5922 5923 * Added $htmlstrip{} to strip out html tags 5924 5925Thu Jan 10 14:34:35 GMT 2002 James Aylett <tartarus@users.sourceforge.net> 5926 5927 * toptermsjs snippet now included inside the HEAD, so it's 5928 actually legal HTML. Snippet now sets the required 'type' 5929 attribute as well. (It keeps the technically illegal 5930 'language' attribute because I have a sneaking suspicion it 5931 won't work otherwise.) 5932 5933Thu Jan 10 14:30:19 GMT 2002 James Aylett <tartarus@users.sourceforge.net> 5934 5935 * $opt with two arguments now acts as a lookup for a $setmap 5936 map. This was previously documented in a misleading fashion. 5937 The new system is backwards compatible with the old. 5938 5939Wed Jan 9 Sam Liddicott <sam@ananova.com> 5940 5941 * Added RAW_SEARCH as cgi param which when set stops change-search 5942 detection being performed and processes rset, topdoc and page-change 5943 parameters ( [ ] < > 1 2 etc etc ) anyway 5944 5945 * Added MIN_HITS cgi param to request many more hits than can 5946 fit on the page so we can be confident that the next few 5947 consecutive pages will really be needed 5948 5949 * Added xml template which when combined with RAW_SEARCH=1 5950 can be very useful when searching is done from another 5951 script 5952 5953Fri Dec 21 17:56:02 GMT 2001 Olly Betts <olly@survex.com> 5954 5955 * Namespace fixes to allow use of find and find_if on Redhat's 5956 "GCC 2.96". 5957 5958Fri Dec 21 17:53:59 GMT 2001 Olly Betts <olly@survex.com> 5959 5960 * Added quick'n'dirty interface to allow experimentation with 5961 OmBiasFunctor. 5962 5963Thu Dec 20 14:46:33 GMT 2001 Olly Betts <olly@survex.com> 5964 5965 * Document xDB, xDAYSMINUS, xDATE1, xDATE2, xB. 5966 5967Thu Dec 20 12:55:29 GMT 2001 Olly Betts <olly@survex.com> 5968 5969 * Use double quotes on parameters to <BODY>. 5970 5971Mon Dec 17 15:01:43 GMT 2001 Olly Betts <olly@survex.com> 5972 5973 * Get rid of whitespace between hundreds and tens image in page 5974 links. 5975 5976Fri Dec 14 17:26:48 GMT 2001 Olly Betts <olly@survex.com> 5977 5978 * Force first page of hits if DB, DEFAULTOP, B, DAYSMINUS, DATE1, 5979 or DATE2 changes; also clear relevance judgements if DB changes. 5980 5981Fri Dec 14 16:21:07 GMT 2001 Olly Betts <olly@survex.com> 5982 5983 * Removed restriction on minimum page size (was 10) - for a shopping 5984 type application with images next to each hit, 5 or fewer per page 5985 might be reasonable; even one result per page makes sense for some 5986 applications. 5987 5988Fri Dec 14 15:37:20 GMT 2001 Olly Betts <olly@survex.com> 5989 5990 * Added $error to make nicer error reporting possible. 5991 5992Fri Dec 14 14:49:18 GMT 2001 Olly Betts <olly@survex.com> 5993 5994 * Give more helpful messages for query syntax errors in cases where 5995 we can without elaborate YACC hackery. 5996 5997Thu Dec 13 15:10:24 GMT 2001 Olly Betts <olly@survex.com> 5998 5999 * For image page buttons, display pages 10-999 by using 2 or 3 images. 6000 6001Thu Dec 13 15:02:16 GMT 2001 Olly Betts <olly@survex.com> 6002 6003 * New operators: $div{}, $mod{}, $mul{}, $sub{}, $ge{}, $gt{}, $le{}, 6004 $lt{}. 6005 6006Wed Dec 12 16:37:47 GMT 2001 Olly Betts <olly@survex.com> 6007 6008 * Updated omegascript documentation. 6009 6010Wed Dec 12 15:43:19 GMT 2001 Olly Betts <olly@survex.com> 6011 6012 * Fixed TOPDOC clipping. 6013 6014Wed Dec 12 15:36:20 GMT 2001 Olly Betts <olly@survex.com> 6015 6016 * templates/query: Fixed typo which caused "..." to appear after 6017 page buttons when it wasn't appropriate. 6018 6019Wed Dec 12 15:11:23 GMT 2001 Olly Betts <olly@survex.com> 6020 6021 * omega: Added stopword list (still hardcoded at present though). 6022 6023Wed Dec 12 12:46:57 GMT 2001 Olly Betts <olly@survex.com> 6024 6025 * omindex: index unstemmed terms with prefix 'R' (mnemonic: Raw). 6026 6027 * omega: $topterms will now return terms with prefix 'R'. 6028 6029 * parsequery.yy: fixed handling of DEFAULT_OP; "+first second" and 6030 "-first second" now work; stopwording queries working (currently 6031 stopword list is hardwired to just "the") - stopwords are ignored 6032 when used as normal terms, but not in phrases, or with + and -. 6033 6034 * templates/query: make use of $prettyterm{}. 6035 6036Wed Dec 12 11:11:30 GMT 2001 Olly Betts <olly@survex.com> 6037 6038 * $highlight{} now uses find_if not find_first_of (faster). 6039 6040 * Fixed detection of new/old/extended query when a term occurs 6041 in the query more than once. 6042 6043 * Added $prettyterm{TERM} to convert a probabilistic term for 6044 display to the user. 6045 6046 * $map would allow more than two arguments, but ignore them. Fixed 6047 to take exactly two. 6048 6049Fri Dec 07 15:59:21 GMT 2001 Olly Betts <olly@survex.com> 6050 6051 * Added macros to OmegaScript. 6052 6053 * template/query: updated to use macros. 6054 6055 * Removed specialcase to allow no-argument commands to accept an empty 6056 argument list (e.g. "$thispage{}" rather than "$thispage"). The only 6057 reason this was useful was to allow "$thispage{}s" which can just as 6058 well be written using a comment to force the parser do what you want, 6059 e.g. "$thispage${}s". 6060 6061Thu Dec 06 18:59:34 GMT 2001 Olly Betts <olly@survex.com> 6062 6063 * If a stemmer is set, and all_stem isn't, only stemmer terms starting 6064 with a lowercase letter. 6065 6066Thu Dec 06 18:49:40 GMT 2001 Olly Betts <olly@survex.com> 6067 6068 * parsequery.yy: changed to use find_if() (faster than find_first_of()). 6069 6070Thu Dec 06 17:46:37 GMT 2001 Olly Betts <olly@survex.com> 6071 6072 * Base page links on estimated number of matches, not minimum. 6073 6074Wed Dec 05 17:07:33 GMT 2001 Olly Betts <olly@survex.com> 6075 6076 * omindex: minor speed tweaks. 6077 6078Wed Dec 05 16:52:21 GMT 2001 Olly Betts <olly@survex.com> 6079 6080 * omindex: further HTML parser speed-ups. 6081 6082Wed Dec 05 16:31:33 GMT 2001 Olly Betts <olly@survex.com> 6083 6084 * omindex: sped up HTML parsing. 6085 6086Wed Dec 05 14:52:53 GMT 2001 Olly Betts <olly@survex.com> 6087 6088 * omindex: parsing terms from text is now twice as fast. 6089 6090Thu Nov 29 16:53:45 GMT 2001 Olly Betts <olly@survex.com> 6091 6092 * NEAR phrases (e.g. "a NEAR b NEAR c") now work; removed "{a b c}" 6093 syntax for NEAR phrases. 6094 6095Thu Nov 29 15:25:54 GMT 2001 Olly Betts <olly@survex.com> 6096 6097 * $highlight{} now allows you to specify the tags to use for the 6098 highlighting. 6099 6100Thu Nov 29 15:24:53 GMT 2001 Olly Betts <olly@survex.com> 6101 6102 * topdoc is unsigned so subtracting and then checking if it's < 0 6103 doesn't work... 6104 6105Wed Nov 28 15:45:39 GMT 2001 Olly Betts <olly@survex.com> 6106 6107 * Fixed clipping of hit page in case when there are a multiple of 6108 HITSPERPAGE matches. 6109 6110Wed Nov 28 14:03:48 GMT 2001 Olly Betts <olly@survex.com> 6111 6112 * Added $hostname{URL}; $version output now says "Xapian - omega 6113 <version>". 6114 6115Wed Nov 28 13:04:46 GMT 2001 Olly Betts <olly@survex.com> 6116 6117 * docs/cgiparams.txt: Minor corrections and updates. 6118 6119Wed Nov 28 13:03:40 GMT 2001 Olly Betts <olly@survex.com> 6120 6121 * If we're asked for a page of hits beyond the end of the matches, clip 6122 to the last page of matches rather than the first. 6123 6124Wed Nov 28 13:02:31 GMT 2001 Olly Betts <olly@survex.com> 6125 6126 * For an EXTENDED_QUERY, force the first page of hits. 6127 6128Wed Nov 28 12:56:56 2001 James Aylett <tartarus@users.sourceforge.net> 6129 6130 * Lower case terms when constructing the query (otherwise why 6131 do we store them in the database that way? :-) 6132 6133Wed Nov 28 12:36:49 GMT 2001 Olly Betts <olly@survex.com> 6134 6135 * Fettled default query template. 6136 6137Wed Nov 28 12:33:52 GMT 2001 Olly Betts <olly@survex.com> 6138 6139 * Request one more match than the last we want to display so we can 6140 tell if the next page of hits is empty or not - otherwise we risk 6141 offering a "next page" link when there are no more hits. 6142 6143Mon Nov 26 16:28:00 2001 James Aylett <tartarus@users.sourceforge.net> 6144 6145 * --no-recurse / -l option added; useful if your sites are 6146 nested in their disc storage (particularly things like 6147 http://example.com/ being a distinct site, with 6148 http://example.com/product being within it) 6149 6150 * --mime-type now really works (it was --mime-map in the code) 6151 6152 * documentation updated further 6153 6154Mon Nov 26 14:39:00 2001 James Aylett <tartarus@users.sourceforge.net> 6155 6156 * options parsing fixed so minimised/unrecognised long options 6157 doesn't segfault 6158 6159Mon Nov 26 14:00:13 2001 James Aylett <tartarus@users.sourceforge.net> 6160 6161 * omindex can now index part of a site (previously 'subsite') 6162 by having an index base within the site's disc storage 6163 6164Mon Nov 26 13:57:10 2001 James Aylett <tartarus@users.sourceforge.net> 6165 6166 * Documentation updated for recent changes 6167 6168Thu Nov 22 13:24:45 GMT 2001 Olly Betts <olly@survex.com> 6169 6170 * Use $nice{} in query template, but don't use $freqs. Use numbers as 6171 page image button tooltips on Netscape 4. 6172 6173Thu Nov 22 13:02:17 GMT 2001 Olly Betts <olly@survex.com> 6174 6175 * Herded escaped CGI parameter mangling code back into cgiparam.cc; 6176 added special handling for numeric image button names. 6177 6178Thu Nov 22 12:55:00 GMT 2001 Olly Betts <olly@survex.com> 6179 6180 * Fixed $nice to put the comma (or dot) in the right place. 6181 6182Tue Nov 20 17:30:19 GMT 2001 Olly Betts <olly@survex.com> 6183 6184 * $lastpage now returns 0 when there are no matches (previously 6185 gave a very large answer). 6186 6187Tue Nov 20 12:30:47 GMT 2001 Olly Betts <olly@survex.com> 6188 6189 * $terms now only returns terms which were in the parsed query 6190 (boolean filter terms are excluded). 6191 6192Tue Nov 20 12:07:54 GMT 2001 Olly Betts <olly@survex.com> 6193 6194 * Fixed bug in date range filtering (got it wrong when start and end 6195 date were in the same month). 6196 6197 * DAYSMINUS now counts back from DATE1 (if specified) rather than 6198 always counting back from the present. 6199 6200Mon Nov 19 17:13:24 GMT 2001 Olly Betts <olly@survex.com> 6201 6202 * Added date-range filtering (not fully tested yet). 6203 6204Mon Nov 19 15:21:31 GMT 2001 Olly Betts <olly@survex.com> 6205 6206 * Fixed (c) message displayed by -v (BrightStation "PLC" not "Inc.", 6207 first (c) 1999). 6208 6209Fri Nov 16 11:49:20 GMT 2001 Olly Betts <olly@survex.com> 6210 6211 * New OmegaScript commands: $allterms{<docid>}, $freq{<term>}, 6212 $nice{<number>}, $set_relevant{<docid>}. 6213 6214 * $map{} now returns a list (shouldn't affect most users - if 6215 the extra tabs are a problem, change `$map{...}' to 6216 `$list{$map{...},}' ). 6217 6218 * Template `query' now preserves value of THRESHOLD. 6219 6220 * Template `godmode' fixed to actually work. 6221 6222Wed Nov 14 15:04:13 GMT 2001 Olly Betts <olly@survex.com> 6223 6224 * Fixed to compile with GCC3.0 6225 6226Wed Nov 14 14:54:53 GMT 2001 Olly Betts <olly@survex.com> 6227 6228 * Updated for changes to OmQuery 6229 6230Tue Nov 06 13:10:15 GMT 2001 Olly Betts <olly@survex.com> 6231 6232 * Updated .cvsignore. 6233 6234Tue Nov 06 13:02:04 GMT 2001 Olly Betts <olly@survex.com> 6235 6236 * Fixed lookup of CGI parameter THRESHOLD. 6237 6238Tue Nov 6 12:38:37 GMT 2001 Richard Boulton <richard@tartarus.org> 6239 6240 * Moved configure.ac to configure.in: depending on autoconf 2.13 is 6241 not needed yet. 6242 6243Tue Nov 06 12:23:55 GMT 2001 Olly Betts <olly@survex.com> 6244 6245 * Added support for percentage threshold cutoff (CGI var THRESHOLD); 6246 Code for calculating better percentages has been pushed into Xapian 6247 so removed it from here. 6248 6249Mon Nov 5 12:42:26 GMT 2001 Richard Boulton <richard@tartarus.org> 6250 6251 * Omega moved to new home, from om-examples/omega. 6252 Standalone build system added. 6253 6254 6255 6256