This ChangeLog file is no longer maintained - see the git repo history for more recent changes: https://xapian.org/bleeding Fri Sep 25 07:30:55 GMT 2015 Olly Betts * NEWS: Update. Fri Sep 25 04:51:20 GMT 2015 Olly Betts * myhtmlparse.cc: Remove unused header. Fri Sep 25 04:49:49 GMT 2015 Olly Betts * loadfile.cc,md5wrap.cc: Stop trying to use O_STREAMING - the patch to implement it was never merged into the Linux kernel, and I can't find any evidence that other platforms implement it. The constant value O_STREAMING used now seems to be used for the part of O_SYNC which isn't covered by O_DSYNC, which seems likely to hurt performance if anything. Fri Sep 25 04:47:06 GMT 2015 Olly Betts * omindex.cc: Use SAMPLE_SIZE in help text rather than literal 512. Add '--title-size' option. Fri Sep 25 04:40:53 GMT 2015 Olly Betts * docs/omegascript.rst,query.cc: Fix documentation of $last to say it's the MSet index *one beyond* the end of the current page. Reported by Andrew Chilton. Fri Sep 25 04:39:54 GMT 2015 Olly Betts * docs/overview.rst: SVG extraction is built-in too. Fri Sep 25 04:32:04 GMT 2015 Olly Betts * docs/cgiparams.rst: Document behaviour if xDB is not set. Fri Sep 25 04:31:15 GMT 2015 Olly Betts * docs/cgiparams.rst,query.cc: If xFILTERS is not set, don't force the first page as that's unhelpful if someone fails to set it in their template. Fri Sep 25 01:22:11 GMT 2015 Olly Betts * configure.ac: Don't provide our own implementation of sleep() under __WIN32__ if there's already one - mingw provides one, and in some situations it seems to clash with ours. Reported to xapian-discuss by John Alveris. Wed May 20 09:57:45 GMT 2015 Olly Betts * NEWS: Bump release date for 1.2.21. Fri May 15 05:32:35 GMT 2015 Olly Betts * NEWS: Another update for 1.2.21. Fri May 15 05:16:05 GMT 2015 Olly Betts * Makefile.am,configure.ac: Use -no-install or -no-fast-install when linking test programs which never get installed, which means libtool can often avoid creating a shell script wrapper. Thu May 14 14:52:14 GMT 2015 Olly Betts * NEWS,configure.ac: Update for 1.2.21. Thu May 14 10:17:32 GMT 2015 Olly Betts * NEWS: Update. Thu May 14 05:16:08 GMT 2015 Olly Betts * omindex.cc: Message tweak. Thu May 14 05:15:18 GMT 2015 Olly Betts * outlookmsg2html.in: Fix handling of message/rfc822 subparts. Thu May 14 04:16:34 GMT 2015 Olly Betts * cdb_hash.cc,md5.cc: Remove 'register' as it's deprecated, and likely to just be ignored by any modern compiler anyway. Wed May 06 10:08:33 GMT 2015 Olly Betts * NEWS: Update. Wed May 06 07:26:28 GMT 2015 Olly Betts * omega.cc: Drop compilation date and time from output - they prevent reproducible builds and the version number is sufficient information. Wed May 06 07:24:27 GMT 2015 Olly Betts * commonhelp.cc,omega.cc,omindex.cc,scriptindex.cc: Add spaces between literal strings and macros which expand to literal strings for C++11 compatibility. Wed May 06 04:56:39 GMT 2015 Olly Betts * Makefile.am: Add common/stringutils.cc to urlenctest_SOURCES, needed now urldecode.h uses C_isxdigit(). Wed May 06 04:53:56 GMT 2015 Olly Betts * configfile.cc,htmlparse.cc,myhtmlparse.cc,omega.cc,omindex.cc, query.cc,scriptindex.cc,urldecode.h: Consistently use C_isupper(), C_toupper(), etc as these versions aren't affected by the locale setting, and also allow signed char values (so we don't need to cast the argument to unsigned char). Tue May 05 14:17:16 GMT 2015 Olly Betts * templates/query: When listing matching terms, don't make the commas italic. Tue May 05 12:43:59 GMT 2015 Olly Betts * docs/overview.rst: Document 'E' prefixed boolean terms for filtering by extension (see #668, reported by bramvdh). Tue May 05 12:42:59 GMT 2015 Olly Betts * docs/overview.rst: Whitespace cleanup. Tue May 05 12:41:52 GMT 2015 Olly Betts * templates/xml: Add XML declaration. Tue May 05 12:40:44 GMT 2015 Olly Betts * templates/query: Eliminate blank line before . Tue May 05 12:39:01 GMT 2015 Olly Betts * templates/godmode: Return charset utf-8 in the content-type. Tue May 05 12:29:46 GMT 2015 Olly Betts * docs/encodings.rst: $prettyurl undoes %-encoding of UTF-8 in 1.2.21 and later too. Tue May 05 12:27:14 GMT 2015 Olly Betts * docs/encodings.rst: Note that one should ensure that Omega gets sent form submissions encoded in UTF-8. Tue May 05 12:23:09 GMT 2015 Olly Betts * docs/encodings.rst: Discuss encodings of filenames (see #550). Tue May 05 12:20:00 GMT 2015 Olly Betts * urldecode.h,urlenctest.cc: $prettyurl now decodes valid UTF-8 sequences. Fixes #550 and #644, reported by catkin and terencz. Tue May 05 12:14:06 GMT 2015 Olly Betts * docs/: Add a document about character encoding, as suggested by James Aylett in #550. Tue May 05 12:08:39 GMT 2015 Olly Betts * urldecode.h,urlenctest.cc: Improve decoding done by $prettyurl - we now leave the query and fragment parts of the URL alone and don't decode an escaped "/" (omindex doesn't create URLs with any of these, so we only risk breaking other URLs which have them), and we decode some additional ASCII characters in the path part: []@!$&'()*+.;= (addresses #550 in part) Wed Mar 04 00:29:44 GMT 2015 Olly Betts * NEWS,configure.ac: Update for 1.2.20. Wed Mar 04 00:03:35 GMT 2015 Olly Betts * Backport change from trunk: * configure.ac: Avoid doing link tests with libmagic in configure as they fail on mingw due to not automatically picking up libraries which libmagic itself depends on. Sat Feb 14 10:46:47 GMT 2015 Olly Betts * NEWS: Update. Tue Jan 27 04:37:12 GMT 2015 Olly Betts * docs/overview.rst,omindex.cc: Interpret a command of "false" in "--filter" as meaning to ignore files with that MIME type. Sat Jan 24 12:14:33 GMT 2015 Olly Betts * NEWS: Update. Thu Jan 22 01:44:01 GMT 2015 Olly Betts * omindex.cc: Ignore extensions .msi and .msp, which are Microsoft installer files, but which libmagic sometimes incorrectly identifies as application/msword. Fri Dec 19 03:21:13 GMT 2014 Olly Betts * docs/cgiparams.rst: Improve wording of docs for SORT parameter. Tue Jan 06 21:15:14 GMT 2015 Olly Betts * configure.ac: Use pkg-config in preference to determine flags needed to compile and link with PCRE, as this will just work when cross-compiling (at least under MXE). Sun Dec 21 21:54:48 GMT 2014 Olly Betts * query.cc: Handle [=0 as [=1. Tue Dec 16 03:53:25 GMT 2014 Olly Betts * configure.ac: Define MINGW_HAS_SECURE_API under mingw to get _putenv_s() declared in stdlib.h. Wed Dec 03 04:17:18 GMT 2014 Olly Betts * templates/xml: Update handling of DATE1, DATE2 and DAYSMINUS which were renamed in 0.6.x and the compatibility aliases removed in 1.0.0. Wed Dec 03 04:15:51 GMT 2014 Olly Betts * docs/omegascript.rst: Update documentation references to DATE1, DATE2, and DAYSMINUS which were renamed in 0.6.x and the compatibility aliases removed in 1.0.0. Wed Dec 03 02:29:37 GMT 2014 Olly Betts * omindex.cc: Remove unused '#include '. Wed Dec 03 02:18:51 GMT 2014 Olly Betts * diritor.h: Fix get_mtime() to return time_t not off_t. In practice, this probably wouldn't have caused issues until at least 2038. Fri Nov 28 11:38:56 GMT 2014 Olly Betts * configure.ac: Move AC_CANONICAL_HOST before first use of $host_os. In practice this wasn't a problem, as LT_INIT implicitly calls AC_CANONICAL_HOST before this point anyway. Wed Nov 26 03:55:13 GMT 2014 Olly Betts * configure.ac: Enable automake option 'subdir-objects' to avoid warning from newer automake. Tue Oct 21 00:33:58 GMT 2014 Olly Betts * NEWS,configure.ac: Update for 1.2.19. Mon Oct 20 10:52:38 GMT 2014 Olly Betts * NEWS: Update. Mon Oct 20 10:44:27 GMT 2014 Olly Betts * Backport change from trunk: * docs/overview.rst: Note that pdftotext is part of poppler as well as xpdf. (Noted by Paul Wise) Sun Jun 22 04:32:15 GMT 2014 Olly Betts * NEWS,configure.ac: Update for 1.2.18. Sat Jun 21 00:50:56 GMT 2014 Olly Betts * Backport change from trunk: * Makefile.am: Don't compile in unixperm.cc - it isn't currently used, and it fails to build with mingw. (fixes #635) Thu Jun 19 13:56:02 GMT 2014 Olly Betts * NEWS: Update. Thu Jun 19 13:11:43 GMT 2014 Olly Betts * omindex.cc,urlencode.cc,urlencode.h: To improve compatibility with existing databases in cases which are OK in practice, only URL encode the starting URL if it contains a character <= 31 or at least one of '#', '%', ':' or '?'. Thu Jun 19 08:44:22 GMT 2014 Olly Betts * Backport change from trunk: * omindex.cc,urlencode.cc,urlencode.h: URL encode starting URL properly. Thu Jun 19 08:27:13 GMT 2014 Olly Betts * Backport change from trunk: * docs/omegascript.rst: Put ``...`` around Xapian C++ class names. Thu Jun 19 08:10:17 GMT 2014 Olly Betts * Backport change from trunk: * diritor.cc: Fix "applications/msword" to "application/msword" in the fallback code for CDF files. Thu Jun 19 07:55:11 GMT 2014 Olly Betts * Backport change from trunk: * diritor.cc: In fallback for CDF files, compare the extension *without* leading dot. Thu Jun 19 07:02:12 GMT 2014 Olly Betts * Backport change from trunk: * diritor.cc: libmagic can return a second string starting "Composite Document File V2 Document" for the mime-type, so just look for that prefix. And newer libmagic returns "application/CDFV2-corrupt" in these cases, so handle that too. Fri May 23 13:43:33 GMT 2014 Olly Betts * Backport change from trunk: * date.cc,scriptindex.cc: Pass std::string by const reference. Fri May 23 13:42:02 GMT 2014 Olly Betts * Backport change from trunk: * query.cc: Removed unused inline function. Fri May 23 13:38:33 GMT 2014 Olly Betts * Backport change from trunk: * omindex.cc: Report the size limit in the message when we skip a file which exceeds it. Wed Apr 30 05:44:54 GMT 2014 Olly Betts * NEWS: Update. Mon Apr 14 10:28:51 GMT 2014 Olly Betts * Backport change from trunk: * diritor.cc: Work around libmagic returning a MIME content-type of "Composite Document File V2 Document, No summary info". Mon Apr 14 09:38:51 GMT 2014 Olly Betts * Backport change from trunk: * expand.cc: Fix warning when built with GCC 4.7.2 using -Os. Fri Mar 14 12:12:59 GMT 2014 Olly Betts * Backport change from trunk: * Makefile.am,docs/omegascript.rst,expand.cc,expand.h,omega.cc, query.cc: Add support for setting the query expansion scheme to use. Tue Jan 28 23:50:11 GMT 2014 Olly Betts * NEWS,configure.ac: Update for 1.2.17. Sun Jan 05 22:11:39 GMT 2014 Olly Betts * NEWS: Update. Thu Dec 26 01:21:42 GMT 2013 Olly Betts * Backport change from trunk: * Makefile.am,portability/mkdtemp.cc,portability/mkdtemp.h,omindex.cc: Add header with prototype of mkdtemp() to avoid "no previous declaration" warning on platforms which don't have mkdtemp() as standard. Fri Dec 20 08:00:25 GMT 2013 Olly Betts * NEWS: Update. Fri Dec 20 07:04:32 GMT 2013 Olly Betts * Backport change from trunk: * docs/overview.rst: Add Abiword as an example use of --filter, based on patch from Frank J Bruzzaniti (fixes#383). Update unoconv example to talk about LibreOffice instead of OpenOffice. Tue Dec 03 23:13:48 GMT 2013 Olly Betts * NEWS: Final update for 1.2.16. Tue Dec 03 07:33:40 GMT 2013 Olly Betts * Backport change from trunk: * configure.ac: Define __MSVCRT_VERSION__ to 0x0601 on mingw so we get __ftime64() defined in the headers. Sat Nov 30 20:36:36 GMT 2013 Olly Betts * Makefile.am: Need to ship common/safewinsock2.h now. Fri Nov 29 04:34:18 GMT 2013 Olly Betts * Backport change from trunk: * configure.ac: Sync GCC checks with xapian-core. Fri Nov 29 01:09:01 GMT 2013 Olly Betts * configure.ac: Update for 1.2.16. Thu Oct 31 12:33:31 GMT 2013 Olly Betts * NEWS: Update. Sun Oct 13 23:25:11 GMT 2013 Olly Betts * Backport change from trunk: * omindex.cc: Group-readable files which are owner-readable but not world-readable should still get a "readable by owner" term added. Reported by Emmanuel Garette. Tue Oct 08 01:06:45 GMT 2013 Olly Betts * NEWS: Update from ChangeLog. Sun Oct 06 08:17:57 GMT 2013 Olly Betts * Backport change from trunk: * configure.ac: Sync compiler warning flag machinery against xapian-core. The changes are special handling for clang, passing -fshow-column where supported, and handling for new warning flags in GCC 4.6 and 4.7. Sun Oct 06 08:13:56 GMT 2013 Olly Betts * Backport changes from trunk: * xlsxparse.cc: Handle pre-defined numfmtid codes for dates. * omindex.cc,xlsxparse.cc,xlsxparse.h: Fix detection of cells with a date format to work with xlsx files other than my first example. * omindex.cc,xlsxparse.cc,xlsxparse.h: Decode dates for xlsx files. Sun Oct 06 08:01:59 GMT 2013 Olly Betts * Backport change from trunk: * omindex.cc,docs/overview.rst: Ignore 'adm', 'cur', and 'ico' by default. Sat Oct 05 06:46:32 GMT 2013 Olly Betts * Backport change from trunk: * weight.cc: Use "" not <> to include local header weight.h. Sat Oct 05 06:44:23 GMT 2013 Olly Betts * Backport change from trunk: * configure.ac,xapian-omega.spec.in: Compress source tarballs with xz instead of gzip. Sat Oct 05 06:40:57 GMT 2013 Olly Betts * Backport change from trunk: * diritor.h: Mark DirectoryIterator ctor as 'explicit'. Sat Oct 05 06:37:08 GMT 2013 Olly Betts * Backport change from trunk: * datematchdecider.h: Fix filename in comment at top of file. Sat Oct 05 06:35:19 GMT 2013 Olly Betts * Backport change from trunk: * omindex.cc: Fix off-by-one when finding documents to delete which would sometimes cause omindex to fail to delete documents from the database when they weren't refound during an index update. Sat Oct 05 06:28:56 GMT 2013 Olly Betts * Backport change from trunk: * Makefile.am,README,docs/Makefile.am: SVN -> git. Tue Apr 16 23:41:20 GMT 2013 Olly Betts * NEWS,configure.ac: Update for 1.2.15. Tue Apr 16 10:43:52 GMT 2013 Olly Betts * NEWS: Update from ChangeLog. Wed Mar 27 09:39:12 GMT 2013 Olly Betts * Backport change from trunk: * Makefile.am: Don't link utf8convert.cc code into omega CGI. Thu Mar 14 22:43:50 GMT 2013 Olly Betts * NEWS,configure.ac: Update for 1.2.14. Thu Mar 14 12:34:29 GMT 2013 Olly Betts * Backport changes from trunk: * docs/overview.rst: Update to add com to the list of ignored extensions. * omindex.cc: Ignore .com files by default. Thu Mar 07 09:41:25 GMT 2013 Olly Betts * NEWS: Update from ChangeLog. Sat Feb 16 08:20:36 GMT 2013 Olly Betts * Backport change from trunk: * xlsxparse.cc: Correct "max" -> "min" when reserving space for shared strings. This only means we now reserve a more appropriate amount of space to start with. Wed Jan 09 11:15:03 GMT 2013 Olly Betts * NEWS,configure.ac: Update for 1.2.13. Wed Jan 09 10:10:18 GMT 2013 Olly Betts * NEWS: Format new entries. Tue Jan 08 11:09:38 GMT 2013 Olly Betts * Backport change from trunk: * commonhelp.cc: Tweak wording about default to match other options better. Tue Jan 08 10:58:58 GMT 2013 Olly Betts * Backport change from trunk: * omindex.cc: Note default size limit on files to index is unlimited. Update --help to reflect that --sample-size now accepts the same formats as --max-size). Tue Jan 08 10:41:27 GMT 2013 Olly Betts * Backport change from trunk: * omindex.cc: When generating a sample for a CSV file, limit the reserved size to the CSV file size as sample_size could be set really high by the user. Tue Jan 08 10:31:13 GMT 2013 Olly Betts * Backport change from trunk: * diritor.cc: MAGIC_MIME_TYPE was added in 4.22, so note that in the comment about its conditional use. Tue Jan 08 10:28:07 GMT 2013 Olly Betts * Backport change from trunk: * Makefile.am: In automake, INCLUDES is now deprecated in favour of AM_CPPFLAGS so update to use the latter. Tue Jan 08 10:14:47 GMT 2013 Olly Betts * Backport change from trunk: * md5wrap.cc: If we get a read error while calculating the md5 checksum of a file, fail rather than returning the checksum of the file up to that point. Tue Jan 08 10:13:36 GMT 2013 Olly Betts * Backport change from trunk: * omindex.cc: Calculate the md5 from the loaded file contents when indexing SVG and Atom files. Use a const ref to avoid a string copy of the file contents for HTML and uncompressed ABI word. Wed Nov 21 05:02:18 GMT 2012 Olly Betts * NEWS: Update from ChangeLog. Wed Nov 21 05:00:51 GMT 2012 Olly Betts * Backport change from trunk: * omindex.cc: Fix omindex not to segfault when -F option without a ':' is passed. Sat Nov 10 08:08:20 GMT 2012 Olly Betts * NEWS: Update from ChangeLog. Sat Oct 13 12:24:59 GMT 2012 Olly Betts * Backport change from trunk: * Makefile.am,omindex.cc: Replace shell_protect() with append_filename_argument() from common/append_filename_arg.h. Extracting text using external filters now works for filenames containing a newline character. * common/: Update svn:externals to latest version of xapian-core on 1.2 branch to get common/append_filename_arg.h. Wed Jul 18 10:51:39 GMT 2012 Olly Betts * Backport change from trunk: * urldecode.h: Fix to decode escaped character at the end of the string. * urlenctest.cc: Add regression testcase. Wed Jun 27 12:15:37 GMT 2012 Olly Betts * NEWS,configure.ac: Update for 1.2.12. Tue Jun 26 12:06:15 GMT 2012 Olly Betts * NEWS,configure.ac: Update for 1.2.11. Tue Jun 26 12:04:16 GMT 2012 Olly Betts * Backport change from trunk: * configure.ac: Set link_all_deplibs_CXX=no on solaris, like we already do for xapian-core. Tue Jun 26 08:48:58 GMT 2012 Olly Betts * NEWS: Update from ChangeLog. Fri Jun 22 12:16:30 GMT 2012 Olly Betts * Backport changes from trunk: * xlsxparse.cc: Check for "uniquecount" parameter, not "unqiueCount" as we normalise parameter names to lower case. * omindex.cc: unzip extracts files in the order they are in the archive, not the order they are on the command line, so call unzip twice when the order of extraction matters. * Makefile.am,omindex.cc,opendocparse.cc,opendocparse.h,xmlparse.cc: Improve handling of headers and footers on OpenDocument documents. Fri Jun 22 12:12:56 GMT 2012 Olly Betts * Backport change from trunk: * omindex.cc: Tweak previous change. Fri Jun 22 12:10:34 GMT 2012 Olly Betts * Backport changes from trunk: * omindex.cc,xlsxparse.cc,xlsxparse.h: Rework .xlsx parsing to substitute the shared strings into the positions they are used in, so that the sample actually matches what appears in the spreadsheet. * xlsxparse.cc,xlsxparse.h: Subclass XlsxParser directly from HtmlParser. * Makefile.am,omindex.cc,xlsxparse.cc,xlsxparse.h: Index calculated numbers from .xlsx files. Fri Jun 22 12:03:06 GMT 2012 Olly Betts * Backport change from trunk: * omindex.cc: pdftotext outputs a formfeed between each page, which messes up our "empty body" check, so trim any trailing formfeeds before the check. * omindex.cc: Fix the "trim trailing formfeeds" code not to remove one character too many. * omindex.cc: Properly fix the "trim trailing formfeeds" code. Sat Jun 09 11:50:16 GMT 2012 Olly Betts * Backport change from trunk: * Cherry pick changes from Mihai Bivol's GSoC snippets branch: * omindex.cc: Add option for the document sample size. * omindex.cc: Add short option for sample-size * omindex.cc: Make sample-size consistent with max-size Fri Jun 08 12:48:47 GMT 2012 Olly Betts * NEWS: Update from ChangeLog. Mon May 21 07:28:26 GMT 2012 Olly Betts * Backport change from trunk: * htmlparsetest.cc,myhtmlparse.cc,myhtmlparse.h: Change parsing of multiple tags and text outside of to match the behaviour if modern web browsers. (ticket#599) Mon May 21 06:34:43 GMT 2012 Olly Betts * Backport change from trunk: * configure.ac: Set link_all_deplibs_CXX=no on freebsd and openbsd, like we already do for xapian-core. Wed May 09 10:43:41 GMT 2012 Olly Betts * NEWS,configure.ac: Update for 1.2.10. Wed May 09 08:19:21 GMT 2012 Olly Betts * NEWS: Update from ChangeLog. Wed May 09 01:20:25 GMT 2012 Olly Betts * Backport change from trunk: * runfilter.cc: Add cast to rlim_t, required for C++11 compatibility according to new error from GCC 4.7 (reported by Gaurav Arora). Tue May 08 05:23:11 GMT 2012 Olly Betts * Backport changes from trunk: * atomparse.cc: For type="html", use the charset of the XML rather than utf-8. * Makefile.am,atomparse.cc,atomparse.h,overview.rst,omindex.cc: Add support for atom feed files, patch from Mihai Bivol in ticket#595. * Makefile.am,atomparsetest.cc: Add tests for AtomParser. Tue May 08 04:58:57 GMT 2012 Olly Betts * Backport change from trunk: * htmlparse.cc,htmlparsetest.cc: Add support for CDATA to HTML parser. Tue May 08 04:48:36 GMT 2012 Olly Betts * Backport change from trunk: * omindex.cc: Add --max-size option, based on patch from ndaley in ticket#587. Wed Mar 21 11:21:08 GMT 2012 Olly Betts * NEWS: Update from ChangeLog. Tue Mar 13 10:16:19 GMT 2012 Olly Betts * Backport change from trunk: * omindex.cc: If the document with the highest existing docid was updated, we'd previously report it as "added", but now we correctly report it as "updated". Tue Mar 13 10:12:10 GMT 2012 Olly Betts * Backport change from trunk: * omindex.cc: Catch and report std::exception. Thu Mar 08 02:08:53 GMT 2012 Olly Betts * NEWS,configure.ac: Update for 1.2.9. Tue Mar 06 11:13:03 GMT 2012 Olly Betts * Backport changes from trunk: * docs/overview.rst,omindex.cc: More extensions to ignore by default: fon pyd ttf * docs/overview.rst: Wrap over-long line. * docs/overview.rst,omindex.cc: Add more extensions to the default ignore list: bin dat db jar lnk pyc pyo sqlite sqlite3 sqlite-journal tmp Tue Mar 06 11:10:59 GMT 2012 Olly Betts * Backport change from trunk: * docs/overview.rst,htmlparse.cc,htmlparsetest.cc: Add support for ignoring sections bracketed by and like we already do for . Patch from Raphael Geissert. Wed Feb 29 10:58:43 GMT 2012 Olly Betts * NEWS: Update from ChangeLog. Mon Jan 23 12:48:00 GMT 2012 Olly Betts * Backport change from trunk: * docs/overview.rst: Document that libmagic is used to determine the MIME type if the extension isn't known. Partly addresses ticket#569. Mon Jan 23 12:44:36 GMT 2012 Olly Betts * Backport change from trunk: * docs/overview.rst: We now limit time as well as CPU and memory for external filters. Tue Dec 13 12:42:10 GMT 2011 Olly Betts * NEWS: Final update for 1.2.8. Tue Dec 13 12:38:04 GMT 2011 Olly Betts * Backport change from trunk: * docs/omegascript.rst,query.cc,templates/emptydocs,templates/godmode, templates/query,urldecode.h,urlenctest.cc: Add new $prettyurl{} command which undoes RFC3986 URL escaping which doesn't affect semantics in practice. Partly addresses ticket#550. Tue Dec 13 11:19:56 GMT 2011 Olly Betts * NEWS,configure.ac: Update for 1.2.8. Thu Dec 08 08:27:28 GMT 2011 Olly Betts * Backport change from trunk: * omindex.cc: Improve --help output (and man page which is generated from it). Closes bug#572. Thu Dec 08 04:53:40 GMT 2011 Olly Betts * Backport change from trunk: * Makefile.am: Ship new header urldecode.h. Thu Dec 08 04:46:18 GMT 2011 Olly Betts * Backport change from trunk: * Makefile.am,cgiparam.cc,urldecode.h,urlenctest.cc: Add new implementation of URL decoding - the old one didn't handle various corner cases well, and had two cut and pasted variants for handling a input from a C string (GET) or from stdin (POST). Also add a new unit test program to test URL encoding and decoding. Fixes bug#578. Tue Dec 06 13:30:58 GMT 2011 Olly Betts * Backport change from trunk: * scriptindex.cc: If no rules are found in the index script, report an error and give up - this is inevitably the result of a mistake, and adding empty documents to the database isn't helpful. Tue Dec 06 13:28:32 GMT 2011 Olly Betts * Backport change from trunk: * docs/omegascript.rst: Add note to discourage use of percentage scores. * templates/query: Don't show the percentage score in the default template. Tue Dec 06 13:26:30 GMT 2011 Olly Betts * Backport change from trunk: * configure.ac,runfilter.cc: If we don't get any data from a filter for 5 minutes, give up - it has probably ended up blocked indefinitely. Tue Dec 06 12:51:24 GMT 2011 Olly Betts * Backport changes from trunk: * templates/query: HTML escape topterms. * templates/godmode: HTML escape the contents of document values. Tue Dec 06 12:48:24 GMT 2011 Olly Betts * Backport change from trunk: * scriptindex.cc: MyHtmlParser::parse_html() no longer throws bool to stop parsing early, so we no longer need to catch it. Tue Dec 06 12:45:48 GMT 2011 Olly Betts * Backport change from trunk: * configure.ac: Sync changes from xapian-core: Don't pass -Wshadow for GCC < 4.1; don't pass -Wstrict-null-sentinel for GCC 4.0.x; only enable symbol visibility on platforms where it is supported; remove now superfluous check for GCC >= 3. Also, add FIXME for enabling -Woverloaded-virtual. Tue Dec 06 11:57:58 GMT 2011 Olly Betts * Backport change from trunk: * scriptindex.cc: Add link to http://xapian.org/docs/omega/scriptindex.html to --help output (and so also to the man page which is generated from this). Tue Dec 06 11:30:42 GMT 2011 Olly Betts * Backport change from trunk: * omega.cc: If P had trailing spaces, we would remove all but one - fixed to remove all of them! Thu Sep 01 12:31:59 GMT 2011 Olly Betts * Backport change from trunk: * xapian-omega.spec.in: Package outlookmsg2html helper. Wed Aug 10 06:13:41 GMT 2011 Olly Betts * NEWS: Finalise 1.2.7. Tue Aug 09 09:00:12 GMT 2011 Olly Betts * NEWS,configure.ac: Initial update for 1.2.7. Thu Jun 23 01:01:25 GMT 2011 Olly Betts * Backport change from trunk: * query.cc: Improve $version output from "Xapian - xapian-omega 1.2.6" to "xapian-omega 1.2.6". * docs/omegascript.rst: Update example to match (and use less ancient version!) Thu Jun 23 00:38:18 GMT 2011 Olly Betts * Backport change from trunk: * dbi2omega: Remove uninteresting reference to 0.9.4. Thu Jun 23 00:29:28 GMT 2011 Olly Betts * Backport change from trunk: * hashterm.cc: Avoid unnecessary temporary string object. Thu Jun 23 00:26:13 GMT 2011 Olly Betts * Backport change from trunk: * hashterm.cc: Fix comment typo. Wed Jun 22 16:59:05 GMT 2011 Olly Betts * Backport change from trunk: * xapian-omega.spec.in: We're ABI compatible within a release series so make dependency on xapian-core-libs >= rather than =. Wed Jun 22 16:55:10 GMT 2011 Olly Betts * Backport change from trunk: * scriptindex.cc: Avoid unnecessary temporary string object. Mon Jun 13 08:00:16 GMT 2011 Olly Betts * Backport change from trunk: * docs/termprefixes.rst: Expand to document mapping a user prefix to multiple term prefixes. Mon Jun 13 03:23:47 GMT 2011 Olly Betts * docs/overview.rst: Improve documentation of htdig_noindex. Sun Jun 12 11:52:29 GMT 2011 Olly Betts * NEWS: Final update for 1.2.6. Fri Jun 10 12:02:32 GMT 2011 Olly Betts * NEWS,configure.ac: Update in preparation for 1.2.6. Fri Jun 10 03:28:33 GMT 2011 Olly Betts * templates/inc/anyallexactradio: Remove unused duplicate of anyallradio. Fri Jun 10 03:21:25 GMT 2011 Olly Betts * configure.ac,omindex-config.cc,omindex-config.html: Strip out partly written and long untouched omindex-config utility. Thu Jun 09 14:20:46 GMT 2011 Olly Betts * weight.cc: Fix a compiler warning (I failed to note the compiler unfortunately). Sun May 29 13:00:26 GMT 2011 Olly Betts * templates/query: Make search query input type=search. Sun May 29 12:24:43 GMT 2011 Olly Betts * templates/query: Autofocus the search query input (using HTML autofocus attribute with Javascript fallback for older browsers). (ticket#544) Wed May 25 14:33:18 GMT 2011 Olly Betts * docs/omegascript.rst: Correct the documentation of the colours used by $highlight{}. Fri May 13 05:50:35 GMT 2011 Olly Betts * docs/overview.rst: Add using unoconv as more complex example of using --filter (ticket#324). Wed Apr 20 07:00:56 GMT 2011 Olly Betts * NEWS: Fix typo; clarify wording. Mon Apr 04 13:58:06 GMT 2011 Olly Betts * NEWS: Update release date. Mon Apr 04 13:53:34 GMT 2011 Olly Betts * templates/xml: Fix syntax error from recent edit. Sun Apr 03 10:54:04 GMT 2011 Olly Betts * NEWS,configure.ac: Update for 1.2.5. Sat Apr 02 14:15:32 GMT 2011 Olly Betts * templates/query: Use $add{$field{modtime}} to ensure it is numeric. Sat Apr 02 14:14:06 GMT 2011 Olly Betts * templates/godmode: More missing escaping. Sat Apr 02 14:07:45 GMT 2011 Olly Betts * templates/xml: Remove double escaping. Sat Apr 02 13:58:44 GMT 2011 Olly Betts * templates/query: More escaping fixes. Sat Apr 02 13:55:03 GMT 2011 Olly Betts * templates/emptydocs,templates/opensearch,templates/xml: More missing escaping. Sat Apr 02 12:34:42 GMT 2011 Olly Betts * templates/query: Add missing escaping. Sat Apr 02 11:48:43 GMT 2011 Olly Betts * templates/godmode: Add missing escaping. Sat Apr 02 10:34:58 GMT 2011 Olly Betts * templates/xml: Remove support for undocumented HILITECLASS CGI variable. There's no evidence I can find using Google code search or web search that this has been used anywhere, and it's problematic to escape properly. Sat Mar 26 14:51:36 GMT 2011 Olly Betts * INSTALL: Copy new Multi-Arch section from xapian-core/INSTALL. Replace VPATH section with better equivalent from xapian-core/INSTALL. Wed Mar 23 15:21:41 GMT 2011 Olly Betts * htmlparse.cc,htmlparse.h,htmlparsetest.cc,metaxmlparse.cc, metaxmlparse.h,myhtmlparse.cc,myhtmlparse.h,omindex.cc,svgparse.cc, svgparse.h,xmlparse.cc,xmlparse.h,xpsxmlparse.cc,xpsxmlparse.h: Instead of throwing a bool to abandon parsing, change methods to return bool to signify if they want to continue parsing or not. This is a bit faster (~0.23% for indexing a lot of HTML files). Mon Mar 21 05:48:08 GMT 2011 Olly Betts * myhtmlparse.cc,myhtmlparse.h,omindex.cc: Add --ignore-exclusions option, which will index HTML files despite meta robots tags, etc - omindex is often used in environments where such exclusions aren't relevant. Fri Mar 18 10:24:58 GMT 2011 Olly Betts * omindex.cc: Just report the mimetype as unknown instead of saying "unknown Office 2007 MIME subtype". Fri Mar 18 05:53:21 GMT 2011 Olly Betts * diritor.h: Avoid using S_IRUSR, etc under __WIN32__. Fri Mar 18 03:00:16 GMT 2011 Olly Betts * docs/overview.rst,omindex.cc: Ignore *.css and *.js by default too. Thu Mar 17 23:34:07 GMT 2011 Olly Betts * omindex.cc: For skip messages which are only to be shown in verbose mode, call skip with new SKIP_VERBOSE_ONLY flag. Pass new SKIP_SHOW_FILENAME flag for skip messages shown before we say what file we are indexing so we know to show the filename even in verbose mode. Thu Mar 17 03:47:54 GMT 2011 Olly Betts * omindex.cc: Restore handling of exceptions from DirectoryIterator::get_type(), and handle exceptions from DirectoryIterator::next() which ended up at the top level before (though they probably never happen, at least on Linux). Wed Mar 16 06:19:01 GMT 2011 Olly Betts * omindex.cc: Push all the code associated with indexing a file into index_file(). Wed Mar 16 02:55:53 GMT 2011 Olly Betts * omindex.cc: Push try block around index_file() call into the function. Wed Mar 16 02:51:52 GMT 2011 Olly Betts * omindex.cc: Factor out handling for skipping files, and improve these messages by consistently reporting the filename. Tue Mar 15 12:47:12 GMT 2011 Olly Betts * docs/Makefile.am,docs/index.rst: Add index page which links to all the other documentation pages. Tue Mar 15 12:20:30 GMT 2011 Olly Betts * omindex.cc: Add --empty-docs option to allow documents we extract no body text from to be indexed (existing behaviour), skipped, or reported and then indexed. Fri Mar 04 14:13:47 GMT 2011 Olly Betts * docs/omegascript.rst: Minor improvements. Wed Mar 02 11:17:42 GMT 2011 Olly Betts * NEWS: Update. Wed Mar 02 06:14:41 GMT 2011 Olly Betts * docs/termprefixes.rst: New standard prefix E for filename extension. * omindex.cc: Index file extension as E-prefixed term. Mon Feb 28 13:45:32 GMT 2011 Olly Betts * omindex.cc: Tell xls2csv not to quote fields and to put spaces not commas between them. Fixes indexing of numeric fields, and means we don't need to use our CSV parser to get a sample. Mon Feb 28 12:10:53 GMT 2011 Olly Betts * xmlparse.cc: Add whitespace between chunks of text extracted from Microsoft Office 2007 formats. Wed Feb 23 12:34:28 GMT 2011 Olly Betts * templates/xml: Try $field{caption} (which is what omindex sets) before $field{title} when getting a value for the hit tag's title attribute - this is consistent with how the query template gets the title. Add new type attribute which gives $field{type}. Thu Feb 17 05:19:28 GMT 2011 Olly Betts * templates/xml: Add DBSize attribute to element. Wed Feb 16 03:19:57 GMT 2011 Olly Betts * Makefile.am,omindex.cc,query.cc,urlencode.cc,urlencode.h: Update URL encoding to follow RFC3986. Tue Feb 15 03:20:40 GMT 2011 Olly Betts * omindex.cc: Encode reserved characters in URLs - now links to files with names containing '#' and '?' will work. Sun Jan 23 13:27:48 GMT 2011 Olly Betts * docs/overview.rst,omindex.cc: Later Microsoft Works version produce .xlr spreadsheet files, which are apparently XL files with a different extension, so handle them as XL files. Thu Jan 20 11:07:46 GMT 2011 Olly Betts * docs/omegascript.rst,omega.cc,query.cc,templates/query: Allow QueryParser flags to be set from OmegaScript (ticket#418). Sat Jan 15 11:14:32 GMT 2011 Olly Betts * NEWS: Update from ChangeLog, 1.0.22 and 1.0.23. Wed Jan 12 02:21:59 GMT 2011 Olly Betts * query.cc: Fix double Content-Type header in some error reporting situations (regression introduced in 1.2.4). Mon Jan 10 10:00:00 GMT 2011 Olly Betts * omindex.cc,pkglibbindir.cc,pkglibbindir.h: Fix typo in function name (get_pkglibdindir() -> get_pkglibbindir()). Mon Jan 10 09:50:38 GMT 2011 Olly Betts * diritor.cc,diritor.h: Don't define or try to set euid member of DirectoryIterator on platforms where we aren't going to use it. Mon Jan 10 09:15:24 GMT 2011 Olly Betts * diritor.h: Stub out get_owner() and get_group() for __WIN32__. Fri Dec 24 10:35:29 GMT 2010 Olly Betts * NEWS: Update from ChangeLog. Thu Dec 23 01:53:06 GMT 2010 Olly Betts * diritor.cc: Fix to work with older libmagic which doesn't have MAGIC_MIME_TYPE (e.g. on Ubuntu hardy). Sun Dec 19 12:39:23 GMT 2010 Olly Betts * NEWS,configure.ac: 1.2.4. Sun Dec 19 12:37:58 GMT 2010 Olly Betts * query.cc: Disable permission filtering based on $REMOTE_USER as that will break some existing installations if users upgrade, which we don't want. Probably this should be specifiable from OmegaScript but it's not worth delaying 1.2.4 while we sort this out. Sun Dec 19 02:46:17 GMT 2010 Olly Betts * docs/overview.rst,omindex.cc: Change the new name for "--preserve-unupdated" from "--preserve-removed" to "--no-delete". Sun Dec 19 02:32:29 GMT 2010 Olly Betts * query.cc: Fix comment typo. Fri Dec 17 12:45:47 GMT 2010 Olly Betts * commonhelp.cc,commonhelp.h,omindex.cc,scriptindex.cc: Swap the meanings of -v and -V in omindex for consistency with scriptindex and typical short options for --verbose and --version in other packages. For backward compatibility, "omindex -v" is handled specially and still reports the version. Fri Dec 17 08:31:29 GMT 2010 Olly Betts * utf8convert.cc: Fix built in converter to handle space in charset names, which fixes failing utf8converttest when iconv isn't available. Fri Dec 17 05:36:36 GMT 2010 Olly Betts * utf8convert.cc: Rework the fixing up of charset names which iconv() doesn't understand a little. Thu Dec 16 06:35:46 GMT 2010 Olly Betts * loadfile.cc: If fstat() fails, preserve the errno value rather than letting close() clobber it. Thu Dec 16 06:31:30 GMT 2010 Olly Betts * loadfile.cc: Fix file descriptor leak if load_file() is called on something which isn't a file (found by cppcheck run on the Debian archive). This case probably couldn't occur in omindex, but could if you used the LOADFILE action in scriptindex. Thu Dec 09 10:58:48 GMT 2010 Olly Betts * docs/omegascript.rst: Replace $simplecommand with $query - a concrete example is more useful. Improve mark-up. * docs/termprefixes.rst: Remove mention of pre-0.9.7 use of W prefix. Thu Nov 18 12:25:50 GMT 2010 Olly Betts * omega.cc: Fix reversed condition in recent exception reporting fix. Wed Nov 17 03:46:24 GMT 2010 Olly Betts * diritor.cc: Add missing magic_cookie argument to calls to magic_error(). Sat Nov 13 12:17:51 GMT 2010 Olly Betts * omindex.cc: Build up document data with += for efficiency. Sat Nov 13 12:08:09 GMT 2010 Olly Betts * omindex.cc: Index author with A prefix. Sat Nov 13 12:00:50 GMT 2010 Olly Betts * omindex.cc: A file extension can't contain a '/'. Sat Nov 13 11:50:31 GMT 2010 Olly Betts * omindex.cc: Index the leafname of the file (without any extension) as if it contained additional keywords. Sat Nov 13 11:32:09 GMT 2010 Olly Betts * omindex.cc: If a filter command isn't installed, flag this in the commands map so we don't try running this command again for any file with the same mimetype (previously we'd rerun it for a different extension which gave the same mimetype). Fri Nov 12 09:11:35 GMT 2010 Olly Betts * Makefile.am,configure.ac: Add -no-undefined to AM_LDFLAGS on platforms which need it to dynamically link such as cygwin (need to do this taken from ticket#282). Fri Nov 12 03:35:56 GMT 2010 Olly Betts * omindex.cc: Report MIME type if it's unknown to us. Remove debug output line. Update comments. Fri Nov 12 03:32:27 GMT 2010 Olly Betts * diritor.cc: Report errors from libmagic. Fri Nov 12 02:58:20 GMT 2010 Olly Betts * diritor.cc,diritor.h: Fix to compile when libmagic is detected. Fri Nov 12 01:40:24 GMT 2010 Olly Betts * diritor.cc: Add missing class qualifier to method definition. Fri Nov 12 01:25:11 GMT 2010 Olly Betts * INSTALL: Mention libmagic in install instructions. Fri Nov 12 01:16:21 GMT 2010 Olly Betts * Makefile.am,configure.ac,diritor.cc,diritor.h,omindex.cc: Optionally use libmagic to detect MIME types for files for which we have no extension mapping, which allows us to handle files with a misleading extension, and files with no extension. (ticket#114) Thu Nov 11 23:23:07 GMT 2010 Olly Betts * omindex.cc: Refactor slightly to handle the unknown extension case up front, so we lose an indentation level for the known extension case. Thu Nov 11 12:25:03 GMT 2010 Olly Betts * omindex.cc: Add new --filter option to allow the user to specify new filters without patching omindex.cc. * docs/overview.rst: Document --filter. Thu Nov 11 02:51:55 GMT 2010 Olly Betts * omindex.cc: Factor out handling for external filter programs which simply return UTF-8 text on stdout. Mon Nov 08 10:58:46 GMT 2010 Olly Betts * omindex.cc,svgparse.cc,svgparse.h: Extract author for SVG files. Mon Nov 08 10:40:09 GMT 2010 Olly Betts * omindex.cc: Extract metadata from Microsoft Office 2007 file formats. Mon Nov 08 10:21:13 GMT 2010 Olly Betts * myhtmlparse.cc,myhtmlparse.h,omindex.cc: Extract author from HTML documents. Mon Nov 08 09:46:03 GMT 2010 Olly Betts * omindex.cc: Escape wildcard patterns being passed to unzip - in the unlikely event that one of these matched files in or under the current directory, we might fail to extract all the files we wanted to. Mon Nov 08 05:03:41 GMT 2010 Olly Betts * metaxmlparse.cc,metaxmlparse.h,omindex.cc: Extract author from OpenDocument documents. Mon Nov 08 03:18:26 GMT 2010 Olly Betts * omindex.cc: Extract author from PDF metadata. Mon Nov 08 03:15:17 GMT 2010 Olly Betts * metaxmlparse.h: Initialise field member variable. Mon Nov 08 00:28:07 GMT 2010 Olly Betts * omindex.cc: Index text in headers and footers for .odt and .docx files. Thu Nov 04 11:55:58 GMT 2010 Olly Betts * omega.cc,omega.h,query.cc: If we catch an error early on, make sure that if it's appropriate, we write out a "Content-Type:" HTTP header and end the headers. Thu Nov 04 11:39:10 GMT 2010 Olly Betts * utf8converttest.cc: Add back in testcases for charset names with hyphens in. Thu Nov 04 09:01:43 GMT 2010 Olly Betts * utils.cc: Fix misuse of BUFSIZE which should be sizeof(buf) (issue reported by compilation with CPPFLAGS=-D_GLIBCXX_DEBUG). Thu Nov 04 09:01:08 GMT 2010 Richard Boulton * utf8convert.cc,utf8converttest.cc: If iconv can't handle a charset, check if it's of the form (UTF|UCS)[_ ]?.* and if so, convert to the official hypenated form. Should fix failure of utf8converttest on OSX, where it fails due to iconv not supporting "UTF16". Tue Nov 02 09:48:19 GMT 2010 Olly Betts * diritor.cc,diritor.h,loadfile.cc,loadfile.h,md5wrap.cc,md5wrap.h, omindex.cc,scriptindex.cc: Use O_NOATIME if available and either the file is owned by the current euid, or the current euid is 0 (i.e. we're running as root). Fixes ticket#222. Fri Oct 29 14:26:25 GMT 2010 Olly Betts * omindex.cc: Use the CSV parser to generate a nicer sample for files of type application/vnd.ms-excel. Fri Oct 29 09:26:52 GMT 2010 Olly Betts * Makefile.am: Put $(PCRE_LIBS) in libtransform_la_LIBADD rather than omega_LDADD (more correct, but probably doesn't actually make any difference). Thu Oct 28 14:46:11 GMT 2010 Olly Betts * omindex.cc: Disable more output unless --verbose is specified. Don't flush the "Indexing" partial message until we get to the potentially time consuming actions. Thu Oct 28 13:54:44 GMT 2010 Olly Betts * docs/overview.rst: Improve mark-up, and tweak wording in a few places. Thu Oct 28 13:46:36 GMT 2010 Olly Betts * docs/overview.rst: Update docs for --duplicates and --preserve-removed. Thu Oct 28 13:27:01 GMT 2010 Olly Betts * omindex.cc: Deprecated "--preserve-nonduplicates" in favour of new long option "--preserve-removed" which does the same thing, but has a (hopefully) clearer name. Rename the variable it controls from preserve_unupdated to delete_removed_documents (with the opposite sense). Thu Oct 28 12:08:59 GMT 2010 Olly Betts * configfile.cc: Only append '/' to directory values if they don't already have a trailing '/'. Thu Oct 28 11:49:54 GMT 2010 Olly Betts * runfilter.cc: Make the memory limit for filter processes the size of physical memory, not 7/8 of this value, which is a little less arbitrary (ticket#424). Thu Oct 28 11:47:38 GMT 2010 Olly Betts * omindex.cc: Under --duplicate=ignore, fix so that old documents which aren't seen get deleted, which wasn't implemented before (to suppress this deletion, pass -p as well). Thu Oct 28 10:38:21 GMT 2010 Olly Betts * omindex.cc: Track how many documents in the index we haven't seen in this index run - if this is 0, we don't need to check for docs to delete at all; otherwise we can at least use it to know when we have found them all. Use a PostingIterator over all documents to avoid having to catch exceptions from delete_document() for gaps in the used docids. Thu Oct 28 04:52:36 GMT 2010 Olly Betts * omindex.cc: Add quotes around directory name in "Entering directory" message. Add directory name to "skipping directory" error message. Thu Oct 28 04:50:37 GMT 2010 Olly Betts * omindex.cc: Document --verbose in --help. Actually recognise -V. Thu Oct 28 04:01:31 GMT 2010 Olly Betts * omindex.cc: Move the directory iteration loop out of the try/catch block for starting the iteration, which means it's indented by a whole level less. Thu Oct 28 03:47:30 GMT 2010 Olly Betts * omindex.cc: Add --verbose option, and disable the less interesting output unless it is specified. Thu Oct 28 03:34:44 GMT 2010 Olly Betts * omindex.cc: Eliminate the message "Caught unknown exception in index_directory, rethrowing" as it isn't actually informative. Thu Oct 28 01:43:44 GMT 2010 Olly Betts * omindex.cc: Variable dbpath doesn't need to be global. Thu Oct 28 01:28:10 GMT 2010 Olly Betts * omindex.cc: The Host and Path terms are the same for every document in a single invocation of omindex, so calculate them just once up front. Thu Oct 28 01:13:36 GMT 2010 Olly Betts * omindex.cc: Eliminate the leading slash on filenames in output, so they are now relative filenames on the system. This also simplifies path building internally. Wed Oct 27 09:51:51 GMT 2010 Olly Betts * omindex.cc: Use rpm's --qf option to produce output which is simpler to parse. Wed Oct 27 09:32:22 GMT 2010 Olly Betts * docs/overview.rst,omindex.cc: Add support for indexing RPM packages (ticket#493). Wed Oct 27 06:07:59 GMT 2010 Olly Betts * docs/overview.rst,omindex.cc: Add support for indexing Debian package files (ticket #493). Wed Oct 27 05:37:02 GMT 2010 Olly Betts * docs/overview.rst,omindex.cc: Quietly ignore files with mimetype set to "ignore". The initial list of extensions set to ignore is: .a .dll .dylib .exe .lib .o .obj .so Wed Oct 27 02:25:01 GMT 2010 Olly Betts * omindex.cc: Report get_description() for Xapian exceptions, which is provides additional information above get_msg(). Wed Oct 27 01:56:08 GMT 2010 Olly Betts * omindex.cc,query.cc,values.h: Add file size as a value, and set up a NumberValueRangeProcessor so size: works in the query (has to be in bytes currently). Wed Oct 27 01:31:25 GMT 2010 Olly Betts * scriptindex.cc: Report get_description() for Xapian exceptions, which is provides additional information above get_msg(). Tue Oct 26 12:00:58 GMT 2010 Olly Betts * docs/overview.rst: Document the new emptydocs template. Tue Oct 26 11:51:31 GMT 2010 Olly Betts * docs/omegascript.rst,query.cc: Add new $emptydocs command which returns a list of documents with doclength zero. * query.cc: Extend $field to take an optional DOCID argument, rather than always using the context from $hitlist. * templates/emptydocs: New template which lists documents with doclength zero. Thu Oct 21 12:05:23 GMT 2010 Olly Betts * configure.ac,unixperm.cc: Fix to build on platforms where getgrouplist() exists but takes int* not gid_t* (e.g. Mac OS X). Wed Oct 20 10:30:13 GMT 2010 Olly Betts * omindex.cc,scriptindex.cc: Add boolean terms with add_boolean_term() so they get wdf of 0 and don't contribute to document length. Sat Oct 16 06:13:23 GMT 2010 Olly Betts * configure.ac: Probe for any options needed to enable large file support. Handling files >= 2GB isn't especially useful, but more importantly this is needed to allow omindex to index files on filing systems with 64 bit inodes on some platforms (e.g. 32-bit Linux). Mon Oct 11 11:11:07 GMT 2010 Olly Betts * Makefile.am: Drop special case to remove man pages on "make clean" in maintainer-mode. Wed Sep 29 04:14:21 GMT 2010 Olly Betts * Makefile.am,configure.ac,query.cc,unixperm.cc,unixperm.h: Pull out permission checks into a separate file and check Unix user and group permissions based on environmental variable REMOTE_USER, if set. Tue Sep 28 08:06:00 GMT 2010 Olly Betts * Makefile.am: Ship common/realtime.h. Tue Sep 28 06:32:10 GMT 2010 Olly Betts * query.cc: Apply permission filters if USER and/or GROUP are set. Tue Sep 28 06:14:50 GMT 2010 Olly Betts * ./: Update svn:externals to latest common from xapian-core. * query.cc: Use RealTime::now() to time running the query. Include more enquire set-up in the time. Tue Sep 28 05:26:07 GMT 2010 Olly Betts * omindex.cc: Index file owner and read permissions, to allow finding documents with a particular owner, and so searches can be restricted to documents a user is able to read. * docs/termprefixes.rst: Document term prefixes used by the above. Tue Sep 28 05:20:01 GMT 2010 Olly Betts * diritor.h: Rename get_other_read() to is_other_readable() for consistency. Tue Sep 28 04:16:55 GMT 2010 Olly Betts * diritor.cc,diritor.h: Rearrange so that the setting of statbuf_valid gets inlined so the compiler should be able to optimise out subsequent calls to call_stat(). Tue Sep 28 04:10:28 GMT 2010 Olly Betts * diritor.h: Add methods to read the owner and group, and to check who can read the file. Tue Sep 28 01:39:15 GMT 2010 Olly Betts * NEWS: Fix typo. Tue Sep 28 01:33:44 GMT 2010 Olly Betts * NEWS: Fix whitespace oddities. Tue Sep 28 01:31:46 GMT 2010 Olly Betts * NEWS: Update from ChangeLog. Tue Sep 28 01:27:41 GMT 2010 Olly Betts * omindex.cc: Improve --help for --mime-type option. Mon Sep 20 06:50:45 GMT 2010 Olly Betts * omindex.cc,svgparse.cc,svgparse.h: Extract any document title and keywords from SVG files. Mon Sep 20 06:49:44 GMT 2010 Olly Betts * htmlparse.cc: Call closing_tag() for XML empty tag syntax (like ""). Mon Sep 20 05:30:54 GMT 2010 Olly Betts * Makefile.am,docs/overview.rst,omindex.cc,svgparse.cc,svgparse.h: Add support for indexing SVG files. Tue Sep 07 04:39:59 GMT 2010 Olly Betts * outlookmsg2html.in: If the required perl modules aren't available, exit with status 127 which omindex interprets as "filter not installed" and won't try further .msg files. Tue Sep 07 02:24:36 GMT 2010 Olly Betts * Makefile.am,configure.ac,docs/overview.rst,omindex.cc, outlookmsg2html.in,pkglibbindir.cc,pkglibbindir.h: Add support for indexing .msg files from Microsoft Outlook. (ticket#334) Tue Aug 31 06:32:15 GMT 2010 Olly Betts * omindex.cc: Fix handling of quoting in CSV files to match what's most common. Tue Aug 31 05:41:13 GMT 2010 Olly Betts * docs/overview.rst,omindex.cc: The V in CSV is Values not Variable. Mon Aug 30 14:56:36 GMT 2010 Olly Betts * docs/overview.rst,omindex.cc: Add support for indexing .csv files. Sat Aug 28 11:46:22 GMT 2010 Olly Betts * cdb_find.cc,cdb_init.cc,cgiparam.cc,date.cc,md5.cc,query.cc,utils.cc, values.h: Fix to compile with Sun C++. Sat Aug 28 11:36:25 GMT 2010 Olly Betts * omega.cc: An ESet can't contain empty terms, so there's no need to check for them. Tue Aug 24 05:58:28 GMT 2010 Olly Betts * NEWS,configure.ac: Update for 1.2.3. Mon Aug 23 15:08:11 GMT 2010 Olly Betts * xapian-omega.spec.in: Don't run autoreconf - it's no longer required. Tue Aug 03 14:11:35 GMT 2010 Olly Betts * docs/termprefixes.rst: Update "flint and quartz" to "flint and chert" as quartz is no longer supported. Give exact term length limit for flint and chert. Sun Jun 27 05:00:39 GMT 2010 Olly Betts * NEWS,configure.ac: Update for 1.2.2. Sat Jun 26 15:59:59 GMT 2010 Olly Betts * NEWS.SKELETON: Add blank line to the end. Sat Jun 26 15:59:05 GMT 2010 Olly Betts * NEWS.SKELETON: Add template NEWS entry. Tue Jun 22 13:55:11 GMT 2010 Olly Betts * NEWS: Sync with 1.0.21. * NEWS,configure.ac: Update for 1.2.1. Sun Jun 13 11:55:40 GMT 2010 Olly Betts * freemem.cc: Merge in __WIN32__ implementation from perftest in xapian-core. Fri May 14 01:39:43 GMT 2010 Olly Betts * freemem.cc: Use "safeunistd.h" instead of . Wed Apr 28 13:38:33 GMT 2010 Olly Betts * NEWS: Sync with 1.0.20. Wed Apr 28 06:44:56 GMT 2010 Olly Betts * configure.ac: Tell libtool not to link in deplibs on platforms where we know they aren't needed. * configure.ac: On Linux, extract the library search path from ldconfig which gives us the default entries reliably. * NEWS,configure.ac: 1.2.0. Thu Apr 15 04:32:06 GMT 2010 Olly Betts * NEWS,configure.ac: Update for 1.1.5. Mon Feb 15 14:00:26 GMT 2010 Olly Betts * configure.ac: Update for 1.1.4. Mon Feb 15 13:51:44 GMT 2010 Olly Betts * NEWS: Add missing notes for 1.1.2 and 1.1.1 including changes from 1.0.14 and 1.0.13 respectively. Mon Feb 15 13:28:12 GMT 2010 Olly Betts * NEWS: Update from ChangeLog and 1.0.18. Mon Feb 08 00:48:44 GMT 2010 Olly Betts * Makefile.am: Need to ship common/omassert.h. Sun Feb 07 23:03:45 GMT 2010 Olly Betts * Makefile.am: Need to ship common/str.h. Sun Feb 07 21:40:03 GMT 2010 Olly Betts * Makefile.am,omega.cc,omindex.cc,query.cc,utils.cc,utils.h: Use the optimised str() routine instead of int_to_string() and long_to_string(). Fri Feb 05 23:29:12 GMT 2010 Olly Betts * omindex.cc: Increase the wdf boost for the document title from 2 to 5, since 2 isn't really enough. Thu Feb 04 03:20:02 GMT 2010 Olly Betts * Makefile.am,configure.ac,runfilter.cc: Use safesyswait.h. * runfilter.cc: Reformat header to @file doxygen comment. Put '#include "runfilter.h"' right after . Wed Dec 10 00:15:10 GMT 2009 Olly Betts * NEWS: Update from ChangeLog. Wed Dec 09 00:26:19 GMT 2009 Olly Betts * myhtmlparse.cc: Add missing "using namespace std;". Wed Dec 09 00:20:38 GMT 2009 Olly Betts * htmlparse.cc: Make the default charset "utf-8" not "UTF-8" as we lower case explicitly specified character sets to compare to see if we need to reparse, so this avoids a reparse when UTF-8 is explicitly specified as well as the default. Tue Dec 08 23:56:46 GMT 2009 Olly Betts * scriptindex.cc: Don't bomb out if indexing is disallowed or we hit for a document which had an overridden character set. Fixes ticket#410. Wed Nov 18 10:48:47 GMT 2009 Olly Betts * NEWS,configure.ac: Update for 1.1.3. Wed Nov 18 02:37:34 GMT 2009 Olly Betts * NEWS: Update from 1.0.17 and ChangeLog. Mon Nov 16 09:08:12 GMT 2009 Olly Betts * utf8converttest.cc: Charset "8859_1" isn't understood by Solaris libiconv, and isn't likely to be specified on a page, so just test it for our built-in convertor and GNU libc. Wed Nov 11 04:52:25 GMT 2009 Olly Betts * configure.ac: Also check for socketpair with -lxnet if it isn't found without, which enables resource limits on Solaris, and possibly some other platforms. Fixes ticket#412. Wed Nov 04 01:51:41 GMT 2009 Olly Betts * freemem.cc: On Linux, _SC_AVPHYS_PAGES excludes pages used by the OS VM cache, so will often return a really low value, so instead use _SC_PHYS_PAGES. Reported by Rune Kock in Debian bug#548987. Also explains ticket#358. Wed Nov 04 00:54:38 GMT 2009 Olly Betts * common/: Sync with latest version from xapian-core to pick up getopt fix for Mac OS X 10.6. Mon Nov 02 09:32:22 GMT 2009 Olly Betts * omindex.cc: Use delete[] (not delete) for array allocated by new[]. Mon Nov 02 07:08:13 GMT 2009 Olly Betts * runfilter.cc: Fix likely crash if read() is interrupted by a signal. Identified by Coverity's Scan. Mon Nov 02 06:47:01 GMT 2009 Olly Betts * scriptindex.cc: Extend exception handling to the whole of main. Xapian::Stem("english") can't actually throw, but that's not obvious to static analysis tools, and it is more robust to wrap the whole of main, and reduces indentation. Mon Nov 02 06:32:41 GMT 2009 Olly Betts * omindex.cc,scriptindex.cc: Tighten up the type of the error we catch to detect an unknown stemming language. Thu Sep 17 12:13:10 GMT 2009 Olly Betts * NEWS: Update from ChangeLog. Thu Sep 10 13:33:06 GMT 2009 Olly Betts * configure.ac: Default to looking for xapian-config-1.1. Thu Sep 10 06:46:55 GMT 2009 Olly Betts * NEWS: Sync changes from 1.0.15 and 1.0.16. Wed Sep 09 13:32:25 GMT 2009 Olly Betts * omega.cc,query.cc,query.h: Fix cross-site scripting vulnerability in reporting of exceptions (CVE-2009-2947). Fri Aug 28 15:30:07 GMT 2009 Richard Boulton * configure.ac: Check for PERL if in maintainer mode, not just when building documentation, because making the omegascript vim syntax mode requires it. Wed Aug 26 14:17:06 GMT 2009 Olly Betts * templates/query: www.xapian.org -> xapian.org. Tue Aug 25 11:15:38 GMT 2009 Olly Betts * gen-omegascript-vim: Fix swapped arguments to perl mkdir function. Tue Aug 25 10:39:29 GMT 2009 Olly Betts * gen-omegascript-vim: Add GPL licence boilerplate. Tue Aug 25 10:29:07 GMT 2009 Olly Betts * gen-omegascript-vim: Need to create "extra" for a VPATH build. Tue Aug 25 08:39:00 GMT 2009 Olly Betts * Makefile.am: Fix for VPATH build. Tue Aug 25 06:38:08 GMT 2009 Olly Betts * Makefile.am,extra/omegascript.vim,extra/omegascript.vim.in, gen-omegascript-vim: The list of OmegaScript commands in the vim mode was rather out of date, and a few commands were misclassified. Fix both problems and avoid future recurrences by automatically generating those lists from the command list in query.cc. Sat Aug 15 11:31:56 GMT 2009 Olly Betts * NEWS: Update from ChangeLog. Wed Aug 05 03:50:54 GMT 2009 Olly Betts * omindex.cc: Implement correct handling of paths when calling external filter programs on Microsoft Windows. Thu Jul 23 12:07:24 GMT 2009 Olly Betts * omindex.cc: Remove pointless fallback code. Thu Jul 23 12:06:37 GMT 2009 Olly Betts * templates/inc/toptermsjs: Use double-quotes rather than single quotes for parameter values on the