1In this directory: 2 3html-xml-utils-*.tar.gz 4 A number of simple utilities for manipulating HTML and XML files. 5 See INSTALL for generic installation instructions. 6 Get the source at: http://www.w3.org/Tools/HTML-XML-utils/ 7 8htmlutils-*.tar.gz 9 Old versions (before version 0.1) 10 11 12Note 1: Your package manager may have a precompiled copy 13already. There are versions in Debian, Ubuntu, Macports and others. In 14that case you need to download from here only if you want a different 15version or want to hack on the source. 16 17Note 2: the names changed in version 5.0: most programs got an "hx" 18prefix. Please, uninstall any version < 5.0 before installing a 19version >= 5.0 20 21 22cexport (1) - create headerfile of exported declarations from a C file 23hxaddid (1) - add ID's to selected elements 24hxcite (1) - replace bibliographic references by hyperlinks 25hxcite-mkbib (1) - expand references and create bibliography 26hxcopy (1) - copy an HTML file while preserving relative links 27hxcount (1) - count elements and attributes in HTML or XML files 28hxextract (1) - extract selected elements 29hxclean (1) - apply heuristics to correct an HTML file 30hxprune (1) - remove marked elements from an HTML file 31hxincl (1) - expand included HTML or XML files 32hxindex (1) - create an alphabetically sorted index 33hxmkbib (1) - create bibliography from a template 34hxmultitoc (1) - create a table of contents for a set of HTML files 35hxname2id - move some ID= or NAME= from A elements to their parents 36hxnormalize (1) - pretty-print an HTML file 37hxnum (1) - number section headings in an HTML file 38hxpipe (1) - convert XML to a format easier to parse with Perl or AWK 39hxprintlinks (1) - number links & add table of URLs at end of an HTML file 40hxremove (1) - remove selected elements from an XML file 41hxtabletrans (1) - transpose an HTML or XHTML table 42hxtoc (1) - insert a table of contents in an HTML file 43hxuncdata (1) - replace CDATA sections by character entities 44hxunent (1) - replace HTML predefined character entities to UTF-8 45hxunpipe (1) - convert output of pipe back to XML format 46hxunxmlns (1) - replace "global names" by XML Namespace prefixes 47hxwls (1) - list links in an HTML file 48hxxmlns (1) - replace XML Namespace prefixes by "global names" 49asc2xml, xml2asc (1) - convert between UTF8 and &#nnn; entities 50hxref (1) - generate cross-references 51hxselect (1) - extract elements that match a (CSS) selector 52 53 54 55This package is configured with automake/autoconf. Generic 56instructions are in the file INSTALL. Here are some specific problems 57that may arise: 58 591) Error when running lex: 60 61 lex scan.l && mv lex.yy.c scan.c 62 "scan.l":line 2: Error: missing translation value 63 64 The scan.l file uses features of flex that do not exist in lex. 65 However, it is not necessary to run lex, since the file scan.c is 66 provided in the package. Just do a "touch scan.c" to make sure 67 "make" will not try to generate it anew. 68 692) Warning about "libidn not found": 70 71 Without libidn2 or libidn, hxwls will not be able to translate 72 Internationalized Domain Names to ASCII (option -a). You can install 73 either libidn2 or libidn. 74 75 If you install them in a non-standard location, use --with-libidn2 76 or --with-libidn when invoking ./configure. E.g., if you install 77 libidn from MacPorts on Mac OS X, run: 78 79 ./configure --with-libidn=/opt/local 80 81 82$Date: 2016/04/14 00:42:15 $ 83