• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

libstd/H03-May-2022-4,5463,361

1.3.2a_to_1.3.2.diffH A D14-Jan-200426.7 KiB770688

Area.CH A D23-Nov-200310.9 KiB479365

Area.hH A D03-May-20225.2 KiB192109

CHANGESH A D14-Jan-200421.1 KiB589502

COPYINGH A D04-Oct-200117.6 KiB340281

CREDITSH A D23-Nov-2003953 3423

HTMLControl.CH A D23-Nov-200315.3 KiB598394

HTMLControl.hH A D03-May-20222.6 KiB9836

HTMLParser.CH A D23-Nov-2003111.8 KiB3,1762,847

HTMLParser.hH A D12-Aug-200212 KiB539475

HTMLParser.kH A D12-Aug-200212 KiB539506

HTMLParser.yH A D23-Nov-200325.8 KiB979828

INSTALLH A D31-Aug-20027.7 KiB187137

KNOWN_BUGSH A D23-Nov-20031.5 KiB3926

Makefile.inH A D03-May-20224.2 KiB13569

Properties.CH A D23-Nov-20034.1 KiB15077

Properties.hH A D03-May-20222.7 KiB8934

READMEH A D14-Jan-20048 KiB196148

RELEASE_NOTESH A D14-Jan-2004871 2113

TODOH A D12-Jan-20042.6 KiB6846

ascii.substitutesH A D12-Jan-20043.8 KiB256251

auto_aptr.hH A D04-Oct-20013.2 KiB9922

cmp_nocase.CH A D21-Sep-19992.6 KiB6717

cmp_nocase.hH A D04-Oct-20012.7 KiB7815

configureH A D03-May-20226.7 KiB323244

format.CH A D23-Nov-200343.3 KiB1,6411,209

format.hH A D03-May-20222.4 KiB7622

html.CH A D23-Nov-200316.2 KiB646508

html.hH A D04-Oct-200119.5 KiB630441

html2text.CH A D23-Nov-200311.1 KiB385291

ls-alRFH A D14-Jan-20044 KiB7168

pretty.styleH A D08-Jun-20011 KiB5754

sgml.CH A D23-Nov-200316 KiB427319

sgml.hH A D26-Oct-19992.6 KiB636

table.CH A D22-Jul-200218.3 KiB593389

urlistream.CH A D22-Jul-20029.4 KiB400233

urlistream.hH A D23-Nov-20032.8 KiB9637

README

1## This is the README file for html2text          Wed Jan 14 14:35:57 CET 2004
2## ===========================================================================
3
4html2text is a command line utility, written in C++, that converts HTML
5documents into plain text. It was written up to version 1.2.2 for and is
6copyrighted by GMRS Software GmbH, Unterschlei�heim.
7
8html2text reads HTML documents from standard input or a (local or remote)
9URI, and formats them into a stream of plain text characters that is written
10to standard output or into an output-file, preserving the original positions
11of table fields.
12
13ISO 8859-1 is used for output by default, plain-ASCII output can be chosen by
14setting the "-ascii" command line option. Type "html2text -help" for an
15overview of all command line options.
16
17Examples:
18html2text <file> | less
19html2text -o outfile.txt -ascii -nobs <file>
20
21The rendering is largely customisable through the "html2textrc" file and the
22"-style" command line option, that may be used to change quickly some
23formatting defaults. See the html2textrc(5) manual page for details.
24
25Although html2text was written for the conversion of HTML 3.2 documents, most
26constructs of HTML 4 are renderred as well, including most SGML entities,
27provided that they are written as "named entities" and not as a numeric value.
28The program tries to parse even XHTML documents and the HTML produced by word
29processors, but this not always as successful as other HTML parsers, because
30html2text is, as already said, for all that an HTML 3.2 converter.
31
32The program accepts also syntactically incorrect input, attempting to
33interpret it "reasonably". If the output is however not satisfactory, of if
34rendering fails completely, and you have the possibility to correct the HTML
35source code, you may want to use the "-unparse" or "-check" options to find
36out what exactly html2text's problem is.
37
38This program was written because GMRS was looking for a good, free
39HTML-to-text converter for UNIX, and they couldn't find one on the net. The
40best they could find was lynx, i.e. "lynx -dump", but lynx could not cope with
41tables.
42
43
44# ----------------------------------------------------------------------------
45# For information on compiling and installing the package on your system,
46# please refer to the file INSTALL.
47
48html2text was developed and is tested under Linux. However, it uses no
49O/S-specific features and should be easily portable to other platforms (at
50least to other UNIX-ish platforms). It is reported to compile and work on the
51following platforms:
52
53	+ AIX 4.3/g++ 2.95.1
54	+ AIX 4.3.2.0/g++ 2.95.2.1
55	+ CYGWIN_NT-5.0 1.5.4/gcc 3.2
56	+ FreeBSD 5.1/gcc 3.2.1
57	+ IRIX64 6.5/MIPS 7.41
58	+ Linux 2.2.18/g++ 2.95.2
59	+ Linux 2.4.16/g++ 2.95.3
60	+ Linux 2.4.22/gcc 3.3.2
61	+ NetBSD 1.6.1/gcc 2.95
62	+ SINIX/CDS++ 2.0A00
63
64You will find some hints for porting it to other platforms at the end of the
65file "INSTALL".
66
67Note for version 1.3.2(a): Version 1.3.2 is distributed in two "flavours":
681.3.2A contains changes needed for g++ 3.3 and later, which are not
69backwards-compatible. Thus, if you use an older (or other) compiler, please
70use version 1.3.2 (without 'a'), if you have g++ 3.3 (and up) installed,
71please use version 1.3.2A. Cross-patches (from 1.3.2 to 1.3.2a and viceversa)
72are included into the source code packages of either flavours.
73
74
75# ----------------------------------------------------------------------------
76# Published under the terms of the GNU General Public License.
77
78This program is free software; you can redistribute it and/or modify it under
79the terms of the GNU General Public License as published by the Free Software
80Foundation; either version 2 of the License, or (at your option) any later
81version.
82
83This program is distributed in the hope that it will be useful, but WITHOUT
84ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
85FOR A PARTICULAR PURPOSE. See the GNU General Public License for more
86details.
87
88You should have received a copy of the GNU General Public License along with
89this program; if not, write to the Free Software Foundation, Inc., 59 Temple
90Place - Suite 330, Boston, MA 02111-1307, USA.
91
92
93# ----------------------------------------------------------------------------
94# GMRS agreed to change the program's license terms to GPL.
95
96Message-ID: <01c401c10f72$d11c3660$12c8a8c0@jag>
97Reply-To: "David Geffen" <geffen@one4net.com>
98From: "David Geffen" <david@one4net.com>
99To: <mbayer@zedat.fu-berlin.de>
100Date: Wed, 18 Jul 2001 12:17:14 +0200
101Organization: GMRS Software GmbH
102
103Hallo Herr Bayer,
104
105html2text darf unter die GPL veroeffentlicht werden, solange one4net keinerlei
106Nachteile oder Verpflichtungen dadurch entstehen.
107
108Mit freundlichen Gruessen
109David Geffen
110
111
112----- Original Message -----
113From: "Martin Bayer" <mbayer@zedat.fu-berlin.de>
114To: <geffen@one4net.com>
115Sent: Thursday, July 12, 2001 5:39 PM
116Subject: Re: Lizenzbedingungen von 'html2text'
117
118
119> Guten Tag!
120>
121> On Mon, Jun 25, 2001 at 03:23:31PM +0200, David Geffen wrote:
122> > > Aus diesem Grunde m�chte ich Sie herzlich bitten, zu �berlegen, ob es
123> > > f�r GMRS nicht m�glich w�re, 'html2text' nachtr�glich unter die GPL zu
124> > > stellen.
125> >
126> > ich bin erst heute zurueck aus dem Urlaub gekommen.
127> >
128> > Ich werde mich in den naechsten paar Tage dazu melden.
129>
130> Darf ich Sie fragen, ob Sie in dieser Angelegenheit bereits zu einem
131> Entschluss gekommen sind? Es ist mittlerweile gelungen, das Programm nach
132> g++3 zu portieren, und da w�re es sch�n, wenn bereits diese neue Version
133> unter GPL ver�ffentlicht werden k�nne.
134>
135> Mit den besten Gr��en
136> --
137> Martin Bayer
138>                                                 c.ne Ostiense, 212/E/15
139> E-Mail: mail@mbayer.de                          I-00154 Roma
140> WWW: http://www.mbayer.de                       GSM: +39 3476605285
141
142
143# ----------------------------------------------------------------------------
144# This program is not provided nor supported by GMRS any longer.
145
146Since GMRS decided not to develop nor to support this program any longer,
147they also did not provide its source code any more. With this, I realised,
148the source code of this program was hardly to obtain, as most archives
149included at best a precompiled version. Because I liked the features, I
150offered a webspace where this program now is living at,
151
152	http://userpage.fu-berlin.de/~mbayer/tools/html2text.html
153
154I'm afraid in this way I've become the maintainer of this package, even if I
155actually don't have time free to spend on working on the program by myself.
156Please keep this in mind if you are going to write me. :-)
157
158The source code can also be obtained from the Ibiblio network at
159
160	[ftp|http]://ftp.ibiblio.org/pub/linux/apps/www/converters/
161
162If you are going to retrieve the source code from within automated scripts,
163e.g. by a software packaging manager, please prefer downloading it from the
164Ibiblio server or one of its mirrors.
165
166
167# ----------------------------------------------------------------------------
168# �We accept patches.�
169
170Please include in all your messages information on
171	� the version of html2text you are referring to (`html2text -version`),
172          if you obtained the program in binary form, the version number as
173	  supplied by your package manager (e.g. `rpm -q html2text`);
174	� name and version of your operating system (`uname -a`);
175	� name and version of your compiler (`cc -v`).
176
177If you think you found a possible security impact, please let _me_ know
178_first_.
179
180If you think you found a bug, please try first to find out its possible
181reason by yourself, using the "-unparse", "-check", "-debug-scanner", and
182"-debug-parser" command line options, in order to save other people's time. I
183will not consider any "bug report" that just claims "your program is
184buggy!!!!!1", nor will I answer to any mail asking me O/S-specific questions.
185
186I will include into the TODO list any sensible feature request.
187
188And, last but not least, patches are always very welcome. :-)
189
190
191Martin Bayer <mbayer@zedat.fu-berlin.de>
192
193	For all e-mails, use of PGP (GPG) is encouraged. You will find my
194	public key (ID: 0xCB537B60) on my homepage and on keyservers. The key's
195	fingerprint is: "46A1 B556 41CD C77A 0261  D22F 41A6 EB90 CB53 7B60".
196