• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

INTERNALSH A D07-Oct-19987.1 KiB172121

Makefile.inH A D03-May-20223 KiB11887

Makefile.linuxH A D07-Oct-19981.4 KiB6942

READMEH A D06-Aug-19997.3 KiB190138

alloca.cH A D07-Oct-199813 KiB467272

col-align.ccH A D03-May-20222.1 KiB11093

compat.cH A D07-Oct-19981.1 KiB8068

config.guessH A D06-Aug-199926.7 KiB974860

config.h.botH A D07-Oct-1998122 85

config.h.inH A D07-Oct-19982.2 KiB8157

config.h.topH A D07-Oct-199844 32

config.subH A D06-Aug-199919.4 KiB957848

configureH A D06-Aug-199966.1 KiB2,2151,814

configure.inH A D06-Aug-19992.1 KiB7869

deHTMLdate.ccH A D30-Nov-1998915 4128

deL1date.ccH A D30-Nov-1998793 3928

dedate.ccH A D30-Nov-1998854 4028

fake_linkH A D07-Oct-199895 74

fifo.hH A D03-May-20224.2 KiB267231

fmt-html.hH A D07-Oct-1998995 4930

fmt-latex.hH A D07-Oct-1998920 4628

getopt.cH A D07-Oct-199821.4 KiB753396

getopt.hH A D07-Oct-19984.4 KiB13348

getopt1.cH A D07-Oct-19984.3 KiB185118

html-embed.ccH A D07-Oct-19987.2 KiB382314

html-fmt.ccH A D07-Oct-199810.6 KiB488396

html-table.ccH A D03-May-20225.9 KiB300228

html-table.hH A D07-Oct-1998520 2823

install-shH A D07-Oct-19984.7 KiB239152

interface.hH A D07-Oct-19981.4 KiB6145

latex-embed.ccH A D07-Oct-199810.4 KiB556458

latex-fmt.ccH A D07-Oct-199811.7 KiB538448

latex-table.ccH A D03-May-20226 KiB304250

latex-table.hH A D07-Oct-1998527 2823

lib.hH A D30-Nov-19982.5 KiB11786

map_chars.ccH A D03-May-2022889 5846

nullproc.ccH A D07-Oct-1998320 138

num_unit_probe.cH A D07-Oct-19981.3 KiB6861

part_num_probe.cH A D07-Oct-1998782 4637

reader.ccH A D03-May-202224.4 KiB1,2481,012

reader.hH A D03-May-20225.5 KiB280219

rtest2.ccH A D03-May-20221.8 KiB11590

scan_num.ccH A D07-Oct-19981,022 7766

shrink_width.ccH A D07-Oct-1998748 3626

strip.ccH A D03-May-202212.6 KiB528430

strip.hH A D03-May-20222.9 KiB137117

tblock.ccH A D03-May-20224.3 KiB244210

tblock.hH A D03-May-2022888 5442

text-fmt.ccH A D07-Oct-19986.3 KiB288230

text-table.ccH A D03-May-20226.7 KiB339274

text-table.hH A D07-Oct-1998453 2621

transcriptH A D06-Aug-19993.9 KiB8483

tune.hH A D07-Oct-1998808 3115

ukdate.ccH A D07-Oct-19981.1 KiB6248

usdate.ccH A D07-Oct-19981.1 KiB6648

use_getopt.hH A D07-Oct-1998210 95

word2x.1H A D07-Oct-19983 KiB10483

word2x.ccH A D06-Aug-19997.1 KiB396328

word6.hH A D07-Oct-1998796 3812

wordwrap.ccH A D07-Oct-19981.3 KiB7564

README

1$Id: README,v 1.7 1997/04/13 02:59:59 dps Exp $
2
3What is new in version 0.005 of word2x
4
5Update version to 0.005
6Fix version number bug
7Update config.guess an config.sub
8Re-generate configure.in with newer autoconf
9Fix various ANSI violations that g++ 2.95 disliked
10
11
12What is new in version 0.004 of word2x
13
14Stupid bug in word2x_junk_filter::filter_junk bug which ignored the last
15character read squashed.
16
17Added german support from word2x port EX2.
18
19
20What it was new in version 0.003 of word2x
21
22word2x-0.003 is version word2x-0.002 with a major bug in strip.cc
23eliminated. word2x-0.002 was 0.001 retro-fitted with some quite new
24junk filtering code with lots of tunable parameters (i.e. all of
25tune.h).  This code is extracted from the envolving, and currently
26incomplete, source tree of the next major release. (When this happens I
27will stop supporting or maintaing any 0.00x versions).
28
29The major change is much better junk filtering, losing less text and
30throwing out more junk; unicode documents should now
31work. Increasing numbers of problem document which have OLE junk in
32places that break the code are appearing. Splitting the document with
33lls (from the LAOLA package) and attacking the WordDocument stream
34works---sometime I will have a useable library that can do this
35automagically. (In word2x-0.002 you have a very good chance of
36tickling the strip.cc bug and its (buggy) bug trap).
37
38Documents that do cause problems after the suggested work-around to
39word2x@duncan.telstar.net please. The immediate fix is to try one of
40the other two programs. (Free software people are prepared to
41co-operate with the "competition"). There are links to all the
42"competition" I know of on the word2x home page at
43http://word2x.alcom.co.uk (hosted by the alcom.co.uk free of charge,
44despite the fact charges normally apply).
45
46
47Installing word2x
48
49You need a C++ compiler and a version of make that does understands
50how to make .o files from .cc files, for example GNU make. Ideally you
51have getopt_long already in your C library but you might not. If this
52applies set GETOPT to gopt.o in the Makefile. getopt_long is the
53version supplied by the free software foundation in glibc-1.09
54
55If your make does not know then add a rule. for GNU make the rule is
56
57%.o: %.cc
58	$(CPP) $(CPPFLAGS) -c -o $@ $<
59
60Please note that a warning about a contravariance violation is normal.
61
62As this is program only recently escaped, YMMV. The main reason for
63its escape was incessant irration that comp.os.linux.misc posters
64manage about word .doc files (IMHO this is justified). I had wrote
65this program for myself and my word problem; I let it run wild in the
66hope that is it useful for others. [I now know is it is helping some
67people]
68
69Further information on other converters is avialable in the list of
70converters avialble via <http://www.kfa-juelich.de/isr/1/texconv.html>
71(word2x seems to have a monopoly on converters from word to latex not
72requiring word and avialable on non-MS platforms).
73
74The program has been compiled on (the first two by me personally):
75
76Linux 2.1.30 (Unix)
77SunOS (Unix)
78DEC Alpha AXP under OSF/1 (Unix)
79IBM SP/2 (RS6000) under AIX (Unix) [SP/2s are heavy computing power...]
80
81It is known not compile with
82
83Borland C++ 3.1 (PC version).
84
85If any manages to compile on a PC version, please tell
86Duncan Simpson <dps@duncan.telstar.net> and
87W.Hennings <W.Hennings@kfa-juelich.de>
88
89Limited flat (linear) memory might be lethal, esp. if your system
90lacks alloca. If have not learned to steal what is free then you can
91send money (prefably UK funds), postcards, etc to the author at
92
93Frax House, Kingston Bagpuize, OXON OX13 5AW
94
95or for the next couple of years
96
97Flat 6, 93 Westridge Road, Southampton
98
99I neither suggest that you do donate nor that you do not donate.
100
101
102SunOS
103
104Can be problematic. Setting LD to ./sunos_link and defining add
105produced a binary that worked for me with one warning about
106strncasecmp. I guess SUN's ld is incompatible with g++ or something;
107using ar and ranlib, aka the sunos_link shell script, works. The
108configuration script hopefuk does this stuff for you.
109
110
111Reported bugs
112
113On some platforms it misses the first 3/4 of a page. If you are
114afflicted get out your copy of hexdump and adjust the start offset in
115word2x to the correct value. This should be fixed now.
116
117Copyright
118
119This program is(c) D.P.Simson 1997. The program is licenced under the
120GPL version 2, or any later version (at your option). This means DOS
121people must distribute source as per the GPL.
122
123The stuff I did not write is:
124
125config.guess and config.sub come from GNU autoconf and are thus
126(c) The Free Software Foundation.
127
128getopt.c, getopt1.c and getopt.h are (c) The Free Software Foundation.
129I am fairly sure the LGPL requires these files to be distributed as
130well.
131
132alloca.c is almost certaintly also (c) The Free Sofwtware Foundation.
133
134install-sh is probably (c) The X consortium
135
136
137Introductory proganda
138
139Despite the fact that open formats like rtf are good and widely
140avialable far too many idiots seem to insist on using word .doc
141format. This program is an attempt to limit the damage this causes
142users of non-microsoft systems and text processing systems, for
143example LaTeX.
144
145It is designed to be retargetable and avoid some of the travesties of
146proper typsetting comitted by word, which is hobbled by the lack of
147litagures in TrueType fonts (and the lack of different design sizes to
148some extent). There is quite a large amount of guesswork from context
149to reduce the impact of my lack of understanding a document the way
150word does. One even sees interesting things like
151<Paragraph mode> 550* <eqn> \F(foo, bar) <end eqn> * 42 * (pixels per em)
152<End paragraph>
153which is not too good! There may be multiple bits of alternating roman
154and equation, multiple items of text in brackets, etc.
155etc. Fortunately the reader converts these, in two stages, to a single
156maths insert. Maths inserts with embedded newlines get rendered as
157eqnarray* in LaTeX mode. All maths is just deleted in text mode (would
158someone like to add this support?).
159
160LaTeX mode sees the equation example above as <eqn insert> 550
161* \F(foo, bar * baz) * 42 * (pixels per em) <end eqn insert> and
162renders it as
163% Some comments omited for brevity
164$$550 \times {\text{foo} \over \text{bar} \times \text{baz} } \times 42
165\times \text{(pixels per em)}$$
166which looks a lot better than word's own version, which uses awful
167stars instead of proper times signs.
168
169Text mode implements tables with real columns, unlike catdoc. Long
170entries are folded automatically and there is some semi-intelligent
171width reduction.  Hypenation is not supported so if someone instists
172on using supercalifragilistic... then an overlong line might result
173(anyone care to fix this? I thought it was just overkill to implement
174the hypenation algorithm along with all the rest).
175
176Apart from the pictures and a little trailing junk the code does a
177good job on the TrueType documents. The readme generates some error
178messages about extra ^Us amoung other things due to a lack of
179understanding of some of the inserts used in some documents. Anyone
180who can decode more types of insert, please tell me about it and
181preferably send a patch so I can avoid extra programming (got too much
182real work to be doing).
183
184If someone wishes to contribute *roff output I would include it. Extra
185understanding of equations also gratefully recieved as the examples in
186the TrueType docs are rather limited. Bibliography and any other you
187can tell me about also grateful listened to.
188
189Duncan (-:
190