xref: /openbsd/gnu/usr.bin/gcc/gcc/doc/trouble.texi (revision 3d8817e4)
1@c Copyright (C) 1988, 1989, 1992, 1993, 1994, 1995, 1996, 1997, 1998,
2@c 1999, 2000, 2001 Free Software Foundation, Inc.
3@c This is part of the GCC manual.
4@c For copying conditions, see the file gcc.texi.
5
6@node Trouble
7@chapter Known Causes of Trouble with GCC
8@cindex bugs, known
9@cindex installation trouble
10@cindex known causes of trouble
11
12This section describes known problems that affect users of GCC@.  Most
13of these are not GCC bugs per se---if they were, we would fix them.
14But the result for a user may be like the result of a bug.
15
16Some of these problems are due to bugs in other software, some are
17missing features that are too much work to add, and some are places
18where people's opinions differ as to what is best.
19
20@menu
21* Actual Bugs::		      Bugs we will fix later.
22* Cross-Compiler Problems::   Common problems of cross compiling with GCC.
23* Interoperation::      Problems using GCC with other compilers,
24			   and with certain linkers, assemblers and debuggers.
25* External Bugs::	Problems compiling certain programs.
26* Incompatibilities::   GCC is incompatible with traditional C.
27* Fixed Headers::       GCC uses corrected versions of system header files.
28                           This is necessary, but doesn't always work smoothly.
29* Standard Libraries::  GCC uses the system C library, which might not be
30                           compliant with the ISO C standard.
31* Disappointments::     Regrettable things we can't change, but not quite bugs.
32* C++ Misunderstandings::     Common misunderstandings with GNU C++.
33* Protoize Caveats::    Things to watch out for when using @code{protoize}.
34* Non-bugs::		Things we think are right, but some others disagree.
35* Warnings and Errors:: Which problems in your code get warnings,
36                         and which get errors.
37@end menu
38
39@node Actual Bugs
40@section Actual Bugs We Haven't Fixed Yet
41
42@itemize @bullet
43@item
44The @code{fixincludes} script interacts badly with automounters; if the
45directory of system header files is automounted, it tends to be
46unmounted while @code{fixincludes} is running.  This would seem to be a
47bug in the automounter.  We don't know any good way to work around it.
48
49@item
50The @code{fixproto} script will sometimes add prototypes for the
51@code{sigsetjmp} and @code{siglongjmp} functions that reference the
52@code{jmp_buf} type before that type is defined.  To work around this,
53edit the offending file and place the typedef in front of the
54prototypes.
55
56@item
57@opindex pedantic-errors
58When @option{-pedantic-errors} is specified, GCC will incorrectly give
59an error message when a function name is specified in an expression
60involving the comma operator.
61@end itemize
62
63@node Cross-Compiler Problems
64@section Cross-Compiler Problems
65
66You may run into problems with cross compilation on certain machines,
67for several reasons.
68
69@itemize @bullet
70@item
71Cross compilation can run into trouble for certain machines because
72some target machines' assemblers require floating point numbers to be
73written as @emph{integer} constants in certain contexts.
74
75The compiler writes these integer constants by examining the floating
76point value as an integer and printing that integer, because this is
77simple to write and independent of the details of the floating point
78representation.  But this does not work if the compiler is running on
79a different machine with an incompatible floating point format, or
80even a different byte-ordering.
81
82In addition, correct constant folding of floating point values
83requires representing them in the target machine's format.
84(The C standard does not quite require this, but in practice
85it is the only way to win.)
86
87It is now possible to overcome these problems by defining macros such
88as @code{REAL_VALUE_TYPE}.  But doing so is a substantial amount of
89work for each target machine.
90@xref{Cross-compilation,,Cross Compilation and Floating Point,
91gccint, GNU Compiler Collection (GCC) Internals}.
92
93@item
94At present, the program @file{mips-tfile} which adds debug
95support to object files on MIPS systems does not work in a cross
96compile environment.
97@end itemize
98
99@node Interoperation
100@section Interoperation
101
102This section lists various difficulties encountered in using GCC
103together with other compilers or with the assemblers, linkers,
104libraries and debuggers on certain systems.
105
106@itemize @bullet
107@item
108On many platforms, GCC supports a different ABI for C++ than do other
109compilers, so the object files compiled by GCC cannot be used with object
110files generated by another C++ compiler.
111
112An area where the difference is most apparent is name mangling.  The use
113of different name mangling is intentional, to protect you from more subtle
114problems.
115Compilers differ as to many internal details of C++ implementation,
116including: how class instances are laid out, how multiple inheritance is
117implemented, and how virtual function calls are handled.  If the name
118encoding were made the same, your programs would link against libraries
119provided from other compilers---but the programs would then crash when
120run.  Incompatible libraries are then detected at link time, rather than
121at run time.
122
123@item
124Older GDB versions sometimes fail to read the output of GCC version
1252.  If you have trouble, get GDB version 4.4 or later.
126
127@item
128@cindex DBX
129DBX rejects some files produced by GCC, though it accepts similar
130constructs in output from PCC@.  Until someone can supply a coherent
131description of what is valid DBX input and what is not, there is
132nothing I can do about these problems.  You are on your own.
133
134@item
135The GNU assembler (GAS) does not support PIC@.  To generate PIC code, you
136must use some other assembler, such as @file{/bin/as}.
137
138@item
139On some BSD systems, including some versions of Ultrix, use of profiling
140causes static variable destructors (currently used only in C++) not to
141be run.
142
143@ignore
144@cindex @code{vfork}, for the Sun-4
145@item
146There is a bug in @code{vfork} on the Sun-4 which causes the registers
147of the child process to clobber those of the parent.  Because of this,
148programs that call @code{vfork} are likely to lose when compiled
149optimized with GCC when the child code alters registers which contain
150C variables in the parent.  This affects variables which are live in the
151parent across the call to @code{vfork}.
152
153If you encounter this, you can work around the problem by declaring
154variables @code{volatile} in the function that calls @code{vfork}, until
155the problem goes away, or by not declaring them @code{register} and not
156using @option{-O} for those source files.
157@end ignore
158
159@item
160On some SGI systems, when you use @option{-lgl_s} as an option,
161it gets translated magically to @samp{-lgl_s -lX11_s -lc_s}.
162Naturally, this does not happen when you use GCC@.
163You must specify all three options explicitly.
164
165@item
166On a SPARC, GCC aligns all values of type @code{double} on an 8-byte
167boundary, and it expects every @code{double} to be so aligned.  The Sun
168compiler usually gives @code{double} values 8-byte alignment, with one
169exception: function arguments of type @code{double} may not be aligned.
170
171As a result, if a function compiled with Sun CC takes the address of an
172argument of type @code{double} and passes this pointer of type
173@code{double *} to a function compiled with GCC, dereferencing the
174pointer may cause a fatal signal.
175
176One way to solve this problem is to compile your entire program with GCC@.
177Another solution is to modify the function that is compiled with
178Sun CC to copy the argument into a local variable; local variables
179are always properly aligned.  A third solution is to modify the function
180that uses the pointer to dereference it via the following function
181@code{access_double} instead of directly with @samp{*}:
182
183@smallexample
184inline double
185access_double (double *unaligned_ptr)
186@{
187  union d2i @{ double d; int i[2]; @};
188
189  union d2i *p = (union d2i *) unaligned_ptr;
190  union d2i u;
191
192  u.i[0] = p->i[0];
193  u.i[1] = p->i[1];
194
195  return u.d;
196@}
197@end smallexample
198
199@noindent
200Storing into the pointer can be done likewise with the same union.
201
202@item
203On Solaris, the @code{malloc} function in the @file{libmalloc.a} library
204may allocate memory that is only 4 byte aligned.  Since GCC on the
205SPARC assumes that doubles are 8 byte aligned, this may result in a
206fatal signal if doubles are stored in memory allocated by the
207@file{libmalloc.a} library.
208
209The solution is to not use the @file{libmalloc.a} library.  Use instead
210@code{malloc} and related functions from @file{libc.a}; they do not have
211this problem.
212
213@item
214Sun forgot to include a static version of @file{libdl.a} with some
215versions of SunOS (mainly 4.1).  This results in undefined symbols when
216linking static binaries (that is, if you use @option{-static}).  If you
217see undefined symbols @code{_dlclose}, @code{_dlsym} or @code{_dlopen}
218when linking, compile and link against the file
219@file{mit/util/misc/dlsym.c} from the MIT version of X windows.
220
221@item
222The 128-bit long double format that the SPARC port supports currently
223works by using the architecturally defined quad-word floating point
224instructions.  Since there is no hardware that supports these
225instructions they must be emulated by the operating system.  Long
226doubles do not work in Sun OS versions 4.0.3 and earlier, because the
227kernel emulator uses an obsolete and incompatible format.  Long doubles
228do not work in Sun OS version 4.1.1 due to a problem in a Sun library.
229Long doubles do work on Sun OS versions 4.1.2 and higher, but GCC
230does not enable them by default.  Long doubles appear to work in Sun OS
2315.x (Solaris 2.x).
232
233@item
234On HP-UX version 9.01 on the HP PA, the HP compiler @code{cc} does not
235compile GCC correctly.  We do not yet know why.  However, GCC
236compiled on earlier HP-UX versions works properly on HP-UX 9.01 and can
237compile itself properly on 9.01.
238
239@item
240On the HP PA machine, ADB sometimes fails to work on functions compiled
241with GCC@.  Specifically, it fails to work on functions that use
242@code{alloca} or variable-size arrays.  This is because GCC doesn't
243generate HP-UX unwind descriptors for such functions.  It may even be
244impossible to generate them.
245
246@item
247Debugging (@option{-g}) is not supported on the HP PA machine, unless you use
248the preliminary GNU tools.
249
250@item
251Taking the address of a label may generate errors from the HP-UX
252PA assembler.  GAS for the PA does not have this problem.
253
254@item
255Using floating point parameters for indirect calls to static functions
256will not work when using the HP assembler.  There simply is no way for GCC
257to specify what registers hold arguments for static functions when using
258the HP assembler.  GAS for the PA does not have this problem.
259
260@item
261In extremely rare cases involving some very large functions you may
262receive errors from the HP linker complaining about an out of bounds
263unconditional branch offset.  This used to occur more often in previous
264versions of GCC, but is now exceptionally rare.  If you should run
265into it, you can work around by making your function smaller.
266
267@item
268GCC compiled code sometimes emits warnings from the HP-UX assembler of
269the form:
270
271@smallexample
272(warning) Use of GR3 when
273  frame >= 8192 may cause conflict.
274@end smallexample
275
276These warnings are harmless and can be safely ignored.
277
278@item
279On the IBM RS/6000, compiling code of the form
280
281@smallexample
282extern int foo;
283
284@dots{} foo @dots{}
285
286static int foo;
287@end smallexample
288
289@noindent
290will cause the linker to report an undefined symbol @code{foo}.
291Although this behavior differs from most other systems, it is not a
292bug because redefining an @code{extern} variable as @code{static}
293is undefined in ISO C@.
294
295@item
296In extremely rare cases involving some very large functions you may
297receive errors from the AIX Assembler complaining about a displacement
298that is too large.  If you should run into it, you can work around by
299making your function smaller.
300
301@item
302The @file{libstdc++.a} library in GCC relies on the SVR4 dynamic
303linker semantics which merges global symbols between libraries and
304applications, especially necessary for C++ streams functionality.
305This is not the default behavior of AIX shared libraries and dynamic
306linking.  @file{libstdc++.a} is built on AIX with ``runtime-linking''
307enabled so that symbol merging can occur.  To utilize this feature,
308the application linked with @file{libstdc++.a} must include the
309@option{-Wl,-brtl} flag on the link line.  G++ cannot impose this
310because this option may interfere with the semantics of the user
311program and users may not always use @samp{g++} to link his or her
312application. Applications are not required to use the
313@option{-Wl,-brtl} flag on the link line---the rest of the
314@file{libstdc++.a} library which is not dependent on the symbol
315merging semantics will continue to function correctly.
316
317@item
318An application can interpose its own definition of functions for
319functions invoked by @file{libstdc++.a} with ``runtime-linking''
320enabled on AIX.  To accomplish this the application must be linked
321with ``runtime-linking'' option and the functions explicitly must be
322exported by the application (@option{-Wl,-brtl,-bE:exportfile}).
323
324@item
325AIX on the RS/6000 provides support (NLS) for environments outside of
326the United States.  Compilers and assemblers use NLS to support
327locale-specific representations of various objects including
328floating-point numbers (@samp{.} vs @samp{,} for separating decimal
329fractions). There have been problems reported where the library linked
330with GCC does not produce the same floating-point formats that the
331assembler accepts. If you have this problem, set the @env{LANG}
332environment variable to @samp{C} or @samp{En_US}.
333
334@item
335@opindex fdollars-in-identifiers
336Even if you specify @option{-fdollars-in-identifiers},
337you cannot successfully use @samp{$} in identifiers on the RS/6000 due
338to a restriction in the IBM assembler.  GAS supports these
339identifiers.
340
341@cindex VAX calling convention
342@cindex Ultrix calling convention
343@item
344@opindex fcall-saved
345On Ultrix, the Fortran compiler expects registers 2 through 5 to be saved
346by function calls.  However, the C compiler uses conventions compatible
347with BSD Unix: registers 2 through 5 may be clobbered by function calls.
348
349GCC uses the same convention as the Ultrix C compiler.  You can use
350these options to produce code compatible with the Fortran compiler:
351
352@smallexample
353-fcall-saved-r2 -fcall-saved-r3 -fcall-saved-r4 -fcall-saved-r5
354@end smallexample
355
356@item
357On the Alpha, you may get assembler errors about invalid syntax as a
358result of floating point constants.  This is due to a bug in the C
359library functions @code{ecvt}, @code{fcvt} and @code{gcvt}.  Given valid
360floating point numbers, they sometimes print @samp{NaN}.
361@end itemize
362
363@node External Bugs
364@section Problems Compiling Certain Programs
365
366@c prevent bad page break with this line
367Certain programs have problems compiling.
368
369@itemize @bullet
370@item
371Parse errors may occur compiling X11 on a Decstation running Ultrix 4.2
372because of problems in DEC's versions of the X11 header files
373@file{X11/Xlib.h} and @file{X11/Xutil.h}.  People recommend adding
374@option{-I/usr/include/mit} to use the MIT versions of the header files,
375or fixing the header files by adding this:
376
377@example
378#ifdef __STDC__
379#define NeedFunctionPrototypes 0
380#endif
381@end example
382
383@item
384On various 386 Unix systems derived from System V, including SCO, ISC,
385and ESIX, you may get error messages about running out of virtual memory
386while compiling certain programs.
387
388You can prevent this problem by linking GCC with the GNU malloc
389(which thus replaces the malloc that comes with the system).  GNU malloc
390is available as a separate package, and also in the file
391@file{src/gmalloc.c} in the GNU Emacs 19 distribution.
392
393If you have installed GNU malloc as a separate library package, use this
394option when you relink GCC:
395
396@example
397MALLOC=/usr/local/lib/libgmalloc.a
398@end example
399
400Alternatively, if you have compiled @file{gmalloc.c} from Emacs 19, copy
401the object file to @file{gmalloc.o} and use this option when you relink
402GCC:
403
404@example
405MALLOC=gmalloc.o
406@end example
407@end itemize
408
409@node Incompatibilities
410@section Incompatibilities of GCC
411@cindex incompatibilities of GCC
412@opindex traditional
413
414There are several noteworthy incompatibilities between GNU C and K&R
415(non-ISO) versions of C@.
416
417@itemize @bullet
418@cindex string constants
419@cindex read-only strings
420@cindex shared strings
421@item
422GCC normally makes string constants read-only.  If several
423identical-looking string constants are used, GCC stores only one
424copy of the string.
425
426@cindex @code{mktemp}, and constant strings
427One consequence is that you cannot call @code{mktemp} with a string
428constant argument.  The function @code{mktemp} always alters the
429string its argument points to.
430
431@cindex @code{sscanf}, and constant strings
432@cindex @code{fscanf}, and constant strings
433@cindex @code{scanf}, and constant strings
434Another consequence is that @code{sscanf} does not work on some systems
435when passed a string constant as its format control string or input.
436This is because @code{sscanf} incorrectly tries to write into the string
437constant.  Likewise @code{fscanf} and @code{scanf}.
438
439@opindex fwritable-strings
440The best solution to these problems is to change the program to use
441@code{char}-array variables with initialization strings for these
442purposes instead of string constants.  But if this is not possible,
443you can use the @option{-fwritable-strings} flag, which directs GCC
444to handle string constants the same way most C compilers do.
445
446@item
447@code{-2147483648} is positive.
448
449This is because 2147483648 cannot fit in the type @code{int}, so
450(following the ISO C rules) its data type is @code{unsigned long int}.
451Negating this value yields 2147483648 again.
452
453@item
454GCC does not substitute macro arguments when they appear inside of
455string constants.  For example, the following macro in GCC
456
457@example
458#define foo(a) "a"
459@end example
460
461@noindent
462will produce output @code{"a"} regardless of what the argument @var{a} is.
463
464@cindex @code{setjmp} incompatibilities
465@cindex @code{longjmp} incompatibilities
466@item
467When you use @code{setjmp} and @code{longjmp}, the only automatic
468variables guaranteed to remain valid are those declared
469@code{volatile}.  This is a consequence of automatic register
470allocation.  Consider this function:
471
472@example
473jmp_buf j;
474
475foo ()
476@{
477  int a, b;
478
479  a = fun1 ();
480  if (setjmp (j))
481    return a;
482
483  a = fun2 ();
484  /* @r{@code{longjmp (j)} may occur in @code{fun3}.} */
485  return a + fun3 ();
486@}
487@end example
488
489Here @code{a} may or may not be restored to its first value when the
490@code{longjmp} occurs.  If @code{a} is allocated in a register, then
491its first value is restored; otherwise, it keeps the last value stored
492in it.
493
494@opindex W
495If you use the @option{-W} option with the @option{-O} option, you will
496get a warning when GCC thinks such a problem might be possible.
497
498@item
499Programs that use preprocessing directives in the middle of macro
500arguments do not work with GCC@.  For example, a program like this
501will not work:
502
503@example
504@group
505foobar (
506#define luser
507        hack)
508@end group
509@end example
510
511ISO C does not permit such a construct.
512
513@item
514K&R compilers allow comments to cross over an inclusion boundary
515(i.e.@: started in an include file and ended in the including file).  I think
516this would be quite ugly and can't imagine it could be needed.
517
518@cindex external declaration scope
519@cindex scope of external declarations
520@cindex declaration scope
521@item
522Declarations of external variables and functions within a block apply
523only to the block containing the declaration.  In other words, they
524have the same scope as any other declaration in the same place.
525
526In some other C compilers, a @code{extern} declaration affects all the
527rest of the file even if it happens within a block.
528
529@item
530In traditional C, you can combine @code{long}, etc., with a typedef name,
531as shown here:
532
533@example
534typedef int foo;
535typedef long foo bar;
536@end example
537
538In ISO C, this is not allowed: @code{long} and other type modifiers
539require an explicit @code{int}.
540
541@cindex typedef names as function parameters
542@item
543PCC allows typedef names to be used as function parameters.
544
545@item
546Traditional C allows the following erroneous pair of declarations to
547appear together in a given scope:
548
549@example
550typedef int foo;
551typedef foo foo;
552@end example
553
554@item
555GCC treats all characters of identifiers as significant.  According to
556K&R-1 (2.2), ``No more than the first eight characters are significant,
557although more may be used.''.  Also according to K&R-1 (2.2), ``An
558identifier is a sequence of letters and digits; the first character must
559be a letter.  The underscore _ counts as a letter.'', but GCC also
560allows dollar signs in identifiers.
561
562@cindex whitespace
563@item
564PCC allows whitespace in the middle of compound assignment operators
565such as @samp{+=}.  GCC, following the ISO standard, does not
566allow this.
567
568@cindex apostrophes
569@cindex '
570@item
571GCC complains about unterminated character constants inside of
572preprocessing conditionals that fail.  Some programs have English
573comments enclosed in conditionals that are guaranteed to fail; if these
574comments contain apostrophes, GCC will probably report an error.  For
575example, this code would produce an error:
576
577@example
578#if 0
579You can't expect this to work.
580#endif
581@end example
582
583The best solution to such a problem is to put the text into an actual
584C comment delimited by @samp{/*@dots{}*/}.
585
586@item
587Many user programs contain the declaration @samp{long time ();}.  In the
588past, the system header files on many systems did not actually declare
589@code{time}, so it did not matter what type your program declared it to
590return.  But in systems with ISO C headers, @code{time} is declared to
591return @code{time_t}, and if that is not the same as @code{long}, then
592@samp{long time ();} is erroneous.
593
594The solution is to change your program to use appropriate system headers
595(@code{<time.h>} on systems with ISO C headers) and not to declare
596@code{time} if the system header files declare it, or failing that to
597use @code{time_t} as the return type of @code{time}.
598
599@cindex @code{float} as function value type
600@item
601When compiling functions that return @code{float}, PCC converts it to
602a double.  GCC actually returns a @code{float}.  If you are concerned
603with PCC compatibility, you should declare your functions to return
604@code{double}; you might as well say what you mean.
605
606@cindex structures
607@cindex unions
608@item
609When compiling functions that return structures or unions, GCC
610output code normally uses a method different from that used on most
611versions of Unix.  As a result, code compiled with GCC cannot call
612a structure-returning function compiled with PCC, and vice versa.
613
614The method used by GCC is as follows: a structure or union which is
6151, 2, 4 or 8 bytes long is returned like a scalar.  A structure or union
616with any other size is stored into an address supplied by the caller
617(usually in a special, fixed register, but on some machines it is passed
618on the stack).  The machine-description macros @code{STRUCT_VALUE} and
619@code{STRUCT_INCOMING_VALUE} tell GCC where to pass this address.
620
621By contrast, PCC on most target machines returns structures and unions
622of any size by copying the data into an area of static storage, and then
623returning the address of that storage as if it were a pointer value.
624The caller must copy the data from that memory area to the place where
625the value is wanted.  GCC does not use this method because it is
626slower and nonreentrant.
627
628On some newer machines, PCC uses a reentrant convention for all
629structure and union returning.  GCC on most of these machines uses a
630compatible convention when returning structures and unions in memory,
631but still returns small structures and unions in registers.
632
633@opindex fpcc-struct-return
634You can tell GCC to use a compatible convention for all structure and
635union returning with the option @option{-fpcc-struct-return}.
636
637@cindex preprocessing tokens
638@cindex preprocessing numbers
639@item
640GCC complains about program fragments such as @samp{0x74ae-0x4000}
641which appear to be two hexadecimal constants separated by the minus
642operator.  Actually, this string is a single @dfn{preprocessing token}.
643Each such token must correspond to one token in C@.  Since this does not,
644GCC prints an error message.  Although it may appear obvious that what
645is meant is an operator and two values, the ISO C standard specifically
646requires that this be treated as erroneous.
647
648A @dfn{preprocessing token} is a @dfn{preprocessing number} if it
649begins with a digit and is followed by letters, underscores, digits,
650periods and @samp{e+}, @samp{e-}, @samp{E+}, @samp{E-}, @samp{p+},
651@samp{p-}, @samp{P+}, or @samp{P-} character sequences.  (In strict C89
652mode, the sequences @samp{p+}, @samp{p-}, @samp{P+} and @samp{P-} cannot
653appear in preprocessing numbers.)
654
655To make the above program fragment valid, place whitespace in front of
656the minus sign.  This whitespace will end the preprocessing number.
657@end itemize
658
659@node Fixed Headers
660@section Fixed Header Files
661
662GCC needs to install corrected versions of some system header files.
663This is because most target systems have some header files that won't
664work with GCC unless they are changed.  Some have bugs, some are
665incompatible with ISO C, and some depend on special features of other
666compilers.
667
668Installing GCC automatically creates and installs the fixed header
669files, by running a program called @code{fixincludes} (or for certain
670targets an alternative such as @code{fixinc.svr4}).  Normally, you
671don't need to pay attention to this.  But there are cases where it
672doesn't do the right thing automatically.
673
674@itemize @bullet
675@item
676If you update the system's header files, such as by installing a new
677system version, the fixed header files of GCC are not automatically
678updated.  The easiest way to update them is to reinstall GCC@.  (If
679you want to be clever, look in the makefile and you can find a
680shortcut.)
681
682@item
683On some systems, in particular SunOS 4, header file directories contain
684machine-specific symbolic links in certain places.  This makes it
685possible to share most of the header files among hosts running the
686same version of SunOS 4 on different machine models.
687
688The programs that fix the header files do not understand this special
689way of using symbolic links; therefore, the directory of fixed header
690files is good only for the machine model used to build it.
691
692In SunOS 4, only programs that look inside the kernel will notice the
693difference between machine models.  Therefore, for most purposes, you
694need not be concerned about this.
695
696It is possible to make separate sets of fixed header files for the
697different machine models, and arrange a structure of symbolic links so
698as to use the proper set, but you'll have to do this by hand.
699
700@item
701On Lynxos, GCC by default does not fix the header files.  This is
702because bugs in the shell cause the @code{fixincludes} script to fail.
703
704This means you will encounter problems due to bugs in the system header
705files.  It may be no comfort that they aren't GCC's fault, but it
706does mean that there's nothing for us to do about them.
707@end itemize
708
709@node Standard Libraries
710@section Standard Libraries
711
712@opindex Wall
713GCC by itself attempts to be a conforming freestanding implementation.
714@xref{Standards,,Language Standards Supported by GCC}, for details of
715what this means.  Beyond the library facilities required of such an
716implementation, the rest of the C library is supplied by the vendor of
717the operating system.  If that C library doesn't conform to the C
718standards, then your programs might get warnings (especially when using
719@option{-Wall}) that you don't expect.
720
721For example, the @code{sprintf} function on SunOS 4.1.3 returns
722@code{char *} while the C standard says that @code{sprintf} returns an
723@code{int}.  The @code{fixincludes} program could make the prototype for
724this function match the Standard, but that would be wrong, since the
725function will still return @code{char *}.
726
727If you need a Standard compliant library, then you need to find one, as
728GCC does not provide one.  The GNU C library (called @code{glibc})
729provides ISO C, POSIX, BSD, SystemV and X/Open compatibility for
730GNU/Linux and HURD-based GNU systems; no recent version of it supports
731other systems, though some very old versions did.  Version 2.2 of the
732GNU C library includes nearly complete C99 support.  You could also ask
733your operating system vendor if newer libraries are available.
734
735@node Disappointments
736@section Disappointments and Misunderstandings
737
738These problems are perhaps regrettable, but we don't know any practical
739way around them.
740
741@itemize @bullet
742@item
743Certain local variables aren't recognized by debuggers when you compile
744with optimization.
745
746This occurs because sometimes GCC optimizes the variable out of
747existence.  There is no way to tell the debugger how to compute the
748value such a variable ``would have had'', and it is not clear that would
749be desirable anyway.  So GCC simply does not mention the eliminated
750variable when it writes debugging information.
751
752You have to expect a certain amount of disagreement between the
753executable and your source code, when you use optimization.
754
755@cindex conflicting types
756@cindex scope of declaration
757@item
758Users often think it is a bug when GCC reports an error for code
759like this:
760
761@example
762int foo (struct mumble *);
763
764struct mumble @{ @dots{} @};
765
766int foo (struct mumble *x)
767@{ @dots{} @}
768@end example
769
770This code really is erroneous, because the scope of @code{struct
771mumble} in the prototype is limited to the argument list containing it.
772It does not refer to the @code{struct mumble} defined with file scope
773immediately below---they are two unrelated types with similar names in
774different scopes.
775
776But in the definition of @code{foo}, the file-scope type is used
777because that is available to be inherited.  Thus, the definition and
778the prototype do not match, and you get an error.
779
780This behavior may seem silly, but it's what the ISO standard specifies.
781It is easy enough for you to make your code work by moving the
782definition of @code{struct mumble} above the prototype.  It's not worth
783being incompatible with ISO C just to avoid an error for the example
784shown above.
785
786@item
787Accesses to bit-fields even in volatile objects works by accessing larger
788objects, such as a byte or a word.  You cannot rely on what size of
789object is accessed in order to read or write the bit-field; it may even
790vary for a given bit-field according to the precise usage.
791
792If you care about controlling the amount of memory that is accessed, use
793volatile but do not use bit-fields.
794
795@item
796GCC comes with shell scripts to fix certain known problems in system
797header files.  They install corrected copies of various header files in
798a special directory where only GCC will normally look for them.  The
799scripts adapt to various systems by searching all the system header
800files for the problem cases that we know about.
801
802If new system header files are installed, nothing automatically arranges
803to update the corrected header files.  You will have to reinstall GCC
804to fix the new header files.  More specifically, go to the build
805directory and delete the files @file{stmp-fixinc} and
806@file{stmp-headers}, and the subdirectory @code{include}; then do
807@samp{make install} again.
808
809@item
810@cindex floating point precision
811On 68000 and x86 systems, for instance, you can get paradoxical results
812if you test the precise values of floating point numbers.  For example,
813you can find that a floating point value which is not a NaN is not equal
814to itself.  This results from the fact that the floating point registers
815hold a few more bits of precision than fit in a @code{double} in memory.
816Compiled code moves values between memory and floating point registers
817at its convenience, and moving them into memory truncates them.
818
819@opindex ffloat-store
820You can partially avoid this problem by using the @option{-ffloat-store}
821option (@pxref{Optimize Options}).
822
823@item
824On AIX and other platforms without weak symbol support, templates
825need to be instantiated explicitly and symbols for static members
826of templates will not be generated.
827
828@item
829On AIX, GCC scans object files and library archives for static
830constructors and destructors when linking an application before the
831linker prunes unreferenced symbols.  This is necessary to prevent the
832AIX linker from mistakenly assuming that static constructor or
833destructor are unused and removing them before the scanning can occur.
834All static constructors and destructors found will be referenced even
835though the modules in which they occur may not be used by the program.
836This may lead to both increased executable size and unexpected symbol
837references.
838@end itemize
839
840@node C++ Misunderstandings
841@section Common Misunderstandings with GNU C++
842
843@cindex misunderstandings in C++
844@cindex surprises in C++
845@cindex C++ misunderstandings
846C++ is a complex language and an evolving one, and its standard
847definition (the ISO C++ standard) was only recently completed.  As a
848result, your C++ compiler may occasionally surprise you, even when its
849behavior is correct.  This section discusses some areas that frequently
850give rise to questions of this sort.
851
852@menu
853* Static Definitions::  Static member declarations are not definitions
854* Temporaries::         Temporaries may vanish before you expect
855* Copy Assignment::     Copy Assignment operators copy virtual bases twice
856@end menu
857
858@node Static Definitions
859@subsection Declare @emph{and} Define Static Members
860
861@cindex C++ static data, declaring and defining
862@cindex static data in C++, declaring and defining
863@cindex declaring static data in C++
864@cindex defining static data in C++
865When a class has static data members, it is not enough to @emph{declare}
866the static member; you must also @emph{define} it.  For example:
867
868@example
869class Foo
870@{
871  @dots{}
872  void method();
873  static int bar;
874@};
875@end example
876
877This declaration only establishes that the class @code{Foo} has an
878@code{int} named @code{Foo::bar}, and a member function named
879@code{Foo::method}.  But you still need to define @emph{both}
880@code{method} and @code{bar} elsewhere.  According to the ISO
881standard, you must supply an initializer in one (and only one) source
882file, such as:
883
884@example
885int Foo::bar = 0;
886@end example
887
888Other C++ compilers may not correctly implement the standard behavior.
889As a result, when you switch to @command{g++} from one of these compilers,
890you may discover that a program that appeared to work correctly in fact
891does not conform to the standard: @command{g++} reports as undefined
892symbols any static data members that lack definitions.
893
894@node Temporaries
895@subsection Temporaries May Vanish Before You Expect
896
897@cindex temporaries, lifetime of
898@cindex portions of temporary objects, pointers to
899It is dangerous to use pointers or references to @emph{portions} of a
900temporary object.  The compiler may very well delete the object before
901you expect it to, leaving a pointer to garbage.  The most common place
902where this problem crops up is in classes like string classes,
903especially ones that define a conversion function to type @code{char *}
904or @code{const char *}---which is one reason why the standard
905@code{string} class requires you to call the @code{c_str} member
906function.  However, any class that returns a pointer to some internal
907structure is potentially subject to this problem.
908
909For example, a program may use a function @code{strfunc} that returns
910@code{string} objects, and another function @code{charfunc} that
911operates on pointers to @code{char}:
912
913@example
914string strfunc ();
915void charfunc (const char *);
916
917void
918f ()
919@{
920  const char *p = strfunc().c_str();
921  @dots{}
922  charfunc (p);
923  @dots{}
924  charfunc (p);
925@}
926@end example
927
928@noindent
929In this situation, it may seem reasonable to save a pointer to the C
930string returned by the @code{c_str} member function and use that rather
931than call @code{c_str} repeatedly.  However, the temporary string
932created by the call to @code{strfunc} is destroyed after @code{p} is
933initialized, at which point @code{p} is left pointing to freed memory.
934
935Code like this may run successfully under some other compilers,
936particularly obsolete cfront-based compilers that delete temporaries
937along with normal local variables.  However, the GNU C++ behavior is
938standard-conforming, so if your program depends on late destruction of
939temporaries it is not portable.
940
941The safe way to write such code is to give the temporary a name, which
942forces it to remain until the end of the scope of the name.  For
943example:
944
945@example
946const string& tmp = strfunc ();
947charfunc (tmp.c_str ());
948@end example
949
950@node Copy Assignment
951@subsection Implicit Copy-Assignment for Virtual Bases
952
953When a base class is virtual, only one subobject of the base class
954belongs to each full object.  Also, the constructors and destructors are
955invoked only once, and called from the most-derived class.  However, such
956objects behave unspecified when being assigned.  For example:
957
958@example
959struct Base@{
960  char *name;
961  Base(char *n) : name(strdup(n))@{@}
962  Base& operator= (const Base& other)@{
963   free (name);
964   name = strdup (other.name);
965  @}
966@};
967
968struct A:virtual Base@{
969  int val;
970  A():Base("A")@{@}
971@};
972
973struct B:virtual Base@{
974  int bval;
975  B():Base("B")@{@}
976@};
977
978struct Derived:public A, public B@{
979  Derived():Base("Derived")@{@}
980@};
981
982void func(Derived &d1, Derived &d2)
983@{
984  d1 = d2;
985@}
986@end example
987
988The C++ standard specifies that @samp{Base::Base} is only called once
989when constructing or copy-constructing a Derived object.  It is
990unspecified whether @samp{Base::operator=} is called more than once when
991the implicit copy-assignment for Derived objects is invoked (as it is
992inside @samp{func} in the example).
993
994g++ implements the ``intuitive'' algorithm for copy-assignment: assign all
995direct bases, then assign all members.  In that algorithm, the virtual
996base subobject can be encountered more than once.  In the example, copying
997proceeds in the following order: @samp{val}, @samp{name} (via
998@code{strdup}), @samp{bval}, and @samp{name} again.
999
1000If application code relies on copy-assignment, a user-defined
1001copy-assignment operator removes any uncertainties.  With such an
1002operator, the application can define whether and how the virtual base
1003subobject is assigned.
1004
1005@node Protoize Caveats
1006@section Caveats of using @command{protoize}
1007
1008The conversion programs @command{protoize} and @command{unprotoize} can
1009sometimes change a source file in a way that won't work unless you
1010rearrange it.
1011
1012@itemize @bullet
1013@item
1014@command{protoize} can insert references to a type name or type tag before
1015the definition, or in a file where they are not defined.
1016
1017If this happens, compiler error messages should show you where the new
1018references are, so fixing the file by hand is straightforward.
1019
1020@item
1021There are some C constructs which @command{protoize} cannot figure out.
1022For example, it can't determine argument types for declaring a
1023pointer-to-function variable; this you must do by hand.  @command{protoize}
1024inserts a comment containing @samp{???} each time it finds such a
1025variable; so you can find all such variables by searching for this
1026string.  ISO C does not require declaring the argument types of
1027pointer-to-function types.
1028
1029@item
1030Using @command{unprotoize} can easily introduce bugs.  If the program
1031relied on prototypes to bring about conversion of arguments, these
1032conversions will not take place in the program without prototypes.
1033One case in which you can be sure @command{unprotoize} is safe is when
1034you are removing prototypes that were made with @command{protoize}; if
1035the program worked before without any prototypes, it will work again
1036without them.
1037
1038@opindex Wconversion
1039You can find all the places where this problem might occur by compiling
1040the program with the @option{-Wconversion} option.  It prints a warning
1041whenever an argument is converted.
1042
1043@item
1044Both conversion programs can be confused if there are macro calls in and
1045around the text to be converted.  In other words, the standard syntax
1046for a declaration or definition must not result from expanding a macro.
1047This problem is inherent in the design of C and cannot be fixed.  If
1048only a few functions have confusing macro calls, you can easily convert
1049them manually.
1050
1051@item
1052@command{protoize} cannot get the argument types for a function whose
1053definition was not actually compiled due to preprocessing conditionals.
1054When this happens, @command{protoize} changes nothing in regard to such
1055a function.  @command{protoize} tries to detect such instances and warn
1056about them.
1057
1058You can generally work around this problem by using @command{protoize} step
1059by step, each time specifying a different set of @option{-D} options for
1060compilation, until all of the functions have been converted.  There is
1061no automatic way to verify that you have got them all, however.
1062
1063@item
1064Confusion may result if there is an occasion to convert a function
1065declaration or definition in a region of source code where there is more
1066than one formal parameter list present.  Thus, attempts to convert code
1067containing multiple (conditionally compiled) versions of a single
1068function header (in the same vicinity) may not produce the desired (or
1069expected) results.
1070
1071If you plan on converting source files which contain such code, it is
1072recommended that you first make sure that each conditionally compiled
1073region of source code which contains an alternative function header also
1074contains at least one additional follower token (past the final right
1075parenthesis of the function header).  This should circumvent the
1076problem.
1077
1078@item
1079@command{unprotoize} can become confused when trying to convert a function
1080definition or declaration which contains a declaration for a
1081pointer-to-function formal argument which has the same name as the
1082function being defined or declared.  We recommend you avoid such choices
1083of formal parameter names.
1084
1085@item
1086You might also want to correct some of the indentation by hand and break
1087long lines.  (The conversion programs don't write lines longer than
1088eighty characters in any case.)
1089@end itemize
1090
1091@node Non-bugs
1092@section Certain Changes We Don't Want to Make
1093
1094This section lists changes that people frequently request, but which
1095we do not make because we think GCC is better without them.
1096
1097@itemize @bullet
1098@item
1099Checking the number and type of arguments to a function which has an
1100old-fashioned definition and no prototype.
1101
1102Such a feature would work only occasionally---only for calls that appear
1103in the same file as the called function, following the definition.  The
1104only way to check all calls reliably is to add a prototype for the
1105function.  But adding a prototype eliminates the motivation for this
1106feature.  So the feature is not worthwhile.
1107
1108@item
1109Warning about using an expression whose type is signed as a shift count.
1110
1111Shift count operands are probably signed more often than unsigned.
1112Warning about this would cause far more annoyance than good.
1113
1114@item
1115Warning about assigning a signed value to an unsigned variable.
1116
1117Such assignments must be very common; warning about them would cause
1118more annoyance than good.
1119
1120@item
1121Warning when a non-void function value is ignored.
1122
1123Coming as I do from a Lisp background, I balk at the idea that there is
1124something dangerous about discarding a value.  There are functions that
1125return values which some callers may find useful; it makes no sense to
1126clutter the program with a cast to @code{void} whenever the value isn't
1127useful.
1128
1129@item
1130@opindex fshort-enums
1131Making @option{-fshort-enums} the default.
1132
1133This would cause storage layout to be incompatible with most other C
1134compilers.  And it doesn't seem very important, given that you can get
1135the same result in other ways.  The case where it matters most is when
1136the enumeration-valued object is inside a structure, and in that case
1137you can specify a field width explicitly.
1138
1139@item
1140Making bit-fields unsigned by default on particular machines where ``the
1141ABI standard'' says to do so.
1142
1143The ISO C standard leaves it up to the implementation whether a bit-field
1144declared plain @code{int} is signed or not.  This in effect creates two
1145alternative dialects of C@.
1146
1147@opindex fsigned-bitfields
1148@opindex funsigned-bitfields
1149The GNU C compiler supports both dialects; you can specify the signed
1150dialect with @option{-fsigned-bitfields} and the unsigned dialect with
1151@option{-funsigned-bitfields}.  However, this leaves open the question of
1152which dialect to use by default.
1153
1154Currently, the preferred dialect makes plain bit-fields signed, because
1155this is simplest.  Since @code{int} is the same as @code{signed int} in
1156every other context, it is cleanest for them to be the same in bit-fields
1157as well.
1158
1159Some computer manufacturers have published Application Binary Interface
1160standards which specify that plain bit-fields should be unsigned.  It is
1161a mistake, however, to say anything about this issue in an ABI@.  This is
1162because the handling of plain bit-fields distinguishes two dialects of C@.
1163Both dialects are meaningful on every type of machine.  Whether a
1164particular object file was compiled using signed bit-fields or unsigned
1165is of no concern to other object files, even if they access the same
1166bit-fields in the same data structures.
1167
1168A given program is written in one or the other of these two dialects.
1169The program stands a chance to work on most any machine if it is
1170compiled with the proper dialect.  It is unlikely to work at all if
1171compiled with the wrong dialect.
1172
1173Many users appreciate the GNU C compiler because it provides an
1174environment that is uniform across machines.  These users would be
1175inconvenienced if the compiler treated plain bit-fields differently on
1176certain machines.
1177
1178Occasionally users write programs intended only for a particular machine
1179type.  On these occasions, the users would benefit if the GNU C compiler
1180were to support by default the same dialect as the other compilers on
1181that machine.  But such applications are rare.  And users writing a
1182program to run on more than one type of machine cannot possibly benefit
1183from this kind of compatibility.
1184
1185This is why GCC does and will treat plain bit-fields in the same
1186fashion on all types of machines (by default).
1187
1188There are some arguments for making bit-fields unsigned by default on all
1189machines.  If, for example, this becomes a universal de facto standard,
1190it would make sense for GCC to go along with it.  This is something
1191to be considered in the future.
1192
1193(Of course, users strongly concerned about portability should indicate
1194explicitly in each bit-field whether it is signed or not.  In this way,
1195they write programs which have the same meaning in both C dialects.)
1196
1197@item
1198@opindex ansi
1199@opindex std
1200Undefining @code{__STDC__} when @option{-ansi} is not used.
1201
1202Currently, GCC defines @code{__STDC__} unconditionally.  This provides
1203good results in practice.
1204
1205Programmers normally use conditionals on @code{__STDC__} to ask whether
1206it is safe to use certain features of ISO C, such as function
1207prototypes or ISO token concatenation.  Since plain @command{gcc} supports
1208all the features of ISO C, the correct answer to these questions is
1209``yes''.
1210
1211Some users try to use @code{__STDC__} to check for the availability of
1212certain library facilities.  This is actually incorrect usage in an ISO
1213C program, because the ISO C standard says that a conforming
1214freestanding implementation should define @code{__STDC__} even though it
1215does not have the library facilities.  @samp{gcc -ansi -pedantic} is a
1216conforming freestanding implementation, and it is therefore required to
1217define @code{__STDC__}, even though it does not come with an ISO C
1218library.
1219
1220Sometimes people say that defining @code{__STDC__} in a compiler that
1221does not completely conform to the ISO C standard somehow violates the
1222standard.  This is illogical.  The standard is a standard for compilers
1223that claim to support ISO C, such as @samp{gcc -ansi}---not for other
1224compilers such as plain @command{gcc}.  Whatever the ISO C standard says
1225is relevant to the design of plain @command{gcc} without @option{-ansi} only
1226for pragmatic reasons, not as a requirement.
1227
1228GCC normally defines @code{__STDC__} to be 1, and in addition
1229defines @code{__STRICT_ANSI__} if you specify the @option{-ansi} option,
1230or a @option{-std} option for strict conformance to some version of ISO C@.
1231On some hosts, system include files use a different convention, where
1232@code{__STDC__} is normally 0, but is 1 if the user specifies strict
1233conformance to the C Standard.  GCC follows the host convention when
1234processing system include files, but when processing user files it follows
1235the usual GNU C convention.
1236
1237@item
1238Undefining @code{__STDC__} in C++.
1239
1240Programs written to compile with C++-to-C translators get the
1241value of @code{__STDC__} that goes with the C compiler that is
1242subsequently used.  These programs must test @code{__STDC__}
1243to determine what kind of C preprocessor that compiler uses:
1244whether they should concatenate tokens in the ISO C fashion
1245or in the traditional fashion.
1246
1247These programs work properly with GNU C++ if @code{__STDC__} is defined.
1248They would not work otherwise.
1249
1250In addition, many header files are written to provide prototypes in ISO
1251C but not in traditional C@.  Many of these header files can work without
1252change in C++ provided @code{__STDC__} is defined.  If @code{__STDC__}
1253is not defined, they will all fail, and will all need to be changed to
1254test explicitly for C++ as well.
1255
1256@item
1257Deleting ``empty'' loops.
1258
1259Historically, GCC has not deleted ``empty'' loops under the
1260assumption that the most likely reason you would put one in a program is
1261to have a delay, so deleting them will not make real programs run any
1262faster.
1263
1264However, the rationale here is that optimization of a nonempty loop
1265cannot produce an empty one, which holds for C but is not always the
1266case for C++.
1267
1268@opindex funroll-loops
1269Moreover, with @option{-funroll-loops} small ``empty'' loops are already
1270removed, so the current behavior is both sub-optimal and inconsistent
1271and will change in the future.
1272
1273@item
1274Making side effects happen in the same order as in some other compiler.
1275
1276@cindex side effects, order of evaluation
1277@cindex order of evaluation, side effects
1278It is never safe to depend on the order of evaluation of side effects.
1279For example, a function call like this may very well behave differently
1280from one compiler to another:
1281
1282@example
1283void func (int, int);
1284
1285int i = 2;
1286func (i++, i++);
1287@end example
1288
1289There is no guarantee (in either the C or the C++ standard language
1290definitions) that the increments will be evaluated in any particular
1291order.  Either increment might happen first.  @code{func} might get the
1292arguments @samp{2, 3}, or it might get @samp{3, 2}, or even @samp{2, 2}.
1293
1294@item
1295Not allowing structures with volatile fields in registers.
1296
1297Strictly speaking, there is no prohibition in the ISO C standard
1298against allowing structures with volatile fields in registers, but
1299it does not seem to make any sense and is probably not what you wanted
1300to do.  So the compiler will give an error message in this case.
1301
1302@item
1303Making certain warnings into errors by default.
1304
1305Some ISO C testsuites report failure when the compiler does not produce
1306an error message for a certain program.
1307
1308@opindex pedantic-errors
1309ISO C requires a ``diagnostic'' message for certain kinds of invalid
1310programs, but a warning is defined by GCC to count as a diagnostic.  If
1311GCC produces a warning but not an error, that is correct ISO C support.
1312If test suites call this ``failure'', they should be run with the GCC
1313option @option{-pedantic-errors}, which will turn these warnings into
1314errors.
1315
1316@end itemize
1317
1318@node Warnings and Errors
1319@section Warning Messages and Error Messages
1320
1321@cindex error messages
1322@cindex warnings vs errors
1323@cindex messages, warning and error
1324The GNU compiler can produce two kinds of diagnostics: errors and
1325warnings.  Each kind has a different purpose:
1326
1327@itemize @w{}
1328@item
1329@dfn{Errors} report problems that make it impossible to compile your
1330program.  GCC reports errors with the source file name and line
1331number where the problem is apparent.
1332
1333@item
1334@dfn{Warnings} report other unusual conditions in your code that
1335@emph{may} indicate a problem, although compilation can (and does)
1336proceed.  Warning messages also report the source file name and line
1337number, but include the text @samp{warning:} to distinguish them
1338from error messages.
1339@end itemize
1340
1341Warnings may indicate danger points where you should check to make sure
1342that your program really does what you intend; or the use of obsolete
1343features; or the use of nonstandard features of GNU C or C++.  Many
1344warnings are issued only if you ask for them, with one of the @option{-W}
1345options (for instance, @option{-Wall} requests a variety of useful
1346warnings).
1347
1348@opindex pedantic
1349@opindex pedantic-errors
1350GCC always tries to compile your program if possible; it never
1351gratuitously rejects a program whose meaning is clear merely because
1352(for instance) it fails to conform to a standard.  In some cases,
1353however, the C and C++ standards specify that certain extensions are
1354forbidden, and a diagnostic @emph{must} be issued by a conforming
1355compiler.  The @option{-pedantic} option tells GCC to issue warnings in
1356such cases; @option{-pedantic-errors} says to make them errors instead.
1357This does not mean that @emph{all} non-ISO constructs get warnings
1358or errors.
1359
1360@xref{Warning Options,,Options to Request or Suppress Warnings}, for
1361more detail on these and related command-line options.
1362