Thread

Topic: negative integer literals

Author: fjh@mundook.cs.mu.OZ.AU (Fergus Henderson)
Date: 1997/04/09 Raw View

X-Auth: PGPMoose V1.1 PGP comp.std.c++
 iQBVAwUBM0wbnEy4NqrwXLNJAQH60wIAmUP0M3Vm4kA1qHL4kiSPyMqNpl8vkuKl
 o5RwdFpJcpiO2SA+x12YH2bZ8I3vekmY4kP8jUhmmqkE+LslSplIMA==
 =86IB

James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:

>fjh@mundook.cs.mu.OZ.AU (Fergus Henderson) writes:
>
>There is obviously a quality of implementation issue involved, but it is
>a very poor choice to activate extensions automatically.  If I
>accidentally type "long long", my program shouldn't compile *UNLESS*
>I've specified that I want the extension.

Sure, it is a quality of implementation issue as to whether it is
better to issue warnings or errors in such cases.  Which is better
may depend on the situation, so perhaps it is best to leave the
choice up to the user, as g++ does: with g++, you can choose between
`-pedantic' and `-pedantic-errors'.

>|>  For a program that contains `long long', after issuing a diagnostic,
>|>  the compiler is free to choose whatever semantics it likes.  However,
>|>  for a program that does not contain `long long', but does contain a
>|>  literal that will fit in `unsigned long' but not in `long', e.g.
>|>  2147483648, the DWP specifies the semantics, and a conforming compiler
>|>  is _not_ free to make the type of such literals `long long'.
>
>Correct.  Presumably, to be useful, such a compiler will have two modes,
>one which accepts long long, and handles type promotion correctly for
>that case, and another which doesn't, and handles type promotion
>correctly for C++.  While I would consider it a very bad choice, I
>suppose that a compiler could select the "long long" accepting mode
>automatically (after issuing a diagnostic) if the program contained
>"long long", and the normal most otherwise.

Unfortunately this is quite difficult to implement, and perhaps
impossible to implement efficiently -- it requires a prepass of the
whole program to see whether it contains "long long" before doing
semantic analysis or code generation.

>I believe that g++'s policy is to have the extensions active by default;

I agree this is a poor policy.

>|>  In that case, it wouldn't be a true extension, because you would be
>|>  changing the semantics of the base language.
>
>OK.  I can buy this.  Two points, however:
>
>1. Are you saying that C++ should adapt rules to facilitate such
>extensions?

I'm saying that

(a) the g++ decision to not change to promotion rules
    when adding `long long' is a reasonable one
    (because changing promotion rules might break existing code,
    if compiled with extensions enabled);

(b) given (a), the sample program fragment
 long long int = -2147483648;
    does not give desired results;

(c) hence the way negative literals are handled in C++ at the moment
    could cause problems in practice.

That's all I was trying to say.

In general I do think that all other things being equal, languages
should be designed to facilitate extensions.  Of course other things
are never equal, so as always it is a matter of choosing appropriate
trade-offs.

--
Fergus Henderson <fjh@cs.mu.oz.au>   |  "I have always known that the pursuit
WWW: <http://www.cs.mu.oz.au/~fjh>   |  of excellence is a lethal habit"
PGP: finger fjh@128.250.37.3         |     -- the last words of T. S. Garp.
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: Neal Becker <neal@ctd.comsat.com>
Date: 1997/03/31 Raw View

>>>>> "James" == James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:

    James> The problem is that the value of the expression -2147483648 is not
    James> representable in a 32 bit signed int, since the value is the positive
    James> number 2147483648.  It would take some extremely tricky special case
    James> wording to make it work.

I'm going out on a limb here since I haven't been following this
discussion.  But with 2's complement arithmetic the range that is
represented by N bits is:

[-2^(N-1) ... 2^(N-1) - 1].  Inclusive.

So -2^(31) IS representable within a signed 32 bit integer!
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: James Kuyper <kuyper@wizard.net>
Date: 1997/04/01 Raw View

Neal Becker wrote:
>
> >>>>> "James" == James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:
>
>     James> The problem is that the value of the expression -2147483648 is not
>     James> representable in a 32 bit signed int, since the value is the
>     James> positive number 2147483648.  It would take some extremely tricky
>     James> special case wording to make it work.
>
> I'm going out on a limb here since I haven't been following this
> discussion.  But with 2's complement arithmetic the range that is
> represented by N bits is:
>
> [-2^(N-1) ... 2^(N-1) - 1].  Inclusive.
>
> So -2^(31) IS representable within a signed 32 bit integer!

Unfortunately, as this thread has discussed ad nauseum, according to the
C standard -2147483648 is not a negative integer literal with the value
of -2^31 (standard C doesn't have negative integer literals). Instead,
(in the 32 bit context assumed throughout this thread) it is a C
expression with  the value 2147683648, which won't fit in a signed 32
bit integer. Some of us think this is a problem with the C standard,
though apparantly a difficult one to fix. See previous messages on this
thread for more details.
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: James Kanze <james-albert.kanze@vx.cit.alcatel.fr>
Date: 1997/04/01 Raw View

fjh@mundook.cs.mu.OZ.AU (Fergus Henderson) writes:

|>  James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:
|>
|>  >Christopher Eltschka <celtschk@physik.tu-muenchen.de> writes:
|>  >
|>  >|>  There can still be problems with this value:
|>  >|>  try f.ex. with gcc under Linux
|>  >|>
|>  >|>  int i=-2147483648;
|>  >|>  assert(i<0); /* works */
|>  >|>
|>  >|>  long long l=-2147483648;
|>  >|>  assert(l<0); /* fails */
|>  >
|>  >There is no "long long" type in C++, so this is a compiler extension.
|>  >IMHO, if a compiler offers such an extension, it should also extend
|>  >preprocessor arithmetic to cover the case.  It would also seem natural
|>  >that the implicit promotions take this into account, so that 2147483648
|>  >would have type "long long", and the above would work.
|>
|>  But if a compiler did that, then it would no longer be conforming.

Of course not.  A compiler which accepts a program containing "long
long" is not conforming, period, regardless of what it does with its
constant expressions.

|>  The standard specifies that integer literals that can fit into an unsigned
|>  long but not a long should have type unsigned long.  An extension
|>  which gave such a value the type "long long" would not be a conforming
|>  extension.  I don't see how "implicit promotions" could fix this.

The standard also specifies that the sequence of tokens "long long" is a
syntax violation, which requires the compiler to issue a diagnostic.

I think that the extension is worthwhile.  (Actually, I think it's a
hack, but for the point of argument...)  But if you are adding integral
types, which can be used in constant integral expressions, you have to
redefine the promotion rules in some way anyway.  What do you do if the
literal it too large to fit in an unsigned long?

In practice, such extensions should only be active if you have given a
specific option requesting them.  If I request a long long type, I
certainly expect it to fit coherently into the type system, and give the
expected results.  And if I don't request it (and I wouldn't unless I
needed it), then the sequence "long long" had better be a compiler
error, so there is no problem with regards to the type of the expression
which initializes it.

|>  >This is all a quality of implementation issue however.  The standard
|>  >says nothing about long long.
|>
|>  Yes, but it does place restrictions on what constitutes a conforming
|>  extension, which constrain how a conforming implementation can implement
|>  `long long'.

Yes.  It says explicitly that a conforming implementation cannot
implement long long.

|>  Hopefully C9X will change the rules when it adds `long long', so that
|>  the code fragment above will work in conforming C9X implementations
|>  (even though doing so would technically not be backwards compatible
|>  with C89).

Agreed.  Although I'm not sure that the new type will be spelled "long
long".  (If so, where do we go from there?  In another ten years, we'll
have "long long long", and then "long long long long", and then...  But
that argument is for another thread, preferably in comp.std.c:-).)

--
James Kanze      home:     kanze@gabi-soft.fr        +33 (0)1 39 55 85 62
                 office:   kanze@vx.cit.alcatel.fr   +33 (0)1 69 63 14 54
GABI Software, Sarl., 22 rue Jacques-Lemercier, F-78000 Versailles France
     -- Conseils en informatique industrielle --
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: fjh@mundook.cs.mu.OZ.AU (Fergus Henderson)
Date: 1997/04/01 Raw View

Neal Becker <neal@ctd.comsat.com> writes:

>>>>>> "James" == James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:
>
>    James> The problem is that the value of the expression -2147483648 is not
>    James> representable in a 32 bit signed int, since the value is the positive
>    James> number 2147483648.  It would take some extremely tricky special case
>    James> wording to make it work.
>
>I'm going out on a limb here since I haven't been following this
>discussion.  But [...]
>-2^(31) IS representable within a signed 32 bit integer!

James Kanze did not deny that the *number* -2^31 is representable in a
32 bit signed int.  We're all aware of that.  He said that the
*value of the expression* `-2147483648', namely the positive number
2147493648, is not representable in a 32 bit signed int.

Why is the value of the expression `-2147483648' a positive number?
Because `2147483648' is a literal of type `unsigned' (for the
implementations under discussion), and C++ defines unary minus on
unsigned integral types to return a value of the same unsigned integral
type.

--
Fergus Henderson <fjh@cs.mu.oz.au>   |  "I have always known that the pursuit
WWW: <http://www.cs.mu.oz.au/~fjh>   |  of excellence is a lethal habit"
PGP: finger fjh@128.250.37.3         |     -- the last words of T. S. Garp.
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: James Kanze <james-albert.kanze@vx.cit.alcatel.fr>
Date: 1997/04/01 Raw View

Neal Becker <neal@ctd.comsat.com> writes:

|>  >>>>> "James" == James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:
|>
|>
|>      James> The problem is that the value of the expression -2147483648 is not
|>      James> representable in a 32 bit signed int, since the value is the positive
|>      James> number 2147483648.  It would take some extremely tricky special case
|>      James> wording to make it work.
|>
|>  I'm going out on a limb here since I haven't been following this
|>  discussion.  But with 2's complement arithmetic the range that is
|>  represented by N bits is:
|>
|>  [-2^(N-1) ... 2^(N-1) - 1].  Inclusive.
|>
|>  So -2^(31) IS representable within a signed 32 bit integer!

You're missing the context.  The whole raison d'etre of this discussion
is that the constant expression -2147483648 is not an int, because it
requires an intermediate value (2147483648) which cannot be represented.

--
James Kanze      home:     kanze@gabi-soft.fr        +33 (0)1 39 55 85 62
                 office:   kanze@vx.cit.alcatel.fr   +33 (0)1 69 63 14 54
GABI Software, Sarl., 22 rue Jacques-Lemercier, F-78000 Versailles France
     -- Conseils en informatique industrielle --
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: fjh@mundook.cs.mu.OZ.AU (Fergus Henderson)
Date: 1997/04/02 Raw View

X-Auth: PGPMoose V1.1 PGP comp.std.c++
 iQBVAwUBM0KdcEy4NqrwXLNJAQHI4AH9G/uey/fgvl6q0AJREZj7hKZb/t2pzVbX
 pfSKJeCSxAOPegUaDJbgO0xkUqWx/WzIcXrNXbbl60qxhIQec2xtrQ==
 =zXke

James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:

>fjh@mundook.cs.mu.OZ.AU (Fergus Henderson) writes:
>
>|>  James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:
>|>
>|>  >There is no "long long" type in C++, so this is a compiler extension.
>|>  >IMHO, if a compiler offers such an extension, it should also extend
>|>  >preprocessor arithmetic to cover the case.  It would also seem natural
>|>  >that the implicit promotions take this into account, so that 2147483648
>|>  >would have type "long long", and the above would work.
>|>
>|>  But if a compiler did that, then it would no longer be conforming.
>
>Of course not.  A compiler which accepts a program containing "long
>long" is not conforming, period, regardless of what it does with its
>constant expressions.

That's not correct.  So long as the compiler issues a diagnostic,
it is then free to go ahead and accept the program.  In fact,
that is exactly what g++ does in this case.  When you invoke gcc
in pedantic mode, any use of `long long' will result in a warning.

>|>  The standard specifies that integer literals that can fit into an unsigned
>|>  long but not a long should have type unsigned long.  An extension
>|>  which gave such a value the type "long long" would not be a conforming
>|>  extension.  I don't see how "implicit promotions" could fix this.
>
>The standard also specifies that the sequence of tokens "long long" is a
>syntax violation, which requires the compiler to issue a diagnostic.

Yes; so?

For a program that contains `long long', after issuing a diagnostic,
the compiler is free to choose whatever semantics it likes.  However,
for a program that does not contain `long long', but does contain a
literal that will fit in `unsigned long' but not in `long', e.g.
2147483648, the DWP specifies the semantics, and a conforming compiler
is _not_ free to make the type of such literals `long long'.

>I think that the extension is worthwhile.  (Actually, I think it's a
>hack, but for the point of argument...)  But if you are adding integral
>types, which can be used in constant integral expressions, you have to
>redefine the promotion rules in some way anyway.  What do you do if the
>literal it too large to fit in an unsigned long?

Well, in that case the DWP says the program is ill-formed, so the compiler
must issue a diagnostic; but after that it has issued the diagnostic,
it can freely give the literal type `long long' and keep going.

>In practice, such extensions should only be active if you have given a
>specific option requesting them.  If I request a long long type, I
>certainly expect it to fit coherently into the type system, and give the
>expected results.

Do you expect this even if doing so would violate standards conformance
for the subset which does _not_ use `long long'?
That might causes _other_ parts of your program, e.g. the header files
in some third-party library that you are using, to break, if they
relied on behaviour specified by the ANSI C standard and the C++ DWP.

In that case, it wouldn't be a true extension, because you would be
changing the semantics of the base language.

>|>  >This is all a quality of implementation issue however.  The standard
>|>  >says nothing about long long.
>|>
>|>  Yes, but it does place restrictions on what constitutes a conforming
>|>  extension, which constrain how a conforming implementation can implement
>|>  `long long'.
>
>Yes.  It says explicitly that a conforming implementation cannot
>implement long long.

No, it does not.  A conforming implementation certainly can implement
`long long'.  For example, `gcc -ansi -pedantic' is (modulo bugs) a
conforming implementation, and it implements `long long'.

What the C standard (and similarly the C++ DWP) says implies that a
conforming implementation can implement `long long', but just not with the
rules for types of integral literals that you would like.

--
Fergus Henderson <fjh@cs.mu.oz.au>   |  "I have always known that the pursuit
WWW: <http://www.cs.mu.oz.au/~fjh>   |  of excellence is a lethal habit"
PGP: finger fjh@128.250.37.3         |     -- the last words of T. S. Garp.
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: Marcelo Cantos <marcelo@mds.rmit.edu.au>
Date: 1997/04/02 Raw View

James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:

> You're missing the context.  The whole raison d'etre of this discussion
> is that the constant expression -2147483648 is not an int, because it
> requires an intermediate value (2147483648) which cannot be represented.

Has anyone noticed how annoying it is to discuss the number
2147483648?  So much typing!  (Or cutting and pasting.)  I'm just glad
we had this discussion before 64-but ints became popular; we'd all be
busy bashing out the number 9223372036854775808; or worse still,
128-bit numbers (170141183460469231731687303715884105728)!  :-) We'd
just about have to abandon 80-column displays for 256-bit numbers:
57896044618658097711785492504343953926634992332820282019728792003956564819968
(Now I'm being silly.)

Maybe we should mandate that public schools teach hexadecimal from now
on!  Imagine having to learn the fifteen times tables!

PS: Please don't take this post seriously!!  :-)

--
______________________________________________________________________
Marcelo Cantos, Research Assistant      __/_   marcelo@mds.rmit.edu.au
Multimedia Database Systems Group, RMIT  /       _  Tel 61-3-9282-2497
723 Swanston St, Carlton VIC 3053    Aus/ralia ><_> Fax 61-3-9282-2490
Acknowledgements: errors - me; wisdom - God; funding - RMIT
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: Ted Clancy <s341282%student.uq.edu.au.SPAM_REPELLENT@nac.no>
Date: 1997/04/06 Raw View

James Kanze wrote:
>
> willhall@idt.net (Will Hall) writes:
>
>
> I think that there could be a solution: it would represent a radical
> change in the way C/C++ has been described until now, but I don't think
> it would have much significant practical change (except to break
> programs which count on -2^31 being positive:-).)
>
> My idea would be that integer literals be untyped (but signed), and that
> the implementation be required to do integral constant arithmetic in
> "infinite" precision.  (In practice, there must be implementation
> limits, but they should be required to be at least twice the number of
> bits in the largest integral type? or all values in the range
> LONG_MIN...ULONG_MAX?)  Only when the integral literal is used does it
> aquire a type, according to its value and the context.
>
Sounds like a good idea, and not unlike what C++ already does for
expression &func, where func could be one of many overloaded functions.
In this case, &func doesn't have a type until it is assigned to a
variable, or passed to a function.

> This is trickier than it looks: there are several implications to be
> considered:
>
> 1. Function overloading.  I don't think that there are any real problems
> here; the current rules are rather confusing anyway (the actual function
> depends on the value, and may change from one implementation to the
> next).
>
> 2. The use of the U and L suffixes.  This is somewhat more difficult; if
> we don't want to break existing code, the presence of a suffix in the
> constant expression must constrain the type of the expression.
>

This could fit into C++ pretty well. Just consider constant integral
expressions to be of type CONSTEXP. The compiler defines
CONSTEXP operator+(CONSTEXP, CONSTEXP);
CONSTEXP operator-(CONSTEXP, CONSTEXP);
CONSTEXP operator-(CONSTEXP);
etc...

when a CONSTEXP is passed to a function, or used as an operand to
sizeof, its type would be the first of int, unsigned int, long, unsigned
long, which it can fit into, as is the current rule.

You could explicitly define
  int operator+(int, CONSTEXP);
  int operator-(int, CONSTEXP);
  long operator+(long, CONSTEXP);
  long operator-(long, CONSTEXP); //etc...
but these would probably be unnessary with the above rule, combined with
integral promotion.

The U and L suffixes can be treated as static_cast<unsigned>(CONSTEXP),
and static_cast<long>(CONSTEXP)

Of course, the time for additions has past, as you say. Maybe in the
next version of C++.

> --
> James Kanze      home:     kanze@gabi-soft.fr        +33 (0)1 39 55 85 62
>                  office:   kanze@vx.cit.alcatel.fr   +33 (0)1 69 63 14 54
> GABI Software, Sarl., 22 rue Jacques-Lemercier, F-78000 Versailles France
>             -- Conseils en informatique industrielle --

--
Ted Clancy,                         | Student of Engineering and Arts
s341282@student.uq.edu.au           | Secretary of UQ-Trek
The University of Queensland.       | The guy who hates macro NULL.
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: James Kanze <james-albert.kanze@vx.cit.alcatel.fr>
Date: 1997/04/08 Raw View

fjh@mundook.cs.mu.OZ.AU (Fergus Henderson) writes:

|>  James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:
|>
|>  >fjh@mundook.cs.mu.OZ.AU (Fergus Henderson) writes:
|>  >
|>  >|>  James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:
|>  >|>
|>  >|>  >There is no "long long" type in C++, so this is a compiler extension.
|>  >|>  >IMHO, if a compiler offers such an extension, it should also extend
|>  >|>  >preprocessor arithmetic to cover the case.  It would also seem natural
|>  >|>  >that the implicit promotions take this into account, so that 2147483648
|>  >|>  >would have type "long long", and the above would work.
|>  >|>
|>  >|>  But if a compiler did that, then it would no longer be conforming.
|>  >
|>  >Of course not.  A compiler which accepts a program containing "long
|>  >long" is not conforming, period, regardless of what it does with its
|>  >constant expressions.
|>
|>  That's not correct.  So long as the compiler issues a diagnostic,
|>  it is then free to go ahead and accept the program.  In fact,
|>  that is exactly what g++ does in this case.  When you invoke gcc
|>  in pedantic mode, any use of `long long' will result in a warning.

So long as the compiler issues a diagnostic, it is also free to go ahead
and reformat your hard disk.  (In comp.std.c, one person once suggested
that the documented diagnostic message could be that the compiler turns
on the light on the disk for 1 sec./Megabyte of storage:-).)

There is obviously a quality of implementation issue involved, but it is
a very poor choice to activate extensions automatically.  If I
accidentally type "long long", my program shouldn't compile *UNLESS*
I've specified that I want the extension.  (FWIW: in normal text,
doubling a word is one of the more frequent typing errors.)

|>  >|>  The standard specifies that integer literals that can fit into an unsigned
|>  >|>  long but not a long should have type unsigned long.  An extension
|>  >|>  which gave such a value the type "long long" would not be a conforming
|>  >|>  extension.  I don't see how "implicit promotions" could fix this.
|>  >
|>  >The standard also specifies that the sequence of tokens "long long" is a
|>  >syntax violation, which requires the compiler to issue a diagnostic.
|>
|>  Yes; so?
|>
|>  For a program that contains `long long', after issuing a diagnostic,
|>  the compiler is free to choose whatever semantics it likes.  However,
|>  for a program that does not contain `long long', but does contain a
|>  literal that will fit in `unsigned long' but not in `long', e.g.
|>  2147483648, the DWP specifies the semantics, and a conforming compiler
|>  is _not_ free to make the type of such literals `long long'.

Correct.  Presumably, to be useful, such a compiler will have two modes,
one which accepts long long, and handles type promotion correctly for
that case, and another which doesn't, and handles type promotion
correctly for C++.  While I would consider it a very bad choice, I
suppose that a compiler could select the "long long" accepting mode
automatically (after issuing a diagnostic) if the program contained
"long long", and the normal most otherwise.

I believe that g++'s policy is to have the extensions active by default;
you need special options (-ansi -pedantic) for g++ to be a C++ compiler
(as opposed to a compiler for G++, a language which is confusingly
similar to C++, but not identical).  Obviously, it is up to the authors
of G++ (the language) to decide what rules they want to use for type
promotion; since they are, in fact, defining a new language, anything
goes.

Just as obviously, if I use "long long", I WON'T compile my program with
a C++ compiler; I WILL use whatever options are necessary to get the
extension.  And I won't expect the C++ type promotion rules to be
applicable as such, since I am working in a different system of types.
(Because the language is so similar to C++, I would expect the type
promotion rules to conform to the *spirit* of C++.  But conforming to
the spirit of C++ would require 2147483648 to have type "long long".)

|>  >I think that the extension is worthwhile.  (Actually, I think it's a
|>  >hack, but for the point of argument...)  But if you are adding integral
|>  >types, which can be used in constant integral expressions, you have to
|>  >redefine the promotion rules in some way anyway.  What do you do if the
|>  >literal it too large to fit in an unsigned long?
|>
|>  Well, in that case the DWP says the program is ill-formed, so the compiler
|>  must issue a diagnostic; but after that it has issued the diagnostic,
|>  it can freely give the literal type `long long' and keep going.

Correct.  And it can freely handle type promotion in a way natural to
the extention it has invoked.

(Again, however, I think that g++ is doing its users a disservice in
making such extensions the default mode.  The only people who don't read
compiler documentation, and just use the default mode, are beginning
students, who also suppose that anything the compiler accepts is
"standard" C++.  As one of the moderators of comp.lang.c++.moderated,
we've been insulted several times by posters for refusing postings
concerning such "standard" C++ features as "__cdecl" or "__far".  If
Linux ever replaces the MS-Windows as the standard newbe platform, I
fear that we will get similar insults concerning nested functions et
at.)

|>  >In practice, such extensions should only be active if you have given a
|>  >specific option requesting them.  If I request a long long type, I
|>  >certainly expect it to fit coherently into the type system, and give the
|>  >expected results.
|>
|>  Do you expect this even if doing so would violate standards conformance
|>  for the subset which does _not_ use `long long'?

If I don't use "long long", I won't use the option which turns it on.
Either I'm compiling C++, or I'm compiling a similar (but not identical)
language.

|>  That might causes _other_ parts of your program, e.g. the header files
|>  in some third-party library that you are using, to break, if they
|>  relied on behaviour specified by the ANSI C standard and the C++ DWP.
|>
|>  In that case, it wouldn't be a true extension, because you would be
|>  changing the semantics of the base language.

OK.  I can buy this.  Two points, however:

1. Are you saying that C++ should adapt rules to facilitate such
extensions?

2. I think that there is always some risk of breaking third-party
headers, etc. when using extensions.  In this case, I consider the risk
negligible; the number of cases where the difference is perceptible are
extremely limited, and mostly occur in code (not in headers), and,
generally, such third party software is targetting multiple platforms,
and cannot make accurate assumtions concerning the type of the constant
anyway.  (The second point may reflect some idealism on my part: third
party software vendors should write high quality portable code.)

|>  >|>  >This is all a quality of implementation issue however.  The standard
|>  >|>  >says nothing about long long.
|>  >|>
|>  >|>  Yes, but it does place restrictions on what constitutes a conforming
|>  >|>  extension, which constrain how a conforming implementation can implement
|>  >|>  `long long'.
|>  >
|>  >Yes.  It says explicitly that a conforming implementation cannot
|>  >implement long long.
|>
|>  No, it does not.  A conforming implementation certainly can implement
|>  `long long'.  For example, `gcc -ansi -pedantic' is (modulo bugs) a
|>  conforming implementation, and it implements `long long'.
|>
|>  What the C standard (and similarly the C++ DWP) says implies that a
|>  conforming implementation can implement `long long', but just not with the
|>  rules for types of integral literals that you would like.

In fact, just not with reasonable rules for type promotion.

What's wrong with just saying: if you want "long long", you don't want
C++?  (You may want something very similar, like G++.)  And what's wrong
with being required to know what language you are compiling in?

--
James Kanze      home:     kanze@gabi-soft.fr        +33 (0)1 39 55 85 62
                 office:   kanze@vx.cit.alcatel.fr   +33 (0)1 69 63 14 54
GABI Software, Sarl., 22 rue Jacques-Lemercier, F-78000 Versailles France
     -- Conseils en informatique industrielle --
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: willhall@idt.net (Will Hall)
Date: 1997/03/27 Raw View

In article <rf5hghzvxhc.fsf@vx.cit.alcatel.fr>, James Kanze
<james-albert.kanze@vx.cit.alcatel.fr> wrote:

> |>  (1) Require, for all integer literals x>=0 such that the mathematical
> |>  value -x is representable in a given signed type, that x also be
> |>  representable in that type. (This amounts to, essentially, changing the
> |>  range of int and long on 32-bit implementations to be (-2^31 + 1, 2^31 -
> |>  1). )
>
> This doesn't even require a change in the standard;

Explicitly labeling behavior for a number -x as implementation-defined, if
x is non-representable, would constitute a change to the latest draft.

> such an
> implementation is legal now (I'm fairly certain).  In fact, I've
> actually used an implementation that did this: 2's complement, but
> defined INT_MIN to be -INT_MAX.
>
> In practice, it doesn't work out too well.  I've long forgotten the
> exact problems, and maybe I wouldn't even feel them as problems today.

Too bad -- it would be interesting to hear them.

> I suspect that there are too many other things
> that will break if int's really can take on values not in the range of
> INT_MIN...INT_MAX.  Formally, I suspect that they are broken already, in
> the sense that the standard does allow this (and programs which happen
> to use such values invoke undefined behavior), but in practice, they
> work under all current implementations, and *requiring* an
> implementation to break them is probably asking too much.

Having the standard require representability of -x for all representable x
is not the same as requiring an implementation to break for -x s.t. x is
not representable. On the contrary, for implementations on which -2^31
e.g. is already usable  -- if not directly specifiable (I'm sure you'd
agree there are many) -- there would be no noticeable effect.

The effect would be that all signed integer literals _required_ by the
standard would now be directly specifiable (i.e., the direct intuitive
specification would yield the intuitively expected value of the
intuitively expected type).

Just as you, M. Kanze, have stated that you don't mind that `int i =
-2147483648;' has an implementation-defined value, because you trust that
all "real life" implementations will do the right thing, should you not
(by that logic) agree that, if the same expression _still_ works with the
above change to the standard (one which makes the standard more
self-consistent and less counter-intuitive and, after all, only affects
this one particular seldom-used number, as you seem to view it) --
shouldn't your attitude to such a change be indifferent to favorable?

-Will Hall
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: James Kuyper <kuyper@wizard.net>
Date: 1997/03/27 Raw View

James Kanze wrote:
>
> willhall@idt.net (Will Hall) writes:
...
> |>  "non-portable". To repeat, this interpretation of "portable" is absurdly
> |>  restrictive and precludes sensible discussion of portability between
> |>  conforming 32-bit implementations and conforming 64-bit implementations.
>
> The question is: are we talking about the standard, or not?  A program
> which uses int's to store values like 100000 is not portable, and the
> standard doesn't guarantee it will work.  In fact, I've used a number of
> implementations where it won't work.

Doesn't the standard guarantee that such a program will work on any
implementation where INT_MAX>=100000 ? I've been assuming that it did; I
assumed that this was part of what INT_MAX means. I thought that I could
store any value in the range INT_MIN <= n <= INT_MAX in an integer,
without giving the implementation an excuse for unexpected behavior.

Similarly, until this thread started, I thought that the standard
guaranteed that I could express any value between LONG_MIN and ULONG_MAX
as an integral type literal, even if that literal was not expressable
under all conforming implementations. I understand now that this is not
true, but I don't like it.

Is the following an accurate statement of your point of view?

"If a program is not portable to every conforming implementation, the
standard (does not ? should not ? can not) make any guarantees about its
behavior under any implementation".

If this is not what you believe, can you provide a similarly worded
statement that does justify your statements?
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: James Kanze <james-albert.kanze@vx.cit.alcatel.fr>
Date: 1997/03/28 Raw View

Christopher Eltschka <celtschk@physik.tu-muenchen.de> writes:

|>  James Kanze wrote:
|>
|>  [...]
|>
|>  > |>  So, we have an example of a (familiar) class of implementations (namely,
|>  > |>  those with 32-bit int and 32-bit long) that have a representation for
|>  > |>  -2147483648 but that do _not_ allow for this value's direct specification.
|>  >
|>  > Except that they do allow it.
|>  >
|>  > If my statement is wrong, it is remarkably easy to disprove: just show
|>  > an implementation where it doesn't work.
|>  >
|>  > I know that this has nothing to do with the standard, but if you use
|>  > such a value, you are outside of the guarantees of the standard anyway.
|>
|>  There can still be problems with this value:
|>  try f.ex. with gcc under Linux
|>
|>  int i=-2147483648;
|>  assert(i<0); /* works */
|>
|>  long long l=-2147483648;
|>  assert(l<0); /* fails */

There is no "long long" type in C++, so this is a compiler extension.
IMHO, if a compiler offers such an extension, it should also extend
preprocessor arithmetic to cover the case.  It would also seem natural
that the implicit promotions take this into account, so that 2147483648
would have type "long long", and the above would work.

This is all a quality of implementation issue however.  The standard
says nothing about long long.

--
James Kanze      home:     kanze@gabi-soft.fr        +33 (0)1 39 55 85 62
                 office:   kanze@vx.cit.alcatel.fr   +33 (0)1 69 63 14 54
GABI Software, Sarl., 22 rue Jacques-Lemercier, F-78000 Versailles France
     -- Conseils en informatique industrielle --
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: James Kanze <james-albert.kanze@vx.cit.alcatel.fr>
Date: 1997/03/28 Raw View

willhall@idt.net (Will Hall) writes:

|>  In article <rf5g1xjvwja.fsf@vx.cit.alcatel.fr>, James Kanze
|>  <james-albert.kanze@vx.cit.alcatel.fr> wrote:
|>
|>  > I've addressed the two proposals in another posting, so I won't talk
|>  > about them here.  If you think that the problem is real, and serious,
|>  > make a concrete proposal; if there are no other problems with it, I
|>  > won't oppose it.  (Note that I consider the fact that an int may have a
|>  > value not in the range INT_MIN...INT_MAX unacceptable.
|>
|>  Care to elaborate? I assume you are referring to my suggestion that, in
|>  effect, INT_MAX == INT_MIN be a property of implementations, and that if
|>  an implementation chooses to allow ints which are < INT_MIN, e.g., then it
|>  is operating in implementation-defined territory. Why is that
|>  unacceptable, exactly?

The problem is that I don't remember what the problems were.  I used
such an implementation once, and I didn't like it at all, but I cannot
remember exactly why.

I can think of vague reasons involving program verification (if x is
declared an int, it is by definition proved that it has a value in the
range INT_MIN...INT_MAX).  In fact, I'm sure that this was not the point
then.  Also, given that overflow is undefined behavior today, anyone
doing program verification will be asserting that there results outside
of the range INT_MIN...INT_MAX are not possible anyway.

This is an easy suggestion to test.  Just modify the value of INT_MIN,
etc. in your headers, grab big chunks of existing code, and go to it.
You should soon get a feel of whether there are any real problems or
not.

--
James Kanze      home:     kanze@gabi-soft.fr        +33 (0)1 39 55 85 62
                 office:   kanze@vx.cit.alcatel.fr   +33 (0)1 69 63 14 54
GABI Software, Sarl., 22 rue Jacques-Lemercier, F-78000 Versailles France
     -- Conseils en informatique industrielle --
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: James Kanze <james-albert.kanze@vx.cit.alcatel.fr>
Date: 1997/03/28 Raw View

willhall@idt.net (Will Hall) writes:

|>  Well then, how about changing the standard to eliminate the "aesthetic"
|>  problem as you call it without generating any new work for lexer writers?
|>  If you're not sure what I mean by this, please see my recent posts on the
|>  matter (I wouldn't want to repeat myself ;-).

Having thought about the problem a little bit more...

I think that there could be a solution: it would represent a radical
change in the way C/C++ has been described until now, but I don't think
it would have much significant practical change (except to break
programs which count on -2^31 being positive:-).)

My idea would be that integer literals be untyped (but signed), and that
the implementation be required to do integral constant arithmetic in
"infinite" precision.  (In practice, there must be implementation
limits, but they should be required to be at least twice the number of
bits in the largest integral type? or all values in the range
LONG_MIN...ULONG_MAX?)  Only when the integral literal is used does it
aquire a type, according to its value and the context.

This is trickier than it looks: there are several implications to be
considered:

1. Function overloading.  I don't think that there are any real problems
here; the current rules are rather confusing anyway (the actual function
depends on the value, and may change from one implementation to the
next).

2. The use of the U and L suffixes.  This is somewhat more difficult; if
we don't want to break existing code, the presence of a suffix in the
constant expression must constrain the type of the expression.

There are probably other points I haven't thought of.

It is too late in the C++ standardization to consider this.  I think it
is more a C issue anyway; I wouldn't want C and C++ to differ on this.
So if anyone is interested, they should probably make the proposal to
the C standards committee, not to the C++ one.  (I'm not sure, but it
may also be too late for this round of C, as well.)

Anyway: I like the idea.  I think that it could be made to work.  I also
think that the current situation is not a real problem, and that there
are more important things to worry about, so I'm not ready to invest any
real effort in getting the idea adopted.

--
James Kanze      home:     kanze@gabi-soft.fr        +33 (0)1 39 55 85 62
                 office:   kanze@vx.cit.alcatel.fr   +33 (0)1 69 63 14 54
GABI Software, Sarl., 22 rue Jacques-Lemercier, F-78000 Versailles France
     -- Conseils en informatique industrielle --
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: fjh@mundook.cs.mu.OZ.AU (Fergus Henderson)
Date: 1997/03/29 Raw View

James Kanze <james-albert.kanze@vx.cit.alcatel.fr> writes:

>Christopher Eltschka <celtschk@physik.tu-muenchen.de> writes:
>
>|>  There can still be problems with this value:
>|>  try f.ex. with gcc under Linux
>|>
>|>  int i=-2147483648;
>|>  assert(i<0); /* works */
>|>
>|>  long long l=-2147483648;
>|>  assert(l<0); /* fails */
>
>There is no "long long" type in C++, so this is a compiler extension.
>IMHO, if a compiler offers such an extension, it should also extend
>preprocessor arithmetic to cover the case.  It would also seem natural
>that the implicit promotions take this into account, so that 2147483648
>would have type "long long", and the above would work.

But if a compiler did that, then it would no longer be conforming.
The standard specifies that integer literals that can fit into an unsigned
long but not a long should have type unsigned long.  An extension
which gave such a value the type "long long" would not be a conforming
extension.  I don't see how "implicit promotions" could fix this.

>This is all a quality of implementation issue however.  The standard
>says nothing about long long.

Yes, but it does place restrictions on what constitutes a conforming
extension, which constrain how a conforming implementation can implement
`long long'.

Hopefully C9X will change the rules when it adds `long long', so that
the code fragment above will work in conforming C9X implementations
(even though doing so would technically not be backwards compatible
with C89).

--
Fergus Henderson <fjh@cs.mu.oz.au>   |  "I have always known that the pursuit
WWW: <http://www.cs.mu.oz.au/~fjh>   |  of excellence is a lethal habit"
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: Oleg Zabluda <zabluda@math.psu.edu>
Date: 1997/03/20 Raw View

James Kanze <james-albert.kanze@vx.cit.alcatel.fr> wrote:
: If INT_MIN (or LONG_MIN) == -2147483647, ...

Small comment:

INT_MIN is an obsolete way to do that. The modern way is:
#include <limits>
numeric_limits<int>::min();

Oleg.
--
Life is a sexually transmitted, 100% lethal disease.
 ==> http://www.math.psu.edu/zabluda/sheep.html before e-mailing about cloning.
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: James Kuyper <kuyper@wizard.net>
Date: 1997/03/21 Raw View

Will Hall wrote:
>
> In article <rf5k9n4fdez.fsf@vx.cit.alcatel.fr>, James Kanze
> <james-albert.kanze@vx.cit.alcatel.fr> wrote:
...
[re: -2^31 ]
> > ... in real life, the number is portable to all machines on which
> > it is representable.
>
> If you are referring to the number -2^31, you are mistaken.
...
Can you please provide a specific counter-example? There is a loophole
in the standard, and Mr. Kanze does not seem to disagree with you about
that. He is merely claiming that no one has yet produced an
implementation that slips through this particular loophole.

I don't think Mr. Kanze is mistaken, I just don't care about the truth
of his statement. If -2^31 can be represented in a given implementation,
the standard itself should guarantee that I can express it as
-2147483648; I shouldn't have to rely upon the historical accident that
no one has implemented this poorly yet.
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: fjh@mundook.cs.mu.OZ.AU (Fergus Henderson)
Date: 1997/03/21 Raw View

Oleg Zabluda <zabluda@math.psu.edu> writes:

>James Kanze <james-albert.kanze@vx.cit.alcatel.fr> wrote:
>: If INT_MIN (or LONG_MIN) == -2147483647, ...
>
>Small comment:
>
>INT_MIN is an obsolete way to do that. The modern way is:
>#include <limits>
>numeric_limits<int>::min();

Well, since this is comp._std_.c++, I should point out that that ought
to be

 #include <limits>
 std::numeric_limits<int>::min();
 ^^^^^

Of course, this will work only if your compiler supports namespaces.
`INT_MIN' is going to be more portable for quite some time.

--
Fergus Henderson <fjh@cs.mu.oz.au>   |  "I have always known that the pursuit
WWW: <http://www.cs.mu.oz.au/~fjh>   |  of excellence is a lethal habit"
PGP: finger fjh@128.250.37.3         |     -- the last words of T. S. Garp.
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: willhall@idt.net (Will Hall)
Date: 1997/03/21 Raw View

In article <01bc3602$4cc5d7c0$1371adce@azguard>, "Bradd W. Szonye"
<bradds@concentric.net> wrote:

> Most implementations I have seen actually define INT_MIN as (-X-1) where X
> depends on the size of the type. Note that a vendor who called -2147483648
> a signed integer (given the ongoing assumptions in this thread--32bit 2'sc.
> int and long) would in fact be implementing it poorly. 2147483648 is an
> unsigned long integer according to 2.13.1,
> so -2147483648 is a signed long integer with implementation-defined value.
     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

This is false. If you review the language of the draft standard more
carefully, I believe you will find it to be the case that (as has been
spelled out several times so far in this thread):

(1) the literal 2147483648 (synonymous herein with 2^31), too large to be
an int, is taken (2.13.1[2]) as an unsigned long int.

(2) since the literal quantity achieved in (1) is unsigned, unary `-'
(5.3[7]) subtracts it from 2^n (where n == the # of bits in the type of the
operand [unsigned long int] == 32), yielding 2^32 - 2^31 = 2^31 with
type unsigned long int (still from 5.3[7]). (Thus the net effect of the
unary negation is nil.)

In summary, -2147483648 is an unsigned long integer with whose value is
mandated by the standard for the subset of implementations under
discussion.

> Fortunately, the problem only affects source-code generators and (arguably
> poor) programmers who put such magic numbers in code. The former can check
> for the special case (abs(x) > INT MAX), and the latter, er, sort of
> deserve what they get.

This is to sidestep arguments I, for one, have made for the desirability
of direct specifiability of all representable numbers, and to implictly
devalue them without addressing them or responding directly. In fact, the
"deserve what they get" assertion is a classic example of unsound logic in
action and has little or nothing to do with the standard and the question
of whether it should allow for a more direct specification of all
representable values of type int than it does at present.

BTW, would anyone care to provide me with a decent definition of "magic
number"? I can't find the term in the draft standard, but it seems to be
coming up with increasing frequency in this thread. FWIW, here is my guess
at its definition based on the context of the aforementioned usage
examples:

DEFINITION: By a "magic number" we shall mean (for a given implementation
of the C++ standard) any representable value of an integer or floating
type which, however, due to oversight, puzzlement or deep wisdom on the
part of standard-writers to date, is not directly specifiable by the
language. (By directly, we mean using one or fewer unary operators.)
Further, any users of the language who should so much as _wish_ to specify
such numbers directly, let alone _act_ on that wish, shall be deemed to
deserve what they get.  ;-)

-Will Hall
---
[ comp.std.c++ is moderated.  To submit articles: try just posting with      ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu         ]
[ FAQ:      http://reality.sgi.com/employees/austern_mti/std-c++/faq.html    ]
[ Policy:   http://reality.sgi.com/employees/austern_mti/std-c++/policy.html ]
[ Comments? mailto:std-c++-request@ncar.ucar.edu                             ]

Author: willhall@idt.net (Will Hall)
Date: 1997/03/21 Raw View

In article <3331AE0C.41C6@wizard.net>, James Kuyper <kuyper@wizard.net> wrote:

> Will Hall wrote:
> >
> > In article <rf5k9n4fdez.fsf@vx.cit.alcatel.fr>, James Kanze
> > <james-albert.kanze@vx.cit.alcatel.fr> wrote:
> ...
> [re: -2^31 ]
> > > ... in real life, the number is portable to all machines on which
> > > it is representable.
> >
> > If you are referring to the number -2^31, you are mistaken.
> ...
> Can you please provide a specific counter-example? There is a loophole
> in the standard, and Mr. Kanze does not seem to disagree with you about
> that.

Unless I misunderstood Mr. Kanze (not at all outside the realm of
possibility), he would not agree that there is a loophole in the standard.
In fact, here's a quote from one of his recent posts:

>: The "loophole" which
>: triggers "implementation-defined" is the presence of the value, not the
>: way it is written.

> He is merely claiming that no one has yet produced an
> implementation that slips through this particular loophole.

Though it would be trivial to create such a conforming implementation, I
see the (rather small) distinction you are making and concede that my "you
are mistaken" reply might, rigorously, have been too hasty without our
(Mr. Kanze and I) having first agreed on a definition of "in real life".
However, I don't have the energy at present to explore the fascinating
minutiae that no doubt lie along the path down which that would lead us.
Instead, I'll simply agree with your next statement, prefaced by the two
words "Rigorously speaking":

> I don't think Mr. Kanze is mistaken, I just don't care about the truth
> of his statement. If -2^31 can be represented in a given implementation,
> the standard itself should guarantee that I can express it as
> -2147483648; I shouldn't have to rely upon the historical accident that
> no one has implemented this poorly yet.

In addition, I would like to put forward an informal suggestion which,
though possibly not the best solution, might at least make the standard
more honest about what the language can handle (without, e.g., resorting
to indirect specification of representable integer values) without
requiring additional work for the lexer-writers of the world:

Require that, for signed integer types, for any non-negative value x such
that -x is representable, the value x must also be representable. In
particular, this could be interpreted for two's complement implementations
as meaning that int and long will range from -2^n + 1 to 2^n - 1 (where n
is 32, 64, etc. depending in practice upon the hardware architecture).

If, as a non-standard extension, an implementation so elected, it would be
free to allow signed integer types to assume the value -2^n (which it
would be free to deal with as it saw fit -- or perhaps even in a somewhat
restricted way indicated by the standard).

Will Hall
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]

Author: hpa@transmeta.com (H. Peter Anvin)
Date: 1997/03/23 Raw View

Followup to:  <199703211647.LAA22507@u1.farm.idt.net>
By author:    willhall@idt.net (Will Hall)
In newsgroup: comp.std.c++
>=20
> In addition, I would like to put forward an informal suggestion which,
> though possibly not the best solution, might at least make the standard
> more honest about what the language can handle (without, e.g., resortin=
g
> to indirect specification of representable integer values) without
> requiring additional work for the lexer-writers of the world:
>=20
> Require that, for signed integer types, for any non-negative value x su=
ch
> that -x is representable, the value x must also be representable. In
> particular, this could be interpreted for two's complement implementati=
ons
> as meaning that int and long will range from -2^n + 1 to 2^n - 1 (where=
 n
> is 32, 64, etc. depending in practice upon the hardware architecture).
>=20

This is no better than what we have now.  I would like to propose the
following alternative:

Require that, for any signed integral type, if -x is representable, a
literal constant of value x must be properly handled by the compiler
*if and only if* it is preceded by a unary minus sign, so that the
value of the resulting constant expression is -x.  This works
correctly automatically on two's-complement, one's-complement,
sign-magnitude, BCD and balanced ternary machines (in the latter four
cases since -INT_MIN =3D=3D INT_MAX, in the former case due to the nature
of the rollover[1].)  For machines with truly *weird* handling of
negative numbers[2] it may require the compiler to keep additional
precision during the translation stages, but it would affect such a
vanishingly small number of machines -- and those that it would affect
would probably have really bizarre compilers anyway -- that I don't
think that is an onerous requirement.

 -hpa

[1] As previously discussed in this thread: -(-INT_MIN) =3D=3D -INT_MIN =3D=
=3D
    INT_MIN on a 2's complement machine.

[2] If someone can find a real-life example where the compiler would
    have to do something special if this was implemented, I would
    really like to hear about it, just for kicks...
--=20
Always looking for a few good BOsFH.  **  Linux - the OS of global cooper=
ation
        I am Bah=E1'=ED -- ask me about it or see http://www.bahai.org/
---
[ comp.std.c++ is moderated.  To submit articles: Try just posting with your
                newsreader.  If that fails, use mailto:std-c++@ncar.ucar.edu
  comp.std.c++ FAQ: http://reality.sgi.com/austern/std-c++/faq.html
  Moderation policy: http://reality.sgi.com/austern/std-c++/policy.html
  Comments? mailto:std-c++-request@ncar.ucar.edu
]