Thread

Topic: Proposal for Addition to STL

Author: thp@cs.ucr.edu (Tom Payne)
Date: 1995/06/11 Raw View

Paul J. Ste. Marie (pstemari@erinet.com) wrote:
: In article <3qv228$j51@galaxy.ucr.edu>, thp@cs.ucr.edu (Tom Payne)
: wrote:
: :[snip]
: :Specifically, signal handlers must be able to:
: :
: :  * read and write global variables (with the usual understandings
: :    that there you may get a stale value due to an outstanding
: :    copy in a register).

: Hmm...wouldn't this be better qualified as "read and write
: _volatile_ global variables"

:  --Paul J. Ste. Marie, pstemari@well.sf.ca.us, pstemari@erinet.com

: The Financial Crimes Enforcement Network claims that they capture every
: public posting that has their name ("FinCEN") in it.  I wish them good hunting.


Yes!!  That would certainly suffice and would be much more reasonable
(i.e., implementable and optimizable).  Unfortunately, per the current
C Standard, reading any global leads to undefined behavior.  The view
is the handlers shoud only set globals, which are then polled by the
underlying program.  Ugh!

Tom Payne

Author: pstemari@erinet.com (Paul J. Ste. Marie)
Date: 1995/06/10 Raw View

In article <3qv228$j51@galaxy.ucr.edu>, thp@cs.ucr.edu (Tom Payne)
wrote:
:[snip]
:Specifically, signal handlers must be able to:
:
:  * read and write global variables (with the usual understandings
:    that there you may get a stale value due to an outstanding
:    copy in a register).

Hmm...wouldn't this be better qualified as "read and write
_volatile_ global variables"

 --Paul J. Ste. Marie, pstemari@well.sf.ca.us, pstemari@erinet.com

The Financial Crimes Enforcement Network claims that they capture every
public posting that has their name ("FinCEN") in it.  I wish them good hunting.

Author: thp@cs.ucr.edu (Tom Payne)
Date: 1995/06/05 Raw View

: >Pre-emptive multithreading seems to present a more difficult problem.

:  No. Not really. The main problem is stupid library
: functions that are not re-entrant.

:  However, threads and tasking are outside the scope
: of the C++ Standard at present.

:  Coroutines are not.


Coroutines + signals + 1500 lines of C++ code == threads + monitors

The problem with the current C Standard for signal handlers is that
their behavior is undefined if they even attempt to read a global
variable.  This polling-only model for asynchrony is marginal even for
standard event-driven programming.

Specifically, signal handlers must be able to:

  * read and write global variables (with the usual understandings
    that there you may get a stale value due to an outstanding copy
    in a register).

  * call a coroutine (with the usual understanding that signals need
    to be blocked at critical times).

It would be convenient, but not necessary, for signals to be able to
throw exceptions, which appears to be compatible with the
implementation strategy for exceptions mentioned on page 397 of D&E.

The necessary norportables functions are the ability to create,
destroy, and call coroutines and to install a given function as a
signal handler.  It looks to me as through we can portably implement
signal blocking with global flags, and from signal blocking we can
get locks in the monoprocessor case.  Multiprocessor locking can
be done portably via the bakery algorithm, but some nonportable
assistance based on the underlying instruction set would improve
efficiency.

What I have in mind is a very simple hardware analog that views a
signal as an involuntary function call on behalf of the currently
running function invocation and gets serviced on the current stack.
(There are various ways to circumvent the old conundrum: "Then where
do you handle stack-overflow signals?")

Of course, for any implementation of threads & monitors to be of much
use, libraries need to be thread safe.  Overloading the global news
and deletes so that they use safe versions of malloc and free is not
difficult, but every class can have its local news and deletes, which
can lead to chaos.


Tom Payne

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/28 Raw View

In article <3piugd$1od@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
>
>:  But it _isn't_ clear any "easy to port" implementation
>: in C++ is possible because of EH. C perhaps, since it has no EH.
>: There's no telling what an implementor might do to keep track of
>: what exception to throw and where to find handlers.
>
>No doubt there are ways to implement exceptions that make easy things
>difficult, but an implementation of exceptions can't survive a full
>saving and subsequent restoration of the working registers (including
>stack pointer and base pointer) is likely to have trouble with signals
>and longjmps as well.

 Nope. If you check the CD, setjmp/longjmp doesn't
have to work except in HIGHLY restricted contexts.
In particular, if there is ANY automatic variable on the stack
between the setjmp and longjmp, the results are undefined.

 This is a change from the C Standard.

>For full portability, how about augmenting the standard with two new
>functions the setjmp library?
>
>  * makejmp( jmp_buf, void (*)() )
>    would allocate a new stack (wherever appropriate) and initialize
>    jmp_buf with register settings for that stack and for the
>    parameterless void function as the base function of the resulting
>    thread (coroutine).
>
>  * killjmp( jmp_buf )
>    would deallocate the stack associated with the jmp_buf (presumably
>    not the program's initial stack).
>

 Yes. I agree.

>The obvious question is, "Why mess with setjmp and longjmp rather than
>adding an entirely new mechanism?"  IMHO:
>
>  *  Their semantics is almost exatly that of coroutines, and a full
>     implementation of threads can be built on them (plus locks).

 Yes.

>Pre-emptive multithreading seems to present a more difficult problem.

 No. Not really. The main problem is stupid library
functions that are not re-entrant.

 However, threads and tasking are outside the scope
of the C++ Standard at present.

 Coroutines are not.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: thp@cs.ucr.edu (Tom Payne)
Date: 1995/05/19 Raw View

John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
: In article <3oufem$apf@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
: >John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
: >: In article <3olo3n$54@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
: >: >Matthew Austern (matt@dogbert.lbl.gov) wrote:
: >: >
: >: >It's not terribly difficult to implement threads, monitors, and
: >: >conditions (along the lines of Hoare's Simula-inspired CACM paper,
: >: >v17, #10) as a simple C++ library.
: >
: >:  A _portable_ implementation?
: >: Which works with Exception Handling?
: >
: >Not totally _portable_ but _easy_to_port_.
: >

:  The question is whether such an implementation would
: work correctly with exceptions. In fact I've just ported
: John English's excellent Borland 3.1 coroutine package to
: Metaware High C/C++ -- and I can report that with some caveats about throwing
: exceptions out of a coroutine -- or destroying the coroutine
: with an exception by unwinding it -- it seems to work.

:  But it _isn't_ clear any "easy to port" implementation
: in C++ is possible because of EH. C perhaps, since it has no EH.
: There's no telling what an implementor might do to keep track of
: what exception to throw and where to find handlers.

No doubt there are ways to implement exceptions that make easy things
difficult, but an implementation of exceptions can't survive a full
saving and subsequent restoration of the working registers (including
stack pointer and base pointer) is likely to have trouble with signals
and longjmps as well.

For full portability, how about augmenting the standard with two new
functions the setjmp library?

  * makejmp( jmp_buf, void (*)() )
    would allocate a new stack (wherever appropriate) and initialize
    jmp_buf with register settings for that stack and for the
    parameterless void function as the base function of the resulting
    thread (coroutine).

  * killjmp( jmp_buf )
    would deallocate the stack associated with the jmp_buf (presumably
    not the program's initial stack).

Wording would have to be added to the standard specifying that idiom

    if( ! setjmp(thread1.context) ) longjmp(thread2.context);

indeed transfers the stream of execution (CPU) from one thread (jmp_buf)
to the other in such a way that exceptions and signals work correctly
as long as certain protocols are observed, e.g.:
  *  no thread may throw an exception back past its creation (makejmp).
  *  no signal handler or any function it calls (even indirectly) may
     throw an exception past its invocation.
And so on.

The obvious question is, "Why mess with setjmp and longjmp rather than
adding an entirely new mechanism?"  IMHO:

  *  Their semantics is almost exatly that of coroutines, and a full
     implementation of threads can be built on them (plus locks).

  *  They already seem to work tolerably well for multithreading in
     many implementations.  (My experiece is restricted to g++ under
     SunOS, Solaris, and Linux, but I've heard rumors of success with
     other systems).

  *  Their original rationale has been pre-empted by exception handling,
     so they currently have no real mission.

  *  Recycling them for this new purpose, via tightened specification,
     would be in keeping with C's economy of concepts (stinginess with
     keywords).

Pre-emptive multithreading seems to present a more difficult problem.
Pre-emption must be initiated by invocations of a signal handler,
e.g., the timer signal for timeslicing.  If the handler directly
transfers the CPU on behalf the interrupted thread, that thread sleeps
on an uncompleted handler invocation, whereas some implementers and
designers of signal protocols have in mind that the last invocation of
a handler will complete before the next one is honored.

The alternatives get quite messy.  The handler must modify the
environment in such a way that after it (the handler) has returned the
interrupted thread will spontaneously transfer execution to another
thread and, upon reawakening from that transfer, will return to the
point of interruption.  For instance one might imagine building a
couple of activation records on the stack on which the handler itself
is executing -- not an easy trick.

Tom Payne

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/17 Raw View

In article <3oufem$apf@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
>John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
>: In article <3olo3n$54@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
>: >Matthew Austern (matt@dogbert.lbl.gov) wrote:
>: >
>: >It's not terribly difficult to implement threads, monitors, and
>: >conditions (along the lines of Hoare's Simula-inspired CACM paper,
>: >v17, #10) as a simple C++ library.
>
>:  A _portable_ implementation?
>: Which works with Exception Handling?
>
>Not totally _portable_ but _easy_to_port_.
>

 The question is whether such an implementation would
work correctly with exceptions. In fact I've just ported
John English's excellent Borland 3.1 coroutine package to
Metaware High C/C++ -- and I can report that with some caveats about throwing
exceptions out of a coroutine -- or destroying the coroutine
with an exception by unwinding it -- it seems to work.

 But it _isn't_ clear any "easy to port" implementation
in C++ is possible because of EH. C perhaps, since it has no EH.
There's no telling what an implementor might do to keep track of
what exception to throw and where to find handlers.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/19 Raw View

In article <D8qLMK.IwB@ucc.su.OZ.AU>,
John Max Skaller <maxtal@Physics.usyd.edu.au> wrote:
>In article <3oufem$apf@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
>>John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
>>: In article <3olo3n$54@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
>>: >Matthew Austern (matt@dogbert.lbl.gov) wrote:
>>: >
>>: >It's not terribly difficult to implement threads, monitors, and
>>: >conditions (along the lines of Hoare's Simula-inspired CACM paper,
>>: >v17, #10) as a simple C++ library.
>>
>>:  A _portable_ implementation?
>>: Which works with Exception Handling?
>>
>>Not totally _portable_ but _easy_to_port_.
>>
>
> The question is whether such an implementation would
>work correctly with exceptions. In fact I've just ported
>John English's excellent Borland 3.1 coroutine package to
>Metaware High C/C++ -- and I can report that with some caveats about throwing
>exceptions out of a coroutine -- or destroying the coroutine
>with an exception by unwinding it -- it seems to work.
>
> But it _isn't_ clear any "easy to port" implementation
>in C++ is possible because of EH. C perhaps, since it has no EH.
>There's no telling what an implementor might do to keep track of
>what exception to throw and where to find handlers.

 In fact a bit more testing reveals the EH mechanism
stuffs up the coroutines. Leaving C++ without an important
control structure that C permits (although it doesn't provide).
--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: thp@cs.ucr.edu (Tom Payne)
Date: 1995/05/12 Raw View

John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
: In article <3olo3n$54@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
: >Matthew Austern (matt@dogbert.lbl.gov) wrote:
: >
: >It's not terribly difficult to implement threads, monitors, and
: >conditions (along the lines of Hoare's Simula-inspired CACM paper,
: >v17, #10) as a simple C++ library.

:  A _portable_ implementation?
: Which works with Exception Handling?



Not totally _portable_ but _easy_to_port_.

Creation, suspension, and resumption of threads may require a bit of
assembly language to initialize, save, and restore contexts.  Often,
however, setjmp and longjmp can be used to transfer execution:

  if( ! setjmp( suspendingContext ) longjmp( resumingContext, 1 );

A few lines of non-portable C++ are still required to initialize the
SP and possibly other fields of a thread's context (jmp_buf).  Also,
if the architecture doesn't support heap-allocated stacks, the OS must
be invoked for the allocation of a thread's stack segment.

If a system already has a threads implementation, e.g., in the OS or a
C-based thread libraries (like Pthreads or Cthreads), an
object-oriented C++ threads library can use those facilities to
create, suspend, and resume threads.  Semaphores initialized to one
can be used as locks, and semaphores initialized to zero can be used
to implement conditions (per Hoare's paper).


I have yet to try threads in the presence of exception handling, but I
take seriously Stroustrup's comment (D&E p.385) that one of the ideals
for C++ exception handling was that it be "A mechanism that by
*default* will work correctly in a multi-threaded program."  [emphasis
added]

The most obvious problem, releasing the appropriate locks when an
exception occurs, can be handled by relegating all lock handling to
local objects (sentries) whose constructor acquires a specified lock
and blocks certain interrupts and signals, and whose destructor
releases the lock and restores signal and interrupt status.  There are
other obvious considerations (e.g., the base function of a thread must
be capable of handling all exceptions that the thread might throw),
and I'd appreciate information about yet others.


Tom Payne

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/11 Raw View

In article <MATT.95May8103436@dogbert.lbl.gov>,
Matthew Austern <matt@physics.berkeley.edu> wrote:
>In article <D873A8.7tp@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
>
>>  This IS provided by coroutines/threads and channels.
>> Unfortunately, yet again, C++ fails to provide language
>> support for a completely fundamental idiom.
>
>That's an interesting point.  The obvious followup question, then:
>what do you think that language support for threads would look like?

 First you do coroutines. They are much simpler because
control exchange is synchronous.

 In a dumb implementation you need to create a coroutine
with a given stack size, you need a "function" to  begin
executing and a way of exchanging control. You also need to
kill the coroutine (if it doesn't suicide).

 A suitable interface is a few lines of code. It is very
hard to implement in C++ because of exception handling.
In C it is fairly easy -- have a look at John English package
for Borland 3.1 (DOS) called CCL110JE. Or Dag Bruck's
proposed (and then withdrawn) class :-(

 Threads are the same except control can _also_ transfer
asynchronously and so you need to have locking.

 Given that monitors can be implemented, a threadwise
monitor may provide blocking or unblocked transmissions.
There is a C variable called "Alef" which does this in the
core language -- very nice.

>The most obvious answer is just to provide library functions to create
>and kill threads; if you're using a multithreaded operating system
>(OS/2, NT, Solaris, and so on), then your compiler vendor has probably
>already provided a library function giving you access to that
>functionality.  But I imagine that real support for concurrency
>involves something more than that.

 Multi-processing in general is a larger field. Outside the
agreed on scope of the first C++ Standard.

 Coroutines are a fundamental control structure for
synchronous symmetric exchange of a _single_ thread of control.

 That is -- within the scope of the C++ Standard.
--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/11 Raw View

In article <3olo3n$54@galaxy.ucr.edu>, Tom Payne <thp@cs.ucr.edu> wrote:
>Matthew Austern (matt@dogbert.lbl.gov) wrote:
>
>It's not terribly difficult to implement threads, monitors, and
>conditions (along the lines of Hoare's Simula-inspired CACM paper,
>v17, #10) as a simple C++ library.

 A _portable_ implementation?
Which works with Exception Handling?

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/05/10 Raw View

maxtal@Physics.usyd.edu.au (John Max Skaller) writes:

>Andrew Fitzgibbon <andrewfg@ed.ac.uk> wrote:
>>
>>Derived classes plus RTTI gives you
>>discriminated unions *but for* the size difference, which should only be an
>>issue if you're taking over storage management.
>
> No. Using RTTI gives you MUCH MORE than
>a discriminated union, which is exactly the problem.

Using RTTI to simulate discriminated unions is not as good
as using inheritance from an abstract base class.

>A proper discriminated unions offers RIGID guarrantees:
>
> a) a FINITE set of KNOWN and SPECIFIED types
> b) a compiler error if you access the wrong type
>    (provided you do not change the type inside
>    a case of another type)
> c) a compiler error is you _miss_ a type case
> d) the ability to compose ANY types
>
>With RTTI (ignoring the storage mamagement issues)
>
> a) there is no bound on the possible types
> b) you get a run time error on selecting the wrong type
> c) compile time diagnostics for incorrect use after selection
> d) no verification of completeness of case handling
> e) invasion: you CANNOT compose ANY type, only types
>    with a common base

With abstract base classes as suggested by James Kanze (and perhaps
downcasting with dynamic_cast, but not typeid()), you get

 (a) there can be a bound on the possible types, if you
     want (by making the abstract base class constructor
     private and listing all the possible types as friends
     of the abstract base class), but there doesn't have to be
 (b) you get a compile error on selecting the wrong type
 (c) you get either a compile error if you miss a type case
     (if you use virtual functions for the dispatch) or
     incorrect results at runtime (if you do partial matching
     via downcasting).

but it has the minor disadvantage

 e) invasion: you cannot compose any type, only types
    derived from the abstract base class (so you have to
    do a little extra work to create the derived classes, and
    conversions between a derived class and its data member
    are by default explicit, not implicit).

and the major disadvantage

 f) adding a new operation forces you to recompile everything

--
Fergus Henderson                       | I'll forgive even GNU emacs as
fjh@cs.mu.oz.au                        | long as gcc is available ;-)
http://www.cs.mu.oz.au/~fjh            |             - Linus Torvalds

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/07 Raw View

In article <9512603.23750@mulga.cs.mu.OZ.AU>,
Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>
>Inheritence from an abstract base class does give you the same sort
>of effect as discriminated unions, but at the expense of having to
>structure your code inside-out (with potentially bad effects on
>recompilation time).

 To me that is a far lesser concern than the fact that
one must turn ones thinking inside out to understand such
perverted design.

 This is sort of like callbacks and event driven
programming -- you end up using objects to partition the
state space but lose the ability to use the natural stacklike
structure which structured programming emphasises. YOu end up
programming state machines manually in the objects. This
is a major step BACKWARDS in software engineering -- it is
equivalent to "flat" global programming like in C or COBOL
or FORTRAN -- but in smaller state space.

 There is a place for callbacks -- but there needs to
be a method of interfacing this control structure with the
_inverted_ structure obtained by polling.

 This IS provided by coroutines/threads and channels.
Unfortunately, yet again, C++ fails to provide language
support for a completely fundamental idiom.

 I call this particular variant of the problem
"control inversion". It is categorically similar to the
union problem above -- there is a time to place
methods inside the data, and a time to place the methods
outside the data.

 It is the mixing and interfacing of these
"inside/outside" problems which constitutes software
engineering and it is the failure of C++ to support
both "inside" and "outside" programming properly --
and the ability to interface them -- which is my
strongest criticism of C++ as a language.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/04 Raw View

In article <KANZE.95May3122008@slsvhdt.lts.sel.alcatel.de>,
James Kanze US/ESC 60/3/141 #40763 <kanze@lts.sel.alcatel.de> wrote:
>In article <D7zAzJ.EMo@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
>Max Skaller) writes:
>
>    [concerning: must `vector' be a template...]
>|>  Yes. Which is why I'm right and James is wrong.
>|> There's no requirement that any Standard Library entity I'm
>|> aware of actually be what it is presented as being. The Standard
>|> Library merely leverages off the core language to present a
>|> specification and then permits any mechanism of implementation
>|> at all which conforms to the required behaviour.
>
>|>  For example "memcpy" does not have to be a function.
>|> Some compilers generate code directly. The same applies to
>|> say a vector<T> where T is memcpy-able and it applies
>|> very much in particular <g> to vectors of bool where
>|> major advantages may be gained by directly generating
>|> code using machine instructions which manipulate bits.
>
>This may be just a disagreement concerning terms.

 Yes, but being pedantic is sometimes important.
See below.

>The standard does *not* say how the function must be implemented.  A
>compiler is free to use "magic" to recognise the special case, and
>inline the function if it wants.
>
>Similarly, the C++ standard defines `vector' as a template.

 Yes. But there the similarity ends. Templates
have to have definitions because they are _extremely_ sensitive
to context. It may make a BIG difference what the actual body
of a template definition is. Because of implicit name binding
and instantiation issues, etc.

 Let me explain in more detail. The C++ CD mentions
a thing called the ODR -- One Definition Rule. But the ODR
is not properly specified in the CD.

 I have written a paper proposing a specific rule
for the ODR; I believe it has some support. The wording
of the rule requires certain definitions to be equivalent,
and states exactly what "equivalent" means.

 Equivalence is defined by starting off with a rule

 TE1) The sequence of tokens of the definition or
 expression shall be the same.

So for example the following code BREAKS the ODR:

 // file 1
 inline extern void f(){}

 // file 2
 extern inline void f(){}

because the definitions are NOT equivalent because the tokens "extern"
"inline" are swapped around.

There are more rules, but the point is that one CANNOT SPEAK
ABOUT THE EQUIVALENCE OF GENERATED DEFINITIONS.

Do you see? A definition is a kind of declaration which
IS A SEQUENCE OF TOKENS. Written by the programmer.

The ODR in the form I wrote it CANNOT be applied to template
instances (it CAN be applied to the templates themselves).

It also CANNOT be applied to the Standard Library because
the Standard Library entities DO NOT NECESSARILY HAVE DEFINITIONS.

So when the ODR says "A function shall have exactly one
definition ... a function may have more than one definition"

... it is refering to FUNCTIONS. Which are exclusively things
coded by users. No entity of the Standard Library is
necessarily a function because no entity of the Standard
Library can be required to even HAVE a definition.
Because a definition is a specific sequence of tokens
written by the programmer.

So if the words above are accepted, only users can code functions.
Otherwise the words need to say "user function" (non-generated
non-Standard Library function).

The ODR does not apply to generated definitions and it does
not apply to the Standard Library.

Note that there is no problem except with templates.

A separate TODR (Template ODR) is needed to handle templates.
It will have somehow to work for the "templates" of the
Standard Library, an extra complication.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/05/04 Raw View

In article <D7y986.K4@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
Max Skaller) writes:

|> In article <KANZE.95Apr27145146@slsvhdt.lts.sel.alcatel.de>,
|> James Kanze US/ESC 60/3/141 #40763 <kanze@lts.sel.alcatel.de> wrote:
|> >
|> >|>  Stl containers in the Standard Library are not templates
|> >|> because they do not "quack" like templates. Real templates
|> >|> "quack" because they have definitions consisting of examinable
|> >|> source code.
|> >
|> >Really.  I have the Rogue Wave components, complete with templates,
|> >delivered with my C++ compiler.  But there is no examinable source
|> >code.
|> >
|> >What makes you state that examinable source code is necessary?

|>  Simple. The WP says so.

Where?  A quick glance at chapter 14 showed no such text.  In fact, if
it did, we would have a problem: the notion of `examinable source
code' is nowhere defined.  (Examinable by whom, for example?  By me,
or only by the compiler?  Consider the Sun distribution of the Rogue
Wave components.  The source code is encrypted.  The compiler can read
it correctly.  I can even examine it.  But what I see when I examine
it will look like a binary file, and not C++ sources.  Does this count
as `examinable source code'?)

|> Specifically, a template is
|> a sequence of tokens contained in a translation unit parsed
|> as a template according to the rules of the grammar and
|> associated constraints and semantic rules.

Strictly speaking, the language doesn't have templates, it has
template declarations and template definitions.  Even if your
interpretation of the above is correct, it simply means that some part
of the implementation, at some time, must have seen the corresponding
sequence of tokens.  It doesn't mean that they are in anyway made
available to me.  It doesn't even mean that they are available on my
machine in source form.  (They could be preloaded directly in internal
formaat into the compiler, for example.  The implementation would have
seen them when the compiler itself was being compiled.)

This (with the exception of the example in parenthese) is valid for
*all* templates, even user defined ones.  The Rogue Wave classes
provided with the Sun compiler are basically an externally written
class library; any class library could be provided in this format
(except that Sun doesn't make the encryption tools available).

Suppose that Sun simply did this automatically.  When you compiled a
template definition, it simply did the required syntax checks, and put
an encrypted copy into the object file.  Are you claiming that this is
illegal, according to the standard.

In the case of the standard library, of course, there is (probably)
even more freedom.  I would contend that the standard doesn't
(shouldn't) even require the library to have ever existed as clear
text.  The implementation should be free to use any desired
meta-magic; the standard defines what *you* can do with the library,
and not how the compiler goes about implementing this.

|>  The behaviour of this entity called a template
|> is (or should be) defined in the CD.

Agreed.  And *only* the behavior.

When I write:

 #include <vector>

 vector< double >    vd ;

in my program, this has a defined meaning.  How the compiler goes
about implementing this meaning is none of the standards business.

|>  Any component not in this form -- or which was not
|> at some stage in this form -- is catgorically NOT a
|> function, template, or any other such thing described
|> in the CD. [Unless it is a library function,
|> which is not a function]

Why are library functions not functions?  Why must library templates
be templates?  IMHO, library functions are functions.  But there is no
requirement that either a function or a template must have, at some
stage, been in the form of ASCII (or any other recognized code set).

I'm not even 100% convinced that the use of meta-magic is forbidden
for functions/classes *not* defined in the standard.  The compiler
need simply provide some way of turning them off.  (Example: most
compilers search for include files in an ordered list of places, with
the place where the "system" include files reside at the end of the
list.  Suppose that when looking for the include file 'hashtbl', if my
compiler finds it in the place where the system include's normally
reside, *and* the file simply contains some meta-magical string, which
is not legal C++, I simply `unhide' a preloaded definition.  This
should be legal.  For the system include files, at least in C, it is
explicitly legal.)

|>  The behviour of a C++ function is determined by
|> its definition which is a sequence of tokens.

Correction: C++ provides one portable way of defining (the behavior
of) a function: inputting a sequence of tokens into the implementation
(compiler or interpreter).  All implementations have been extended in
some way or another to provide other possibilities, at the very least,
to link with programs written in assembler for example.

On my system, I have a header file called unistd.h, which declares a
function called read.  Are you saying that read is not a function,
because it was not written as a sequence of C/C++ tokens?  (In fact,
there is *no* way to write a function in pure C/C++ with the
functionality of read.  It requires an extension, either in the form
of linking with assembler code, inline assembler code, or special
compiler extensions like _input.)

Read is a very real function.  In C or C++.  It was not defined as a
sequence of (ISO standard) C/C++ tokens.

|>  A Standard Library entity that quacks like a function
|> is NOT a function.

|>  Vendors have the freedom to supply things that
|> quack like functions behaviourly but are not in fact functions,
|> in the Standard Library.

|>  Same applies to templates.

OK, so we differ concerning a definition.  But I would argue that not
only are vendors free to supply such things, they also have the
freedom to make extensions which allow the users to supply them.  All
vendors that I know of *do* have such extensions.

|> >|>  Standard library STL containers are not templates
|> >|> because they do not have to have a definition, so there
|> >|> is no way to do the deduction.
|> >
|> >I beg to disagree.  According to the description I read, they *are*
|> >templates.  The working papers define them clearly as templates, e.g.:
|> >section 23.2.8 Template class vector.

|>  No, it doesn't. It _describes_ them as if they
|> were templates for convinence, and if it says anything
|> else it is wrong and must be fixed.

And I think we are just arguing terminology (which *is* important in a
standard).  If you claim that 1) there is no requirement that the
things describes as templates in the working papers have ever existed
as actual template source code, and 2) there is no requirement that
the actual source code for any template be accessible to the
implementation at the moment I present my source code (which uses said
template) to the implementation (or at any time later), then we are
agreed on what I consider the essential.  I would also argue that
there is nothing which prevents an implementation from offering (as an
extension) the possibility for a user to implement something
additional in the same way as 1).

So why not just call it a template.  Since there is *no* practical way
within the implementation that you can distinguish it from a classical
template.  (Being able as a human being to examine the source code is
definitely outside the implementation, unless you consider yourself as
part of the implementation.)
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/05/04 Raw View

In article <D7y164.KEH@aisb.ed.ac.uk> andrewfg@dai.ed.ac.uk (Andrew
Fitzgibbon) writes:

|> In article <D6ytsv.HJC@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
|> Max Skaller) writes:

|>     [You need discriminated unions for...]
|> |>  2) A grammar (or parse tree).

|> |>  Each node of a grammar is either a terminal or non-terminal.
|> |>  You can't process the grammar without knowing which is which.
|> |>  You have to build a tree of nodes. The node has to be
|> |>  a discrimniated union of the two kinds.

|> Hold on.  This can't be right.  Derived classes plus RTTI gives you
|> discriminated unions *but for* the size difference, which should only be an
|> issue if you're taking over storage management.

Actually, you don't need RTTI, and derived classes can theoretically
require less memory than discriminated unions.  (A union,
discriminated or not, must be as big as its biggest member.  Each
derived class can be only as big as necessary.  Of course, except in
the unlikely case where one node type is significantly larger than all
of the others, this is unlikely to make a significant difference in
practice.)

I state this from experience.  I have written at least two grammars
using these techniques.  In neither did I ever feel the lack of RTTI
(nor did I implement it myself, except insofar as the `dump' function
returned the a symbolic representation of the node type, with
additional data).
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/05/05 Raw View

andrewfg@dai.ed.ac.uk (Andrew Fitzgibbon) writes:

>In article <D6ytsv.HJC@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
>Max Skaller) writes:
>
>    [You need discriminated unions for...]
>|>  2) A grammar (or parse tree).
>
>|>  Each node of a grammar is either a terminal or non-terminal.
>|>  You can't process the grammar without knowing which is which.
>|>  You have to build a tree of nodes. The node has to be
>|>  a discrimniated union of the two kinds.
>
>Hold on.  This can't be right.  Derived classes plus RTTI gives you
>discriminated unions *but for* the size difference, which should only be an
>issue if you're taking over storage management.

No, derived classes plus RTTI (typeid or dynamic_cast) does not
give you the equivalent of discriminated unions, because the compiler
will not tell you if you have missed a case.

Inheritence from an abstract base class does give you the same sort
of effect as discriminated unions, but at the expense of having to
structure your code inside-out (with potentially bad effects on
recompilation time).

--
Fergus Henderson                       | I'll forgive even GNU emacs as
fjh@cs.mu.oz.au                        | long as gcc is available ;-)
http://www.cs.mu.oz.au/~fjh            |             - Linus Torvalds

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/05/05 Raw View

kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763) writes:

[Regarding discriminated unions:]
>Actually, you don't need RTTI, and derived classes can theoretically
>require less memory than discriminated unions.  (A union,
>discriminated or not, must be as big as its biggest member.  Each
>derived class can be only as big as necessary.  Of course, except in
>the unlikely case where one node type is significantly larger than all
>of the others, this is unlikely to make a significant difference in
>practice.)

I would say that discriminated unions are a mathematical construct, not
an implementation technique.  They do not have to be implemented
naively.  The compiler I'm currently working on (for Mercury) usually
implements discriminated unions as tagged pointers to variable-sized
storage allocated on the heap.

Using inheritence from an abstract base class is not such a bad way
of implementing discriminated unions in C++, but it does fundamentally
structure the program around the different data types, rather than
around the different algorithms; this is not always the best way
of structuring programs, and forces the entire program to be recompiled
whenever you add a new algorithm.

--
Fergus Henderson                       | I'll forgive even GNU emacs as
fjh@cs.mu.oz.au                        | long as gcc is available ;-)
http://www.cs.mu.oz.au/~fjh            |             - Linus Torvalds

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/05/05 Raw View

rridge@calum.csclub.uwaterloo.ca (Ross Ridge) writes:

>So what in the standard can be enforced?

Most of it.

>What exactly does
>"enforced" mean anyways?  The question I'm interested in is "is
>it conformant?" and the answer to that can only be "no" or "maybe".

If you can get a "no" answer, then the rule is enforcable.  But the
point is that with some of the complexity requirements (e.g. those that
talk about something being "linear") it is not possible to show that an
implementation does not conform.

--
Fergus Henderson                       | I'll forgive even GNU emacs as
fjh@cs.mu.oz.au                        | long as gcc is available ;-)
http://www.cs.mu.oz.au/~fjh            |             - Linus Torvalds

Author: matt@dogbert.lbl.gov (Matthew Austern)
Date: 1995/05/08 Raw View

In article <D873A8.7tp@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:

>  This IS provided by coroutines/threads and channels.
> Unfortunately, yet again, C++ fails to provide language
> support for a completely fundamental idiom.

That's an interesting point.  The obvious followup question, then:
what do you think that language support for threads would look like?

The most obvious answer is just to provide library functions to create
and kill threads; if you're using a multithreaded operating system
(OS/2, NT, Solaris, and so on), then your compiler vendor has probably
already provided a library function giving you access to that
functionality.  But I imagine that real support for concurrency
involves something more than that.

I imagine the right answer would go something along the lines of
formalizing the difference between an active and an inactive object,
and allowing the number of active objects to be greater than one and
to change dynamically.  That's a big change conceptually, but I bet it
wouldn't involve adding very many new language constructs or changing
the semantics of very many existing ones.  I bet that an experimental
Concurrent C++ compiler, presumably based on gcc, wouldn't be all that
big a project.

I'm sure the Ada contingent will tell me that Ada has already solved
this problem...  One of these days I really ought to learn more about
that language.
--
Matt Austern          matt@physics.berkeley.edu
http://dogbert.lbl.gov/~matt

Author: thp@cs.ucr.edu (Tom Payne)
Date: 1995/05/08 Raw View

Matthew Austern (matt@dogbert.lbl.gov) wrote:
: In article <D873A8.7tp@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:

: >  This IS provided by coroutines/threads and channels.
: > Unfortunately, yet again, C++ fails to provide language
: > support for a completely fundamental idiom.

: That's an interesting point.  The obvious followup question, then:
: what do you think that language support for threads would look like?

: The most obvious answer is just to provide library functions to create
: and kill threads; if you're using a multithreaded operating system
[stuff deleted]
: I imagine the right answer would go something along the lines of
: formalizing the difference between an active and an inactive object,
: and allowing the number of active objects to be greater than one and
: to change dynamically.  That's a big change conceptually, but I bet it
[more stuff deleted]


It's not terribly difficult to implement threads, monitors, and
conditions (along the lines of Hoare's Simula-inspired CACM paper,
v17, #10) as a simple C++ library.  The problem seems to be producing
an implementation of the Standard C++ Library that is thread-safe
relative to a given definition of threads and thread coordination.
For that reason (among others), it would be best if there were an
official C++ standard for threads libraries.

Tom Payne

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/03 Raw View

In article <DOUG.95Apr27094046@monet.ads.com>,
Doug Morgan <doug@monet.ads.com> wrote:
>In article <KANZE.95Apr27145146@slsvhdt.lts.sel.alcatel.de> kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763) writes:
>> In article <D7Iw22.HqA@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
>> Max Skaller) writes:
>>[...]
>> |>  Standard library STL containers are not templates
>> |> because they do not have to have a definition, so there
>> |> is no way to do the deduction.
>>
>> I beg to disagree.  According to the description I read, they *are*
>> templates.  The working papers define them clearly as templates, e.g.:
>> section 23.2.8 Template class vector.  They are thus templates by
>> definition. [...]
>
>That sounds mildly depressing.  By my reading of the orginal STL
>papers, containers were any objects with certain specified semantics.
>Objects of particular templates could be examples of containers, but
>were not all containers.  Under the definitions of the STL papers,
>there could be many other true STL vector containers other than the
>objects of any one particular class template that happens to be named
>vector.
>
>If the standardization process doesn't maintain this distinction in
>some form, then we will be reduced to descriptions like "this object
>presents the same external interface as an instantiation of the
>(one-and-only, language-provided) STL template class vector," rather
>than "this object is an STL vector."  We'll also lose the ability to
>say "this is a vendor-supplied STL vector and these are various kinds
>of user-supplied STL vectors."

 Yes. Which is why I'm right and James is wrong.
There's no requirement that any Standard Library entity I'm
aware of actually be what it is presented as being. The Standard
Library merely leverages off the core language to present a
specification and then permits any mechanism of implementation
at all which conforms to the required behaviour.

 For example "memcpy" does not have to be a function.
Some compilers generate code directly. The same applies to
say a vector<T> where T is memcpy-able and it applies
very much in particular <g> to vectors of bool where
major advantages may be gained by directly generating
code using machine instructions which manipulate bits.

 It isn't clear _any_ portable template implementation
in C++ could be as efficient as this and there is no intention
to prevent such an implementation. On the contrary a major
goal of placing components in the Standard Library is to
enable such optimisation.

 Another example is the numerical array components
which were specifically designed by a physicist (Kent Budge)
to allow parallel processing. (Vectorisation).

 There IS an intention to ensure that it is POSSIBLE to
write a portable Standard Library (or most of it).

 So lets be very clear. The Standard Library is NOT a library
in the sense of a library of user defined components.
It is an _intrinsic_ part of the C++ language which hopefully
CAN be implemented as a library of "user" defined components
(by the vendor) but does NOT have to be.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/03 Raw View

In article <D7y164.KEH@aisb.ed.ac.uk>,
Andrew Fitzgibbon <andrewfg@ed.ac.uk> wrote:
>In article <D6ytsv.HJC@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
>Max Skaller) writes:
>
>    [You need discriminated unions for...]
>|>  2) A grammar (or parse tree).
>
>|>  Each node of a grammar is either a terminal or non-terminal.
>|>  You can't process the grammar without knowing which is which.
>|>  You have to build a tree of nodes. The node has to be
>|>  a discrimniated union of the two kinds.
>
>Hold on.  This can't be right.  Derived classes plus RTTI gives you
>discriminated unions *but for* the size difference, which should only be an
>issue if you're taking over storage management.

 No. Using RTTI gives you MUCH MORE than
a discriminated union, which is exactly the problem.
A proper discriminated unions offers RIGID guarrantees:

 a) a FINITE set of KNOWN and SPECIFIED types
 b) a compiler error if you access the wrong type
    (provided you do not change the type inside
    a case of another type)
 c) a compiler error is you _miss_ a type case
 d) the ability to compose ANY types

With RTTI (ignoring the storage mamagement issues)

 a) there is no bound on the possible types
 b) you get a run time error on selecting the wrong type
 c) compile time diagnostics for incorrect use after selection
 d) no verification of completeness of case handling
 e) invasion: you CANNOT compose ANY type, only types
    with a common base

The importance of discriminated unions is that in MANY cases
the alternatives are finite -- a grammar is an example in
which there are exactly TWO types. A "binary tree" has exactly
four types (leaf, two branches, left branch only, right branch only)

The fact is that discriminated unions and structures are
of "equal" importance in programming and programming is not
possible without both any more than you can do logic with
"AND" but without "OR".

In fact, structures (composition) and unions (unification)
are the _fundamental_ constructions of imperative
programming as presented in the language of Category Theory,
in which these constructions correspond to things called
Products (cartesian product) and Sums (disjoint set union).

A finite sum turns out to be useful just like a finite
product: most structs and unions have finite members.

It is possible to build weak representations of
discriminated unions in C++ which do not fully enforce
the constraints. However this lack of enforcement and
the extreme clumbsiness of the constructions makes
the natural use of such dunions unattractive and perverts
coding styles to inappropriate use of inheritance.

It is particularly sad that in a language with type
inference (overloading, templates) there is no
coherent unification operation.
--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/05/03 Raw View

In article <D7zAzJ.EMo@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
Max Skaller) writes:

    [concerning: must `vector' be a template...]
|>  Yes. Which is why I'm right and James is wrong.
|> There's no requirement that any Standard Library entity I'm
|> aware of actually be what it is presented as being. The Standard
|> Library merely leverages off the core language to present a
|> specification and then permits any mechanism of implementation
|> at all which conforms to the required behaviour.

|>  For example "memcpy" does not have to be a function.
|> Some compilers generate code directly. The same applies to
|> say a vector<T> where T is memcpy-able and it applies
|> very much in particular <g> to vectors of bool where
|> major advantages may be gained by directly generating
|> code using machine instructions which manipulate bits.

This may be just a disagreement concerning terms.  The C standard
states explicitly that `memcpy' must be either a function or a macro;
the standard also defines how the function may be accessed in the case
where it is a macro (i.e.: the function must exist).

The standard does *not* say how the function must be implemented.  A
compiler is free to use "magic" to recognise the special case, and
inline the function if it wants.

Similarly, the C++ standard defines `vector' as a template.  In time,
I would imagine that most C++ compilers will use meta-magic to handle
this particular template in a special way.  Note that the name
`vector' *is* a reserved word.  In this sense, vector is different
from any template that a user can write, and the compiler `knows' the
semantics of the template, and is free to know this knowledge.  This
does not in any way stop it from being a template, however, anymore
than the right to use such knowledge concerning `memcpy' means that
`memcpy' is not a function.
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/03 Raw View

In article <3o3jsi$44o@calum.csclub.uwaterloo.ca>,
Ross Ridge <rridge@calum.csclub.uwaterloo.ca> wrote:
>John Max Skaller <maxtal@Physics.usyd.edu.au> wrote:
>> As it stands, certain complexity requirements
>>cannot be enforced because they cannot be measured
>>or because they are limiting phenomena (which are inherently
>>untestable)
>
>So what in the standard can be enforced?  What exactly does
>"enforced" mean anyways?  The question I'm interested in is "is
>it conformant?" and the answer to that can only be "no" or "maybe".

 Normative rules of an ISO Standard are "enforceable"
in the sense that test methods exist which can determine
by experiment repeatable by different parties, that a particular
product is not conforming.

 Statements of the Standard fail to be normative
if they cannot be enforced because the Standard -- like any
standard -- is something against which measurements are taken.

 If it cannot be measured, it cannot be a normative
part of a Standard.

 Some Standards state what kinds of measurements are
permitted (e.g. C++) and other specify precisely minimal
test methods (like POSIX and many safety related Standards).

 In C++ what is measured is called "behaviour" --
of both the translator and the executing program.
Behaviour is defined as the sequence of calls to Standard
Library functions and accesses to volatile memory locations
(if I remember rightly).

 No permission is granted to measure performance or
memory utilisation.

 Here's a case in point: the ARM says something about
"not eliding global objects with side-effects even if they
are not refered to". No normative rule of the Standard
can require objects _without_ side effects being elided
because if there are no side effects there is no way to
tell if the object has been elided or not.

 Of course this matters for people building libraries.
The C++ Standard has nothing to say about libraries.
It speaks only of programs.

 Another example is the template instantiation request.
It actually has semantics and we do NOT want it to.
We want the normative semantics to be "none".
(Unfortunately where you instantiate a template _can_
make a difference)

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: andrewfg@dai.ed.ac.uk (Andrew Fitzgibbon)
Date: 1995/05/02 Raw View

In article <D6ytsv.HJC@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
Max Skaller) writes:

    [You need discriminated unions for...]
|>  2) A grammar (or parse tree).

|>  Each node of a grammar is either a terminal or non-terminal.
|>  You can't process the grammar without knowing which is which.
|>  You have to build a tree of nodes. The node has to be
|>  a discrimniated union of the two kinds.

Hold on.  This can't be right.  Derived classes plus RTTI gives you
discriminated unions *but for* the size difference, which should only be an
issue if you're taking over storage management.

A.

--
Andrew Fitzgibbon (Research Associate),                     andrewfg@ed.ac.uk
Artificial Intelligence, Edinburgh University.               +44 031 650 4504
<a href=http://www.dai.ed.ac.uk/staff/personal_pages/andrewfg> Home Page </a>
                         "Never say there is no way" -- me.

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/05/02 Raw View

In article <KANZE.95Apr27145146@slsvhdt.lts.sel.alcatel.de>,
James Kanze US/ESC 60/3/141 #40763 <kanze@lts.sel.alcatel.de> wrote:
>
>|>  Stl containers in the Standard Library are not templates
>|> because they do not "quack" like templates. Real templates
>|> "quack" because they have definitions consisting of examinable
>|> source code.
>
>Really.  I have the Rogue Wave components, complete with templates,
>delivered with my C++ compiler.  But there is no examinable source
>code.
>
>What makes you state that examinable source code is necessary?

 Simple. The WP says so. Specifically, a template is
a sequence of tokens contained in a translation unit parsed
as a template according to the rules of the grammar and
associated constraints and semantic rules.

 The behaviour of this entity called a template
is (or should be) defined in the CD.

 Any component not in this form -- or which was not
at some stage in this form -- is catgorically NOT a
function, template, or any other such thing described
in the CD. [Unless it is a library function,
which is not a function]

 The behviour of a C++ function is determined by
its definition which is a sequence of tokens.

 A Standard Library entity that quacks like a function
is NOT a function.

 Vendors have the freedom to supply things that
quack like functions behaviourly but are not in fact functions,
in the Standard Library.

 Same applies to templates.

>|>  Standard library STL containers are not templates
>|> because they do not have to have a definition, so there
>|> is no way to do the deduction.
>
>I beg to disagree.  According to the description I read, they *are*
>templates.  The working papers define them clearly as templates, e.g.:
>section 23.2.8 Template class vector.

 No, it doesn't. It _describes_ them as if they
were templates for convinence, and if it says anything
else it is wrong and must be fixed.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/05/02 Raw View

In article <DOUG.95Apr27094046@monet.ads.com> doug@monet.ads.com (Doug
Morgan) writes:

|> In article <KANZE.95Apr27145146@slsvhdt.lts.sel.alcatel.de> kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763) writes:
|> > In article <D7Iw22.HqA@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
|> > Max Skaller) writes:
|> >[...]
|> > |>  Standard library STL containers are not templates
|> > |> because they do not have to have a definition, so there
|> > |> is no way to do the deduction.
|> >
|> > I beg to disagree.  According to the description I read, they *are*
|> > templates.  The working papers define them clearly as templates, e.g.:
|> > section 23.2.8 Template class vector.  They are thus templates by
|> > definition. [...]

|> That sounds mildly depressing.  By my reading of the orginal STL
|> papers, containers were any objects with certain specified semantics.
|> Objects of particular templates could be examples of containers, but
|> were not all containers.  Under the definitions of the STL papers,
|> there could be many other true STL vector containers other than the
|> objects of any one particular class template that happens to be named
|> vector.

Sorry for the confusion.  I thought that John was talking about the
STL containers provided in the standard library.

There is no requirement, for example, that the containers passed to
the functions in the algorithmic section be templates.  Only that they
meet the requirements for the specific function.

There is a requirement that the vector type provided by the
implementation (that is, the type whose name is the reserved word
`vector') be a template.
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: tseaver@neosoft.com (Tres Seaver)
Date: 1995/05/01 Raw View

In <ncmD7tHzr.17F@netcom.com>, ncm@netcom.com (Nathan Myers) writes:
>>In article <ncmD7DHFK.H8y@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
>>>If it quacks like a template, and instantiates like a template,
>>>it's a template.
>>>
>>>In particular, when I instantiate it,
>>>it had better use my operator<().  If it doesn't, that's
>>>not conforming; if it does I can count calls to it.
>
>In article <D7Iw22.HqA@ucc.su.oz.au>,
>John Max Skaller <maxtal@Physics.usyd.edu.au> wrote:
>> I agree. I modify my previous claim: I was not
>>thinking of counts of calls to user defined functions
>>but the more generic complexity requirements which
>>are not enforeable.
>>
>> What I mean is that requirements expressed in terms
>>of algorithmic limiting complexity cannot be enforced.
>>Requirements expressed as exact values or upper bounds
>>on calls to use defined functions can be.
>>
>> For example a requirement that "insert" be constant
>>time is not enforceable because it requires using
>>a clock to measure performance of an infinite sequences
>>of operations on increasingly large containers.
>
>Skaller's modified claim, I'm afraid, holds no more water than
>before.  Like Fergus Henderson's statement that "linear time
>complexity" is untestable, it's belied simply by counting
>all operations on the element type instantiated on.  If the
>number of accesses, comparisons, assignments, copy constructions,
>etc. exceeds linear behavior, the implementation is not conforming.
>
>None of this requires inspecting implementation code.  None
>of this requires "infinite" containers.  The reason is that
>it is impossible even in principle to show that an implementation
>conforms; all you can do, and all you need to do, is show that
>one does not.  For that I have already demonstrated that two
>small sequences are sufficient.
>
>Nathan Myers
>myersn@roguewave.com

Surely you do not mean to imply that measuring counts for two small test cases is
sufficient to establish non-linearity, as any two points will establish the constants
which determine the line.

Some curves appear linear within measurable precision over a signinficant span,
and yet diverge from the asymptote as they depart that span -- I fail to see that
a standard which requires "linear" behavior of a function, without specifying
test cases to exercise the entire range of the function, can be enforceable.


Tres Seaver                    tseaver@neosoft.com
MACRO Enterprises, Inc.        Vox: (713) 827-7273
Houston, Texas, USA            Fax: (&13) 827-7278

Author: rridge@calum.csclub.uwaterloo.ca (Ross Ridge)
Date: 1995/05/01 Raw View

John Max Skaller <maxtal@Physics.usyd.edu.au> wrote:
> As it stands, certain complexity requirements
>cannot be enforced because they cannot be measured
>or because they are limiting phenomena (which are inherently
>untestable)

So what in the standard can be enforced?  What exactly does
"enforced" mean anyways?  The question I'm interested in is "is
it conformant?" and the answer to that can only be "no" or "maybe".

      Ross Ridge

--
 l/  //   Ross Ridge -- The Great HTMU, Ook                    +1 519 883 4329
[oo][oo]  rridge@csclub.uwaterloo.ca      http://csclub.uwaterloo.ca/u/rridge/
-()-/()/
 db  //

Author: doug@monet.ads.com (Doug Morgan)
Date: 1995/04/27 Raw View

In article <KANZE.95Apr27145146@slsvhdt.lts.sel.alcatel.de> kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763) writes:
> In article <D7Iw22.HqA@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
> Max Skaller) writes:
>[...]
> |>  Standard library STL containers are not templates
> |> because they do not have to have a definition, so there
> |> is no way to do the deduction.
>
> I beg to disagree.  According to the description I read, they *are*
> templates.  The working papers define them clearly as templates, e.g.:
> section 23.2.8 Template class vector.  They are thus templates by
> definition. [...]

That sounds mildly depressing.  By my reading of the orginal STL
papers, containers were any objects with certain specified semantics.
Objects of particular templates could be examples of containers, but
were not all containers.  Under the definitions of the STL papers,
there could be many other true STL vector containers other than the
objects of any one particular class template that happens to be named
vector.

If the standardization process doesn't maintain this distinction in
some form, then we will be reduced to descriptions like "this object
presents the same external interface as an instantiation of the
(one-and-only, language-provided) STL template class vector," rather
than "this object is an STL vector."  We'll also lose the ability to
say "this is a vendor-supplied STL vector and these are various kinds
of user-supplied STL vectors."

Doug
----------
Doug Morgan, doug@ads.com
Booz-Allen & Hamilton
1500 Plymouth St.
Mountain View, CA 94043-1230
     (415) 960-7444
FAX: (415) 960-7500
----------

Author: ncm@netcom.com (Nathan Myers)
Date: 1995/04/29 Raw View

>In article <ncmD7DHFK.H8y@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
>>If it quacks like a template, and instantiates like a template,
>>it's a template.
>>
>>In particular, when I instantiate it,
>>it had better use my operator<().  If it doesn't, that's
>>not conforming; if it does I can count calls to it.

In article <D7Iw22.HqA@ucc.su.oz.au>,
John Max Skaller <maxtal@Physics.usyd.edu.au> wrote:
> I agree. I modify my previous claim: I was not
>thinking of counts of calls to user defined functions
>but the more generic complexity requirements which
>are not enforeable.
>
> What I mean is that requirements expressed in terms
>of algorithmic limiting complexity cannot be enforced.
>Requirements expressed as exact values or upper bounds
>on calls to use defined functions can be.
>
> For example a requirement that "insert" be constant
>time is not enforceable because it requires using
>a clock to measure performance of an infinite sequences
>of operations on increasingly large containers.

Skaller's modified claim, I'm afraid, holds no more water than
before.  Like Fergus Henderson's statement that "linear time
complexity" is untestable, it's belied simply by counting
all operations on the element type instantiated on.  If the
number of accesses, comparisons, assignments, copy constructions,
etc. exceeds linear behavior, the implementation is not conforming.

None of this requires inspecting implementation code.  None
of this requires "infinite" containers.  The reason is that
it is impossible even in principle to show that an implementation
conforms; all you can do, and all you need to do, is show that
one does not.  For that I have already demonstrated that two
small sequences are sufficient.

Nathan Myers
myersn@roguewave.com

Author: kanze@lts.sel.alcatel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/04/27 Raw View

In article <D7Iw22.HqA@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
Max Skaller) writes:

|> In article <ncmD7DHFK.H8y@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
|> >
|> >> They _would_ be were STL components required
|> >>to be actual templates <they're not> with sources
|> >>which vendors had to submit to conformance testers
|> >><which is not a requirement of the proposed Standard>.
|> >
|> >If it quacks like a template, and instantiates like a template,
|> >it's a template.

|>  Stl containers in the Standard Library are not templates
|> because they do not "quack" like templates. Real templates
|> "quack" because they have definitions consisting of examinable
|> source code.

Really.  I have the Rogue Wave components, complete with templates,
delivered with my C++ compiler.  But there is no examinable source
code.

What makes you state that examinable source code is necessary?

|>  Standard library STL containers are not templates
|> because they do not have to have a definition, so there
|> is no way to do the deduction.

I beg to disagree.  According to the description I read, they *are*
templates.  The working papers define them clearly as templates, e.g.:
section 23.2.8 Template class vector.  They are thus templates by
definition.  (I.e.: if the standards committee were to adapt a rule
requiring the sources of templates to be available and human-readable,
then automatically, a conforming implementation must make the sources
of vector available in a human-readable form.  In fact, the standards
committee has not adapted such a rule, and I hold it for highly
unlikely that they will adapt such a rule.)
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/24 Raw View

In article <9511222.24198@mulga.cs.mu.OZ.AU>,
Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>>
>> I'm sorry Nathan but you are not right here.
>
>Actually I agree with Nathan!  At least with respect to those complexity
>requirements that are expressed in terms of counts.

 Agreed. I wasn't thinking in terms of "counts" of calls
to user supplied function arguments to algorithms. I adjust
my position accordingly. Such specification can be normative.

>(As I outlined
>in another post, I still disagree with respect to those complexity
>requirements that are stated as an operation being "linear", "constant
>time"", etc.)

 So it seems we agree.

>> There is no requirement that vendors supply
>>C++ source code for STL parts of the C++ Standard for the
>>inspection of conformance testers.
>
>Correct.
>
>> Experimental measurements cannot either confirm or
>>deny compliance.
>
>They can't confirm compliance, but they in some circumstances can deny
>it.  For example, when instantiating a standard library template with
>a user-defined type, you can put code in your copy constructors and
>comparison operators which counts the number of times they are invoked,
>and compare this with the number of times the standard specifies.

 It is not clear you are correct in the case of copy
constructors. I agree in the case of comparisons though.

[The issue of copy constructors relates to an issue raised
by you (Fergus) in the Australian National Body Comments on CDR
which should be followed up -- when exactly are temporaries
created and copy constructors called?]
--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/24 Raw View

In article <ncmD7DHFK.H8y@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
>
>> They _would_ be were STL components required
>>to be actual templates <they're not> with sources
>>which vendors had to submit to conformance testers
>><which is not a requirement of the proposed Standard>.
>
>If it quacks like a template, and instantiates like a template,
>it's a template.

 Stl containers in the Standard Library are not templates
because they do not "quack" like templates. Real templates
"quack" because they have definitions consisting of examinable
source code.

>In particular, when I instantiate it,
>it had better use my operator<().  If it doesn't, that's
>not conforming; if it does I can count calls to it.

 I agree. I modify my previous claim: I was not
thinking of counts of calls to user defined functions
but the more generic complexity requirements which
are not enforeable.

 What I mean is that requirements expressed in terms
of algorithmic limiting complexity cannot be enforced.
Requirements expressed as exact values or upper bounds
on calls to use defined functions can be.

 For example a requirement that "insert" be constant
time is not enforceable because it requires using
a clock to measure performance of an infinite sequences
of operations on increasingly large containers.

 Limits of sequences cannot be measured,
only deduced from the definition.

 Standard library STL containers are not templates
because they do not have to have a definition, so there
is no way to do the deduction.

 We could require that the STL containers actually
be templates and then such deduction would be possible.
This would be interesting in that it would be possible
to confirm (or deny) compliance unequivocably, rather than
the usual case where compliance can not be positively
confirmed, only denied.

 As it stands, certain complexity requirements
cannot be enforced because they cannot be measured
or because they are limiting phenomena (which are inherently
untestable)

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/22 Raw View

kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763) writes:

>In all of the grammars I've written, I've found polymorphism
>(derivation) to be the ideal solution here.  In fact, back when I
>still worked in C, my standard solution was to have each node type
>start with a pointer to a node descriptor.  Typically, the node
>descriptor contains a variety of information; for anything complex
>(say calculate the Sethi-Ulmann number), it contained a pointer to a
>function.  Given this structure in C, you can imagine how quickly I
>switched to C++.

Could you explain in a bit more detail?  In your C++ code, do you have
a class for each node type, and bunch of virtual functions for the
operations on nodes, or is it vice versa?  How do you organize your
source files - one file per operation, or one file per node?  When you
come to add a new operation, how many source files do you have to
modify?  How many do you have to recompile?  How about when adding a
new node?  Is the tree traversal code reused or duplicated?

(I'm genuinely curious.)

--
Fergus Henderson            | Tell you what: go write a 100x100 matrix multiply
fjh@cs.mu.oz.au             | of integers in both languages and then let's talk
http://www.cs.mu.oz.au/~fjh | about speed, ok? - Tom Christiansen.

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/22 Raw View

maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
>Nathan Myers <ncm@netcom.com> wrote:
>>Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>>>I think it's clear that complexity requirements are unenforceable.
>>
>>On the contrary.  The complexity specifications are in terms
>>of the number of operations -- calls to copy constructors
>>and comparison operators -- which can be counted.
>
> I'm sorry Nathan but you are not right here.

Actually I agree with Nathan!  At least with respect to those complexity
requirements that are expressed in terms of counts.  (As I outlined
in another post, I still disagree with respect to those complexity
requirements that are stated as an operation being "linear", "constant
time"", etc.)

> There is no requirement that vendors supply
>C++ source code for STL parts of the C++ Standard for the
>inspection of conformance testers.

Correct.

> Experimental measurements cannot either confirm or
>deny compliance.

They can't confirm compliance, but they in some circumstances can deny
it.  For example, when instantiating a standard library template with
a user-defined type, you can put code in your copy constructors and
comparison operators which counts the number of times they are invoked,
and compare this with the number of times the standard specifies.

--
Fergus Henderson            | Tell you what: go write a 100x100 matrix multiply
fjh@cs.mu.oz.au             | of integers in both languages and then let's talk
http://www.cs.mu.oz.au/~fjh | about speed, ok? - Tom Christiansen.

Author: kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/04/21 Raw View

In article <DOUG.95Apr20094823@monet.ads.com> doug@monet.ads.com (Doug
Morgan) writes:

|> In article <D7BtJy.7EG@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
|> > ...
|> >  In particular, this is kind of tricky to do and
|> > I suspect there is an "error" in the current WP:
|> > a dereferenced iterator to a container of T does NOT
|> > have to be an lvalue of type T (even if the WP says that).

|> This is certainly an error.  I remember the original STL documents as
|> being pretty clear that iterators had to dereference to something that
|> is convertible to a T.  There was a similar error (to the one
|> apparently now in the WP) in table 10 of the May 31, 1994 version of
|> the STL document.  It specified that sequence's a.front(), a.back(),
|> and a[n] return references.  Of course this was an oversight (and even
|> the example vector<bool> did not return references).  The next version
|> of the document was corrected.  Hopefully, the WP can be corrected
|> soon, too.

This is an important point, since it is a decernable difference.  The
"as if" rule cannot be used to allow the iterator to dereference to an
object which converts to an lvalue of type T, since this would
introduce a user defined conversion.
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/04/21 Raw View

In article <19950419.194958.67@morpheus.demon.co.uk>
gustav@morpheus.demon.co.uk (Paul Moore) writes:

|> In message <KANZE.95Apr19150630@slsvhdt.us-es.sel.de> James Kanze US/ESC 60/3/141
|> #40763 wrote:

|> > In article <9510113.6681@mulga.cs.mu.OZ.AU> fjh@munta.cs.mu.OZ.AU
|> > (Fergus Henderson) writes:
|> >
|> > |> Yes.  I agree entirely that discriminated unions are the right way of
|> > |> handling this sort of thing.  The only problem is that C++ doesn't support
|> > |> discriminated unions very well.  If you follow this argument to it's
|> > |> logical conclusion, you have to say that even null pointers were a mistake.
|> >
|> > Aren't they?  I've stopped using them, in favor of `Fallible' (e.g.: a
|> > function will return `Fallible< char* >', rather than simply `char*',
|> > if it can fail).
|> >

|> I've never seen "Fallible". Can you point me to a description, and
|> possibly an implementation?

Barton and Nackmann.  I don't have my copy here, so I cannot give the
exact page number, but it is in the index.

The idea is really pretty simple.  The class `template< class T >
class Fallible' associates a bool defining the validity with a value
of type T.  The default constructor constructs an `invalid' Fallible,
a constructor from `T const&' constructs a valid Fallible.  Assigning
a T to a Fallible also makes it valid.

The class contains a conversion operator for T, which throws an
exception (or has an assertion failure) if the value is not valid.
There is also a function to test validity.

As a simple example, imagine strchr using Fallible:

 Fallible< char* >
 strchr( char* p , char c )
 {
     Fallible< char* >   result ;
     while ( ! result.isValid() && *p != '\0' )
         if ( *p == c )
             result = p ;
     return result ;
 }

In a simple case like the above, a null pointer does an acceptible
job, and will certainly be more efficient in terms of runtime.  But in
the more general case, Fallible is probably more understandable (since
it doesn't require a `unique' value), and the efficiency differences
will be almost negligible.

Even in the above case, Fallible has the advantage that attempting to
use the pointer returned by strchr in the case where the character was
not found will result in an exception, rather than undefined behavior.

The following code is off the top of my head, so some of the details
may not be right, but it should be enough to get the general idea:

 template< class T >
 class Fallible
 {
 public :
                         Fallible() ;
                         Fallible( T const& val ) ;
     Fallible< T >&      operator=( T const& val ) ;

     bool                isValid() const ;
                         operator T() const ;
 private :
     T                   value ;
     bool                valid ;
 } ;

 template< class T >
 Fallible< T >::Fallible()
     :   valid( false )
 {
 }

 template< class T >
 Fallible< T >::Fallible( T const& val )
     :   value( val )
     ,   valid( true )
 {
 }

 template< class T >
 Fallible< T >&
 Fallible< T >::operator=( T const& val )
 {
     value = val ;
     valid = true ;
 }

 template< class T >
 bool
 Fallible< T >::isValid() const
 {
     return valid ;
 }

 template< class T >
 Fallible< T >::operator T() const
 {
     assert( valid ) ;
     return value ;
 }
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: ncm@netcom.com (Nathan Myers)
Date: 1995/04/21 Raw View

>>In article <9510023.17400@mulga.cs.mu.oz.au>,
>>Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>>>I think it's clear that complexity requirements are unenforceable.

>In article <ncmD6wpGv.4p1@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
>>On the contrary.  The complexity specifications are in terms
>>of the number of operations -- calls to copy constructors
>>and comparison operators -- which can be counted.

In article <D7BsGI.3KB@ucc.su.oz.au>,
John Max Skaller <maxtal@Physics.usyd.edu.au> wrote:
> I'm sorry Nathan but you are not right here.
>
> There is no requirement that vendors supply
>C++ source code for STL parts of the C++ Standard for the
>inspection of conformance testers.

Source code is irrelevant. The question is, does the
implementation behave as specified?

> Experimental measurements cannot either confirm or
>deny compliance.  Hence ...the requirements are not enforceable.

Of course they can.  If I run an experiment in which
sort() fails to sort elements, the implementation doesn't
conform.  If it calls operator<() too many times, it
doesn't conform.  I've already outlined what "too many
times" means, and it's the conventional definition.

Of course it's not possible to confirm compliance in any case,
as I've (also) already outlined.

> They _would_ be were STL components required
>to be actual templates <they're not> with sources
>which vendors had to submit to conformance testers
><which is not a requirement of the proposed Standard>.

If it quacks like a template, and instantiates like a template,
it's a template.  In particular, when I instantiate it,
it had better use my operator<().  If it doesn't, that's
not conforming; if it does I can count calls to it.

> Do you desire to change the conformance requirements
>of the proposed C++ Standard so that in fact the complexity
>requirements are normative (enforceable)?

No, I find the present form entirely satisfactory, thank you.

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/21 Raw View

kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763) writes:

>> fjh@munta.cs.mu.OZ.AU (Fergus Henderson) writes:
>>
>> |> If you follow this argument to it's logical conclusion,
>> |> you have to say that even null pointers were a mistake.
>>
>> Aren't they?  I've stopped using them, in favor of `Fallible'

An optimized FalliblePtr (you could make it a template specialization
for Fallible<T*>, I guess) and conditional compilation could give you
optimal performance when you needed it and guaranteed error-checking
when you don't.  I think something like that is a good idea, although I
don't use it myself.  But there are some draw-backs: you probably run
into the "only one user-defined conversion" limit sooner; of course
compilation time is worse; and some C++ compilers don't do a good job of
inlining templates.  In C++, pointers are the way everyone does things.
I prefer not to rock the boat so much unless the benefit is large.

BTW, similar techniques have been around in other languages for
a long time.  There is a type in the ML standard library, I think
think it is called "maybe", which does a similar job to "Fallible".
Of course the details are different.  I think the ML definition
is one line of code rather than 50, although perhaps it doesn't
provide quite as much functionality.

--
Fergus Henderson            | Tell you what: go write a 100x100 matrix multiply
fjh@cs.mu.oz.au             | of integers in both languages and then let's talk
http://www.cs.mu.oz.au/~fjh | about speed, ok? - Tom Christiansen.

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/20 Raw View

In article <ncmD6wpGv.4p1@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
>In article <9510023.17400@mulga.cs.mu.oz.au>,
>Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>>I think it's clear that complexity requirements are unenforceable.
>
>On the contrary.  The complexity specifications are in terms
>of the number of operations -- calls to copy constructors
>and comparison operators -- which can be counted.

 I'm sorry Nathan but you are not right here.

 There is no requirement that vendors supply
C++ source code for STL parts of the C++ Standard for the
inspection of conformance testers.

 Experimental measurements cannot either confirm or
deny compliance.

 Hence Fergus is right -- the requirements are not
enforceable.

 They _would_ be were STL components required
to be actual templates <they're not> with sources
which vendors had to submit to conformance testers
<which is not a requirement of the proposed Standard>.

 Do you desire to change the conformance requirements
of the proposed C++ Standard so that in fact the complexity
requirements are normative (enforceable)?

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/20 Raw View

In article <ncmD6wpGv.4p1@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
>In article <9510023.17400@mulga.cs.mu.oz.au>,
[enforcement of STL complexity requirements ..]

>This "unenforceable" myth keeps popping up, and I don't know why.

 The reason  is a large number of committee members
and non-members -- including if not especially those on the
Library Working Group -- do not seem to understand that the
C++ Standard Library is NOT a user defined library provided
as sources, compiled object modules, or whatever, but is
an intrinsic and inalienable part of the C++ core language.

 Just because such components are characterised in terms
of the C++ language itself "as if" they were user written
does not mean that they must be.

 That is, there is no requirement any library component
have, or ever did have, a declaration, definition, or any other
such thing associated with all user defined types and functions.

 This is very important for two reasons -- the first
is optimisation, compilers can do things with Standard Library
components that would be impossible with user defined entities.

 The second reason is that some library entities
cannot possibly be written as portable user defined entities,
and others -- such as malloc -- which can be often are not.

 The notion -- in the proposed C++ Standard  -- and
most other languages -- of a "standard library" is merely
a way of leveraging the description of the formal semantics
of the entities described off the rest of the "core" language.

 In fact, the STL description (of the original paper)
takes some pains to describe behaviour in terms of the syntax
used rather than the entities that such usages denotes.
For example an "iterator" is a piece of syntax with certain
properties.

 In particular, this is kind of tricky to do and
I suspect there is an "error" in the current WP:
a dereferenced iterator to a container of T does NOT
have to be an lvalue of type T (even if the WP says that).

 It is only necessary it look like one, it may
in fact be an object that converts to one on any use,
but is not in itself such an lvalue.

 The example in mind is an iterator to a vector
of bool specialised to use 1 bit per value. The dereferenced
iterator might well be an rvalue of some handle class
which will accept any valid syntax that might be applied
to a bool lvalue -- but actually isn't such an lvalue.

 Or, the iterator may just be notation the compiler
converts _directly_ into appropriate machine instructions.

 In fact, the WP description can be rescued by
noting that entities denoted by STL Standard Library
components do not have to be actual classes or even types:
all that is required is that the _look as if_ they are.

 This makes it important NOT to require
these components be actual classes or types -- and indeed
at present the conformance model cannot require that
even if we wanted it to or actually said it.

 [And all of this clearly excludes any possibility
of complexity requirements being normative :-)

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/20 Raw View

ncm@netcom.com (Nathan Myers) writes:

>In article <9510023.17400@mulga.cs.mu.oz.au>,
>Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>>I think it's clear that complexity requirements are unenforceable.
>
>On the contrary.  The complexity specifications are in terms
>of the number of operations -- calls to copy constructors
>and comparison operators -- which can be counted.

In the current draft, the complexity of the various STL algorithms
is specified in terms of counts, but the complexity of the iterator
and container operations is simply specified as "constant time (amortized)",
"linear", etc.

I agree with you in part, and must retract part of my claim.
For the algorithms, where the draft states "Complexity: Exactly <n>
applications of the corresponding predicate." or "Complexity: At most
<n> assignments" or something similar, the complexity requirements are
enforceable.  But for containers and iterators, where the draft just
states that the complexity is "linear" or "constant", etc., the complexity
requirements are not enforceable.

>The "constant
>factor" can be bounded simply by requiring (for instance) that
>a list of 2N elements take no more than twice as long to
>process as a list of N elements, for a linear algorithm; no
>more than four times as long, for a quadratic algoritm; and
>so on.

The standard _could_ require that, but unless I am missing something,
it currently doesn't.  To do so would be contrary to the usual meaning
of "linear" and "quadratic". Furthermore, such a requirement
would make the implementations that I have seen non-conforming!
Memory hierarchy effects mean that there is almost invariably
some N for which processing a list of 2N elements thrashes the
cache or main memory, causing it to take considerably more than
twice as long than for process a list of N elements, even though
the algorithm is linear.

>This "unenforceable" myth keeps popping up, and I don't know why.
>It's easy to show that a library doesn't meet such a requirement
>for some input [...]

That is only true if complexity requirements are stated in terms
of maximum number of invokations of some operation, which is true
of only some of the complexity requirements in the current draft.

Furthermore, the sort of complexity requirements that are enforceable
don't actually give you any guarantees about the actual time
requirements of the program.

[Note: all the above does not mean that I think complexity requirements
in the standard are not a good idea.  I think they are a great idea!
For all practical purposes, they achieve their aim.  But I don't think
we should be deluded into thinking that they prevent peversely low-quality
implementations from executing our programs as slowly as they want.]

--
Fergus Henderson            | As practiced by computer science, the study of
fjh@cs.mu.oz.au             | programming is an unholy mixture of mathematics,
http://www.cs.mu.oz.au/~fjh | literary criticism, and folklore. - B. A. Sheil

Author: doug@monet.ads.com (Doug Morgan)
Date: 1995/04/20 Raw View

In article <D7BtJy.7EG@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
> ...
>  In particular, this is kind of tricky to do and
> I suspect there is an "error" in the current WP:
> a dereferenced iterator to a container of T does NOT
> have to be an lvalue of type T (even if the WP says that).

This is certainly an error.  I remember the original STL documents as
being pretty clear that iterators had to dereference to something that
is convertible to a T.  There was a similar error (to the one
apparently now in the WP) in table 10 of the May 31, 1994 version of
the STL document.  It specified that sequence's a.front(), a.back(),
and a[n] return references.  Of course this was an oversight (and even
the example vector<bool> did not return references).  The next version
of the document was corrected.  Hopefully, the WP can be corrected
soon, too.

> ...
>  This makes it important NOT to require
> these components be actual classes or types -- and indeed
> at present the conformance model cannot require that
> even if we wanted it to or actually said it.

All correct.  I hope that STL as an abstract interface standard isn't
wiped out (with the best of misguided intentions) by committees that
don't understand what they are standardizing.

Doug
----------
Doug Morgan, doug@ads.com
Booz-Allen & Hamilton
1500 Plymouth St.
Mountain View, CA 94043-1230
     (415) 960-7444
FAX: (415) 960-7500
----------

Author: kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/04/19 Raw View

In article <9510113.6681@mulga.cs.mu.OZ.AU> fjh@munta.cs.mu.OZ.AU
(Fergus Henderson) writes:

|> Yes.  I agree entirely that discriminated unions are the right way of
|> handling this sort of thing.  The only problem is that C++ doesn't support
|> discriminated unions very well.  If you follow this argument to it's
|> logical conclusion, you have to say that even null pointers were a mistake.

Aren't they?  I've stopped using them, in favor of `Fallible' (e.g.: a
function will return `Fallible< char* >', rather than simply `char*',
if it can fail).

The one advantage of null pointers in this regard is that most
compilers will return a single pointer in a register, whereas I know
of no compiler which currently optimizes class type return values into
a register when they fit.  (And I will be the first to admit that I
don't expect to see compilers returning a Fallible in registers,
although such an optimization is both possible and useful.)

Since this advantage is not valid for class types, I see *no* reason
to try and perpetuate this hack for iterators.  (Not to mention John's
point that you often need more than one error code.)
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/04/19 Raw View

In article <3m55ss$fl2@eclipse.eng.sc.rolm.com>
eddy@clipper.robadome.com (eddy Gorsuch) writes:

|> OK, I think I understand what you are trying to do.
|> You want your iterator to serve two purposes:
|> 1. Be an iterator
|> 2. Be an indication that some routine failed miserably.

|> I agree with John Max Skaller that you should really be returning 2
|> different results. Since you don't want to use exceptions here, could you
|> change your Dicts::look_up_word() to return a
|> pair<error_indicator, Dicts::iterator> instead of just an iterator?

Agreed, 100%.  I would strongly recommend reading Barton and Nackmann.
In particular, look at their `Fallible' class.
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/04/19 Raw View

In article <D6ytsv.HJC@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John
Max Skaller) writes:

    [Concerning discriminated unions...]
|>  Here are two examples where you need them:

|>  1) A transaction processing system.

|>  There are a finite set of transactions.
|>  No polymorphism is involved, you have to process each
|>  one differently according to fixed rules dependent on
|>  the type.

The type of the transaction, or the type of the object.  Generally, I
suspect both.  Which means multiple dispatch (not discriminate
unions).  Although C++ doesn't support multiple dispatch, there is
actually a fairly good work-around for the case where the number of
variants of one of the objects is closed, as is the case here.

|>  The effects of a transaction permeates the system wholistically,
|>  and cannot be encapsulated into a single notion.

|>  2) A grammar (or parse tree).

|>  Each node of a grammar is either a terminal or non-terminal.
|>  You can't process the grammar without knowing which is which.
|>  You have to build a tree of nodes. The node has to be
|>  a discrimniated union of the two kinds.

Typically, you have many more than two types of nodes.

In all of the grammars I've written, I've found polymorphism
(derivation) to be the ideal solution here.  In fact, back when I
still worked in C, my standard solution was to have each node type
start with a pointer to a node descriptor.  Typically, the node
descriptor contains a variety of information; for anything complex
(say calculate the Sethi-Ulmann number), it contained a pointer to a
function.  Given this structure in C, you can imagine how quickly I
switched to C++.
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: gustav@morpheus.demon.co.uk (Paul Moore)
Date: 1995/04/19 Raw View

In message <KANZE.95Apr19150630@slsvhdt.us-es.sel.de> James Kanze US/ESC 60/3/141
#40763 wrote:

> In article <9510113.6681@mulga.cs.mu.OZ.AU> fjh@munta.cs.mu.OZ.AU
> (Fergus Henderson) writes:
>
> |> Yes.  I agree entirely that discriminated unions are the right way of
> |> handling this sort of thing.  The only problem is that C++ doesn't support
> |> discriminated unions very well.  If you follow this argument to it's
> |> logical conclusion, you have to say that even null pointers were a mistake.
>
> Aren't they?  I've stopped using them, in favor of `Fallible' (e.g.: a
> function will return `Fallible< char* >', rather than simply `char*',
> if it can fail).
>

I've never seen "Fallible". Can you point me to a description, and
possibly an implementation?

Gustav.

--
------------------------------------------------------------------------
Paul Moore                                   gustav@morpheus.demon.co.uk
------------------------------------------------------------------------

... Pets just die on you, where's the fun in that?

Author: kanze@us-es.sel.de (James Kanze US/ESC 60/3/141 #40763)
Date: 1995/04/19 Raw View

In article <FENSTER.95Apr4213620@ground.cs.columbia.edu>
fenster@ground.cs.columbia.edu (Sam Fenster) writes:

|> It's often good design to let an object have an `invalid' state.

Is it?  It's often an acceptable compromize, which will allow
returning a built-in type in a register, rather than a class.  (With
most compilers, returning a class will result in extra overhead, even
if the class would fit in a register easily.)  But it tends to be
limited, and to cause problems in the long run.

Consider the simple case of the Unix system call: signal.  Its second
parameter is a pointer (to a function which handles the signal).  It
uses a special value to signal the default handling, *and* a second
special value to signal ignore.  Oops, maybe we need both NULL and
NULL2 values, and not just NULL.
--
James Kanze         Tel.: (+33) 88 14 49 00        email: kanze@gabi-soft.fr
GABI Software, Sarl., 8 rue des Francs-Bourgeois, F-67000 Strasbourg, France
Conseils en informatique industrielle --
                              -- Beratung in industrieller Datenverarbeitung

Author: perkinsp@nando.net (Paul A. Perkins)
Date: 1995/04/16 Raw View

In article <D6ytsv.HJC@ucc.su.OZ.AU>, John Max Skaller says...

>        1) A transaction processing system.
>
>        There are a finite set of transactions.
>        No polymorphism is involved, you have to process each
>        one differently according to fixed rules dependent on
>        the type.

I see. You want polymophism, but you don't want to CALL it polymophism.
Suit yourself, I guess.

>Note that inheritance and polymorphism are exactly the
>wrong thing to use when unification and discrimination
>is required -- the whole point of an abstract type
>with polymorphic subtypes is that the subtypes ARE
>the supertype by definition.

Subtypes ARE the supertype???!!! I hope this is just a typo!

--
Paul A. Perkins
perkinsp@nando.net
(All things are Fire)

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/16 Raw View

In article <3mmpph$iae@highway.LeidenUniv.nl>,
Jan-Peter de Ruiter <ruiter@ruls41.LeidenUniv.nl> wrote:
>John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
>
>: The key idea is that you have to "extend" STL.
>: (Actually, you are not extending it but introducing a "subtype")
>: That does not mean adding a template class. It means
>: adding extra PROTOCOL. STL already shows how to specify
>: protocol. It is a good model. Learning how to extend it
>: is a whole new ballgame.
>:
>
>Sure, but if everyone is going to write his or her own 'safe'
>layer on STL, we are not having a standard anymore.

 Do you see a way around this?

 We standardise C++, everyone is going to write their
own programs. They're not "standard". The direct advantage to the
reusable components market is  zero.

 We standardise a set of 10 library components,
everyone uses them differently and continues to invent their own.
The advantage to the reusable components market is 10.

 We standardise a _protocol_ like STL and at least
there is an opportunity for YOUR layers and extensions
to cooperate with mine. At worst your LAYER containers
and iterators will still work with STL itself. So I can
use them even if I do not use your protocol. Remember,
your extensions still have to be STL compliant.

 Advantage to reusable components industry?
What a silly question. This CREATES a reusable components industry
which cannot possibly exist with a mere 10 standard components or
just a standardised language.

 Example: there is a PD hash table extension of STL.
Will you write your own or just use it? I will at least
_try_ it out.

 Is there work to be done, and products to produce which
may succeed or fail? Yes of course.

 How successful will STL be? It remains to be seen.

 I can tell you it has improved my own productivity
enormously -- and I'm only just starting to use it.

>I still don't see how an important library like STL can allow
>the following code to compile and run without even as much as a
>warning:
>
>#include <iostream.h>
>#include <list.h>
>
>int array [] = { 1,2,3,4,5 };
>
>int main ()
>{
>  list<int> l1 (array, array + 5);
>  list<int>::iterator i1 = l1.begin ();
>  while (i1 != l1.end ())
>    cout << *i1++ << endl;
>  int test = *i1; // i1 is now pointing at random nonsense
>  cout << test; // OK, on _some_ systems test will be 0.
>  return 0;
>}
>

 Let me turn your question around. I don't see how
an important library like STL can _prevent_ the code above
from compiling, or even crashing, without imposing a performance penalty
on code not containing such errors.

 However, I'm told by Alex the idea is that STL based
code might be statically checked by tools.

 C++ as a language is not capable of that. Remember in
modern terms, it isn't a very secure language in the first place.

 So as more and more STL code is written, and more
and more people discover particular classes of bugs that are
common -- and experts begin to discover how to recognize these
bugs -- then you can expect tools to be developed to help
you write safer code. (And changes to the C++ language eventually
to accomodate this)

----------------------------------------------------------------
Now let me make a suggestion. Rewrite your code:

  {for
  (
     list<int>::iterator i1 = l1.begin ();
     i1 != l1.end ();
     i1++
  )
  {
    cout << *i1 << endl;
  }}
  int test = *i1; // i1 is now pointing at random nonsense
  // COMPILER ERROR: i1 not declared

When your compiler supports proper scoping of conditionals,
you won't need the { and } around the for statement.

This is an _indication_ of the kind of thing needed for
safe programming -- unfortunately C++ will never
have constructions properly designed to support correctness,
it is based on C which is far too archaic, and was designed
for hand optimisation rather than verification.

However, it is generally known for statements are superior
to while statements because the iteration control is all
in one place. In Pascal they're even better, since you
cannot modify the control variable at all in the body,
and because the increment and compare are generated
correctly by the compiler. That is, in Pascal, for
statements are syntactically guarranteed to be secure.

Similarly, array index violations are impossible in Pascal.
Of course, it _is_ possible to get an error converting
a type to a subrange. So the advantages are small.

However there is a body of theory on checking these things.
I'm told by a professor at QUT that about 90% of all
this class of error can be detected in typical programs.
That is given some integer calculations

 var i : 10..20;
      j : 0..99;
 ...
 j += i + i*i;
 ...
 array[j] ...

it is possible to compute the possible range of values
so that required run time checks can be elided because
it can be proved they will always succeed.

This kind of work indicates that with appropriate language
constructions and coding styles, checked conversions
may be possible without significant loss of efficiency
in future languages. (Well, in this case the work is
being done for Modula II, which is an ISO Standard language)

The point? What you want is being worked on but it is HARD.
There is no magic.

In the end, "correctness" cannot be assured. Testing
is never going to go out of style :-)

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: horstman@sjsumcs.sjsu.edu (Cay Horstmann)
Date: 1995/04/17 Raw View

John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
: In article <3mmpph$iae@highway.LeidenUniv.nl>,
: Jan-Peter de Ruiter <ruiter@ruls41.LeidenUniv.nl> wrote:
: >John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:
: >
: >: The key idea is that you have to "extend" STL.
: >: (Actually, you are not extending it but introducing a "subtype")
: >: That does not mean adding a template class. It means
: >: adding extra PROTOCOL. STL already shows how to specify
: >: protocol. It is a good model. Learning how to extend it
: >: is a whole new ballgame.
: >:
: >
: >Sure, but if everyone is going to write his or her own 'safe'
: >layer on STL, we are not having a standard anymore.

[a few lines deleted]

:  We standardise a _protocol_ like STL and at least
: there is an opportunity for YOUR layers and extensions
: to cooperate with mine.

So it is a PROTOCOL. No wonder some people thought that STL had
less than they expected. They expected a LIBRARY. But what you say
makes a lot of sense. If you want to templatize, you need a protocol
so that the string replacement in the templates yields something
that actually instantiates. That explains the crazy names, like
push_back to append an element to the end of a list.

A few people had disagreed with the "T" in STL, suggesting that
"standard container library" would have been a better choice. It seems
the "T" is wholly appropriate, but the "L" should have been a "P"
--the Standard Template Protocol.

Cay

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/13 Raw View

In article <0iJjvA2NBh107h@double.actrix.gen.nz>,
Chris Double <chris@double.actrix.gen.nz> wrote:
>In <9510113.6681@mulga.cs.mu.OZ.AU> fjh@munta.cs.mu.OZ.AU (Fergus Henderson) writes:
>>Yes.  I agree entirely that discriminated unions are the right way of
>>handling this sort of thing.  The only problem is that C++ doesn't support
>>discriminated unions very well.  If you follow this argument to it's
>>logical conclusion, you have to say that even null pointers were a mistake.
>
>What are discriminated unions? At a guess it sounds like a union of types
>and some sort of identifier saying which type is actually stored in the
>union. Is this correct?

 In a nutshell, yes.
>
>What sort of area you would use this in?

 Everywhere. The notion is as general and important
as inheritance and of equal utility. It is like AND and OR
of logic. The idea of not having both is just absurd.

>I remember John Skaller saying
>in a previous post that people don't use them enough. Where should they
>be used? Is there an idiom that you can use in C++ to support
>discriminated unions?

 No, there isn't. Not really. So when you need them,
it is very hard to get around this problem. dynamic_cast
with a dummy "object" works, but is invasive.

 Here are two examples where you need them:

 1) A transaction processing system.

 There are a finite set of transactions.
 No polymorphism is involved, you have to process each
 one differently according to fixed rules dependent on
 the type.

 The effects of a transaction permeates the system wholistically,
 and cannot be encapsulated into a single notion.

 2) A grammar (or parse tree).

 Each node of a grammar is either a terminal or non-terminal.
 You can't process the grammar without knowing which is which.
 You have to build a tree of nodes. The node has to be
 a discrimniated union of the two kinds.

Put it this way: subtyping (nice inheritance) takes a subset,
it is the AND operation. Discrimination is by polymorhism.

Unification is an OR operation, it makes a new type which is
the union of two others. Discrimination is subtraction --
a way of getting the original type of an object of a unified
type.

Type casing is a kind of selection on type like a switch.
Indeed when the type discriminant is represented as
testable state, this is how you do the discrimination.

In fact, one can view any subset of states of a type
as another type -- and so EVERY time you make a decision
in a program -- by a switch or an if/then/else -- you are
type casing on an implied type.

Discriminated unions allow "packaging" of these distinctions
with semantics for unification and discrimination.
C unions only permit unification, you have to do the
discrimination manually.

Note that inheritance and polymorphism are exactly the
wrong thing to use when unification and discrimination
is required -- the whole point of an abstract type
with polymorphic subtypes is that the subtypes ARE
the supertype by definition.

Discriminiated unions are the opposite: unification
is imposed afterwards, it isn't inherent.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: ruiter@ruls41.LeidenUniv.nl (Jan-Peter de Ruiter)
Date: 1995/04/14 Raw View

John Max Skaller (maxtal@Physics.usyd.edu.au) wrote:

: The key idea is that you have to "extend" STL.
: (Actually, you are not extending it but introducing a "subtype")
: That does not mean adding a template class. It means
: adding extra PROTOCOL. STL already shows how to specify
: protocol. It is a good model. Learning how to extend it
: is a whole new ballgame.
:

Sure, but if everyone is going to write his or her own 'safe'
layer on STL, we are not having a standard anymore.

I still don't see how an important library like STL can allow
the following code to compile and run without even as much as a
warning:

#include <iostream.h>
#include <list.h>

int array [] = { 1,2,3,4,5 };

int main ()
{
  list<int> l1 (array, array + 5);
  list<int>::iterator i1 = l1.begin ();
  while (i1 != l1.end ())
    cout << *i1++ << endl;
  int test = *i1; // i1 is now pointing at random nonsense
  cout << test; // OK, on _some_ systems test will be 0.
  return 0;
}

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/12 Raw View

maxtal@Physics.usyd.edu.au (John Max Skaller) writes:

>Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>>
>>I think it's clear that complexity requirements are unenforceable.
>
> I don't think thats quite correct, in a sense I think
>you would agree with -- it is possible to determine in some cases
>what the complexity of a piece of code is.

My point was in fact it is possible to determine the complexity of
_all_ operations (that terminate and that don't involve I/O), but that
in fact worst-case complexity analysis if carried to extremes can give
you results which are simply not at all useful.  As soon as you try to
enforce them, you realize that for enforcement purposes the complexity
requirements are meaningless.

> The point is you have to do this by reading and analysing
>the code, it isn't something that is really "testable" in the sense
>of executing the code and recording and analysing behaviour.
>This would imply that in order to met test requirements, vendors
>would be _required_ to supply actual sources for, say, STL
>Standard Library entities.

An alternative would be to require implementations to document the
appropriate constants.  For example, if some operation was supposed to
be at worst linear in N, the implementation could be required to
document an N0 and a K such that the time for the operation was no
worse than K*max(N,N0) for all N.  That would be quite testable,
although the documentation requirements would probably be too onerous.
But it would not be at all useful, since implementations would supply
ridiculously high values for the appropriate constants.

BTW, the draft implies that the complexity of all operations on iterators
is amortized constant time.  What does this mean when applied to input
iterators, e.g. istream_iterator<char>(cin)?  If it's constant time,
what is the constant?

--
Fergus Henderson            | As practiced by computer science, the study of
fjh@cs.mu.oz.au             | programming is an unholy mixture of mathematics,
http://www.cs.mu.oz.au/~fjh | literary criticism, and folklore. - B. A. Sheil

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/12 Raw View

chris@double.actrix.gen.nz (Chris Double) writes:

>What are discriminated unions? At a guess it sounds like a union of types
>and some sort of identifier saying which type is actually stored in the
>union. Is this correct?

Yes, roughly speaking, and that's one way of implementing discriminated
unions in C++.

Mathematically, if you consider types as sets of values,
then a discriminated union D of types T_1, T_2, ..., T_n is the set

 D = ({ tag_1 } x T_1) U ({ tag_2 } x T2 } U ... U ({ tag_n } x T_n)

where `tag_1' thru `tag_n' are distinct (but otherwise arbitrary) constants,
(`x' denotes cross-product, `U' denotes set union, and `{ X }' denotes
the set containing just X.)

Note that enumerations are equivalent to a special case of discriminated
unions, when all the T_i are the unit type.

Another way of implementing discriminated unions in C++ is with inheritence.
All the types T_1 ... T_n are derived from a common abstract base class.
An element of the discriminated union is represented by a pointer or
reference to the base class, and you use RTTI to determine which of
the different possibilities it actually refers to.

>What sort of area you would use this in?

Whenever a piece of data could be one of a finite, stable set of alternatives.

If it could be one of an unbounded or unstable set of alternatives, then
you should generally use subtyping implemented as inheritence from an
abstract base class (and in this case, you should generally _not_ use RTTI).

In languages with good support for discriminated unions, the most
commonly used examples are types like lists and trees.  For example, a
list of integers is either empty (`nil'), or it is constructed (`cons')
from an integer followed by another intlist:

 datatype intlist = nil
    | cons of int * intlist
    ;

I've borrowed SML syntax.  Note that the `*' in `int * intlist' is a
cross-product, i.e.  the type `int * intlist' is just a pair consisting
of an int and an intlist.

With the above definition, you can write expressions of type intlist
such as `nil', which represents the empty list, and
`cons(1, cons(2, cons(3, nil)))', which represents the list 1, 2, 3.
You can also write functions which act on this recursive data structure,
such as a function to compute the length of a list:

 fun length (list) =
  case list of
   nil =>
    0
   cons (head, tail) =>
    length (tail) + 1;

Another example is a 2-3 tree, which can contain nodes with either two
or three children:

 datatype tree = empty
   | two_node of tree * int * tree
   | three_node of tree * int * tree * int * tree
   ;

Another example, from a hypothetical card came:

 datatype suit = spades | clubs | diamonds | hearts ;
 datatype card = card of suit * rank | joker ;

--
Fergus Henderson            | As practiced by computer science, the study of
fjh@cs.mu.oz.au             | programming is an unholy mixture of mathematics,
http://www.cs.mu.oz.au/~fjh | literary criticism, and folklore. - B. A. Sheil

Author: harinath@hecto.cs.umn.edu (Raja R Harinath)
Date: 1995/04/12 Raw View

In article <D6oI19.ILL@reston.icl.com>,
Stephen Carlson <scc@reston.icl.com> wrote:
>In article <D6DwEJ.5M2@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
>>In article <D6482o.KtJ@reston.icl.com>,
>>Stephen Carlson <scc@reston.icl.com> wrote:
>>>In article <KINNUCAN.95Mar22150251@candide.hq.ileaf.com> kinnucan@hq.ileaf.com (Paul Kinnucan) writes:
>>>>   If iterators are to be a generalization of pointers, shouldn't one of
>>>>   their most important behaviors (the ability to be null) also be part
>>>>   of the generalization?
>>>>No.
>>>Would you care to explain this answer?  Why should a "generalization of
>>>pointers" not include an important property of pointers?
>>
>> Because that property is not useful for making the
>>_algorithms_ of STl work.
>
>Your answer explains why STL Iterators have been misbilled (in the
>documentation no less).  There is no property about generic pointers
>that is being generalized, yet one property, being NULL, has been
>taken away.

>IMHO, the proper way to bill STL Iterators is as a "generalization of
>array pointers."  This simultaneously (a) shows what is being generalized
>("array" to "Container") and (b) explains why there are no null iterators:
>array pointers are simply not null.

True.

The C/C++ pointer is unique; it is a language construct which provides
_two_ mostly disjoint abstractions:

1. the Pascal like pointer, or the `link' of a linked data structure.
This abstraction requires a NULL value.

2. the array iterator model, unique to C/C++, which definitely doesn't
require a NULL value.

In addition to which, it provides:

3. a back-door for pass-by-reference in a (mostly) pass-by-value
language.  Again, it doesn't require a NULL value (or am I missing
something).

The main feature common to all three is that they can be dereferenced.

I can't come up with a scenario where a single pointer simultaneously
acts as both (1) and (2) above.

STL iterators have little, if nothing to do with (1) above, the only
case where a NULL is required.  STL iterators generalize (2) above.

The usage of NULL in other `pointer' contexts is mainly due to a
confusion of abstractions.  We should _not_ carry the same confusion
to the STL iterator, which provides one and only one abstraction.

- Hari
--
--Raja R Harinath-----------`finger harinath@cs.umn.edu' for more info
   "Come to think of it, there already _are_ a million monkeys on a
     million typewriters, but Usenet is _nothing_ like Shakespeare."
                                                  -- Blaire Houghton

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/11 Raw View

maxtal@Physics.usyd.edu.au (John Max Skaller) writes:

>A null value to express "failure" fails to specify what _kind_
>of failure. The correct way to do this is with discriminated union:
>
> enum {success, notfound, emptyrange, duplicatesfound,
>  illformedquery} ..
>
>that is, adding a mere "boolean" flag to an iterator is refusing
>to recognize that the return value of a function might
>be a valid iterator OR any other state information.

Yes.  I agree entirely that discriminated unions are the right way of
handling this sort of thing.  The only problem is that C++ doesn't support
discriminated unions very well.  If you follow this argument to it's
logical conclusion, you have to say that even null pointers were a mistake.

--
Fergus Henderson            | As practiced by computer science, the study of
fjh@cs.mu.oz.au             | programming is an unholy mixture of mathematics,
http://www.cs.mu.oz.au/~fjh | literary criticism, and folklore. - B. A. Sheil

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/12 Raw View

In article <9510113.6681@mulga.cs.mu.OZ.AU>,
Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
>
>>A null value to express "failure" fails to specify what _kind_
>>of failure. The correct way to do this is with discriminated union:
>>
>> enum {success, notfound, emptyrange, duplicatesfound,
>>  illformedquery} ..
>>
>>that is, adding a mere "boolean" flag to an iterator is refusing
>>to recognize that the return value of a function might
>>be a valid iterator OR any other state information.
>
>Yes.  I agree entirely that discriminated unions are the right way of
>handling this sort of thing.  The only problem is that C++ doesn't support
>discriminated unions very well.  If you follow this argument to it's
>logical conclusion, you have to say that even null pointers were a mistake.

 Not necessarily: something that I learned from someone else
and which has really stuck in my mind is the answer to the following
question:

 "What is the difference between state and type?"

The answer is:

 "Engineering"

(Substitute "art" if you like :-)

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: chris@double.actrix.gen.nz (Chris Double)
Date: 1995/04/12 Raw View

In <9510113.6681@mulga.cs.mu.OZ.AU> fjh@munta.cs.mu.OZ.AU (Fergus Henderson) writes:
>Yes.  I agree entirely that discriminated unions are the right way of
>handling this sort of thing.  The only problem is that C++ doesn't support
>discriminated unions very well.  If you follow this argument to it's
>logical conclusion, you have to say that even null pointers were a mistake.

What are discriminated unions? At a guess it sounds like a union of types
and some sort of identifier saying which type is actually stored in the
union. Is this correct?

What sort of area you would use this in? I remember John Skaller saying
in a previous post that people don't use them enough. Where should they
be used? Is there an idiom that you can use in C++ to support
discriminated unions?

Regards,
Chris Double.

Author: ncm@netcom.com (Nathan Myers)
Date: 1995/04/12 Raw View

In article <9510023.17400@mulga.cs.mu.oz.au>,
Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>I think it's clear that complexity requirements are unenforceable.

On the contrary.  The complexity specifications are in terms
of the number of operations -- calls to copy constructors
and comparison operators -- which can be counted.  The "constant
factor" can be bounded simply by requiring (for instance) that
a list of 2N elements take no more than twice as long to
process as a list of N elements, for a linear algorithm; no
more than four times as long, for a quadratic algoritm; and
so on.

This "unenforceable" myth keeps popping up, and I don't know why.
It's easy to show that a library doesn't meet such a requirement
for some input, even if you can't prove that it satisfies for all
possible input.  This only means that nobody can certify that a
library does conform; but nobody can do that anyway, because they
can't prove there are no bugs.

Nathan Myers
myersn@roguewave.com

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/12 Raw View

In article <9510023.17400@mulga.cs.mu.OZ.AU>,
Fergus Henderson <fjh@munta.cs.mu.OZ.AU> wrote:
>maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
>
>> It is uncertain if, in fact, such complexity requirements
>>are "normative" since they are deducible from code -- but
>>cannot be measured.
>
>I think it's clear that complexity requirements are unenforceable.

 I don't think thats quite correct, in a sense I think
you would agree with -- it is possible to determine in some cases
what the complexity of a piece of code is.

 The point is you have to do this by reading and analysing
the code, it isn't something that is really "testable" in the sense
of executing the code and recording and analysing behaviour.
This would imply that in order to met test requirements, vendors
would be _required_ to supply actual sources for, say, STL
Standard Library entities.

 That would deny existing permissions that such actual
codes are an implementation detail at best -- and may not
exist.
--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/12 Raw View

To those who want NULL iterators, try this:

Define:

 enum zero_t { zero=0; };

Specify:

 A type X providing the syntax

 x == zero

 with a type bool is said to be _pointed_ (mathematical
 term for "have a distinguished value")

Now you can write algorithms and state in the requirements

 "This algorithm requires the type of the input range
 iterators to be pointed".

Thats it. You just extended STL.

Now prove it is useful. If you can others might use your extension.
You'd then have established a defacto standard.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: chris@double.actrix.gen.nz (Chris Double)
Date: 1995/04/09 Raw View

In <D6ntAG.9qu@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:

> Sigh. And I'd rather have referential transparency.
>Because I want to write algorithms that are correct and
>operate whether or not the container supports the semantics
>efficiently.

> So it seems I need to derive :

> template<class T, template<class> class Container>
> struct IndexedContainer: Container<T> {
>  T& operator[](int) {
>   list<T>::iterator i = begin();
>   while(i--) i++;
>   return i;
>  }
> }

>[which as written has the serious problem of being LESS efficient
>for a container like vector]

Can't you do something like:

  T& operator[](int index) {
   list<T>::iterator n = begin();
   advance(n, index);
   return *n;
  }

Using advance will be efficient for vectors and not for lists.

Regards,
Chris.

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/10 Raw View

ruiter@ruls41.LeidenUniv.nl (Jan-Peter de Ruiter) writes:

>Paul Kinnucan (kinnucan@hq.ileaf.com) wrote:
>
>: I see, so the standards committee should add
>: a feature to an already complex language solely on the basis that it
>: will make some programmers happier?
>
>Indeed, it should. Unless it makes other programmers' life
>harder, and that is definitely not the case with null valued iterators.

Unfortunately complexity makes _everyone's_ life harder.
That's why we have to make trade-offs.

--
Fergus Henderson - fjh@munta.cs.mu.oz.au

Author: fjh@munta.cs.mu.OZ.AU (Fergus Henderson)
Date: 1995/04/10 Raw View

maxtal@Physics.usyd.edu.au (John Max Skaller) writes:

> It is uncertain if, in fact, such complexity requirements
>are "normative" since they are deducible from code -- but
>cannot be measured.

I think it's clear that complexity requirements are unenforceable.
After all, all real machines have only a finite amount of memory, and
hence only a finite number of different states.  This implies that the
time to complete any action not involving I/O is bounded by the product
of the number of states and the state transition time (presuming that
the action does terminate at all).  Thus any action not involving I/O
is constant-time.  Of course, the constant may be considerably longer
than the lifetime of the universe, so this conclusion isn't useful for
anything other than proving conformance to the letter of some standard,
or pointing out some inherent limitations in complexity analysis.

In fact the draft seems to be only half-serious about complexity anyway -
for allocators, it says that all operations are "expected" to have
amortized constant time.  That doesn't sound like a normative requirement
to me.

Note that a standard doesn't have to be enforceable in order to be
useful.  For example, the Ada standard has lots of useful
"Implementation Advice" sections, which state that implementations
"should" do such-and-such.

--
Fergus Henderson - fjh@munta.cs.mu.oz.au

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/11 Raw View

In article <3m4mam$2le@jupiter.SJSU.EDU>,
Cay Horstmann <horstman@sjsumcs.sjsu.edu> wrote:
>
>In contrast, STL purposefully chose [to do]
>nothing to prevent iterator mishandling.
>I don't think they are wrong, it just is
>different from my own efficiency/safety tradeoff model.

 Why don't we tack this issue head on? Here's a
proposition for you:

 1) STL is right as it is (not providing iterator safety)

 2) STL is powerful and extensible enough to allow
 YOU to provide a layer that does provide iterator safety.

As I write I do not _know_ whether these propositions hold water
or not. But I'd like to find out. In particular, I start
with the proposition as an mental assumption and attempt to
prove it by constructively -- by actually writing a safe iterator layer.

I may need help doing that. Anyhow, lets start:

First, what does "safe" mean? Well, some possibilities are:

 1) An iterator pair representing a range both denote
 the same container

 2) An iterator pair representing a range not only denote
 the same container, they denote a valid subsequence
 of the container (possibly empty)

 3) An iterator requiring dereferening is dereferenceable

Now, to check (1) we require a function

 container_of(iterator)

which tells to what container (if any) an iterator refers.
To tell if an iterator is valid we could require a function

 valid_iterator(iterator)

and to tell if dereferenceable we could define

 dereferenceable(iterator it) {
  return
   valid_iterator(it) &&
   iterator != container_of(itertor) -> end();
 }

To tell if a range is valid is easy if slow:

 valid_range(iterator it1, iterator2 it2) {
  if (container_of(it1) == container_of(it2))
  {
   int len = container_of(it1)->size();
   while(len--) if(it1++==it2) return true;
  }
  return false;
 }

In fact this idea can be used to implement:

 valid_iterator(iterator)

by simply checking all possible iterators.

Now, of COURSE this is inefficient. But the above are templates
and so can be specialised.

Implementing "container_of" can be done by delegation:

 class iterator {
  delegate it;
  container *cont;
  ....
  // use delegation here, possibly with checks
 }

OK, that is only an outline not real code. But it _looks_
possible.

The key idea is that you have to "extend" STL.
(Actually, you are not extending it but introducing a "subtype")
That does not mean adding a template class. It means
adding extra PROTOCOL. STL already shows how to specify
protocol. It is a good model. Learning how to extend it
is a whole new ballgame.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: pstemari@erinet.com (Paul J. Ste. Marie)
Date: 1995/04/08 Raw View

In article <FENSTER.95Apr4213620@ground.cs.columbia.edu>,
   fenster@ground.cs.columbia.edu (Sam Fenster) wrote:
:I agree.  An STL container should always hand back an iterator
:guaranteed to point to valid data, or to an end iterator.  It
:shouldn't require error checking.  But it would be useful if
:*other* functions could return Null iterators, and if iterators
:could be initialized to Null.

Perhaps.  But I don't see the utility of bundling a error state
that has nothing to do with an STL container with an STL iterator,
which, after all, exists to support STL containers.  To me it
smacks of data bundling, which is a bad idea in procedural code and
doesn't get any better in OO code.

:It's often good design to let an object have an `invalid' state.

I don't know about that.  It's better than an object that can have
an invalid state and doesn't tell you about it, but in general I'd
rather have objects that never wind up in invalid states.

:I agree that STL containers should not return invalid iterators.
:But functions which are not part of the STL, and have unrelated or
:expanded functionality, can hand back iterators defining an STL
:range.  (This is similar to how a function that does something
:unrelated to string handling can return a string!)  Null would
:be a particularly compact and convenient way to indicate the
:function's failure.

But again, why burden STL with keeping around error flags for
non-STL functions?  If this is really a problem, why can't the
client class define its own error state and return that, ala:

class STLClient {
    public:
 ErrorState func(iterator& arg);
    };

What could you do with a simple null state, anyway?  Terminate?  At
that point you might as well throw an exception.

 --Paul J. Ste. Marie, pstemari@well.sf.ca.us, pstemari@erinet.com

The Financial Crimes Enforcement Network claims that they capture every
public posting that has their name ("FinCEN") in it.  I wish them good hunting.

Author: fenster@ground.cs.columbia.edu (Sam Fenster)
Date: 1995/04/08 Raw View

Perhaps Null iterators have a drawback as a technique, but not for any of
Paul's three reasons:

> > [Someone else wrote:]
> >   Can you give me a good reason not to define a singular value for an
> >   iterator?  Whether I choose to call it null seems immaterial.

> Paul Kinnucan <kinnucan@hq.ileaf.com> wrote:
> >1. Redundant.
> >You can't do anything with null-valued iterators that you can't
> >do just as easily, clearly, and economically without them.

I think the alternatives are less easy and less economical.  Perhaps they'd be
more clear, but also more verbose and klunky.

> >2. Dangerous.
> >Encourages careless programmers to think that they can safely incr/decr
> >isolated iterators, for example, as one post to this thread
> >suggested, to explore the region around an isolated iterator.

I don't see how it encourages that.  The kind of carelessness you mention
seems unrelated to Null iterators.

> >3. Complex.
> >Null iterators introduce additional (and needless) complexity
> >into the design and implementation of STL containers and applications.

Hardly any complexity at all, and it simplifies users' lives by allowing them
to signal invalidity.

However, Max's criticism makes more sense:

maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
> 4) Inadequate.
> A null value to express "failure" fails to specify what _kind_ of
> failure. The correct way to do this is with discriminated union...

> Certainly a boolean is most commonly useful, but it lacks generality
> appropriate to a Standard Library.
>
> Requiring a returned iterator be valid is therefore a sensible cut off point
> -- the returned value can be immediately reused WITHOUT checking by another
> algorithm....

> IMHO it is a good thing STL does NOT try to make this decision for you.

Author: horstman@sjsumcs.sjsu.edu (Cay Horstmann)
Date: 1995/04/08 Raw View

Matthew Hannigan (matth@extro.ucc.su.OZ.AU) wrote:
: horstman@sjsumcs.sjsu.edu (Cay Horstmann) writes:
: > [ .. ]
: >It is weird that STL is very defensive in one regard (worst-case
: >running time of algorithms) and very rough-and-ready in another
: >(no testability of iterator state).
: > [ .. ]

: Surely that's because the user can do something about the latter
: but not usually about the former.  (without writing another
: implementation)

In theory, "the user" can write perfect programs and never make a pointer/
iterator bug. In theory, "the user" can always pick wonderful hash
functions. In practice, that doesn't seem to pan out, and pragmatically
speaking I'd put "user makes iterator error" into the same category as
"user chooses crappy hash function". And then I'd spend my time defending
against the iterator errors first because their impact on my program is more
dramatic. In contrast, STL purposefully chose not to include hashing
because people might choose crappy hash functions, and they do nothing to
prevent iterator mishandling. I don't think they are wrong, it just is
different from my own efficiency/safety tradeoff model.

Cay

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/07 Raw View

In article <3lp3ko$di5@calum.csclub.uwaterloo.ca>,
Ross Ridge <rridge@calum.csclub.uwaterloo.ca> wrote:
>
>STL already says that.  But I wouldn't slow index operations added to
>standard STL containers, I'd rather have the static checking that tells
>me that I'm violating an assumption.

 Sigh. And I'd rather have referential transparency.
Because I want to write algorithms that are correct and
operate whether or not the container supports the semantics
efficiently.

 So it seems I need to derive :

 template<class T, template<class> class Container>
 struct IndexedContainer: Container<T> {
  T& operator[](int) {
   list<T>::iterator i = begin();
   while(i--) i++;
   return i;
  }
 }

[which as written has the serious problem of being LESS efficient
for a container like vector]

My reason: I want to delay fixing implementation details like this
until I have sorted out the interface design.

This is _exactly_ what using abstract classes does for you --
you can use any working implementation for prototyping and
speed it up by specialisation afterwards (overriding virtual
functions).

In particular you may be wise to do some profiling to balance the
speed/memory tradeoff of your application sensibly.

In a GUI for example, in managing window lists does not matter
as much as blitting pixels around FAST. There's a lot more
pixels than windows :-)

However, in this case lists have a semantic advantage over
vectors -- iterators remain valid in lists after insertion or
deletion, but not in vectors. I still _need_ to index
the windows. It doesn't matter how long it takes, I have to
do it anyhow.

So -- the STL is correct because you can have what you want
and so can I. I just have some more work to do. :-)

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: scc@reston.icl.com (Stephen Carlson)
Date: 1995/04/07 Raw View

In article <D6DwEJ.5M2@ucc.su.OZ.AU> maxtal@Physics.usyd.edu.au (John Max Skaller) writes:
>In article <D6482o.KtJ@reston.icl.com>,
>Stephen Carlson <scc@reston.icl.com> wrote:
>>In article <KINNUCAN.95Mar22150251@candide.hq.ileaf.com> kinnucan@hq.ileaf.com (Paul Kinnucan) writes:
>>>   If iterators are to be a generalization of pointers, shouldn't one of
>>>   their most important behaviors (the ability to be null) also be part
>>>   of the generalization?
>>>No.
>>Would you care to explain this answer?  Why should a "generalization of
>>pointers" not include an important property of pointers?
>
> Because that property is not useful for making the
>_algorithms_ of STl work.

Your answer explains why STL Iterators have been misbilled (in the
documentation no less).  There is no property about generic pointers
that is being generalized, yet one property, being NULL, has been
taken away.

IMHO, the proper way to bill STL Iterators is as a "generalization of
array pointers."  This simultaneously (a) shows what is being generalized
("array" to "Container") and (b) explains why there are no null iterators:
array pointers are simply not null.

This model, a generalization of array pointers, is a good one:

-  Those who know that it is unsafe to use a single array pointer to
   navigate an array without length/end information, use pairs of iterators.

-  Those who want the convenience and compactness of using a single array
   pointer to navigate an array, terminate the Container with a null value
   (i.e., NULL for a Container of pointers).

-  Those who want to indicate the existance and identity of an element,
   use a pointer.

I submit that most of the impetus for the null pointer position is due
to a desire to make iterators live up to its billing.  Unfortunately,
the billing does not explain the model.

Stephen Carlson
--
Stephen Carlson     :  Poetry speaks of aspirations,  : ICL, Inc.
scc@reston.icl.com  :  and songs chant the words.     : 11490 Commerce Park Dr.
(703) 648-3330      :                 Shujing 2:35    : Reston, VA  22091   USA

Author: eddy@clipper.robadome.com (eddy Gorsuch)
Date: 1995/04/07 Raw View

OK, I think I understand what you are trying to do.
You want your iterator to serve two purposes:
1. Be an iterator
2. Be an indication that some routine failed miserably.

I agree with John Max Skaller that you should really be returning 2
different results. Since you don't want to use exceptions here, could you
change your Dicts::look_up_word() to return a
pair<error_indicator, Dicts::iterator> instead of just an iterator? Or if
that is not acceptable, add a isValid() method to your iterator. Or even
define your iterators such that there you can assign them to NULL when you
want to indicate that they are invalid- as long as your code makes sure
never to pass a NULL iterator to any STL compliant algorithm there is
nothing wrong with this. There are many ways you can implement that test
without placing additional requirements on the C++ standard.

Nobody has said that your iterators must be limited to the interface
defined by STL. The STL specification defines a minimal set of interfaces
that guarentee interoperability between containers and algorithms. If you
follow the rule that you can only pass valid iterators to the algorithms,
you will only get valid iterators back. If you require that all iterators
support NULL, then you also need to say that all of the STL algorithms need
to check for this NULL value (which adds overhead, which will cause people
to not use these algorithms, which will make STL not as useful as it is).

I believe that the minimal interface requirements of the STL components is
one of the things that makes STL so wonderful. There is nothing preventing
anyone from adding on to the interface when special functions are needed
(which is where I'd classify your examples). In fact, STL uses this model
itself: Forward iterators only require the operations (==, !=, *, pre and
post ++). Bidirectional iterators add to this set the operations (pre and
post --). Random access iterators add even more required operations (+=, +,
-=, -, [], <, <=, >, >=). You have a requirement that your iterators have a
testable invalid state. None of the algorithms or containers that come with
STL require this, and I believe that adding the NULL value to the C++
standard is unnecessary.

Another problem is that STL imposes no restrictions on how iterators can be
implemented. An iterator could be a C style pointer, an enum, an integer, a
complex structure, or anything else. How do you create one value that can
generically be compared against any implementation of an iterator? (I don't
think that this problem can't be overcome, I just don't think that it needs
to be solved.)

eddy
--
ed.dy \'ed-e-\ n [ME (Sc dial.) ydy, prob. fr. ON itha; akin to OHG ith-
   again], L et and 1a: a current of water or air running contrary to the main
   current; esp)X : a small whirlpool 1b: a substance moving similarly  2: a
   contrary or circular current  - eddy vb

Author: eddy@clipper.robadome.com (eddy Gorsuch)
Date: 1995/04/07 Raw View

In article <FENSTER.95Apr4160955@ground.cs.columbia.edu>,
Sam Fenster <fenster@ground.cs.columbia.edu> wrote:
>> Sam Fenster <fenster@ground.cs.columbia.edu> wrote:
>>>
>>>   #include "dictionaries.h"   // Defines namespace Dicts.
>>>   int main () {
>>>   // Dicts is a module that accesses global containers and files that we
>>>   // can't see.  It is *not* a single container.  Its data is *not* public.
>>>
>>>   Dicts::iterator it1 = Dicts::look_up_word ("the"); // "The" starts with T
>>>   if (it1 == Dicts::Null) error();
>>>
>>>   Dicts::iterator it2 = Dicts::end_of_section (it1); // End of the T's
>>>   if (it2 == Dicts::Null) error();
>>>
>>>   for (Dicts::iterator i=it1; i!=it2; ++i)  output (*i); // "The"..."Tzar"
>>>   return 0;}
[...]
>>> The availability of Null would make the design much cleaner.
[...]
>The Dict routines *do* return a range within an STL container.  Dict, as a
>whole, manages many STL containers.  Dict doesn't have to *be* an STL
>container.  And it doesn't have to give direct access to any STL container
>beyond returning a pair of iterators.

OK, say that you really do need NULL. Are you allowed to pass this NULL
value back to Dicts? In your example, Dicts::end_of_section() takes as a
parameter an iterator it created (in Dicts::look_up_word()). Is
Dicts::end_of_section(Dicts::look_up_word("the")) valid in your model, or is
it required that every call to Dicts::look_up_word() be followed by a check?

By saying that interators can have a valid value that is NULL, you are sort
of saying that every function that takes an iterator needs to make sure the
value is not NULL before operating on that iterator. (There is another
thread in comp.std.c++ going on about the difference between references and
pointers where this comes up.) I don't see how making everyone check for
NULL makes the _system_ design any cleaner. It might make the design of
Dicts::look_up_word() easier, but it muddies up the design of other
components of the system.

If you _must_ have testable invalid iterators, I still think that a member
function of the iterator (something like it1.isValid()) makes the design
cleaner than comparing the iterator against NULL.

--
ed.dy \'ed-e-\ n [ME (Sc dial.) ydy, prob. fr. ON itha; akin to OHG ith-
   again], L et and 1a: a current of water or air running contrary to the main
   current; esp)X : a small whirlpool 1b: a substance moving similarly  2: a
   contrary or circular current  - eddy vb

Author: rridge@calum.csclub.uwaterloo.ca (Ross Ridge)
Date: 1995/04/03 Raw View

John Max Skaller <maxtal@Physics.usyd.edu.au> wrote:
> But the problem is that well defined operations you may
>need cannot always be linear or meet STL requirements --
>and then you find NO standard container that meets your needs.
>You may well be quite happy with a list with slow indexing,
>the problem is that you have to WRITE the extra code yourself.
>
> It would be easy enough to say: "The performance of
>this algorithm is logarithmic provided the indexing operation
>is constant time" -- and provide the slower indexing operation
>anyhow.

STL already says that.  But I wouldn't slow index operations added to
standard STL containers, I'd rather have the static checking that tells
me that I'm violating an assumption.

      Ross Ridge

--
 l/  //   Ross Ridge -- The Great HTMU, Ook                    +1 519 883 4329
[oo][oo]  rridge@csclub.uwaterloo.ca      http://csclub.uwaterloo.ca/u/rridge/
-()-/()/
 db  //

Author: pete@borland.com (Pete Becker)
Date: 1995/04/03 Raw View

In article <D6BME0.Ezr@cdf.toronto.edu>, g2devi@cdf.toronto.edu (Robert N. Deviasse) says:
>
>In article <3lfjcl$qvt@druid.borland.com>,
>Pete Becker <pete@borland.com> wrote:
>>In article <D688s0.60A@cdf.toronto.edu>, g2devi@cdf.toronto.edu (Robert N. Deviasse) says:
>>>
>>>BTW, one of the reason people wanted to have a null value is to be able to
>>>check if an iterator has been initialized or made invalid, assuming that
>>>the programmer follows defensive a programming style. For example:
>>>
>>>      Container c;
>>>      Iter i=Null<Iter>();
>>>      ... // (*)
>>>      // we assume that 'i' has been set to a value in 'c'
>>>      assert(i!=Null<Iter>());
>>>      ... // (**)
>>>      invalidate(c);           // append or some other operation
>>>      DEBUG(i=Null<Iter>());   // used only for defensive programming
>>>      ...
>>>
>>>I can't see how we can emulate this with the end() iterator.
>>
>>       No, I don't see offhand how to do it, either. Again, I'll ask: why
>>do you want to do this? There are much safer ways of programming this
>>sort of thing. Don't create the iterator until you are ready to initialize
>>it.
>
>Unfortunately this isn't always possible. Consider this the following:
>      Container c;
>      Iter i=Null<Iter>();
>      if (condition1()){
>         ... set i wrt c appropriately
>      }else if (condition2()){
>         ... set i wrt c appropriately
>      } else {
>         ... set i wrt c appropriately
>      }
>
>      // As a sanity check I want to ensure that i has been initialized.
>      // This also provides *executable* documentation on my expectations at this point.
>      assert(i!=Null<Iter>());
>
>The problem is that sometimes the calculation of an initialization value is
>conditional upon the state of the system at the time of initialization. Given the
>nature of conditionals, how can you *practically* get around this problem?
>

Call a function. Use its result to initialize the iterator.

 Iter Initialize( Container& c )
 {
       if (condition1()){
         ... return iterator set wrt c appropriately
       }else if (condition2()){
         ... return iterator set wrt c appropriately
       } else {
         ... return iterator set wrt c appropriately
       }
 Container c;
       Iter i=Initialize(c);

>> Make sure it goes out of scope when it becomes invalid.
>
>Is this always possible? Remember that if an iterator can become invalid when
>Consider this example:    (Note DEBUG(x) is a macro that expands to x)
>
>    Container c;
>    ...
>    Iter i=c.start();
>    ...
>    Container d;
>    ...
>    Iter j=c.start();
>    do_something_that_changes_a_Container_structure(c);   // now i is invalid
>    DEBUG(i=Null<Iter>());                                // document that i is invalid


"document that i is invalid"? Sounds like a comment. I don't see the point of this code.
It seems at best circular: we need null iterators so that I can write code that uses
null iterators.
>
>At this point, i is invalid, but j is still needed, so we can't just end the scope
>here.
>

 Well, yes, it is certainly possible to write code snippets that appear to
make it hard add blocks that make things out of scope. Rather than respond with a trivial
re-ordering of these statements which makes it simple to add a block, let me repeat my
usual question: can someone provide a fairly complete example that requires this?

>> I don't think that's really the case: it certainly hasn't worked
>>out that way with pointers.
>
>I don't follow. You mean that you've never seen or used assert statements to ensure
>that your pointers are non-null?
>
>> What is behind this urge to import all the
>>known dangers of null pointers into STL?
>
>Dangers? How can null pointers be any more dangerous than an out of boundary pointer
>(which is the alternative)? Neither of these can be safely dereferenced. At least
>with null pointers you can *test* if it is dereferenceable. How is this more dangerous?

Precisely because it encourages programmers to test for validity rather than assure
validity. Write the code so that it is impossible (or very difficult) to dereference a
null pointer. Then you won't have to test for them.
 -- Pete

Author: ruiter@ruls41.LeidenUniv.nl (Jan-Peter de Ruiter)
Date: 1995/04/03 Raw View

Tony Cook (tony@online.tmx.com.au) wrote:

: a) is covered well by the way you should be using STL - using start
: and end iterators.  I've never seen a pointer incremented to null
: and I don't really believe it should be.

Well, OK, maybe it shouldn't be, but in C this is or course frequently
happening using strings.

: b) has a stronger root in the common "C" pattern of return null to
: indicate an error (malloc and fopen for example), but this has been
: overshadowed in C++ by exceptions.  Is there some reason you can't
: use exceptions for this?  If you can't have you considered, that if
: you need such a validity test in your own iterators, that you can
: use some sort of test member function, like good() in IOStreams?

I agree with this in the sense that I would like to be able to test
(e.g. using good() ) the validity of an iterator _before_ any exception
is thrown. Whether that is by using a null value or some test function
doesn't matter to me. Our own library does both, and it really is handy
to be able to test validity, especially during debugging.

Greetings,

JP

Author: fenster@ground.cs.columbia.edu (Sam Fenster)
Date: 1995/04/03 Raw View

> fenster@ground.cs.columbia.edu (Sam Fenster) writes:
> |> You compare an iterator against an end iterator when it's
> |> traversing/searching a pre-existing range.  Pre-existing ranges *have*
> |> end iterators.  When someone hands you an iterator that helps *define* a
> |> range, it's serving a different purpose.  There may be no other range in
> |> sight to compare it to.  Then you need Null as a standard way to signal
> |> invalidity.

swf@elsegundoca.ncr.com (Stan Friesen) writes:
> You missed it again!
>
> The way to signal this is to use a begin-end pair where the begin iterator
> tests equal to the end iterator.  This represents an empty range, since
> the end iterator points "one past the end".

You missed it again!

An empty range can be a perfectly valid range.  It is not the same as an error
condition.  As I've pointed out several times already.

Author: eddy@clipper.robadome.com (eddy Gorsuch)
Date: 1995/04/04 Raw View

In article <3lk13b$7cb@druid.borland.com>,
Pete Becker <pete@borland.com> wrote:
>>        OK, now consider the code fragment
>>
>>        my_container<int>       c;
>>        my_container<int>::iterator     it = c.begin();
>>        c.insert(5);
>>        // does *it == 5 ? Of course not!
>>        // ++it will also be undefined.
>>
>
>Yes. That is true for all STL containers. Don't save off iterators then
>modify the container. Grab the iterators when they are needed. It can be
>very expensive to write iterators that "know" when their container has
>been modified.
[...]
>Seems to me that mentioning NULL pointers doesn't help your case. NULL
>pointers cause many programming problems. It is not at all clear to me
>that they solve more problems than they cause.

I could see NULL being useful to indicate that the container under the
iterator has changed, which invalidates the iterator. But this isn't
practical with STL (as Pete points out, it can get very expensive). There
is nothing stopping anyone from writing STL compliant containers/iterators
that can have the iterator "know" when the container has changed (and set
the iterator to NULL), but I'd prefer to see this as an extension to STL,
rather than an additional requirement on the current STL. The current STL
makes a nice base from which to work.

If you need NULL iterators, build them on top of STL. Just remember that
passing a NULL iterator to one of the STL algorithms will have undefined
behavior (but then passing any "singular" iterator has the same effect).

eddy

--
ed.dy \'ed-e-\ n [ME (Sc dial.) ydy, prob. fr. ON itha; akin to OHG ith-
   again], L et and 1a: a current of water or air running contrary to the main
   current; esp)X : a small whirlpool 1b: a substance moving similarly  2: a
   contrary or circular current  - eddy vb

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/04 Raw View

In article <KINNUCAN.95Mar31170947@candide.hq.ileaf.com>,
Paul Kinnucan <kinnucan@hq.ileaf.com> wrote:
>
>   Can you give me a good reason not to define a singular value for an
>   iterator?  Whether I choose to call it null seems immaterial.
>
>He already has.  In fact, three good reasons have been stated by
>Pete and others in this thread:
>
>1. Redundant.
>
>You can't do anything with null-valued iterators that you can't
>do just as easily, clearly, and economically without them.
>
>2. Dangerous.
>
>Encourages careless programmers to think that they can safely incr/decr
>isolated iterators, for example, as one post to this thread
>suggested, to explore the region around an isolated iterator.
>
>3. Complex.
>
>Null iterators introduce additional (and needless) complexity
>into the design and implementation of STL containers and applications.

 Let me add:

4) Inadequate.

A null value to express "failure" fails to specify what _kind_
of failure. The correct way to do this is with discriminated union:

 enum {success, notfound, emptyrange, duplicatesfound,
  illformedquery} ..

that is, adding a mere "boolean" flag to an iterator is refusing
to recognize that the return value of a function might
be a valid iterator OR any other state information.

Certainly a boolean is most commonly useful, but it lacks
generality appropriate to a Standard Library.

Requiring a returned iterator be valid is therefore a sensible
cut off point -- the returned value can be immediately reused
WITHOUT checking by another algorithm.

If this is not enough it is up to you to return your
own kind of data structure and test is before the iterator
is used -- OR simply throw an exception.

Which choice you make depends on whether you consider the kind
of "failure" involved an error or a valid return. (Throw
exceptions in the former case, and a discriminated union in the
latter)

IMHO it is a good thing STL does NOT try to make this decision for you.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/04 Raw View

In article <3lop14$92e@vbohub.vbo.dec.com>,
Ian Johnston <johnston@caiman.enet.dec.com> wrote:
>>
>> But it is my guess there needs to be a layer
>>on top of STL that provides referential transparency
>>_irrespective_ of performance. Because in 90% of code,
>>performance just doesn't matter. Correctness matters
>>in 100% of code :-)
>>
>
>Agreed 110%. (:-))
>
>But seriously, this is absolutely right. I still think STL is fine for
>experts, but will prove less than robust in the hands of less expert
>programmers.
>
>That's a shame.
>
>Yes, I can implement something on top for those less expert programmers.
>
>But then, it would be nice not to have to.

 Yes, but if you think about it it is necessary. In the
next Standard we might standardise such a set of application
layers. But it is _surely_ way too soon to do that now.

 I personally do not yet know exactly what the "best"
way to extend and wrap STL is. I'm still doing basic stuff
like

 a) using the supplied containers

 b) adding my own containers and iterators

 c) fiddling to get my compiler to work with STL at all

 d) doing lots of thinking about extending the
    STL Standard -- ie the protocol -- in various
    ways. For example -- extension to partial
    orderings is obvious. Extension to several
    dimensions is necessary for graphics.

 So what I'm saying is that while I agree with you,
there is no magic. I think STL is pitched at _exactly_ the
right level for this point in time and for this version
of the Standard.

 I'm finding more deficiencies in the C++ language
by using STL than in STL itself. For example, there is NO WAY
to declare a variable of the type of a member of a parameter:

 template<class T> void f(T t) {
  _Typeof<t.m> x = t.m;
 };

I can do this in Metaware HighC/C++ and am doing it. You can use
'typeof()' in GNU. The compiler knows the type of t.m and it can
do overloading on it but you can't make a variable??

In fact what we need is:

 define x = expr;

which means

 typeof(expr) x = expr;

 My point -- the C++ language is far more deficient itself
than STL. STL was designed. C++ was grown, pruned, and hacked.
--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: pstemari@erinet.com (Paul J. Ste. Marie)
Date: 1995/04/04 Raw View

In article <3loejf$sv7@highway.LeidenUniv.nl>,
   ruiter@ruls41.LeidenUniv.nl (Jan-Peter de Ruiter) wrote:

:Tony Cook (tony@online.tmx.com.au) wrote:
:
:: a) is covered well by the way you should be using STL - using
:: start and end iterators.  I've never seen a pointer incremented
:: to null and I don't really believe it should be.
:
:Well, OK, maybe it shouldn't be, but in C this is or course
:frequently happening using strings.

No, the pointer didn't increment to null, *pointer became null.
This idiom requires an endmarker in the container.  Null is
convienent if the container contains characters or pointers, but is
not so convienent if, for example, it contains floats.

:: b) has a stronger root in the common "C" pattern of return null
:: to indicate an error (malloc and fopen for example), but this
:: has been overshadowed in C++ by exceptions.  Is there some
:: reason you can't use exceptions for this?  If you can't have you
:: considered, that if you need such a validity test in your own
:: iterators, that you can use some sort of test member function,
:: like good() in IOStreams?

:I agree with this in the sense that I would like to be able to
:test (e.g. using good() ) the validity of an iterator _before_ any
:exception is thrown. Whether that is by using a null value or some
:test function doesn't matter to me. Our own library does both, and
:it really is handy to be able to test validity, especially during
:debugging.

The headache with this style of programming is that every line of
code winds up being an if statement.  I'd rather have containers
that were guaranteed to give back a valid iterator than ones which
decided whether or not to hand me back a usable iterator depending
on the phase of the moon or the drizzle in Redmond.

Think about it. A container isn't interfacing with a user or
(in most cases) doing I/O.  All it does is hold the data you put
into it.  It should never have to hand back a bad iterator, and
from a container client perspective, the code is cleaner if flags
indicating various odd states on the part of the client are
seperate and not conflating into the iterator, which could care
less.

 --Paul J. Ste. Marie, pstemari@well.sf.ca.us, pstemari@erinet.com

The Financial Crimes Enforcement Network claims that they capture every
public posting that has their name ("FinCEN") in it.  I wish them good hunting.

Author: pstemari@erinet.com (Paul J. Ste. Marie)
Date: 1995/04/04 Raw View

In article <3lp2u3$f7u@druid.borland.com>,
   pete@borland.com (Pete Becker) wrote:
[...]
:Precisely because it encourages programmers to test for validity
:rather than assure validity. Write the code so that it is
:impossible (or very difficult) to dereference a null pointer. Then
:you won't have to test for them.

Exactly.  Deal with the problem when it occurs, and you won't need
to replicate the same error-handling code in every last module of
your program.

 --Paul J. Ste. Marie, pstemari@well.sf.ca.us, pstemari@erinet.com

The Financial Crimes Enforcement Network claims that they capture every
public posting that has their name ("FinCEN") in it.  I wish them good hunting.

Author: maxtal@Physics.usyd.edu.au (John Max Skaller)
Date: 1995/04/04 Raw View

In article <ncmD6EIC8.Cvy@netcom.com>, Nathan Myers <ncm@netcom.com> wrote:
>
>You can make operator!() start up a Doom session if you like.

 OH! We definitely SHOULD Standardise that. I'll write
a proposal immediately.

--
        JOHN (MAX) SKALLER,         INTERNET:maxtal@suphys.physics.su.oz.au
 Maxtal Pty Ltd,
        81A Glebe Point Rd, GLEBE   Mem: SA IT/9/22,SC22/WG21
        NSW 2037, AUSTRALIA     Phone: 61-2-566-2189

Author: fenster@ground.cs.columbia.edu (Sam Fenster)
Date: 1995/04/04 Raw View

> Sam Fenster <fenster@ground.cs.columbia.edu> wrote:
>>
>>   #include "dictionaries.h"   // Defines namespace Dicts.
>>   int main () {
>>   // Dicts is a module that accesses global containers and files that we
>>   // can't see.  It is *not* a single container.  Its data is *not* public.
>>
>>   Dicts::iterator it1 = Dicts::look_up_word ("the"); // "The" starts with T
>>   if (it1 == Dicts::Null) error();
>>
>>   Dicts::iterator it2 = Dicts::end_of_section (it1); // End of the T's
>>   if (it2 == Dicts::Null) error();
>>
>>   for (Dicts::iterator i=it1; i!=it2; ++i)  output (*i); // "The"..."Tzar"
>>   return 0;}

eddy@clipper.robadome.com (eddy Gorsuch) writes:
> Q1: If I change the second call to:
>     Dicts::iterator it2 = Dicts::look_up_word ("zoo");
>     is it2 reachable from it1? (That is, is
>        while (it1 != it2) {++it1;};
>     guarenteed to stop?

No.  The words may be found in different dictionaries.

>     If so, then your dictionary is "logically" one container (even if it
>     might be implemented as many containers).

> Q2: OK, if you still don't think the dictionary is one container, how is
>     your use of Dict::Null any different than using Dict::end() in your
>     above example. The current STL definition uses container::end() the
>     exact same way that you are using Dict::Null in the above code.

No.  `Dicts' is not a single container.  It's just a namespace in which all
the names in a module reside.  The operations in it give access to any number
of privately held containers.  If I were to define end(), it would not be the
end of any particular container.  Note three points here: (1) The semantics of
the interface displayed above do not require knowledge of any larger container
than what is defined by the returned bounds.  (2) If end() were defined, it
could not actually be the end of any single container.  (3) Null is not being
used to indicate the end of anything.  It's an invalid iterator, being used to
indicate a failed operation.

>     (i.e. STL uses the end of the container to indicate an "invalid"
>     iterator...

There is no "*the* container."  There is no "end of *the* container."

>> If someone hands me two iterators, it is useful to be able to check them
>> for validity.  Do you claim that in every such situation, there will be a
>> third iterator somewhere in my environment, the end of some larger range?
>
> If you get 2 STL iterators as a range and the iterators are equal, then
> there are no items in the "container" that is returned. This means that the
> iterators are valid (they point into some container), but that there is no
> valid data to be returned. Why do you need a third iterator?

I don't.  I need Null.  If the iterators are equal, this does not tell me that
the operation failed.  It tells me that the operation succeeded, and the
resulting range exists in a particular container, and it is empty.  If I never
initialized Dict, or if it could not find the dictionaries on disk, or if it
couldn't find a range in any of the dictionaries that satisfied the specified
criteria, I need to return an invalid iterator.  Not an empty range in a
particular container.

Since people are making me repeat myself, I figure I'll repeat the following
as well.  I'm interested in people's thoughts, and no one's addressed it:

> Pete Becker <pete@borland.com> writes:
>> Use exceptions to indicate errors.

Sam Fenster <fenster@ground.cs.columbia.edu> wrote:
> Unfortunately, there is no way to stop exceptions from causing program
> termination, so they should be avoided, except for extreme errors.  If your
> exception-throwing function ever gets called by a destructor during stack
> unwinding for an unrelated exception, for instance, your program will
> terminate.  Or if the copy constructor of the thrown object throws an
> unrelated exception....The more unrelated subsystems use exceptions for
> non-extreme errors, the more likely your program is to terminate
> unexpectedly.

If it were not for these drawbacks in the C++ definition of exceptions, I
would indeed advocate exceptions as *by far* the best way of signaling errors.
There would then be no need for return values indicating invalidity.  There
might still be some need for object states indicating invalidity.  I'm sorely
disappointed that exceptions can't be used safely.  But even if they could,
they are not the only style of programming, or error handling, out there.
(In C++, they don't even have any established practice yet!)

Back to our regularly scheduled programming:

> Sam Fenster <fenster@ground.cs.columbia.edu> wrote:
>> Modeling my collection of dictionaries as a single large container doesn't
>> have any desirable semantics for me!  Why impose an ordering between the
>> last word in one dictionary and the first word in some other dictionary?
>>
>> The availability of Null would make the design much cleaner.

eddy@clipper.robadome.com (eddy Gorsuch) writes:
> But STL is a library of containers (and things you can do with those
> containers). If your programming model doesn't fit this (your dictionary is
> not a container, then it doesn't fit the STL model, and you should not be
> using the STL semantics if they don't fit your model).

The Dict routines *do* return a range within an STL container.  Dict, as a
whole, manages many STL containers.  Dict doesn't have to *be* an STL
container.  And it doesn't have to give direct access to any STL container
beyond returning a pair of iterators.

(See the end of this post for a repost of another example -- a database
query.)

> The example you posted above does (as far as I can tell) fall into the STL
> model. Let me change it slightly:
>
>    Dicts::iterator it1 = Dicts::look_up_word ("the"); // "The" starts with T
>    if (it1 == Dicts::end_of_section(it1)) error();
>    // or maybe: if (it1 == Dicts::end_of_section("the")) error();
>
> it1 is a single value. I can check whether it is a valid item in the
> container of T's by checking it against the end of section of T's.

it1 can only be used to identify a particular container (and its end) if
look_up_word() succeeds.  look_up_word ("#%&*") will not be able to find any
particular container with that string in it.  Should I make a fake container
just so all failed operations can point at its end?  That's a pretty ugly
design.  Why don't I just make up a single fake iterator value instead?  One
that doesn't point into a particular container?  Called, say, `Null'?

> >    Dicts::iterator it2 = Dicts::end_of_section (it1); // End of the T's
> >    if (it2 == Dicts::Null) error();
>
> How can your end_of_section return an invalid iterator? Remember that STL
> iterators point into a container, and possibly one place past the end of the
> container. (i.e. your Dicts::end_of_section() for T's should not return
> "Tzar", but "Tzar" + 1.)

Yes, that's what it would do to indicate the range ["the".."tzar"].

end_of_section(it1) might fail if Dicts wasn't properly initialized, or if its
argument pointed at invalid data.  It would then not be able to return an
iterator pointing to the end of any particular container.  In fact, if it1
were in the `Z' section, it would return the end iterator of a particular
dictionary (or section -- it's all relative) on *success*.  How do I indicate
failure?  Null would be most welcome here.

As a postscript, here's a repost of another example in which Null presents a
solution where an end iterator is not applicable.  Keep in mind the points
I've already made -- (1) A valid, empty returned range is not the same as an
error, and (2) interfaces which return iterator pairs do not need to be
containers themselves:

Sam Fenster <fenster@ground.cs.columbia.edu> wrote:
> I pass a function a database query string.  It passes me back an iterator
> pair which lets me traverse (or search) the list of query results:
>
> #include "db.h"
> int main ()
> {
>    DB_Handle h = db_open("personnel");  // A database of tables
>    if (!h) {error("Couldn't open database"); exit(1);}
>
>    string query;
>    cin.getline(query);  // "select name from employee where salary > 50000"
>
>    DB_iterator_pair p = db_query(h, query);
>    if (p.i1 == Null<DB_iterator>())  {error("Invalid query"); exit(2);}
>
>    for (DB_iterator i=p.i1; i!=p.i2; ++i)  cout << *i << endl;
>    return 0;
> }

Author: fenster@ground.cs.columbia.edu (Sam Fenster)
Date: 1995/04/05 Raw View

> :Tony Cook (tony@online.tmx.com.au) wrote:
> :: b) has a stronger root in the common "C" pattern of return null to
> :: indicate an error (malloc and fopen for example), but this has been
> :: overshadowed in C++ by exceptions.

Not yet, it hasn't.

> :: Is there some reason you can't use exceptions for this?

In a different thread, Sam Fenster <fenster@ground.cs.columbia.edu> wrote
something relevant here:
> Unfortunately, there is no way to stop exceptions from causing program
> termination, so they should be avoided, except for extreme errors.  If any
> exception-throwing function ever gets called by a destructor during stack
> unwinding for an unrelated exception, for instance, your program will
> terminate.  Or if the copy constructor of the thrown object throws an
> unrelated exception.  The more unrelated subsystems use exceptions for
> non-extreme errors, the more likely your program is to terminate
> unexpectedly.

> If it were not for these drawbacks in the C++ definition of exceptions, I
> would indeed advocate exceptions as *by far* the best way of signaling
> errors.  There would then be no need for return values indicating
> invalidity.  There might still be some need for object states indicating
> invalidity.  I'm sorely disappointed that exceptions can't be used safely.
> But even if they could, they are not the only style of programming, or error
> handling, out there.  (In C++, they don't even have any established practice
> yet!)

> :: If you can't [use exceptions,] have you considered, that if you need such
> :: a validity test in your own iterators, that you can use some sort of test
> :: member function, like good() in IOStreams?

pstemari@erinet.com (Paul J. Ste. Marie) writes:
> The headache with this style of programming is that every line of code winds
> up being an if statement.  I'd rather have containers that were guaranteed
> to give back a valid iterator than ones which decided whether or not to hand
> me back a usable iterator depending on the phase of the moon or the drizzle
> in Redmond.
>
> Think about it. A container isn't interfacing with a user or (in most cases)
> doing I/O.  All it does is hold the data you put into it.  It should never
> have to hand back a bad iterator,

I agree.  An STL container should always hand back an iterator guaranteed to
point to valid data, or to an end iterator.  It shouldn't require error
checking.  But it would be useful if *other* functions could return Null
iterators, and if iterators could be initialized to Null.

This is similar to how the address of an array element is guaranteed non-null.
No error checking is required.  But a function may return null, and a pointer
may be initialized to null.

> and from a container client perspective, the code is cleaner if flags
> indicating various odd states on the part of the client are seperate and not
> conflating into the iterator, which could care less.

It's often good design to let an object have an `invalid' state.  I agree that
STL containers should not return invalid iterators.  But functions which are
not part of the STL, and have unrelated or expanded functionality, can hand
back iterators defining an STL range.  (This is similar to how a function that
does something unrelated to string handling can return a string!)  Null would
be a particularly compact and convenient way to indicate the function's
failure.

Like `bool', it would be useful if Null were standard, because its use would
be common, and you don't want every library incompatibly defining its own Null
template.

Author: mpl@pegasus.bl-els.att.com (-Michael P. Lindner)
Date: 1995/04/05 Raw View

In article <3lk13b$7cb@druid.borland.com>,
Pete Becker <pete@borland.com> wrote:
>In article <D6BL0K.1w8@nntpa.cb.att.com>, mpl@pegasus.bl-els.att.com (-Michael P. Lindner) says:
>>
>>I promised myself I wasn't going to post any more followups to this
>>thread, but here goes.
>
> Don't quit now! You've posted an example that shows what you're
>trying to do. That makes it a lot easier to understand than simply
>describing it in words.

The reason I promised myself this was because this thread (more like a
knot by now :^) has lost all sense of a logical, calm discussion of
technical merit.  It has degraded into name calling and religious debate.
I feel ashamed that my posts have contributed to the resulting
discussions.

>This problem is not unique to iterators. It can apply to any object that
>you create before you use it. In general, the way I handle this is to
>intialize something with a function call. So, in a perhaps overly simple
>case:

 ... example deleted ...

Sorry, not applicable.  The iterator data member exists to remember the
state of the iterator, so you can't just set it from a function every
time.

>>        my_container<int>       c;
>>        my_container<int>::iterator     it = c.begin();
>>        c.insert(5);
>>        // does *it == 5 ? Of course not!
>>        // ++it will also be undefined.
>
>Yes. That is true for all STL containers. Don't save off iterators then
>modify the container. Grab the iterators when they are needed.

No, that's not true for all STL containers.  To quote from "The Standard
Template Library" by Stepanov and Lee, February 7, 1995 (caps for
emphasis are mine):

Vector:
 "insert causes reallocation if the new size is greater than the
 old capacity. If no reallocation happens ALL THE ITERATORS AND
 REFERENCES BEFORE THE INSERTION POINT REMAIN VALID."

List:
 "INSERT DOES NOT AFFECT THE VALIDITY OF ITERATORS AND
 REFERENCES."

Deque:
 "insert and push invalidate all the iterators and references to
 the deque."

Associative containers:
 not specified in the document, but in practice, INSERT DOES NOT
 AFFECT THE VALIDITY OF IOTERATORS AND REFERENCES.

--
Mike Lindner
mikel@attmail.com
mpl@cmprime.attpls.com
mpl@pegasus.att.com

Author: Duncan@rcp.co.uk (Duncan Booth)
Date: 1995/04/05 Raw View

In article <FENSTER.95Apr4160955@ground.cs.columbia.edu>,
fenster@ground.cs.columbia.edu (Sam Fenster) wrote:
... a long message about hist Dict iterator example including:
> I don't.  I need Null.  If the iterators are equal, this does not tell me that
> the operation failed.  It tells me that the operation succeeded, and the
> resulting range exists in a particular container, and it is empty.

Your Dict class does not appear to be a template. It appears to be a
specific class returning a specific type that is an STL iterator. You
are free to define NULL for that type if you wish.

If, in the general case, you define a template Dict<T> class then you
are free to require that iterator<T> has a NULL value defined. No
changes in STL are required to do this. You are free to define a
class that only works with a subset of iterators and iterators with
NULL defined are a genuine (and arguably useful) subset of iterators.

I think the problem is that the STL is a very general collection of
algorithms and conventions and to add something like NULL restricts
the application of STL without adding anything to STL itself. You are
free to define NULL for any iterator class you create, it is only the
abstract concept of an iterator that lacks a NULL value.

Is there any good reason to define an abstract iterator_with_null?
STL itself does not need it, but you and several other people would
obviously like one. If everyone is going to reinvent it then perhaps
it should be standardised. Ideally if your iterator is a class it
should define the null value within the class 'iter.null()', but this
syntax is not pointer compatible.

Reserving a special value such as 0 would be a pain. The best way I
can think of would be to require that for each iterator type ITER
there should be a function 'nulliterator<ITER>()' that returns the
null value. Since this is needed only for your subset of iterators it
should be part of your library documentation (put it in a namespace?)
and not part of the standard.

--
Duncan Booth                                             duncan@rcp.co.uk
int month(char *p){return(124864/((p[0]+p[1]-p[2]&0x1f)+1)%12)["\5\x8\3"
"\6\7\xb\1\x9\xa\2\0\4"];} // Who said my code was obscure?
           A little inaccuracy sometimes saves tons of explanation.

Author: Duncan@rcp.co.uk (Duncan Booth)
Date: 1995/04/05 Raw View

In article <D6J4Dz.B7x@ucc.su.OZ.AU>,
maxtal@Physics.usyd.edu.au (John Max Skaller) wrote:
>
>  I'm finding more deficiencies in the C++ language
> by using STL than in STL itself. For example, there is NO WAY
> to declare a variable of the type of a member of a parameter:
>
>  template<class T> void f(T t) {
>   _Typeof<t.m> x = t.m;
>  };
>
> I can do this in Metaware HighC/C++ and am doing it. You can use
> 'typeof()' in GNU. The compiler knows the type of t.m and it can
> do overloading on it but you can't make a variable??
>
> In fact what we need is:
>
>  define x = expr;
>
> which means
>
>  typeof(expr) x = expr;
>
Now don't get me wrong. I completely agree with you about needing
something like typeof() or your define keyword, but can't you do
something like the tricks STL plays to 'fix' your example?

I haven't tried this, and I rather suspect my compiler might crash,
but:

template<class T, class M> void f2(T t, M) {
        M x = t.m;
}

template<class T> void f(T t) { f2(t, t.m) }



>  My point -- the C++ language is far more deficient itself
> than STL. STL was designed. C++ was grown, pruned, and hacked.

And regular pruning ensures a good bushy growth :-)

--
Duncan Booth                                             duncan@rcp.co.uk
int month(char *p){return(124864/((p[0]+p[1]-p[2]&0x1f)+1)%12)["\5\x8\3"
"\6\7\xb\1\x9\xa\2\0\4"];} // Who said my code was obscure?
              Are you still here?  The message is over.  Go away!

Author: rridge@calum.csclub.uwaterloo.ca (Ross Ridge)
Date: 1995/04/05 Raw View

Jan-Peter de Ruiter <ruiter@ruls41.LeidenUniv.nl> wrote:
>However, this debate tends to become almost as silly as the hashtable
>debate, and it will probably end the same way too.

Why?  Because neither proposal will become part of the standard?

>What I find sad is that I have the strong suspicion that the opponents
>of hash tables and null valued iterators are not open minded about it.

Someone other than me opposed hash tables being added to the standard?
As near as I can tell I was only one who did.  Are you really so niave
as to think that absolutely everyone would agree with you?  I'm
surprised you even think my opinion matters so much.

>They cling to STL as if any change in it will mean public humiliation
>for the people who designed and supported STL.

I haven't even made up my mind if I like STL yet.

>That attitude is not very productive.

Neither are speculative attacks, but hey, don't worry, I don't
expect a lot productivity on Usenet.

       Ross Ridge

--
 l/  //   Ross Ridge -- The Great HTMU, Ook                    +1 519 883 4329
[oo][oo]  rridge@csclub.uwaterloo.ca      http://csclub.uwaterloo.ca/u/rridge/
-()-/()/
 db  //

Author: g2devi@cdf.toronto.edu (Robert N. Deviasse)
Date: 1995/04/05 Raw View

In article <3lp2u3$f7u@druid.borland.com>, pete@borland.com (Pete Becker) writes:
> In article <D6BME0.Ezr@cdf.toronto.edu>, g2devi@cdf.toronto.edu (Robert N. Deviasse) says:
> >
> >In article <3lfjcl$qvt@druid.borland.com>,
> >Pete Becker <pete@borland.com> wrote:
> >> What is behind this urge to import all the
> >>known dangers of null pointers into STL?
> >
> >Dangers? How can null pointers be any more dangerous than an out of boundary pointer
> >(which is the alternative)? Neither of these can be safely dereferenced. At least
> >with null pointers you can *test* if it is dereferenceable. How is this more dangerous?
>
> Precisely because it encourages programmers to test for validity rather than assure
> validity. Write the code so that it is impossible (or very difficult) to dereference a
> null pointer. Then you won't have to test for them.

I'm curious. What is your opinion of exceptions in C++? A great many languages
don't need them, and since you appear to believe that prevention is *always*
preferable to being able to deal with recovery. Surely exceptions encourage
programmers to write sloppy code that can generate exceptions rather than
just prevent the error from ever occuring.

IMO, the problem is that we have to live with tradeoffs. When you redesign
your code to prevent certain types of errors instead of checking for them,
you change, among other things, the readability and maintainability of the
code. Quite often, you can increase the total quality of the code and all
is good. Often you make tradeoffs, but their small and it's more than worth
it. There are times however when it's not worth the tradeoffs. This is the
case with null iterators as well as exceptions. And why all the fuss about
testing for validity. You *already* have to. You can't dereference an
iterator to container.end(). Sure you can (IMO always) avoid the problem
(just provide a function that counts the number of elements in the data
structure and never have the iterator incremented more than that number
of times), but is it always worth it?

John recognized the problem and provided, IMO, a better solution than null
iterators. I concede that using exceptions is better than null iterators.
They are self-monitoring (i.e. they will be checked even if I forget to or I
can't because the code is in some library code), they can say more about
an error than just say that it's invalid, and they can be specific to the
iterator in question. Given this, I don't see a reason for null iterators.

>  -- Pete
>
>
>

Take care
    Robert
--
/----------------------------------+------------------------------------------\
| Robert N. Deviasse               |"If we have to re-invent the wheel,       |
| EMAIL: g2devi@cdf.utoronto.ca    |  can we at least make it round this time"|
+----------------------------------+------------------------------------------/

Author: matth@extro.ucc.su.OZ.AU (Matthew Hannigan)
Date: 1995/04/06 Raw View

horstman@sjsumcs.sjsu.edu (Cay Horstmann) writes:
> [ .. ]
>It is weird that STL is very defensive in one regard (worst-case
>running time of algorithms) and very rough-and-ready in another
>(no testability of iterator state).
> [ .. ]

Surely that's because the user can do something about the latter
but not usually about the former.  (without writing another
implementation)

--
 -Matt Hannigan

Author: pete@borland.com (Pete Becker)
Date: 1995/04/06 Raw View

In article <D6Ks8J.Eo8@nntpa.cb.att.com>, mpl@pegasus.bl-els.att.com (-Michael P. Lindner) says:
>
>In article <3lk13b$7cb@druid.borland.com>,
>Pete Becker <pete@borland.com> wrote:
>>This problem is not unique to iterators. It can apply to any object that
>>you create before you use it. In general, the way I handle this is to
>>intialize something with a function call. So, in a perhaps overly simple
>>case:
>
>        ... example deleted ...
>
>Sorry, not applicable.  The iterator data member exists to remember the
>state of the iterator, so you can't just set it from a function every
>time.
>

I don't understand. Wasn't the example about initialization? I suppose there's
a natural extension of the initialization argument, which suggests that you
ought to be able to explicitly test for an invalid state at any time, even
after the iterator has been set to a valid state, but that really doesn't sound
like the iterator model that STL uses.

>>>        my_container<int>       c;
>>>        my_container<int>::iterator     it = c.begin();
>>>        c.insert(5);
>>>        // does *it == 5 ? Of course not!
>>>        // ++it will also be undefined.
>>
>>Yes. That is true for all STL containers. Don't save off iterators then
>>modify the container. Grab the iterators when they are needed.
>
>No, that's not true for all STL containers.  To quote from "The Standard
>Template Library" by Stepanov and Lee, February 7, 1995 (caps for
>emphasis are mine):
>

 Yes, there are cases where it is permissible to save iterators into
particular types of containers. I should have been clearer: I don't think this
is a good design policy. I think it's much more important to be able to
substitute a different container when application profiling reveals that that
the original choice is less than optimal. Relying on properties that change
from container to container makes this sort of substitution much harder.
 -- Pete

Author: tony@online.tmx.com.au (Tony Cook)
Date: 1995/04/06 Raw View

Jan-Peter de Ruiter (ruiter@ruls41.LeidenUniv.nl) wrote:
: Tony Cook (tony@online.tmx.com.au) wrote:

: : a) is covered well by the way you should be using STL - using start
: : and end iterators.  I've never seen a pointer incremented to null
: : and I don't really believe it should be.

: Well, OK, maybe it shouldn't be, but in C this is or course frequently
: happening using strings.

Remember though, with strings you are testing the current item under
the iterator (pointer) - not the pointer itself.  There's nothing
stopping you having an iterator for a collection of type T where T
has a conversion to bool.

: : b) has a stronger root in the common "C" pattern of return null to
: : indicate an error (malloc and fopen for example), but this has been
: : overshadowed in C++ by exceptions.  Is there some reason you can't
: : use exceptions for this?  If you can't have you considered, that if
: : you need such a validity test in your own iterators, that you can
: : use some sort of test member function, like good() in IOStreams?

: I agree with this in the sense that I would like to be able to test
: (e.g. using good() ) the validity of an iterator _before_ any exception
: is thrown. Whether that is by using a null value or some test function
: doesn't matter to me. Our own library does both, and it really is handy
: to be able to test validity, especially during debugging.

The point of exceptions is that they remove the _need_ to worry
about detecting failures of this sort, but you can still do it if
you wish:
 // rough syntax here only
 Dict<T>::iterator s, e;
 try {
  s = d.begin();
  e = d.end();

  while (s < e)
   // do something with it
 }
 catch (...)
 {
  // Oh! It failed - do something about it
 }
--
        Tony Cook - tony@online.tmx.com.au
                    100237.3425@compuserve.com

Author: tholaday@jpmorgan.com (Thomas Holaday,COMM)
Date: 1995/03/31 Raw View

In article 7pc@druid.borland.com, pete@borland.com (Pete Becker) writes:

>    for (Dicts::iterator i=it1; i!=it2; ++i)  output (*i); // "The"..."Tzar"

I'm fonder of:

 extern ostream_iterator &output;
 copy(it1, it2, output) ;


---
~THol()

Thomas Holaday
holaday_thomas@jpmorgan.com
tlhol@ibm.net
70407.534@compuserve.com

Author: pete@borland.com (Pete Becker)
Date: 1995/04/01 Raw View

In article <3lh8bv$edh@jupiter.SJSU.EDU>, horstman@sjsumcs.sjsu.edu (Cay Horstmann) says:
>
>I do
>not know if anyone has ever thought of organizing the iterator hierarchy
>to have "nice" iterators and "classic" iterators (pointers into C
>arrays).

 A caution on terminology here: STL does not have an "iterator
hierarchy" in the sense of having iterator types that inherit from other
iterator types. It has a family of iterators with increasing power, but
there is no requirement that there be any inheritance involved, and the
only use of inheritance in the iterators as implemented by HP is to
reduce the amount of boilerplate code. That is strictly an implementation
technique and a convenience; STL can be implemented in full conformance
to its specification without using these bases.
 In the broader sense of "heirarchy" as a graded series, it is an
appropriate description of STL iterator types.
 -- Pete

Author: pete@borland.com (Pete Becker)
Date: 1995/04/01 Raw View

In article <D6BL0K.1w8@nntpa.cb.att.com>, mpl@pegasus.bl-els.att.com (-Michael P. Lindner) says:
>
>I promised myself I wasn't going to post any more followups to this
>thread, but here goes.
>

 Don't quit now! You've posted an example that shows what you're
trying to do. That makes it a lot easier to understand than simply
describing it in words.

>Saying "your problem is that 'Dicts' is not STL compliant" is not a
>viable answer.  Making "Dicts" STl compliant simply pushes the iteration
>problem to the implementor of "Dicts" rather than the caller.
>

Yes. That's where it belongs. The goal here is to provide a uniform way
of accessing containers. That makes writing the containers harder, but
ultimately makes containers easier to use. Since a particular container,
if successful, will only be written once but used many times, this is the
right way to allocate the effort.

>In fact, my quest for the null-valued iterator came about from
>implementing complex STL-compliant containers which were aggregates of
>simpler ones.  The ugliness of not having a null has two forms.
>
>1. Testing code correctness:
>   Since the default constructor leaves the iterator in an undefined
>   state, there is no way to define an iterator with a known value.
>
>   This makes code clumsier to debug and makes some assertions difficult
>   (i.e. impossible without defining dummy containers) to write.
>
>        iterator        it;
>        // complex code which _should_ assign a value to it
>        assert(/* it has been assigned a value? */);
>

This problem is not unique to iterators. It can apply to any object that
you create before you use it. In general, the way I handle this is to
intialize something with a function call. So, in a perhaps overly simple
case:

 iterator it;
 if( condition() )
  {
  // do some computations
  it = resultOfComputations;
  }
 else
  {
  // do some other computations
  it = resultOfOtherComputations;
  }

this becomes:

 iterator getIterator()
 {
 if( condition() )
  {
  // do some computations
  return resultOfComputations;
  }
 else
  {
  // do some other computations
  return resultOfOtherComputations;
  }
 }

 iterator it = getIterator();

This way there are no uninitialized objects hanging around, and no need to
test whether something has been initialized. It seems to me that this is
much cleaner and much less error prone than having objects that may have
a real meaning or may be serving as flags that indicate that they do not
have a real meaning, and having to remember to check the flag before
using the object.

>[example omitted]
>
>        OK, now consider the code fragment
>
>        my_container<int>       c;
>        my_container<int>::iterator     it = c.begin();
>        c.insert(5);
>        // does *it == 5 ? Of course not!
>        // ++it will also be undefined.
>

Yes. That is true for all STL containers. Don't save off iterators then
modify the container. Grab the iterators when they are needed. It can be
very expensive to write iterators that "know" when their container has
been modified.

>
>In summary, just as you can program with pointers without using NULL,
>you can program with iterators without needing a null value.

Seems to me that mentioning NULL pointers doesn't help your case. NULL
pointers cause many programming problems. It is not at all clear to me
that they solve more problems than they cause.
 -- Pete

Author: ncm@netcom.com (Nathan Myers)
Date: 1995/04/02 Raw View

>I just don't understand this mulish reluctance to define singular
>iterator values.  ...
>Can you give me a good reason not to define a singular value for an
>iterator?  Whether I choose to call it null seems immaterial.

Neither STL, nor the C++ Standard Library, imposes any restriction on
defining singular values for your iterators.  In fact, it imposes very
few restrictions of any kind.  (You can make operator!() start up
a Doom session if you like.)  What it does say is that you cannot
count on any random iterator having any given singular value,
and that no conforming algorithm will check for them.

Nathan Myers
myesn@roguewave.com

Author: ruiter@ruls41.LeidenUniv.nl (Jan-Peter de Ruiter)
Date: 1995/04/02 Raw View

Paul Kinnucan (kinnucan@hq.ileaf.com) wrote:

: I see, so the standards committee should add
: a feature to an already complex language solely on the basis that it
: will make some programmers happier?

Indeed, it should. Unless it makes other programmers' life
harder, and that is definitely not the case with null valued iterators.

BTW, on what other basis would a standard committee add features?
Platonic beauty? The Truth? The U.S. deficit? I Ching hexagrams?

:    However, this debate tends to become almost as silly as the hashtable
:    debate, and it will probably end the same way too. What I find sad is
:    that I have the strong suspicion that the opponents of hash tables and
:    null valued iterators are not open minded about it. They cling to STL
:    as if any change in it will mean public humiliation for the people
:    who designed and supported STL.

:    That attitude is not very productive.

: Oh, yes, if all else fails, deplore their motives, shedding a mock tear
: or two.

Well, after reading what you seem to think about the purpose of a
standard committee, I feel like shedding another tear. But of course,
you are right that shedding tears will not work. The question is, what
would? There have been numerous code examples around in this thread,
indicating clearly that null values for iterators are often very
convenient.

JP