Topic: A small holiday gift, I hope


Author: david_abrahams@my-deja.com
Date: 24 Dec 2000 08:42:32 -0500
Raw View
The standard library provides the following powerful binary search
algorithms which operate on sorted ranges. Each of these functions
takes a range parameter, a value parameter, and is overloaded to take
an optional comparison function:

  binary_search  Find an element matching a certain value
  lower_bound    Find a value's first order-preserving position
  upper_bound    Find a value's last order-preserving position
  equal_range    make_pair(lower_bound(...), upper_bound(...))

If the comparison function is omitted, the search uses the less-than
operator to compare the supplied value to elements of the range.

Suppose you tried to use these functions to implement a dictionary
lookup. A dictionary is composed of entries which contain a word plus
the word's definition. Well, you'd like to be able to look up a word's
dictionary entry given just the word, and not be forced to build an
entire dictionary entry just to do the search, since the definition
part of the dictionary entry won't be used at all. To experienced
users of hand-crafted binary searches, this usage is certainly
familiar and reliable. For example:

// UNTESTED CODE FOLLOWS. FOR EXPOSITION PURPOSES ONLY
typedef std::string Word;
typedef std::pair<Word, Word> DictionaryEntry
typedef std::vector<DictionaryEntry> Dictionary;

// Almost exactly like std::lower_bound()
Dictionary::const_iterator word_position(
    const Dictionary& d,
    const Word& word)
{
  // binary search implementation
  Dictionary::const_iterator first = d.begin();
  std::size_t len = d.size();
  while (len > 0) {
    const std::size_t half = len >> 1;
    const Dictionary::const_iterator middle = first + half;
    if (*middle < word) {
      first = middle;
      ++first;
      len -= half + 1;
    }
    else {
      len = half;
    }
  }
  return first;
}

// Return a pointer to the definition of the given word, or 0 if the
// word doesn't appear in the dictionary

const Word*
find_definition(const Dictionary& d, const Word& word)
{
  Dictionary::const_iterator p = word_position(d, word);
  return (p == d.end() || p->first != word) ? 0 : &p->second;
}

// Define a word in the dictionary or throw if already defined
void define_word(
  Dictionary& d,
  const Word& word,
  const Word& definition)
{
  Dictionary::const_iterator p = word_position(d, word);
  if (p != d.end() && p->first == word) {
    throw std::exception("duplicate definition");
  }
  else {
    d.insert(d.begin() + (p - d.begin()),
             DictionaryEntry(word, definition));
  }
}

The question is, instead of writing the word_position() function
above, which is tedious and error-prone, can we reuse the generic
algorithms in the standard library? This is what Scott Meyers was
trying to accomplish in the comp.std.c++ thread entitled "Emulating
map with a sorted vector" [I'm not sure if this will work for you, but
<http://www.deja.com/[ST_rn=fs]/viewthread.xp?
AN=675191351&group=comp.std.c%2B%2B>
takes me to the page].

Certainly, with nearly all implementations** of the standard library,
we can use the following comparison function object with
std::lower_bound(), and it will give us the expected results:

// A "heterogeneous comparison object"
struct CompareEntryWord1
{
  bool operator()(const DictionaryEntry& e, const Word& w) const
    { return e.first < w; }
};

But is it legal? The standard's position on this question is not
encouraging. For one thing, 25.3 says that for the algorithms to work
correctly, the comparison object has to induce a strict weak ordering
on the values. If we take "the values" to mean the elements of the
iterator range, then our comparison function clearly fails: you can't
use it when both arguments are DictionaryEntrys. The standard also
says the algorithms "assume that the sequence being searched is in
order according to the implied or explicit comparison function," which
makes little sense when the comparison function can't compare the
sequence elements.

Technically, though, we can satisfy the standard's requirements for
the comparison function by adding an overloaded operator()() to "keep
the language lawyers happy":

struct CompareEntryWord2
{
  // Heterogeneous comparison actually gets used
  bool operator()(const DictionaryEntry& e, const Word& w) const
    { return e.first < w; }

  // Homogeneous comparison just to satisfy the legal requirements.
  bool operator()(
    const DictionaryEntry& e, const DictionaryEntry&& w) const
    { return e.first < w.first; }
};

This version is arguably legal, but it subverts the intent of the
standard. The authors of that text clearly never meant to leave this
loophole in there for us. One dead giveaway is that the EFFECTS:
clause for lower_bound says "Finds the first position into which value
can be inserted without violating the ordering." Clearly, when the
value doesn't have the same type as the rest of the range, the clause
becomes nonsensical***.

Dietmar Kuehl has suggested an alternative which doesn't suffer these
problems: define a new iterator which wraps
Dictionary::const_iterator, but returns a const Word& (instead of a
const DictionaryEntry&) when dereferenced. This approach has two new
problems, though:

  1. It won't work for many similar cases where the value to be
     compared with must be computed from the elements of the range,
     because an iterator is required to return a reference type when
     dereferenced. It may well be that no appropriate lvalue exists to
     which a reference can be returned.

  2. it requires much more code than simply re-writing the binary
     search algorithm
     (<http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
AN=674894959&search=thread&CONTEXT=977625247.1661403165&hitnum=5>
     shows an example), and is correspondingly more error-prone.

I think "the right answer" in the long run is to figure out how to
loosen the standard's requirements so that CompareEntryWord1 can be
guaranteed to work as expected. Matt Austern made a first stab at it
in
<http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
AN=674946713&search=thread&CONTEXT=977625247.1661403165&hitnum=9>,
but was discouraged to find that his first attempt, though probably
already too complicated, wasn't complicated enough to do the job (see
<http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
AN=675191351&search=thread&CONTEXT=977625247.1661403165&hitnum=12>).
This is the formulation he ended up with:

    comp is a function object whose first argument type is V
    and whose second argument type is T, and where comp(x, y)
    is equivalent to comp'(pi(x), y), where comp' is some
    strict weak ordering on T and where pi is some
    homomorphism from V to T.  The sequence [first, last)
    must be sorted in ascending order by comp'', where
    comp''(x, y) is equivalent to comp'(pi(x), pi(y)).

Even if this is formally correct, it is probably beyond the ken of
most committee members to verify its correctness, and beyond the ken
of even most expert programmers to verify that their comparison
functions satisfy these criteria. That makes it a bad choice for the
standard on both counts.

The problem with both of these early attempts is that they focus on
the sort order of the range. This is an obvious way to think of things
if the range elements and the search key are the same type, but I
think to solve the problem for the case we're interested in, a shift
in thinking is required. Strict weak ordering is a great concept for
sorting, but maybe it's not appropriate for searching.

Suppose as a simplifying assumption, we think about the search key as
though it were bound to one of the arguments of the comparison
function (say, using std::bind2nd for lower_bound()). That gives us a
simple unary comparison function object operating on elements of the
range and returning bool. In the lower_bound algorithm, we are
searching for the first element for which the unary function object
returns false (or the end position if no such element exists). For
this to work, of course, the unary function object must return true
for zero or more initial elements, and false for all the rest. That
is, the comp(e, value), where value is the search key, must induce a
/partition/ on the elements of the sequence. I believe this
formulation captures what's actually going on with binary_search more
generally, and to boot, is simpler to express.

I'll post my suggested changes to the standard text tomorrow, in a
followup article. So long as the standard remains as is, however, we
will need another solution. I also plan to contribute a small,
open-source binary search library to boost (www.boost.org) which is
guaranteed to work with these heterogeneous comparison functions.

-Dave

**SGI's library implementation actually has some fancy "concept checks"
which try to make sure you're following all the rules at compile time,
and it would fail to compile the use of the above comparison object
with lower_bound.

***On the other hand, the EFFECTS: clause is arguably redundant, since
the result of the algorithm is much more clearly specified by the
RETURNS: clause, which still makes perfect sense:

   Returns: The furthermost iterator i in the range [first, last] such
   that for any iterator j in the range [first, i) the following
   corresponding conditions hold: *j < value or comp(*j, value) !=
   false


Sent via Deja.com
http://www.deja.com/
---
[ comp.std.c++ is moderated.  To submit articles, try just posting with ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu    ]
[              --- Please see the FAQ before posting. ---               ]
[ FAQ: http://www.research.att.com/~austern/csc/faq.html                ]





Author: Valentin.Bonnard@free.fr (Valentin Bonnard)
Date: Mon, 25 Dec 2000 21:56:55 GMT
Raw View
> The question is, instead of writing the word_position() function
> above, which is tedious and error-prone, can we reuse the generic
> algorithms in the standard library? This is what Scott Meyers was
> trying to accomplish in the comp.std.c++ thread entitled "Emulating
> map with a sorted vector" [I'm not sure if this will work for you,
but
> <http://www.deja.com/[ST_rn=fs]/viewthread.xp?
> AN=675191351&group=comp.std.c%2B%2B>
> takes me to the page].
>
> Certainly, with nearly all implementations** of the standard library,
> we can use the following comparison function object with
> std::lower_bound(), and it will give us the expected results:
>
> // A "heterogeneous comparison object"
> struct CompareEntryWord1
> {
>   bool operator()(const DictionaryEntry& e, const Word& w) const
>     { return e.first < w; }
> };
>
> But is it legal? The standard's position on this question is not
> encouraging. For one thing, 25.3 says that for the algorithms to work
> correctly, the comparison object has to induce a strict weak ordering
> on the values. If we take "the values" to mean the elements of the
> iterator range, then our comparison function clearly fails: you can't
> use it when both arguments are DictionaryEntrys. The standard also
> says the algorithms "assume that the sequence being searched is in
> order according to the implied or explicit comparison function,"
which
> makes little sense when the comparison function can't compare the
> sequence elements.

Indeed.

> Technically, though, we can satisfy the standard's requirements for
> the comparison function by adding an overloaded operator()() to "keep
> the language lawyers happy":
>
> struct CompareEntryWord2
> {
>   // Heterogeneous comparison actually gets used
>   bool operator()(const DictionaryEntry& e, const Word& w) const
>     { return e.first < w; }
>
>   // Homogeneous comparison just to satisfy the legal requirements.
>   bool operator()(
>     const DictionaryEntry& e, const DictionaryEntry&& w) const
>     { return e.first < w.first; }
> };
>
> This version is arguably legal,

No, it is not legal.

> One dead giveaway is that the EFFECTS:
> clause for lower_bound says "Finds the first position into which
value
> can be inserted without violating the ordering." Clearly, when the
> value doesn't have the same type as the rest of the range, the clause
> becomes nonsensical***.

And EFFECT clauses are required to be make sens.

> Dietmar Kuehl has suggested an alternative which doesn't suffer these
> problems: define a new iterator

If we have to define a new view-iterator (an iterator providing
a different view of objects) just to apply a standard algorithm
in a very common case, then the standard library has a serious
design flaw.

> I think "the right answer" in the long run is to figure out how to
> loosen the standard's requirements so that CompareEntryWord1 can be
> guaranteed to work as expected.

I strongly disagree.

> Matt Austern made a first stab at it
> in
> <http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
> AN=674946713&search=thread&CONTEXT=977625247.1661403165&hitnum=9>,
> but was discouraged to find that his first attempt, though probably
> already too complicated, wasn't complicated enough to do the job (see
> <http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
> AN=675191351&search=thread&CONTEXT=977625247.1661403165&hitnum=12>).
> This is the formulation he ended up with:
>
>     comp is a function object whose first argument type is V
>     and whose second argument type is T, and where comp(x, y)
>     is equivalent to comp'(pi(x), y), where comp' is some
>     strict weak ordering on T and where pi is some
>     homomorphism from V to T.  The sequence [first, last)
>     must be sorted in ascending order by comp'', where
>     comp''(x, y) is equivalent to comp'(pi(x), pi(y)).

Ho ! This seems pretty involve to do something so simple.

> Even if this is formally correct,

Actually I don't even understand what it means. What's pi ?
A homomorphism wrt to what ? We have no function internal
to V. The only thing we can say about V objects is how they
compare to some T (via comp), but we have

                 comp(x, y) = comp'(pi(x), y)

by definition, so yes, pi is obviously a homomorphism,
with the following definition of homomorphism [*]: f is
a homomorphism from X to Y wrt a property P on X and a
property P' on Y iff for any x in X, P(x) <=> P'(f(x)).

[*] It is not the usual definition of homomorphism, and I
doupt any mathematician would call it a homomorphism.

>     The sequence [first, last)
>     must be sorted in ascending order by comp'', where
>     comp''(x, y) is equivalent to comp'(pi(x), pi(y)).

It would be clearer IMHO to say that the sequence [first, last)
is sorted in ascending order wrt comp'(pi(x), pi(y)). (Avoiding
the introduction of another notation.)

(Note that if comp' is a strict weak order, we need absolutly
no condition on pi for ((x,y)->comp'(pi(x), pi(y))) to be a
strict weak order too.)

> it is probably beyond the ken of most committee members to verify
> its correctness,

I believe that many people in the committe have implemented
complicated algorithm, and can handle much more complex math
arguments that this one.

> and beyond the ken
> of even most expert programmers to verify that their comparison
> functions satisfy these criteria.

The definition seems very involved and is almost unreadable.
The underlying theory is trivial. The idea behind it is
simple and intuitive.

> That makes it a bad choice for the standard on both counts.

IMO it is a bad choice for the standard primarilly because
it makes no sens.

> The problem with both of these early attempts is that they focus on
> the sort order of the range. This is an obvious way to think of
things
> if the range elements and the search key are the same type, but I
> think to solve the problem for the case we're interested in, a shift
> in thinking is required. Strict weak ordering is a great concept for
> sorting, but maybe it's not appropriate for searching.
>
> Suppose as a simplifying assumption, we think about the search key as
> though it were bound to one of the arguments of the comparison
> function (say, using std::bind2nd for lower_bound()). That gives us a
> simple unary comparison function object operating on elements of the
> range and returning bool.

Yes, and it is a much simplier view. We are talking about a
fast ``find (range, Predicate)'' function, where Predicate
is known to be monotonic, and the function can thus use Newton's
algorithm:

#include <algorithm>

// returns the first ``it'' in [first, last) such that ``pred(*it)''
// returns ``last'' if, for each it in [first, last), ``!pred (*it)''
template <typename FwdIt, typename AscPred>
FwdIt find_first_ascending (FwdIt first, FwdIt last, AscPred pred)
{
  using namespace std;
  if (last == first)
    return last;
  FwdIt med = first;
  advance (med, distance (first, last)/2);
  if (med == first)
    return pred (*med) ? med : last;
  if (pred (*med))
    return find_first_ascending (first, med, pred);
  else
    return find_first_ascending (med, last, pred);
}

This implementation isn't the most efficient, but it seems correct.

The implementation of the ``dual'' function, find_last_descending,
is a bit more difficult because, unlike find_first_ascending, it
isn't monotonic wrt pred.

// returns the last ``it'' in [first, last) such that ``pred(*it)''
// returns ``last'' if, for each it in [first, last), ``!pred (*it)''
template <typename FwdIt, typename DescPred>
FwdIt find_last_descending (FwdIt first, FwdIt last, DescPred pred)
{
  using namespace std;
  if (last == first)
    return last;
  FwdIt end = last;
  while (distance (first, end) > 1)
    {
      FwdIt med = first;
      advance (med, distance (first, end)/2);
      if (pred (*med))
 first = med;
      else
 end = med;
    }
  return pred (first) ? first : last;
}

> I'll post my suggested changes to the standard text tomorrow, in a
> followup article. So long as the standard remains as is, however, we
> will need another solution. I also plan to contribute a small,
> open-source binary search library to boost (www.boost.org) which is
> guaranteed to work with these heterogeneous comparison functions.

Another solution is to just rewrite the whole 'sorted data'
part of the standard to take a pi ``projection'' function:

#include <functional>

template <typename FwdIt, typename T, typename Compare, typename Pi>
BidIt lower_bound (FwdIt first, FwdIt last, T x, Compare comp, Pi pi)
{
  using namespace std;
  return find_first_ascending (first, last,
          not1 (compose1 (bind2nd (comp,x), pi)));
}

template <typename FwdIt, typename T, typename Compare, typename Pi>
BidIt upper_bound (FwdIt first, FwdIt last, T x, Compare comp, Pi pi)
{
  using namespace std;
  return find_first_ascending (first, last,
          compose1 (bind1st (comp,x), pi));
}

etc...

Note: It would be a _simplification_ of the SGI STL, since all
the associative containers are already implemented that way --
they are just undocumented.

  -- Valentin Bonnard, who hopes you'll appreciate
                       this Christmas gift

---
[ comp.std.c++ is moderated.  To submit articles, try just posting with ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu    ]
[              --- Please see the FAQ before posting. ---               ]
[ FAQ: http://www.research.att.com/~austern/csc/faq.html                ]
[ Note that the FAQ URL has changed!  Please update your bookmarks.     ]





Author: david_abrahams@my-deja.com
Date: Tue, 26 Dec 2000 17:57:55 GMT
Raw View
The standard library provides the following powerful binary search
algorithms which operate on sorted ranges. Each of these functions
takes a range parameter, a value parameter, and is overloaded to take
an optional comparison function:

  binary_search  Find an element matching a certain value
  lower_bound    Find a value's first order-preserving position
  upper_bound    Find a value's last order-preserving position
  equal_range    make_pair(lower_bound(...), upper_bound(...))

If the comparison function is omitted, the search uses the less-than
operator to compare the supplied value to elements of the range.

Suppose you tried to use these functions to implement a dictionary
lookup. A dictionary is composed of entries which contain a word plus
the word's definition. Well, you'd like to be able to look up a word's
dictionary entry given just the word, and not be forced to build an
entire dictionary entry just to do the search, since the definition
part of the dictionary entry won't be used at all. To experienced
users of hand-crafted binary searches, this usage is certainly
familiar and reliable. For example:

// UNTESTED CODE FOLLOWS. FOR EXPOSITION PURPOSES ONLY
typedef std::string Word;
typedef std::pair<Word, Word> DictionaryEntry
typedef std::vector<DictionaryEntry> Dictionary;

// Almost exactly like std::lower_bound()
Dictionary::const_iterator word_position(
    const Dictionary& d,
    const Word& word)
{
  // binary search implementation
  Dictionary::const_iterator first = d.begin();
  std::size_t len = d.size();
  while (len > 0) {
    const std::size_t half = len >> 1;
    const Dictionary::const_iterator middle = first + half;
    if (*middle < word) {
      first = middle;
      ++first;
      len -= half + 1;
    }
    else {
      len = half;
    }
  }
  return first;
}

// Return a pointer to the definition of the given word, or 0 if the
// word doesn't appear in the dictionary

const Word*
find_definition(const Dictionary& d, const Word& word)
{
  Dictionary::const_iterator p = word_position(d, word);
  return (p == d.end() || p->first != word) ? 0 : &p->second;
}

// Define a word in the dictionary or throw if already defined
void define_word(
  Dictionary& d,
  const Word& word,
  const Word& definition)
{
  Dictionary::const_iterator p = word_position(d, word);
  if (p != d.end() && p->first == word) {
    throw std::exception("duplicate definition");
  }
  else {
    d.insert(d.begin() + (p - d.begin()),
             DictionaryEntry(word, definition));
  }
}

The question is, instead of writing the word_position() function
above, which is tedious and error-prone, can we reuse the generic
algorithms in the standard library? This is what Scott Meyers was
trying to accomplish in the comp.std.c++ thread entitled "Emulating
map with a sorted vector" [I'm not sure if this will work for you, but
<http://www.deja.com/[ST_rn=fs]/viewthread.xp?
AN=675191351&group=comp.std.c%2B%2B>
takes me to the page].

Certainly, with nearly all implementations** of the standard library,
we can use the following comparison function object with
std::lower_bound(), and it will give us the expected results:

// A "heterogeneous comparison object"
struct CompareEntryWord1
{
  bool operator()(const DictionaryEntry& e, const Word& w) const
    { return e.first < w; }
};

But is it legal? The standard's position on this question is not
encouraging. For one thing, 25.3 says that for the algorithms to work
correctly, the comparison object has to induce a strict weak ordering
on the values. If we take "the values" to mean the elements of the
iterator range, then our comparison function clearly fails: you can't
use it when both arguments are DictionaryEntrys. The standard also
says the algorithms "assume that the sequence being searched is in
order according to the implied or explicit comparison function," which
makes little sense when the comparison function can't compare the
sequence elements.

Technically, though, we can satisfy the standard's requirements for
the comparison function by adding an overloaded operator()() to "keep
the language lawyers happy":

struct CompareEntryWord2
{
  // Heterogeneous comparison actually gets used
  bool operator()(const DictionaryEntry& e, const Word& w) const
    { return e.first < w; }

  // Homogeneous comparison just to satisfy the legal requirements.
  bool operator()(
    const DictionaryEntry& e, const DictionaryEntry&& w) const
    { return e.first < w.first; }
};

This version is arguably legal, but it subverts the intent of the
standard. The authors of that text clearly never meant to leave this
loophole in there for us. One dead giveaway is that the EFFECTS:
clause for lower_bound says "Finds the first position into which value
can be inserted without violating the ordering." Clearly, when the
value doesn't have the same type as the rest of the range, the clause
becomes nonsensical***.

Dietmar Kuehl has suggested an alternative which doesn't suffer these
problems: define a new iterator which wraps
Dictionary::const_iterator, but returns a const Word& (instead of a
const DictionaryEntry&) when dereferenced. This approach has two new
problems, though:

  1. It won't work for many similar cases where the value to be
     compared with must be computed from the elements of the range,
     because an iterator is required to return a reference type when
     dereferenced. It may well be that no appropriate lvalue exists to
     which a reference can be returned.

  2. it requires much more code than simply re-writing the binary
     search algorithm
     (<http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
AN=674894959&search=thread&CONTEXT=977625247.1661403165&hitnum=5>
     shows an example), and is correspondingly more error-prone.

I think "the right answer" in the long run is to figure out how to
loosen the standard's requirements so that CompareEntryWord1 can be
guaranteed to work as expected. Matt Austern made a first stab at it
in
<http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
AN=674946713&search=thread&CONTEXT=977625247.1661403165&hitnum=9>,
but was discouraged to find that his first attempt, though probably
already too complicated, wasn't complicated enough to do the job (see
<http://x52.deja.com/[ST_rn=fs]/getdoc.xp?
AN=675191351&search=thread&CONTEXT=977625247.1661403165&hitnum=12>).
This is the formulation he ended up with:

    comp is a function object whose first argument type is V
    and whose second argument type is T, and where comp(x, y)
    is equivalent to comp'(pi(x), y), where comp' is some
    strict weak ordering on T and where pi is some
    homomorphism from V to T.  The sequence [first, last)
    must be sorted in ascending order by comp'', where
    comp''(x, y) is equivalent to comp'(pi(x), pi(y)).

Even if this is formally correct, it is probably beyond the ken of
most committee members to verify its correctness, and beyond the ken
of even most expert programmers to verify that their comparison
functions satisfy these criteria. That makes it a bad choice for the
standard on both counts.

The problem with both of these early attempts is that they focus on
the sort order of the range. This is an obvious way to think of things
if the range elements and the search key are the same type, but I
think to solve the problem for the case we're interested in, a shift
in thinking is required. Strict weak ordering is a great concept for
sorting, but maybe it's not appropriate for searching.

Suppose as a simplifying assumption, we think about the search key as
though it were bound to one of the arguments of the comparison
function (say, using std::bind2nd for lower_bound()). That gives us a
simple unary comparison function object operating on elements of the
range and returning bool. In the lower_bound algorithm, we are
searching for the first element for which the unary function object
returns false (or the end position if no such element exists). For
this to work, of course, the unary function object must return true
for zero or more initial elements, and false for all the rest. That
is, the comp(e, value), where value is the search key, must induce a
/partition/ on the elements of the sequence. I believe this
formulation captures what's actually going on with binary_search more
generally, and to boot, is simpler to express.

I'll post my suggested changes to the standard text tomorrow, in a
followup article. So long as the standard remains as is, however, we
will need another solution. I also plan to contribute a small,
open-source binary search library to boost (www.boost.org) which is
guaranteed to work with these heterogeneous comparison functions.

-Dave

**SGI's library implementation actually has some fancy "concept checks"
which try to make sure you're following all the rules at compile time,
and it would fail to compile the use of the above comparison object
with lower_bound.

***On the other hand, the EFFECTS: clause is arguably redundant, since
the result of the algorithm is much more clearly specified by the
RETURNS: clause, which still makes perfect sense:

   Returns: The furthermost iterator i in the range [first, last] such
   that for any iterator j in the range [first, i) the following
   corresponding conditions hold: *j < value or comp(*j, value) !=
   false


Sent via Deja.com
http://www.deja.com/

---
[ comp.std.c++ is moderated.  To submit articles, try just posting with ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu    ]
[              --- Please see the FAQ before posting. ---               ]
[ FAQ: http://www.research.att.com/~austern/csc/faq.html                ]
[ Note that the FAQ URL has changed!  Please update your bookmarks.     ]





Author: "David Abrahams" <abrahams@mediaone.net>
Date: Wed, 27 Dec 2000 01:42:55 GMT
Raw View
"Valentin Bonnard" <Valentin.Bonnard@free.fr> wrote in message
news:928ffj$1q82$1@nef.ens.fr...
> > Technically, though, we can satisfy the standard's requirements for
> > the comparison function by adding an overloaded operator()() to "keep
> > the language lawyers happy":
> >
> > struct CompareEntryWord2
> > {
> >   // Heterogeneous comparison actually gets used
> >   bool operator()(const DictionaryEntry& e, const Word& w) const
> >     { return e.first < w; }
> >
> >   // Homogeneous comparison just to satisfy the legal requirements.
> >   bool operator()(
> >     const DictionaryEntry& e, const DictionaryEntry&& w) const
> >     { return e.first < w.first; }
> > };
> >
> > This version is arguably legal,
>
> No, it is not legal.

To say something is "arguably legal" is to say that "a reasonable person
could make the argument that it is legal". I am not making that argument
myself; I am only trying to leave a "trail of breadcrumbs" for anyone who
wants to try to follow what I'm thinking.

> > Dietmar Kuehl has suggested an alternative which doesn't suffer these
> > problems: define a new iterator
>
> If we have to define a new view-iterator (an iterator providing
> a different view of objects) just to apply a standard algorithm
> in a very common case, then the standard library has a serious
> design flaw.

Granted.

> > I think "the right answer" in the long run is to figure out how to
> > loosen the standard's requirements so that CompareEntryWord1 can be
> > guaranteed to work as expected.
>
> I strongly disagree.

On what basis?

> It would be clearer IMHO to say that the sequence [first, last)
> is sorted in ascending order wrt comp'(pi(x), pi(y)). (Avoiding
> the introduction of another notation.)

It turns out that any sort ordering between sequence elements is really
altogether beside the point. All that's important is that the comp function
divide the sequence into two (possibly empty) parts when used with the
search key, where false is returned for one part and true is returned for
the other. Granted, this does imply a certain trivial "sortedness", but any
formulation of the requirements in terms of sort order only seems to
complicate things.

> > it is probably beyond the ken of most committee members to verify
> > its correctness,
>
> I believe that many people in the committe have implemented
> complicated algorithm, and can handle much more complex math
> arguments that this one.

I agree. All the same, I think if a majority of committee members were able
to cope with it someone would have tried to shoot it down on theoretical
grounds before you did.

> The definition seems very involved and is almost unreadable.
> The underlying theory is trivial. The idea behind it is
> simple and intuitive.
>
> > That makes it a bad choice for the standard on both counts.
>
> IMO it is a bad choice for the standard primarilly because
> it makes no sens.

If you are right, then you have found another good reason not to use it.

> Yes, and it is a much simplier view. We are talking about a
> fast ``find (range, Predicate)'' function, where Predicate
> is known to be monotonic, and the function can thus use Newton's
> algorithm:

I believe Newton's algorithm is a specific application of binary searching
for finding roots. It depends on a sufficiently accurate initial guess for
convergence. I'm just talking about plain vanilla binary searching. Why
bring up Newton's algorithm?

> Another solution is to just rewrite the whole 'sorted data'
> part of the standard to take a pi ``projection'' function:

<snip implementations of new>

I don't understand why you'd go to these lengths when the only known
implementations of the standard-supplied algorithms already work, at least
in the absence of strict concept checking which is designed only to make
them break in these cases.

> Another solution is to just rewrite the whole 'sorted data'
> part of the standard to take a pi ``projection'' function:

<snip another implementation>

Okay, but why bother?

-Dave



---
[ comp.std.c++ is moderated.  To submit articles, try just posting with ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu    ]
[              --- Please see the FAQ before posting. ---               ]
[ FAQ: http://www.research.att.com/~austern/csc/faq.html                ]
[ Note that the FAQ URL has changed!  Please update your bookmarks.     ]





Author: "Alf P. Steinbach" <alf_steinbach@my-deja.com>
Date: 29 Dec 00 03:24:30 GMT
Raw View
In article <92489g$es2$1@nnrp1.deja.com>,
  david_abrahams@my-deja.com wrote:
> ...
>
> The question is, instead of writing the word_position() function
> above, which is tedious and error-prone, can we reuse the generic
> algorithms in the standard library?

In my experience if you can do approach (A) easily, but even after
long and hard study, research, tearing out of hair etc. still wonder
whether approach (B) is at all possible, then approach (A) is
generally the least tedious and error prone... ;-)

I agree, if the STL can be modified in some way to make it easy
to do *all* (or at least more!) of the things it probably was
intended for, then that would be wonderful!

Have a merry christmas (well, post-christmas) and a fruitful &
happy new year!


- Alf

--
alf_DOT_steinbach_AT_ac_DOT_com (clean the address before replying)
After 1. january 2000: alf_DOT_steinbach_AT_mogul_DOT_com


Sent via Deja.com
http://www.deja.com/

      [ Send an empty e-mail to c++-help@netlab.cs.rpi.edu for info ]
      [ about comp.lang.c++.moderated. First time posters: do this! ]

[ comp.std.c++ is moderated.  To submit articles, try just posting with ]
[ your news-reader.  If that fails, use mailto:std-c++@ncar.ucar.edu    ]
[              --- Please see the FAQ before posting. ---               ]
[ FAQ: http://reality.sgi.com/austern_mti/std-c++/faq.html              ]