C++ Programme By-bjarne stroustrup | More Info | Notesale | Buy and Sell Study Notes Online | Extra Student Income | University Notes

Search for notes by fellow students, in your own course and all over the country.

Browse our notes for titles which look like what you need, you can preview any of the notes via a sample of the contents. After you're happy these are the notes you're after simply pop them into your shopping cart.

My Basket

Buy These Notes

You have nothing in your shopping cart yet.

Title: C++ Programme By-bjarne stroustrup
Description: This Book contains all the necessary details of C++ Programme and.......all the information about C++ are present in this book......this book a contains all the important programmes that a computer programmer should know............so..........i hope this book is very beneficial for you...........Thank you........

Buy These Notes Preview

Document Preview

Extracts from the notes are below, to see the PDF you'll receive please use the links above

The
C++
Programming
Language
Fourth Edition

Bjarne Stroustrup

Upper Saddle River, NJ • Boston • Indianapolis • San Francisco
New York • Totonto • Montreal • London • Munich • Paris • Madrid
Capetown • Sydney • Tokyo • Singapore • Mexico City

Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks
...

The author and publisher have taken care in the preparation of this book, but make no expressed or implied warranty of any
kind and assume no responsibility for errors or omissions
...

The publisher offers excellent discounts on this book when ordered in quantity for bulk purchases or special sales, which
may include electronic versions and/or custom covers and content particular to your business, training goals, marketing
focus, and branding interests
...
S
...
com
For sales outside the United States, please contact:
International Sales
international@pearsoned
...
com/aw
Library of Congress Cataloging-in-Publication Data
Stroustrup, Bjarne
...
—Fourth edition
...

ISBN 978-0-321-56384-2 (pbk
...
paper)—ISBN 0-321-56384-0 (pbk
...
paper)
1
...
Title
...
73
...
13’3—dc23

2013002159

Copyright © 2013 by Pearson Education, Inc
...
Printed in the United States of America
...
To obtain permission to use material
from this work, please submit a written request to Pearson Education, Inc
...

This book was typeset in Times and Helvetica by the author
...

Second printing, June 2013

Contents

Contents

iii

Preface

v
Preface to the Fourth Edition
...
ix
Preface to the Second Edition
...
xii

Part I: Introductory Material
1
...

3
...

5
...
3
A Tour of C++: The Basics
...
59
A Tour of C++: Containers and Algorithms
...
111

Part II: Basic Facilities
6
...

8
...

10
...
135
Pointers, Arrays, and References
...
201
Statements
...
241

133

iv

Contents

11
...

13
...

15
...
273
Functions
...
343
Namespaces
...
419

Part III: Abstraction Mechanisms
16
...

18
...

20
...

22
...

24
...

26
...

28
...

Classes
...
481
Overloading
...
549
Derived Classes
...
613
Run-Time Type Information
...
665
Generic Programming
...
721
Instantiation
...
759
Metaprogramming
...
827

Part IV: The Standard Library
30
...

32
...

34
...

36
...

38
...

40
...

42
...

44
...
859
STL Containers
...
927
STL Iterators
...
973
Utilities
...
1033
Regular Expressions
...
1073
Locales
...
1159
Concurrency
...
1209
The C Standard Library
...
1267
1281

Preface
All problems in computer science
can be solved by another level of indirection,
except for the problem of too many layers of indirection
...
Wheeler

C++ feels like a new language
...
Furthermore, the resulting programs are better
checked by the compiler and run faster
...
I describe every language feature and standard-library
component that a professional programmer is likely to need
...

• Examples: How can it be used well by itself and in combination with other features? What
are the key techniques and idioms? What are the implications for maintainability and performance?
The use of C++ has changed dramatically over the years and so has the language itself
...
The current ISO
standard C++ (ISO/IEC 14882-2011, usually called C++11) is simply a far better tool for writing
quality software than were previous versions
...
Many answers are not the
same as you would ﬁnd with 1985, 1995, or 2005 vintage C++: progress happens
...
It is particularly suited for resource-constrained applications, such as
those found in software infrastructures
...
C++ is a language for someone who takes the task of programming seriously
...

There are billions of lines of C++ deployed
...
However, for all applications,
you can do better with modern C++; if you stick to older styles, you will be writing lower-quality
and worse-performing code
...
All code in this book conforms
to the 2011 ISO C++ standard
...

Naturally, these three groups are not disjoint – a professional software developer masters more than
just one programming language
...
If you ask, ‘‘What’s a for-loop?’’ or
‘‘What’s a compiler?’’ then this book is not (yet) for you; instead, I recommend my Programming:
Principles and Practice Using C++ to get started with programming and C++
...
If you ask ‘‘Why bother testing?’’
or say, ‘‘All languages are basically the same; just show me the syntax’’ or are conﬁdent that there
is a single language that is ideal for every task, this is not the book for you
...
Language and standard-library facilities for doing systemslevel concurrent programming (e
...
, using multicores)
...

General and uniform initialization, a simpler for-statement, move semantics, basic Unicode support,
lambdas, general constant expressions, control over class defaults, variadic templates, user-deﬁned
literals, and more
...
They are meant to be used in combination –
as bricks in a building set – rather than to be used individually in relative isolation to solve a speciﬁc problem
...
In particular,
C++’s design aims to be sufﬁciently ﬂexible and general to cope with future problems undreamed
of by its designers
...
Boehm, Marshall Clow, Jonathan Coe, Lawrence Crowl,
Walter Daugherty, J
...
Without their help this book would have been much poorer
...

Andrew Sutton is the author of the Origin library, which was the testbed for much of the discussion of emulating concepts in the template chapters, and of the matrix library that is the topic of
Chapter 29
...
’’
Thanks to my graduate design class for ﬁnding more problems with the ‘‘tour chapters’’ than
anyone else
...
Every expert
reviewer suggested adding technical details, advanced examples, and many useful development
conventions; every novice reviewer (or educator) suggested adding examples; and most reviewers
observed (correctly) that the book may be too long
...
Brian
Kernighan, for hosting me for part of the sabbatical that gave me time to write this book
...
Andy Hopper, for hosting me for part of the sabbatical that gave me time to write this book
...

College Station, Texas

Bjarne Stroustrup

This page intentionally left blank

Preface to the Third Edition
Programming is understanding
...
C++’s support for design and programming has
improved dramatically over the years, and lots of new helpful techniques have been developed for
its use
...
Ordinary practical programmers have achieved signiﬁcant
improvements in productivity, maintainability, ﬂexibility, and quality in projects of just about any
kind and scale
...

This book introduces standard C++† and the key programming and design techniques supported
by C++
...
New language features such as namespaces, exceptions,
templates, and run-time type identiﬁcation allow many techniques to be applied more directly than
was possible before, and the standard library allows the programmer to start from a much higher
level than the bare language
...
This
third edition is the result of a rewrite of even larger magnitude
...
The explosion of C++ use and the massive amount of experience accumulated as a result makes this possible
...
As before, this book presents C++ independently of any particular implementation,
and as before, the tutorial chapters present language constructs and concepts in a ‘‘bottom up’’
order so that a construct is used only after it has been deﬁned
...
Therefore, the standard library can be used to provide realistic and interesting examples well before a reader can be
assumed to understand its inner workings
...

This book presents every major C++ language feature and the standard library
...
However, features are presented in the context of their use
...

x

Preface to the Third Edition

That is, the focus is on the language as the tool for design and programming rather than on the language in itself
...
Except where illustrating technicalities, examples are
taken from the domain of systems software
...

The primary aim of this book is to help the reader understand how the facilities offered by C++
support key programming techniques
...
Only a good understanding of the ideas behind the language facilities leads to
mastery
...
The hope is that this book will help the reader gain
new insights and become a better programmer and designer
...
Without their help and suggestions, this book would have been harder to understand, contained more
errors, been slightly less complete, and probably been a little bit shorter
...
It is slightly unfair to single out individuals, but it would be even more unfair not to mention anyone, so I’d like to especially mention
Mike Ball, Dag Br¨ ck, Sean Corﬁeld, Ted Goldstein, Kim Knuttila, Andrew Koenig, Dmitry
u
Lenkov, Nathan Myers, Martin O’Riordan, Tom Plum, Jonathan Shopiro, John Spicer, Jerry
Schwarz, Alex Stepanov, and Mike Vilot, as people who each directly cooperated with me over
some part of C++ and its standard library
...
I have been able to accommodate many of their suggestions within
the framework of the book so that later printings beneﬁtted signiﬁcantly
...
In response to requests from readers, I
have added appendices D and E
...
Sevinc, Andy Tenne-Sens, Shoichi Uchida,
u
¸
Ping-Fai (Mike) Yang, and Dennis Yelle
...

– Bilbo Baggins

As promised in the ﬁrst edition of this book, C++ has been evolving to meet the needs of its users
...
The C++ user-community has grown a hundredfold during the
six years since the ﬁrst edition of this book; many lessons have been learned, and many techniques
have been discovered and/or validated by experience
...

The primary aim of the language extensions made in the last six years has been to enhance C++
as a language for data abstraction and object-oriented programming in general and to enhance it as
a tool for writing high-quality libraries of user-deﬁned types in particular
...
In this context, safe means that a class provides a speciﬁc
type-safe interface between the users of the library and its providers; efﬁcient means that use of the
class does not impose signiﬁcant overheads in run-time or space on the user compared with handwritten C code
...
Chapters 1 through 10 give a tutorial introduction; Chapters 11 through 13 provide a discussion of design and software development issues; and,
ﬁnally, the complete C++ reference manual is included
...
They include reﬁned
overloading resolution, memory management facilities, and access control mechanisms, type-safe
linkage, const and static member functions, abstract classes, multiple inheritance, templates, and
exception handling
...
In addition, C++ is successfully used in many application areas
that are not covered by this label
...
Consequently,
this book describes the C++ language itself without trying to explain a particular implementation,
programming environment, or library
...
’’ This style of exposition allows general principles and useful techniques to stand out more
clearly than they would in a fully elaborated program, where they would be buried in details
...
, are available in ‘‘bulletproof ’’ and/or ‘‘goldplated’’ versions from a
wide variety of commercial and non-commercial sources
...

This edition provides a greater emphasis on tutorial aspects than did the ﬁrst edition of this
book
...
The discussion of design issues has been greatly
expanded to reﬂect the demand for information beyond the description of language features and
their immediate use
...
The reference manual, in particular, represents many years of work in this direction
...
In
other words, this book presents the C++ language, its fundamental principles, and the key techniques needed to apply it
...
Many people inﬂuenced the development of C++ from 1985
to 1991
...
Also thanks to the many participants of the ‘‘external reviews’’ of the reference manual
drafts and to the people who suffered through the ﬁrst year of X3J16
...

– B
...
Whorf

C++ is a general purpose programming language designed to make programming more enjoyable
for the serious programmer
...
In addition to the facilities provided by C, C++ provides ﬂexible and efﬁcient facilities for
deﬁning new types
...
This technique for program construction is often called data abstraction
...

Such objects can be used conveniently and safely in contexts in which their type cannot be determined at compile time
...
When
used well, these techniques result in shorter, easier to understand, and easier to maintain programs
...
A class is a user-deﬁned type
...
C++ provides
much better facilities for type checking and for expressing modularity than C does
...
C++ retains C’s ability to deal efﬁciently with the fundamental
objects of the hardware (bits, bytes, words, addresses, etc
...

C++ and its standard libraries are designed for portability
...
C libraries can be used from a C++ program, and most tools that
support programming in C can be used with C++
...
It provides a complete description of C++, many complete examples, and many
more program fragments
...
In particular, Tom Cargill, Jim Coplien, Stu Feldman, Sandy Fraser,
Steve Johnson, Brian Kernighan, Bart Locanthi, Doug McIlroy, Dennis Ritchie, Larry Rosler, Jerry
Schwarz, and Jon Shopiro provided important ideas for development of the language
...

In addition, hundreds of people contributed to the development of C++ and its compiler by
sending me suggestions for improvements, descriptions of problems they had encountered, and
compiler errors
...

Many people have also helped with the production of this book, in particular, Jon Bentley,
Laura Eaves, Brian Kernighan, Ted Kowalski, Steve Mahaney, Jon Shopiro, and the participants in
the C++ course held at Bell Labs, Columbus, Ohio, June 26-27, 1985
...
It also provides an overview of this book
and explains the approach taken to the description of the language facilities and their
use
...

Chapters
1
2
3
4
5

Notes to the Reader
A Tour of C++: The Basics
A Tour of C++: Abstraction Mechanisms
A Tour of C++: Containers and Algorithms
A Tour of C++: Concurrency and Utilities

2

Introduction

Part I

‘‘
...
Be many people
...
You
have worried too much about Marcus Cocoza, so that you have been really his slave
and prisoner
...
You were always much afraid that Marcus
might do a stupid thing, or be bored
...
I should like you to be easy, your little heart to
be light again
...
’’
– Karen Blixen,
The Dreamers from Seven Gothic Tales (1934)

1
Notes to the Reader
Hurry Slowly
(festina lente)
...
1 The Structure of This Book
A pure tutorial sorts its topics so that no concept is used before it has been introduced; it must be
read linearly starting with page one
...
A pure tutorial can in principle be read without prerequisites – it carefully describes all
...

This book combines aspects of both
...
If not, you can start at the beginning, but try not to
get bogged down in details
...

4

Notes to the Reader

Chapter 1

Making parts of the book relatively self-contained implies some repetition, but repetition also
serves as review for people reading the book linearly
...
Experienced programmers can read the (relatively) quick
‘‘tour’’ of C++ to gain the overview needed to use the book as a reference
...
Chapters 2-5 give a quick introduction to the C++ language
and its standard library
...

Part III
Abstraction Mechanisms: Chapters 16-29 describe C++’s abstraction mechanisms and their use for object-oriented and generic programming
...

1
...
1 Introduction
This chapter, Chapter 1, provides an overview of this book, some hints about how to use it, and
some background information about C++ and its use
...
Please do not feel
obliged to read it all carefully before proceeding
...

Chapter 3
A Tour of C++: Abstraction Mechanisms presents the language features supporting data abstraction, object-oriented programming, and generic programming
...

Chapter 5
A Tour of C++: Concurrency and Utilities outlines the standard-library utilities
related to resource management, concurrency, mathematical computation, regular expressions, and more
...
In particular, it should convince readers that C++ has come a long way since the ﬁrst, second, and third editions of this book
...
1
...
It introduces the notions of type, object, scope, and storage
...
Modularity – as supported
by namespaces, source ﬁles, and exception handling – is also discussed:
Chapter 6
Types and Declarations: Fundamental types, naming, scopes, initialization, simple type deduction, object lifetimes, and type aliases

Section 1
...
2

Basic Facilities

5

Chapter 7
Chapter 8
Chapter 9

Pointers, Arrays, and References
Structures, Unions, and Enumerations
Statements: Declarations as statements, selection statements (if and switch), iteration statements (for, while, and do), goto, and comments
Chapter 10 Expressions: A desk calculator example, survey of operators, constant expressions, and implicit type conversion
...
For example,
I explain the C++ facilities for expressing recursion and iteration, but I do not go into technical
details or spend much time explaining how these concepts are useful
...
Many programmers lack experience with exceptions or
got their experience from languages (such as Java) where resource management and exception handling are not integrated
...
It goes into some detail
about strategy with a focus on the ‘‘Resource Acquisition Is Initialization’’ technique (RAII)
...
1
...
The chapters fall into three rough categories: classes, class hierarchies, and templates
...

Chapter 17 Construction, Cleanup, Copy, and Move shows how a programmer can deﬁne the
meaning of creation and initialization of objects of a class
...

Chapter 18 Operator Overloading presents the rules for giving meaning to operators for
user-deﬁned types with an emphasis on conventional arithmetic and logical operators, such as +, ∗, and &
...
’’

6

Notes to the Reader

Chapter 1

Classes can be organized into hierarchies:
Chapter 20 Derived Classes presents the basic language facilities for building hierarchies out
of classes and the fundamental ways of using them
...

The C++ model for access control (public, protected, and private) is presented
...
It also
presents the notion of multiple inheritance, that is, a class having more than one
direct base class
...
We can use dynamic_cast to inquire whether an object of
a base class was deﬁned as an object of a derived class and use the typeid to gain
minimal information from an object (such as the name of its class)
...
Class
templates, function templates, and template aliases are presented
...
The technique of lifting an abstract algorithm from a number of concrete
code examples is central, as is the notion of concepts specifying a generic algorithm’s requirements on its arguments
...

Chapter 26 Instantiation focuses on the rules for name binding
...

Chapter 28 Metaprogramming explores how templates can be used to generate programs
...

Chapter 29 A Matrix Design gives a longish example to show how language features can be
used in combination to solve a complex design problem: the design of an Ndimensional matrix with near-arbitrary element types
...
The presentation technique in Part III differs from that of Part II in that I don’t assume that
the reader knows the techniques described
...
1
...
In particular, they are meant to be
read in any order and can be used as a user-level manual for the library components:
Chapter 30 Standard-Library Overview gives an overview of the standard library, lists the
standard-library headers, and presents language support and diagnostics support,
such as exception and system_error
...

Section 1
...
4

Chapter 32
Chapter 33
Chapter 34

Chapter 35
Chapter 36
Chapter 37

Chapter 38
Chapter 39

Chapter 40
Chapter 41
Chapter 42

Chapter 43
Chapter 44

The Standard Library

7

STL Algorithms presents the algorithms from the STL, including ﬁnd(), sort(),
and merge()
...

Memory and Resources presents utility components related to memory and
resource management, such as array, bitset, pair, tuple, unique_ptr, shared_ptr,
allocators, and the garbage collector interface
...

Strings documents the string library, including the character traits that are the
basis for the use of different character sets
...

I/O Streams documents the stream I/O library
...

Locales describes class locale and its various facets that provide support for the
handling of cultural differences in character sets, formatting of numeric values,
formatting of date and time, and more
...

Concurrency presents the C++ basic memory model and the facilities offered for
concurrent programming without locks
...

The C Standard Library documents the C standard library (including printf() and
clock()) as incorporated into the C++ standard library
...

1
...
5 Examples and References
This book emphasizes program organization rather than the design of algorithms
...
A trivial algorithm is typically better suited to
illustrate an aspect of the language deﬁnition or a point about program structure
...
Often, reimplementation with a
more suitable algorithm is an exercise
...

Textbook examples necessarily give a warped view of software development
...
I see no substitute for
writing realistically sized programs in order to get an impression of what programming and a

8

Notes to the Reader

Chapter 1

programming language are really like
...
These are the basic techniques from which every program is composed
...

The selection of examples reﬂects my background in compilers, foundation libraries, and simulations
...
Examples are simpliﬁed versions of what is found in real code
...
My ideal is the shortest and clearest example that
illustrates a design principle, a programming technique, a language construct, or a library feature
...
For purely language-technical
examples, I use variables named x and y, types called A and B, and functions called f() and g()
...
The language features presented and the detail in which
they are described roughly reﬂect my view of what is needed for effective use of C++
...
An
understanding of every language-technical detail of a language feature or library component is neither necessary nor sufﬁcient for writing good programs
...
What is
needed is an understanding of design and programming techniques together with an appreciation of
application domains
...
The ﬁnal arbiter of language and
standard-library rules is the ISO C++ standard [C++,2011]
...
3
...
5
...
1 (ISO C++ standard, §5
...
1)
...
g
...
g
...

To save a few trees and to simplify additions, the hundreds of exercises for this book have been
moved to the Web
...
stroustrup
...

The language and library used in this book are ‘‘pure C++’’ as deﬁned by the C++ standard
[C++,2011]
...
The
major program fragments in this book were tried using several C++ implementations
...
However, I
see no point in mentioning which implementations failed to compile which examples
...
See Chapter 44 for suggestions on how to
cope with older C++ compilers and with code written for C compilers
...
For example, I prefer
{}-style initializers and using for type aliases
...
’’ However, being startled is often a good way to start reviewing material
...

Obviously, if you have to use a pre-C++11 compiler (say, because some of your customers have
not yet upgraded to the current standard), you have to refrain from using novel features
...

§44
...

Section 1
...
2 The Design of C++
The purpose of a programming language is to help express ideas in code
...
The ﬁrst purpose ideally requires a language that is ‘‘close to the
machine’’ so that all important aspects of a machine are handled simply and efﬁciently in a way
that is reasonably obvious to the programmer
...
The second purpose ideally requires a language that is ‘‘close to the problem to be solved’’
so that the concepts of a solution can be expressed directly and concisely
...
Thus, C++ is based on the
idea of providing both
• direct mappings of built-in operations and types to hardware to provide efﬁcient memory
use and efﬁcient low-level operations, and
• affordable and ﬂexible abstraction mechanisms to provide user-deﬁned types with the same
notational support, range of uses, and performance as built-in types
...
Over the years, further application
of these simple ideals resulted in a far more general, efﬁcient, and ﬂexible set of facilities
...

The design of C++ has focused on programming techniques dealing with fundamental notions
such as memory, mutability, abstraction, resource management, expression of algorithms, error handling, and modularity
...

By deﬁning libraries of classes, class hierarchies, and templates, you can write C++ programs at
a much higher level than the one presented in this book
...
4
...
For high-level applications programming to be effective and convenient, we need libraries
...
That’s true for every general-purpose
language
...

My standard introduction of C++ used to start:
• C++ is a general-purpose programming language with a bias toward systems programming
...
What has changed over the years is an increase in the importance, power, and
ﬂexibility of C++’s abstraction mechanisms:
• C++ is a general-purpose programming language providing a direct and efﬁcient model of
hardware combined with facilities for deﬁning lightweight abstractions
...

By general-purpose programming language I mean a language designed to support a wide variety
of uses
...
No language is ideal for every application and every programmer,
but the ideal for C++ is to support the widest possible range of application areas well
...
In particular, the implementation of
software infrastructure (e
...
, device drivers, communications stacks, virtual machines, operating
systems, operations systems, programming environments, and foundation libraries) is mostly systems programming
...

Of course, you can also program in ways that completely hide hardware, use expensive abstractions (e
...
, every object on the free store and every operation a virtual function), use inelegant styles
(e
...
, overabstraction), or use essentially no abstractions (‘‘gloriﬁed assembly code’’)
...

The Design and Evolution of C++ book [Stroustrup,1994] (known as D&E) outlines the ideas
and design aims of C++ in greater detail, but two principles should be noted:
• Leave no room for a lower-level language below C++ (except for assembly code in rare
cases)
...

• What you don’t use you don’t pay for
...
Therefore, a language feature and
a fundamental abstraction must be designed not to waste a single byte or a single processor
cycle compared to equivalent alternatives
...

These are Draconian principles, but essential in some (but obviously not all) contexts
...
The STL is an example (§4
...
1, §4
...
5, Chapter 31, Chapter 32,
Chapter 33)
...

1
...
1 Programming Style
Languages features exist to provide support for programming styles
...

The general ideals for design and programming can be expressed simply:
• Express ideas directly in code
...

• Represent relationships among ideas directly in code
...

• Express simple ideas simply
...
A fundamental reason for that is that a language embodies a set of engineering tradeoffs
reﬂecting differing needs, tastes, and histories of various individuals and communities
...

Section 1
...
1

Programming Style

11

The C++ language features most directly support four programming styles:
• Procedural programming
• Data abstraction
• Object-oriented programming
• Generic programming
However, the emphasis is on the support of effective combinations of those
...
) solution to most nontrivial problems tends to be one
that combines aspects of these styles
...
For example, what I
refer to as a ‘‘programming style,’’ others call a ‘‘programming technique’’ or a ‘‘paradigm
...
I feel
uncomfortable with the word ‘‘paradigm’’ as pretentious and (from Kuhn’s original deﬁnition) having implied claims of exclusivity
...

• Procedural programming: This is programming focused on processing and the design of
suitable data structures
...
C++’s support comes in the form of the built-in types, operators, statements, functions, structs, unions, etc
...
Compared to C, C++ provides further support for procedural programming in the
form of many additional language constructs and a stricter, more ﬂexible, and more supportive type system
...
C++ supports concrete and
abstract classes
...
The notion of an
abstract class provides direct support for complete data hiding
...
In addition to allowing the deﬁnition lattices of classes, C++
provides a variety of features for navigating class lattices and for simplifying the deﬁnition
of a class out of existing ones
...
3
...
2) and encapsulation (§20
...
5)
...
Here, ‘‘general’’ means that an algorithm can be designed to accept a
wide variety of types as long as they meet the algorithm’s requirements on its arguments
...
Templates provide (compiletime) parametric polymorphism
...
Thus, C++ could be (and has been) called class oriented
...

Focusing exclusively on one of these styles is a mistake: except for toy examples, doing so leads to
wasted development effort and suboptimal (inﬂexible, verbose, poorly performing, unmaintainable,
etc
...

12

Notes to the Reader

Chapter 1

I wince when someone characterizes C++ exclusively through one of these styles (e
...
, ‘‘C++ is
an object-oriented language’’) or uses a term (e
...
, ‘‘hybrid’’ or ‘‘mixed paradigm’’) to imply that a
more restrictive language would be preferable
...
The styles mentioned are not distinct alternatives: each contributes techniques to a more
expressive and effective style of programming, and C++ provides direct language support for their
use in combination
...

Even the earliest published account of C++ [Stroustrup,1982] presents examples that use these different styles in combination and presents language features aimed at supporting such combinations:
• Classes support all of the mentioned styles; all rely on the user representing ideas as userdeﬁned types or objects of user-deﬁned types
...

• Member functions, constructors, destructors, and user-deﬁned assignment provide a clean
functional interface to objects as needed by data abstraction and object-oriented programming
...
More
general overloading had to wait until 1984 and uniform initialization until 2010
...
They are necessary for overloading
...

• Generic functions and parameterized types (generated from functions and classes using
macros) support generic programming
...

• Base and derived classes provide the foundation for object-oriented programming and some
forms of data abstraction
...

• Inlining made the use of these facilities affordable in systems programming and for building
run-time and space efﬁcient libraries
...
Today’s C++ provides much better support for design and programming based on
lightweight abstraction, but the aim of elegant and efﬁcient code was there from the very beginning
...

The fundamental object in C++ has identity; that is, it is located in a speciﬁc location in memory and can be distinguished from other objects with (potentially) the same value by comparing
addresses
...
4)
...
In C++11, this notion of rvalue
has been developed into a notion of a value that can be moved around cheaply (§3
...
2, §6
...
1,
§7
...
2)
...
This nicely complements the techniques and language features (e
...
, lambda expressions) developed primarily for
generic programming
...
g
...

Section 1
...
1

Programming Style

13

From the very earliest days, C++ programs and the design of C++ itself have been concerned
about resource management
...
2),
• perfect (no leaks are acceptable), and
• statically type-safe
...
Foundation and application libraries beyond the standard
provided many more examples, such as Matrix and Widget
...
This was soon backed with the ability to control copy by deﬁning
assignment as well as copy constructors
...
3) in C++11 completes this line of thinking by allowing cheap movement of potentially
large objects from scope to scope (§3
...
2) and to simply control the lifetime of polymorphic or
shared objects (§5
...
1)
...
Any class that establishes and maintains an invariant relies on a subset of those features
...
2
...
For this reason, restricting language features with the intent of eliminating programmer errors is, at best, dangerous
...
Good design and the
absence of errors cannot be guaranteed merely by the presence or absence of speciﬁc language features
...

The notion of static types and compile-time type checking is central to effective use of C++
...
Following Simula, the design of user-deﬁned types with interfaces that are checked at compile time is key to the
expressiveness of C++
...

C++ type-checking and data-hiding features rely on compile-time analysis of programs to prevent accidental corruption of data
...
They can, however, be used freely without incurring run-time or space overheads
...

C++’s static type system is ﬂexible, and the use of simple user-deﬁned types implies little, if
any overhead
...
A type-rich style of programming makes code more

14

Notes to the Reader

Chapter 1

readable, maintainable, and analyzable
...
C++ compilers and development tools support such type-based analysis [Stroustrup,2012]
...
However, my ideal is (and always was) complete type safety
...
’’ Note that Simula was both
type-safe and ﬂexible
...
’’ However, the list of solid reasons against basing my work on type-safe
Algol68 [Woodward,1974] was long and painful
...
But it is an ideal that C++ programmers (especially library
builders) can strive for
...
Outside of low-level sections of code (hopefully
isolated by type-safe interfaces), code that interfaces to code obeying different language conventions (e
...
, an operating system call interface), and the implementations of fundamental abstractions
(e
...
, string and vector), there is now little need for type-unsafe code
...
2
...
The main reasons for relying on C were to build on a proven set of low-level language
facilities and to be part of a technical community
...
The continuing, more or less parallel evolution of C
and C++ has been a constant source of concern and requires constant attention [Stroustrup,2002]
...
In particular, there are differences in opinion as
to the value of compatibility, differences in opinion on what constitutes good programming, and
differences in opinion on what support is needed for good programming
...

One hundred percent C/C++ compatibility was never a goal for C++ because that would compromise type safety and the smooth integration of user-deﬁned and built-in types
...
C++98 adopted many details from C89 (§44
...
1)
...
C’s facilities for low-level systems programming tasks are retained and enhanced; for
example, see inlining (§3
...
1
...
1
...
2
...
2
...
4, §12
...
6)
...
g
...

The deﬁnition of C++ has been revised to ensure that a construct that is both legal C and legal
C++ has the same meaning in both languages (§44
...

One of the original aims for C was to replace assembly coding for the most demanding systems
programming tasks
...
2
...
The difference between C and C++ is primarily in the degree of emphasis on types and structure
...
Through extensive use of the type system, C++ is even more
expressive without loss of performance
...
Programming in C encourages many techniques and tricks that are rendered unnecessary by C++ language features
...
3
...
However, good
C programs tend to be C++ programs
...
Experience with
any statically typed language will be a help when learning C++
...
2
...
C++ has no built-in high-level data types
and no high-level primitive operations
...
If a user wants such
a type, it can be deﬁned in the language itself
...
A well-designed userdeﬁned type differs from a built-in type only in the way it is deﬁned, not in the way it is used
...
) provides many examples
of such types and their uses
...
Except for a few unfortunate and unimportant historical accidents, the C++ standard library is written in C++
...
This ensures that they can be used in large systems that typically consist of layer
upon layer of abstraction
...
For
example, constructs that would make it necessary to store ‘‘housekeeping information’’ in every
object were rejected, so if a user declares a structure consisting of two 16-bit quantities, that structure will ﬁt into a 32-bit register
...
This
can be essential for embedded and high-performance applications
...
So, programmers of such applications don’t have to work with a low-level
(error-prone, impoverished, and unproductive) set of language features
...
Fortunately, C++ was never restricted
to UNIX; it simply used UNIX and C as a model for the relationships among language, libraries,
compilers, linkers, execution environments, etc
...
There are, however, good reasons for using C++ in environments that provide signiﬁcantly more run-time support
...

16

Notes to the Reader

Chapter 1

Not every piece of code can be well structured, hardware-independent, easy to read, etc
...
It also possesses facilities for hiding
such code behind elegant and safe interfaces
...
C++’s emphasis on modularity, strongly typed interfaces, and ﬂexibility pays off here
...

This book emphasizes techniques for providing general-purpose facilities, generally useful
types, libraries, etc
...
Furthermore, because all nontrivial programs consist of many semi-independent parts, the techniques for writing such parts serve programmers of all applications
...

This introduces library components and their underlying design concepts and implementation techniques
...

However, if the standard library provides a component that addresses a problem, it is almost always
better to use that component than to build your own
...
Over the
longer term, the standard component (possibly accessed through a convenient custom interface) is
likely to lower long-term maintenance, porting, tuning, and education costs
...
With C++,
this is not so
...
, is typically a
bit shorter than the equivalent C program not using these facilities
...

C++ supports systems programming
...
The idea of writing all software in a
single language is a fantasy
...
By that, I meant that a C++, C, assembler, or Fortran
function could call functions in the other languages without extra overhead or conversion of data
structures passed among them
...
The use of multiple processes and
multiple address spaces relied on (extralinguistic) operating system support
...
Initially, I relied on the UNIX Shell for that, but just about
any ‘‘scripting language’’ will do
...
C++ was designed to be part of large, concurrent, multilanguage systems
...
3

Learning C++

17

1
...
Fortunately, a programming language does not have to be
perfect to be a good tool for building great systems
...
What is perfect for one task is
often seriously ﬂawed for another because perfection in one area implies specialization
...

Not everything can be expressed directly using the built-in features of a language
...
Language features exist to support a variety of programming styles and techniques
...

Writing programs is essential; understanding a programming language is not just an intellectual
exercise
...

In practical programming, there is little advantage in knowing the most obscure language features or using the largest number of features
...
Only in the context provided by techniques and by other features does the feature acquire
meaning and interest
...

No signiﬁcant system is built exclusively in terms of the language features themselves
...
We use libraries to improve maintainability, portability, and performance
...
g
...
Many of the most fundamental programming concepts are represented in the standard
library
...
The standard library
is the repository of much hard-earned knowledge of how to use C++ well
...
This has surprised some who – correctly – point
out that C++ isn’t the smallest or cleanest language ever designed
...

The most important thing to do when learning C++ is to focus on fundamental concepts (such
as type safety, resource management, and invariants) and programming techniques (such as
resource management using scoped objects and the use of iterators in algorithms) and not get lost in
language-technical details
...
For this, an appreciation of programming and design techniques is far more
important than understanding all the details
...

18

Notes to the Reader

Chapter 1

C++ programming is based on strong static type checking, and most techniques aim at achieving a high level of abstraction and a direct representation of the programmer’s ideas
...
To gain the beneﬁts of C++, programmers coming to it from a different language must
learn and internalize idiomatic C++ programming style and technique
...

Thoughtlessly applying techniques effective in one language to another typically leads to awkward, poorly performing, and hard-to-maintain code
...
’’ You can write in the style of Fortran, C, Lisp, Java,
etc
...
Every language can be a fertile source of ideas about how to write C++ programs
...
Over the basic type system of a language, only
Pyrrhic victories are possible
...
C++ is safer and more expressive, and it reduces the need to
focus on low-level techniques
...
Chapter 44 is
a guide for programmers going from C++ to C, say, to deal with legacy code
...

There are several independently developed implementations of C++
...
To help master all of this you
can ﬁnd textbooks, manuals, and a bewildering variety of online resources
...
Each has its own
emphasis and bias, so use at least two
...
3
...
Imitate good writing
...

The main ideal for C++ programming – as for programming in most higher-level languages – is
to express concepts (ideas, notions, etc
...
We try to ensure that the
concepts we talk about, represent with boxes and arrows on our whiteboard, and ﬁnd in our (nonprogramming) textbooks have direct and obvious counterparts in our programs:
[1] Represent ideas directly in code
...
g
...

[3] Represent independent ideas independently in code
...

Section 1
...
1

Programming in C++

19

More speciﬁcally:
[5] Prefer statically type-checked solutions (when applicable)
...
g
...

[7] Don’t overabstract (i
...
, don’t generalize, introduce class hierarchies, or parameterize
beyond obvious needs and experience)
...
3
...

1
...
2 Suggestions for C++ Programmers
By now, many people have been using C++ for a decade or two
...
Often, what an experienced C++ programmer has failed to notice over the
years is not the introduction of new features as such, but rather the changes in relationships
between features that make fundamental new programming techniques feasible
...
You ﬁnd out only by reexamining the basics
...
If you already know the contents of a chapter, you can be
done in minutes
...
I learned a fair bit writing this book, and I suspect that hardly any C++ programmer knows
every feature and technique presented
...
Through its organization and examples,
this book offers such a perspective
...
4
...
2, §13
...
2
...

[2] Use constructor/destructor pairs to simplify resource management (RAII; §5
...
3)
...
2
...
2, §11
...
1)
...
4, §4
...
4, Chapter 32)
...
2
...

[6] Use exceptions, rather than error codes, to report errors that cannot be handled locally
(§2
...
3, §13
...

[7] Use move semantics to avoid copying large objects (§3
...
2, §17
...
2)
...
2
...

[9] Use shared_ptr to reference shared objects, that is, objects without a single owner that is
responsible for their destruction (§5
...
1)
...
2)
...
3
...
3
...

1
...
3 Suggestions for C Programmers
The better one knows C, the harder it seems to be to avoid writing C++ in C style, thereby losing
many of the potential beneﬁts of C++
...

20

Notes to the Reader

Chapter 1

[1]

Don’t think of C++ as C with a few features added
...
To get really major advantages from C++ as compared to C, you need to
apply different design and implementation styles
...

[3] Use the C++ standard library as a teacher of new techniques and programming styles
...
g
...

[4] Macro substitution is almost never necessary in C++
...
5), constexpr (§2
...
3,
§10
...
4) to deﬁne manifest constants, inline (§12
...
5) to avoid
function-calling overhead, templates (§3
...
4
...
3
...

[5] Don’t declare a variable before you need it, and initialize it immediately
...
3), in for-statement initializers (§9
...
4
...

[6] Don’t use malloc()
...
2) does the same job better, and instead of
realloc(), try a vector (§3
...
2)
...
2
...
2, §11
...
1)
...
Their use limits the support you can get from the type system and can harm performance
...
If you must use an
explicit type conversion, try using one of the named casts (e
...
, static_cast; §11
...
2) for a
more precise statement of what you are trying to do
...
C++ standard-library strings (§4
...
2
...
4
...
In general, try not to build yourself what has
already been provided by the standard library
...
g
...

[10] Do not assume that something laboriously written in C style (avoiding C++ features such
as classes, templates, and exceptions) is more efﬁcient than a shorter alternative (e
...
,
using standard-library facilities)
...

To obey C linkage conventions, a C++ function must be declared to have C linkage (§15
...
5)
...
3
...
Their aims are signiﬁcantly different and so are many of their application domains
...
To use C++
well, you need to adopt programming and design techniques appropriate to C++, rather than trying
to write Java in C++
...

Section 1
...
4

Suggestions for Java Programmers

21

[2]

Use the C++ abstraction mechanisms (e
...
, classes and templates): don’t fall back to a C
style of programming out of a false feeling of familiarity
...

[4] Don’t immediately invent a unique base for all of your classes (an Object class)
...

[5] Minimize the use of reference and pointer variables: use local and member variables
(§3
...
1
...
2, §16
...
4, §17
...

[6] Remember: a variable is never implicitly a reference
...

[8] A function is not virtual by default
...

[9] Use abstract classes as interfaces to class hierarchies; avoid ‘‘brittle base classes,’’ that is,
base classes with data members
...

[11] Use a constructor to establish a class invariant (and throw an exception if it can’t)
...
g
...
Don’t imitate ﬁnally (doing so is more ad hoc and in the longer run far
more work than relying on destructors)
...
g
...
g
...

[14] Use freestanding functions (nonmember functions) to minimize coupling (e
...
, see the
standard algorithms), and use namespaces (§2
...
2, Chapter 14) to limit the scope of freestanding functions
...
5
...
1)
...

[17] C++ offers only the most minimal run-time reﬂection: dynamic_cast and typeid (Chapter
22)
...
g
...

Most of this advice applies equally to C# programmers
...
4 History
I invented C++, wrote its early deﬁnitions, and produced its ﬁrst implementation
...

C++ was designed to provide Simula’s facilities for program organization [Dahl,1970]
[Dahl,1972] together with C’s efﬁciency and ﬂexibility for systems programming [Kernighan,1978]
[Kernighan,1988]
...
The class concept (with derived classes and virtual functions) was borrowed from it
...

22

Notes to the Reader

Chapter 1

The evolution of C++ was always in the context of its use
...
In particular, my colleagues at
AT&T Bell Laboratories were essential for the growth of C++ during its ﬁrst decade
...
Furthermore, it does not go into details
...
My two papers from the ACM History of Programming Languages conference and my
Design and Evolution of C++ book (known as ‘‘D&E’’) describe the design and evolution of C++
in detail and document inﬂuences from other programming languages
...
In my FAQ, I try to maintain a connection between the standard facilities and the people
who proposed and reﬁned those facilities [Stroustrup,2010]
...

1
...
1 Timeline
The work that led to C++ started in the fall of 1979 under the name ‘‘C with Classes
...
The initial feature set included classes and derived
classes, public/private access control, constructors and destructors, and function declarations with argument checking
...

1984 ‘‘C with Classes’’ was renamed to C++
...

1985 First commercial release of C++ (October 14)
...

1985 The C++ Programming Language (‘‘TC++PL,’’ October 14) [Stroustrup,1986]
...

1991 The C++ Programming Language, Second Edition [Stroustrup,1991], presenting generic
programming using templates and error handling based on exceptions (including the
‘‘Resource Acquisition Is Initialization’’ general resource management idiom)
...
The standard
library added the STL framework of generic containers and algorithms
...

2002 Work on a revised standard, colloquially named C++0x, started
...
A C++ Technical Report
introduced new standard-library components, such as regular expressions, unordered containers (hash tables), and resource management pointers, which later became part of
C++0x
...

Section 1
...
1

Timeline

23

2009 C++0x was feature complete
...
The standard library added several components, including
threads, locks, and most of the components from the 2003 Technical Report
...

2012 The ﬁrst complete C++11 implementations emerged
...

2013 The C++ Programming Language, Fourth Edition introduced C++11
...
As is not uncommon in large projects, we
were overly optimistic about the completion date
...
4
...
For that, I needed some event-driven simulations for which Simula would have been
ideal, except for performance considerations
...
The result of adding Simula-style
classes to C, ‘‘C with Classes,’’ was used for major projects in which its facilities for writing programs that use minimal time and space were severely tested
...
The ﬁrst
use of C++ outside a research organization started in July 1983
...
The name signiﬁes the evolutionary nature of the changes from C; ‘‘++’’ is the C increment operator
...
Connoisseurs
of C semantics ﬁnd C++ inferior to ++C
...
For yet another interpretation of the name
C++, see the appendix of [Orwell,1949]
...
Its main purpose was to make writing good
programs easier and more pleasant for the individual programmer
...
There was
no ‘‘C++ project’’ either, or a ‘‘C++ design committee
...

1
...
2
...
I used
macros to provide primitive parameterization
...
Late that year, I was

24

Notes to the Reader

Chapter 1

able to present a set of language facilities supporting a coherent set of programming styles; see
§1
...
1
...

In the terminology of the time, ‘‘a constructor creates the execution environment for the member
functions and the destructor reverses that
...
If there were other languages at the time that supported multiple constructors capable of executing general code, I didn’t (and don’t) know of them
...

C++ was released commercially in October 1985
...
1
...
2
...
2
...
5, §16
...
9), function overloading (§12
...
7), operator
overloading (§3
...
1
...
2
...
3
...
Of these
features, support for run-time polymorphism in the form of virtual functions was by far the most
controversial
...
Systems programmers tended to view indirect function
calls with suspicion, and people acquainted with other languages supporting object-oriented programming had a hard time believing that virtual functions could be fast enough to be useful in systems code
...
The resistance to virtual functions may be related to a resistance to
the idea that you can get better systems through more regular structure of code supported by a programming language
...
My view was (and is) that
we need every bit of help we can get from languages and tools: the inherent complexity of the systems we are trying to build is always at the edge of what we can express
...
In the early years,
the feedback from Stu Feldman, Alexander Fraser, Steve Johnson, Brian Kernighan, Doug McIlroy,
and Dennis Ritchie was invaluable
...
The most important of those were templates [Stroustrup,1988] and exception handling
[Koenig,1990], which were considered experimental at the time the standards effort started
...

At the time, nobody knew how to simultaneously get all three, and to compete with C-style code
for demanding systems applications, I felt that I had to choose the ﬁrst two properties
...
The design of exceptions focused on
multilevel propagation of exceptions, the passing of arbitrary information to an error handler, and
the integrations between exceptions and resource management by using local objects with destructors to represent and release resources (what I clumsily called ‘‘Resource Acquisition Is Initialization’’; §13
...

I generalized C++’s inheritance mechanisms to support multiple base classes [Stroustrup,1987a]
...
I
considered it far less important than templates or exceptions
...

Section 1
...
2
...
For example, I designed the complex [Stroustrup,1984], vector, stack, and (I/O) stream
[Stroustrup,1985] classes together with the operator overloading mechanisms
...
Jonathan’s
string and list classes were the ﬁrst to see extensive use as part of a library
...
The task library described in [Stroustrup,1987b] was part of the ﬁrst ‘‘C with Classes’’ program ever written in 1980
...
Unfortunately, we had to wait until 2011
(30 years!) to get concurrency support standardized and universally available (§1
...
4
...
3, Chapter 41)
...

C++ grew up in an environment with a multitude of established and experimental programming
languages (e
...
, Ada [Ichbiah,1979], Algol 68 [Woodward,1974], and ML [Paulson,1996])
...
However, the determining inﬂuences always came from
the applications I encountered
...

1
...
3 The 1998 Standard
The explosive growth of C++ use caused some changes
...
The result was a conscious effort to maintain contact
between implementers of C++ compilers and major users
...

AT&T Bell Labs made a major contribution to C++ and its wider community by allowing me to
share drafts of revised versions of the C++ reference manual with implementers and users
...
A less enlightened company
could have caused major problems of language fragmentation simply by doing nothing
...
Their names can be found in The Annotated C++ Reference Manual (‘‘the
ARM’’) [Ellis,1989]
...
In June 1991, this ANSI (American national) standardization of C++
became part of an ISO (international) standardization effort for C++ and named WG21
...
I served on these committees throughout
...
An initial draft standard for public review was produced in April 1995
...
A ‘‘bug
ﬁx release’’ of this standard was issued in 2003, so you sometimes hear people refer to C++03, but
that is essentially the same language as C++98
...
4
...
1 Language Features
By the time the ANSI and ISO standards efforts started, most major language features were in place
and documented in the ARM [Ellis,1989]
...
The template mechanisms, in particular, beneﬁted from much
detailed work
...
At the initiative of Dmitry Lenkov from Hewett-Packard, minimal facilities to use run-time type information (RTTI; Chapter 22) were introduced
...
I tried to get a facility
for optional conservative garbage collection accepted, but failed
...

Clearly, the 1998 language was far superior in features and in particular in the detail of speciﬁcation to the 1989 language
...
In addition to the
inevitable minor mistakes, two major features were added that in retrospect should not have been:
• Exception speciﬁcations provide run-time enforcement of which exceptions a function is
allowed to throw
...
Exception speciﬁcations turned out to be worse than useless for improving readability, reliability, and performance
...
The 2011 standard introduced noexcept (§13
...
1
...

• It was always obvious that separate compilation of templates and their uses would be ideal
[Stroustrup,1994]
...
After a long debate in the committee, a compromise was
reached and something called export templates were speciﬁed as part of the 1998 standard
...
We are still looking for a solution
...
Thus, export solved the wrong problem
...
3) may help by providing precise speciﬁcation of template requirements
...

1
...
3
...
4, §4
...
It was the work of Alex Stepanov (with Dave Musser, Meng Le, and others) based
on more than a decade’s work on generic programming
...
The STL has been massively inﬂuential
within the C++ community and beyond
...
I had failed to ship a sufﬁciently large foundation library with Release 1
...
0
...
4
...
2

The Standard Library

27

time the standards work started
...
g
...

The standard-library string (§4
...
The valarray library for numerical computation (§40
...
Jerry Schwarz transformed my streams library (§1
...
2
...
3, Chapter 38) using Andrew Koenig’s manipulator technique (§38
...
5
...
The
iostreams library was further reﬁned during standardization, where the bulk of the work was done
by Jerry Schwarz, Nathan Myers, and Norihiro Kumagai
...
For example, there is no standard
GUI, database access library, or Web application library
...
The reasons for that are practical and commercial, rather than technical
...

1
...
4 The 2011 Standard
The current C++, C++11, known for years as C++0x, is the work of the members of WG21
...
These processes probably led to a better (and more rigorous) speciﬁcation, but they also limited innovation
[Stroustrup,2007]
...
The second
ISO C++ standard (ISO/IEC 14882-2011) [C++,2011] was ratiﬁed by a 21-0 national vote in
August 2011
...
Consequently, serious work on
new language features did not start until 2002
...
In terms of pages of standards text, the language grew by
about 30% and the standard library by about 100%
...
Also, the work on a new C++ standard obviously had
to take great care not to compromise older code through incompatible changes
...

The overall aims for the C++11 effort were:
• Make C++ a better language for systems programming and library building
...

The aims are documented and detailed in [Stroustrup,2007]
...

This involved a memory model (§41
...
3),
which is primarily the work of Hans Boehm, Brian McKnight, and others
...
Pete Becker, Peter Dimov, Howard Hinnant, William Kempf, Anthony
Williams, and others did massive amounts of work on that
...
3
...
Concurrency is an
area where a complete and detailed listing of who did what and why would require a very long
paper
...

1
...
4
...
2
...
By ‘‘better’’ I mean easier to read, easier to write, more elegant, less error-prone, more maintainable, faster-running, consuming fewer resources, etc
...
3
...
6
...
6
...

• Deducing the type of an object from its initializer, auto: §2
...
2, §6
...
6
...

I ﬁrst designed and implemented auto in 1983 but had to remove it because of C compatibility problems
...
2
...
4, §12
...
6; Gabriel Dos Reis and Bjarne Stroustrup [DosReis,2010]
...
4
...

• Inheriting constructors: §20
...
5
...

• Lambda expressions, a way of implicitly deﬁning function objects at the point of their use in
an expression: §3
...
3, §11
...

• Move semantics, a way of transmitting information without copying: §3
...
2, §17
...
2;
Howard Hinnant
...
5
...
1; David Abrahams, Rani Sharoni, and Doug Gregor
...
2
...

• The range-for statement: §2
...
5, §9
...
1; Thorsten Ottosen and Bjarne Stroustrup
...
3
...
Alisdair Meredith, Chris Uzdavinis, and Ville
Voutilainen
...
In particular, a
way of deﬁning a template by binding some arguments of another template: §3
...
5, §23
...

• Typed and scoped enumerations: enum class: §8
...
1; David E
...

• Universal and uniform initialization (including arbitrary-length initializer lists and protection against narrowing): §2
...
2, §3
...
1
...
3
...
3
...
3
...

• Variadic templates, a mechanism for passing an arbitrary number of arguments of arbitrary
types to a template: §3
...
4, §28
...

Section 1
...
4
...
The technical reports to the
committee [WG21] and my C++11 FAQ [Stroustrup,2010a] give many of the names
...
The reason my name appears so often is (I
hope) not vanity, but simply that I chose to work on what I consider important
...
Their major role is to ﬂesh out the C++ feature set to better
support programming styles (§1
...
1)
...

Much work went into a proposal that did not make it into the standard
...
g
...
It was designed, speciﬁed, implemented, and tested, but by a large majority the committee
decided that the proposal was not yet ready
...
However, the committee decided against ‘‘concepts’’ on the grounds of complexity, difﬁculty of use, and compile-time performance [Stroustrup,2010b]
...
’’ This is
currently a ﬁeld of active research and design [Sutton,2011] [Stroustrup,2012a]
...
4
...
2 Standard Library
The work on what became the C++11 standard library started with a standards committee technical
report (‘‘TR1’’)
...

As for language features, I’ll only list a few standard-library components with references to the
text and the names of the individuals most closely associated with them
...
2
...
Some components, such as unordered_map (hash tables), were ones we simply didn’t
manage to ﬁnish in time for the C++98 standard
...
Boost is a volunteer organization
created to provide useful library components based on the STL [Boost]
...
4
...

• The basic concurrency library components, such as thread, mutex, and lock: §5
...
2; Pete
Becker, Peter Dimov, Howard Hinnant, William Kempf, Anthony Williams, and more
...
3
...
4
...

• The garbage collection interface: §34
...

• A regular expression library, regexp: §5
...

• A random number library: §5
...
3, §40
...
It was about time
...

Several utility components were tried out in Boost:
• A pointer for simply and efﬁciently passing resources, unique_ptr: §5
...
1, §34
...
1; Howard
E
...
This was originally called move_ptr and is what auto_ptr should have been had
we known how to do so for C++98
...
2
...
3
...
A
successor to the C++98 counted_ptr proposal from Greg Colvin
...
4
...
5, §34
...
4
...
They credit a
long list of contributors, including Doug Gregor, David Abrahams, and Jeremy Siek
...
5
...
His acknowledgments list a veritable who’s who
of Boost (including Doug Gregor, John Maddock, Dave Abrahams, and Jaakko Jarvi)
...
5
...
He credits William
Kempf and others with contributions
...
4
...
You don’t usually see it
...

C++ is used by millions of programmers in essentially every application domain
...
This massive use is supported by
half a dozen independent implementations, many thousands of libraries, hundreds of textbooks, and
dozens of websites
...

Early applications tended to have a strong systems programming ﬂavor
...
Many current ones (e
...
, Windows, Apple’s OS,
Linux, and most portable-device OSs) have key parts done in C++
...
I consider uncompromising low-level efﬁciency essential
for C++
...
In such code, predictability of performance
is at least as important as raw speed
...
C++
was designed so that every language feature is usable in code under severe time and space constraints (§1
...
4) [Stroustrup,1994,§4
...

Some of today’s most visible and widely used systems have their critical parts written in C++
...
Many other programming languages
and technologies depend critically on C++’s performance and reliability in their implementation
...
g
...
g
...
g
...
g
...
NET Web
services framework)
...

Most applications have sections of code that are critical for acceptable performance
...
For most code, maintainability, ease of extension, and ease of testing are key
...
Examples
are ﬁnancial systems, telecommunications, device control, and military applications
...
S
...
e
...
Many
such applications are large and long-lived
...
Multimillion-line C++ programs are common
...
4
...
Thus, games has been
another major applications area for C++
...
g
...
g
...
g
...
g
...

C++ wasn’t speciﬁcally designed with numerical computation in mind
...
A major reason for this is that traditional numerical work must often be combined with graphics and with computations relying on
data structures that don’t ﬁt into the traditional Fortran mold (e
...
, [Root,1995])
...

C++’s ability to be used effectively for applications that require work in a variety of application
areas is an important strength
...
Traditionally, such application
areas were considered distinct and were served by distinct technical communities using a variety of
programming languages
...
It is
designed so that C++ code can coexist with code written in other languages
...
Furthermore, no really major system is written 100% in a single language
...

Major applications are not written in just the raw language
...
There are many thousands of
C++ libraries, so keeping up with them all is impossible
...
5 Advice
Each chapter contains an ‘‘Advice’’ section with a set of concrete recommendations related to its
contents
...
A piece of advice
should be applied only where reasonable
...

I ﬁnd rules of the form ‘‘never do this’’ unhelpful
...
Negative suggestions tend not to be phrased as absolute prohibitions
and I try to suggest alternatives
...
The ‘‘Advice’’ sections do not contain explanations
...

For starters, here are a few high-level recommendations derived from the sections on design,
learning, and history of C++:

32

Notes to the Reader

Chapter 1

[1]

Represent ideas (concepts) directly in code, for example, as a function, a class, or an enumeration; §1
...

[2] Aim for your code to be both elegant and efﬁcient; §1
...

[3] Don’t overabstract; §1
...

[4] Focus design on the provision of elegant and efﬁcient abstractions, possibly presented as
libraries; §1
...

[5] Represent relationships among ideas directly in code, for example, through parameterization or a class hierarchy; §1
...
1
...
2
...

[7] C++ is not just object-oriented; §1
...
1
...
2
...

[9] Prefer solutions that can be statically checked; §1
...
1
...
2
...
4
...
1
...
2
...

[12] Use libraries, especially the standard library, rather than trying to build everything from
scratch; §1
...
1
...
2
...

[14] Low-level code is not necessarily efﬁcient; don’t avoid classes, templates, and standardlibrary components out of fear of performance problems; §1
...
4, §1
...
3
...
3
...

[16] C++ is not just C with a few extensions; §1
...
3
...
You are not going to get
it right the ﬁrst time
...
6 References
[Austern,2003]

[Barron,1963]
[Barton,1994]

[Berg,1995]
[Boehm,2008]
[Boost]
[Budge,1992]

Matt Austern et al
...
Software – Practice & Experience
...

November 2003
...
W
...
: The main features of CPL
...
6 (2):
134
...
comjnl
...
org/content/6/2/134
...
pdf+html
...
J
...
R
...
Addison-Wesley
...
1994
...

William Berg, Marshall Cline, and Mike Girou: Lessons Learned from the
OS/400 OO Project
...
Vol
...
10
...

Hans-J
...
Adve: Foundations of the C++ concurrency
memory model
...

The Boost library collection
...
boost
...

Kent Budge, J
...
Perry, and A
...
Robinson: High-Performance Scientiﬁc
Computation Using C++
...
USENIX C++ Conference
...
August 1992
...
6

[C,1990]

[C,1999]
[C,2011]
[C++,1998]
[C++Math,2010]
[C++,2011]
[Campbell,1987]
[Coplien,1995]
[Cox,2007]
[Czarnecki,2000]

[Dahl,1970]
[Dahl,1972]
[Dean,2004]

[Dechev,2010]

[DosReis,2006]
[DosReis,2010]

[DosReis,2011]

[Ellis,1989]
[Freeman,1992]
[Friedl,1997]:

References

33

X3 Secretariat: Standard – The C Language
...
ISO Standard
ISO/IEC 9899-1990
...
Washington, DC
...
Standard – The C Language
...

ISO/IEC 9899
...
X3J11/90-013-2011
...

ISO/IEC 14882:1998
...
ISO/IEC 29124:2010
...

ISO/IEC 14882:2011
...
: The Design of a Multiprocessor Operating System
...
USENIX C++ Conference
...
November 1987
...
Coplien: Curiously Recurring Template Patterns
...

February 1995
...
January
2007
...
com/˜rsc/regexp/regexp1
...

K
...
Eisenecker: Generative Programming: Methods, Tools,
and Applications
...
Reading, Massachusetts
...
ISBN
0-201-30977-7
...
Dahl, B
...
Nygaard: SIMULA Common Base Language
...
Oslo, Norway
...

O-J
...
A
...
Hoare: Hierarchical Program Construction in Structured Programming
...
New York
...

J
...
Ghemawat: MapReduce: Simpliﬁed Data Processing on Large
Clusters
...
2004
...
Dechev, P
...
Stroustrup: Understanding and Effectively
Preventing the ABA Problem in Descriptor-based Lock-free Designs
...
May 2010
...

POPL06
...

Gabriel Dos Reis and Bjarne Stroustrup: General Constant Expressions for
System Programming Languages
...
The 25th ACM Symposium
On Applied Computing
...

Gabriel Dos Reis and Bjarne Stroustrup: A Principled, Complete, and Efﬁcient Representation of C++
...

Vol
...
2011
...
Ellis and Bjarne Stroustrup: The Annotated C++ Reference
Manual
...
Reading, Mass
...
ISBN 0-201-51459-1
...
Prentice
Hall
...
1992
...

Jeffrey E
...
Friedl: Mastering Regular Expressions
...

Sebastopol, California
...
ISBN 978-1565922570
...
: Design Patterns: Elements of Reusable Object-Oriented
Software
...
Reading, Massachusetts
...
ISBN
0-201-63361-2
...
: Concepts: Linguistic Support for Generic Programming in C++
...

John L
...
Patterson: Computer Architecture, Fifth Edition: A Quantitative Approach
...
San Francisco, California
...
ISBN 978-0123838728
...
Ichbiah et al
...
SIGPLAN Notices
...
14, No
...
June 1979
...
Kamath, Ruth E
...
Smith: Reaping Beneﬁts
with Object-Oriented Technology
...
Vol
...
5
...

Brian W
...
Ritchie: The C Programming Language
...
Englewood Cliffs, New Jersey
...

Brian W
...
Ritchie: The C Programming Language,
Second Edition
...
Englewood Cliffs, New Jersey
...
ISBN
0-13-110362-8
...
Knuth: The Art of Computer Programming
...

Reading, Massachusetts
...

Andrew Koenig and Bjarne Stroustrup: C++: As close to C as possible – but
no closer
...
Vol
...
7
...

A
...
Koenig and B
...

Proc USENIX C++ Conference
...

Joseph C
...
NASA/TM-2002-211716
...
Addison-Wesley
...

ISBN 978-0201183955
...
McKenney: Is Parallel Programming Hard, And, If So, What Can
You Do About It? kernel
...
Corvallis, Oregon
...

http://kernel
...
html
...
Regex
...
boost
...
2009
...
Secker and Warburg
...
1949
...
Paulson: ML for the Working Programmer
...
Cambridge
...
ISBN 0-521-56543-X
...
Pirkelbauer, Y
...
Stroustrup: Design and Evaluation of
C++ Open Multi-Methods
...
Elsevier
Journal
...
doi:10
...
scico
...
06
...

Martin Richards and Colin Whitby-Strevens: BCPL – The Language and Its
Compiler
...
Cambridge
...
ISBN
0-521-21965-5
...
root
...
ch
...
6

[Rozier,1988]
[Siek,2000]

[Solodkyy,2012]
[Stepanov,1994]
[Stewart,1998]
[Stroustrup,1982]

[Stroustrup,1984]

[Stroustrup,1985]
[Stroustrup,1986]
[Stroustrup,1987]
[Stroustrup,1987b]

[Stroustrup,1988]
[Stroustrup,1991]
[Stroustrup,1993]

[Stroustrup,1994]
[Stroustrup,1997]

[Stroustrup,2002]

[Stroustrup,2007]

References

35

Web address
...
Rozier et al
...
Computing Systems
...
1, No
...
Fall 1988
...
Siek and Andrew Lumsdaine: Concept checking: Binding parametric polymorphism in C++
...
First Workshop on C++ Template Programming
...
2000
...
Solodkyy, G
...
Stroustrup: Open and Efﬁcient Type Switch
for C++
...
OOPSLA’12
...
HP
Labs Technical Report HPL-94-34 (R
...
1994
...
W
...
Basic Decompositions
...

Philadelphia, Pennsylvania
...

B
...

Sigplan Notices
...
The ﬁrst public description of ‘‘C with
Classes
...
Stroustrup: Operator Overloading in C++
...
IFIP WG2
...

September 1984
...
Stroustrup: An Extensible I/O Facility for C++
...
Summer 1985
USENIX Conference
...
Stroustrup: The C++ Programming Language
...
Reading, Massachusetts
...
ISBN 0-201-12078-X
...
Stroustrup: Multiple Inheritance for C++
...
EUUG Spring Conference
...

B
...
Shopiro: A Set of C Classes for Co-Routine Style Programming
...
USENIX C++ Conference
...

November 1987
...
Stroustrup: Parameterized Types for C++
...
USENIX C++ Conference, Denver
...

B
...
Addison-Wesley
...
1991
...

B
...
Proc
...
ACM Sigplan Notices
...
1993
...
Stroustrup: The Design and Evolution of C++
...
Reading, Mass
...
ISBN 0-201-54330-3
...
Stroustrup: The C++ Programming Language, Third Edition
...
Reading, Massachusetts
...
ISBN 0-201-88954-4
...
2000
...

B
...
The C/C++ Users Journal
...
www
...
com/papers
...

B
...
ACM HOPL-III
...

36

Notes to the Reader

[Stroustrup,2008]

Chapter 1

B
...
Addison-Wesley
...
ISBN 0-321-54372-6
...
Stroustrup: The C++11 FAQ
...
stroustrup
...
html
...
Stroustrup: The C++0x ‘‘Remove Concepts’’ Decision
...
Dobb’s Journal
...

[Stroustrup,2012a] B
...
Sutton: A Concept Design for the STL
...
January 2012
...
Stroustrup: Software Development for Infrastructure
...
January
2012
...
1109/MC
...
353
...
Sutton and B
...
Proc
...

July 2011
...
Tanenbaum: Modern Operating Systems, Third Edition
...
Upper Saddle River, New Jersey
...
ISBN 0-13-600663-9
...
: Minimizing Dependencies within Generic Classes for
Faster and Smaller Programs
...
October 2009
...
0
...
Reading, Massachusetts
...
ISBN 0-201-48345-9
...
Research Version,
Tenth Edition
...
February
1985
...
Josuttis: C++ Templates: The Complete
Guide
...
2002
...

[Veldhuizen,1995]
Todd Veldhuizen: Expression Templates
...
June 1995
...
Veldhuizen: C++ Templates are Turing Complete
...
2003
...
ACM Transactions on Mathematical Software, Vol
...
1
...

[WG21]
ISO SC22/WG21 The C++ Programming Language Standards Committee:
Document Archive
...
open-std
...

[Williams,2012]
Anthony Williams: C++ Concurrency in Action – Practical Multithreading
...
ISBN 978-1933988771
...
Wilson and Paul Lu (editors): Parallel Programming Using C++
...
Cambridge, Mass
...
ISBN 0-262-73118-5
...
Addison-Wesley
...
1999
...

[Woodward,1974]
P
...
Woodward and S
...
Bond: Algol 68-R Users Guide
...
London
...

2
A Tour of C++: The Basics
The ﬁrst thing we do, let’s
kill all the language lawyers
...
1 Introduction
The aim of this chapter and the next three is to give you an idea of what C++ is, without going into
a lot of details
...
These are the language facilities supporting the styles most often seen in C and sometimes called procedural programming
...
Chapter 4 and
Chapter 5 give examples of standard-library facilities
...
If not, please consider reading a textbook, such as Programming: Principles and Practice Using C++ [Stroustrup,2009], before continuing here
...
If you ﬁnd this ‘‘lightning tour’’
confusing, skip to the more systematic presentation starting in Chapter 6
...
For example, loops are not
discussed in detail until Chapter 10, but they will be used in obvious ways long before that
...

As an analogy, think of a short sightseeing tour of a city, such as Copenhagen or New York
...
You do not know the city after such a
tour
...
To really know a city, you have to live in
it, often for years
...
After the tour, the
real exploration can begin
...
Consequently, it
does not identify language features as present in C, part of C++98, or new in C++11
...
4 and Chapter 44
...
2 The Basics
C++ is a compiled language
...
A
C++ program typically consists of many source code ﬁles (usually simply called source ﬁles)
...
When we talk about portability of C++ programs, we usually
mean portability of source code; that is, the source code can be successfully compiled and run on a
variety of systems
...
g
...
g
...
g
...
g
...
That is, the C++ standard library can be implemented in C++ itself (and is with very
minor uses of machine code for things such as thread context switching)
...

C++ is a statically typed language
...
g
...
The type of an object determines
the set of operations applicable to it
...
2
...
2
...
4)
...
Here, they indicate the start and end of the function
body
...
A comment is for
the human reader; the compiler ignores comments
...
The program starts
by executing that function
...
’’ If no value is returned, the system will receive a value indicating successful completion
...
Not every operating system and execution
environment make use of that return value: Linux/Unix-based environments often do, but Windows-based environments rarely do
...
Here is a program that writes Hello, World!:
#include
int main()
{
std::cout << "Hello, World!\n";
}

The line #include instructs the compiler to include the declarations of the standard
stream I/O facilities as found in iostream
...
The operator << (‘‘put to’’) writes its second argument onto its ﬁrst
...
A string
literal is a sequence of characters surrounded by double quotes
...
’’ In this case, \n is the
newline character, so that the characters written are Hello, World! followed by a newline
...
4
...
I usually leave out the std:: when discussing standard features; §2
...
2 shows how to
make names from a namespace visible without explicit qualiﬁcation
...
For example:
#include
using namespace std;
double square(double x)
{
return x∗x;
}

// make names from std visible without std:: (§2
...
2)
// square a double precision ﬂoating-point number

40

A Tour of C++: The Basics

Chapter 2

void print_square(double x)
{
cout << "the square of " << x << " is " << square(x) << "\n";
}
int main()
{
print_square(1
...
234 is 1
...

2
...
2 Types, Variables, and Arithmetic
Every name and every expression has a type that determines the operations that may be performed
on it
...

A declaration is a statement that introduces a name into the program
...

• An object is some memory that holds a value of some type
...

• A variable is a named object
...
For example:
bool
char
int
double

// Boolean, possible values are true and false
// character, for example, 'a', ' z', and '9'
// integer, for example, 1, 42, and 1066
// double-precision ﬂoating-point number, for example, 3
...
0

Each fundamental type corresponds directly to hardware facilities and has a ﬁxed size that determines the range of values that can be stored in it:
bool:
char:
int:
double:

A char variable is of the natural size to hold a character on a given machine (typically an 8-bit
byte), and the sizes of other types are quoted in multiples of the size of a char
...
e
...

The arithmetic operators can be used for appropriate combinations of these types:

Section 2
...
2

x+y
+x
x−y
−x
x∗y
x/y
x%y

Types, Variables, and Arithmetic

41

// plus
// unar y plus
// minus
// unar y minus
// multiply
// divide
// remainder (modulus) for integers

So can the comparison operators:
x==y
x!=y
xx>y
x<=y
x>=y

// equal
// not equal
// less than
// greater than
// less than or equal
// greater than or equal

In assignments and in arithmetic operations, C++ performs all meaningful conversions (§10
...
3)
between the basic types so that they can be mixed freely:
void some_function()
{
double d = 2
...

C++ offers a variety of notations for expressing initialization, such as the
universal form based on curly-brace-delimited initializer lists:

=

used above, and a

double d1 = 2
...
3};
complex z = 1;
complex z2 {d1,d2};
complex z3 = {1,2};

// a complex number with double-precision ﬂoating-point scalars

vector v {1,2,3,4,5,6};

// a vector of ints

// the = is optional with {
...
3
...
2)
...
5):
int i1 = 7
...
2};
int i3 = {7
...
2
...
Don’t introduce a name until you have a suitable value for it
...
2
...
1)
...
2;
auto z = sqrt(y);

// a bool
// a char
// an int
// a double
// z has the type of whatever sqr t(y) returns

With auto, we use the = syntax because there is no type conversion involved that might cause problems (§6
...
6
...

We use auto where we don’t have a speciﬁc reason to mention the type explicitly
...

• We want to be explicit about a variable’s range or precision (e
...
, double rather than ﬂoat)
...
This is especially important in
generic programming where the exact type of an object can be hard for the programmer to know
and the type names can be quite long (§4
...
1)
...
3), C++ offers more speciﬁc operations for modifying a variable:
x+=y
++x
x−=y
−−x
x∗=y
x/=y
x%=y

// x = x+y
// increment: x = x+1
// x = x-y
// decrement: x = x-1
// scaling: x = x*y
// scaling: x = x/y
// x = x%y

These operators are concise, convenient, and very frequently used
...
2
...
5):
• const: meaning roughly ‘‘I promise not to change this value’’ (§7
...
This is used primarily
to specify interfaces, so that data can be passed to functions without fear of it being modiﬁed
...

• constexpr: meaning roughly ‘‘to be evaluated at compile time’’ (§10
...
This is used primarily to specify constants, to allow placement of data in memory where it is unlikely to be corrupted, and for performance
...
4∗square(dmv);
constexpr double max2 = 1
...
4∗square(var);

// dmv is a named constant
// var is not a constant
// OK if square(17) is a constant expression
// error : var is not a constant expression
// OK, may be evaluated at run time

Section 2
...
3

double sum(const vector&);
vector v {1
...
4, 4
...
2
...
For example:
constexpr double square(double x) { return x∗x; }

To be

constexpr, a function must be rather simple: just a return-statement computing a value
...
We allow a constexpr function to be called with non-constant-expression argu-

ments in contexts that do not require constant expressions, so that we don’t have to deﬁne essentially the same function twice: once for constant expressions and once for variables
...
g
...
2
...
3), case labels (§2
...
4, §9
...
2), some template arguments (§25
...
In other cases, compile-time evaluation is important for performance
...
4)
...
2
...
For example,
here is a simple function that prompts the user and returns a Boolean indicating the response:
bool accept()
{
cout << "Do you want to proceed (y or n)?\n";
char answer = 0;
cin >> answer;

// write question

// read answer

if (answer == 'y') return true;
return false;
}

To match the << output operator (‘‘put to’’), the >> operator (‘‘get from’’) is used for input; cin is
the standard input stream
...
The \n character at the end
of the output string represents a newline (§2
...
1)
...
\n";
return false;
}
}

A switch-statement tests a value against a set of constants
...
If no default is provided, no
action is taken if the value doesn’t match any case constant
...
For example, we might like to give the user a few tries
to produce acceptable input:
bool accept3()
{
int tries = 1;
while (tries<4) {
cout << "Do you want to proceed (y or n)?\n";
char answer = 0;
cin >> answer;

// write question
// read answer

switch (answer) {
case 'y':
return true;
case 'n':
return false;
default:
cout << "Sorry, I don't understand that
...
\n";
return false;
}

The while-statement executes until its condition becomes false
...
2
...
’’ All arrays have

0

as their lower

Section 2
...
5

Pointers, Arrays, and Loops

45

bound, so v has six elements, v[0] to v[5]
...
2
...
A pointer variable can hold the address of an object of the appropriate type:
char∗ p = &v[3];
char x = ∗p;

// p points to v’s four th element
// *p is the object that p points to

In an expression, preﬁx unary ∗ means ‘‘contents of’’ and preﬁx unary & means ‘‘address of
...

}

This for-statement can be read as ‘‘set i to zero; while i is not 10, copy the ith element and increment
i
...
C++ also offers
a simpler for-statement, called a range-for-statement, for loops that traverse a sequence in the simplest way:
void print()
{
int v[] = {0,1,2,3,4,5,6,7,8,9};
for (auto x : v)
cout << x << '\n';

// for each x in v

for (auto x : {10,21,32,43,54,65})
cout << x << '\n';
//
...
’’ Note that we don’t have to specify an array bound when we initialize it
with a list
...
4
...

If we didn’t want to copy the values from v into the variable x, but rather just have x refer to an
element, we could write:

46

A Tour of C++: The Basics

Chapter 2

void increment()
{
int v[] = {0,1,2,3,4,5,6,7,8,9};
for (auto& x : v)
++x;
//
...
’’ A reference is similar to a pointer,
except that you don’t need to use a preﬁx ∗ to access the value referred to by the reference
...
When used in declarations, operators (such as &, ∗, and []) are called declarator operators:
T a[n];
T∗ p;
T& r;
T f(A);

// T[n]: array of n Ts (§7
...
2)
// T&: reference to T (§7
...
2
...
When
we don’t have an object to point to or if we need to represent the notion of ‘‘no object available’’
(e
...
, for an end of a list), we give the pointer the value nullptr (‘‘the null pointer’’)
...

The deﬁnition of count_x() assumes that the char∗ is a C-style string, that is, that the pointer
points to a zero-terminated array of char
...
2
...
However, using nullptr
eliminates potential confusion between integers (such as 0 or NULL) and pointers (such as nullptr)
...
3

User-Deﬁned Types

47

2
...
2
...
2
...
2
...
C++’s set of built-in types and operations is
rich, but deliberately low-level
...
However, they don’t provide the programmer with high-level facilities to conveniently write advanced applications
...
The C++ abstraction mechanisms are primarily designed to let programmers design
and implement their own types, with suitable representations and operations, and for programmers
to simply and elegantly use such types
...
They are referred to as classes and enumerations
...
The rest of
this chapter presents the simplest and most fundamental facilities for that
...

Chapter 4 and Chapter 5 present an overview of the standard library, and since the standard library
mainly consists of user-deﬁned types, they provide examples of what can be built using the language facilities and programming techniques presented in Chapter 2 and Chapter 3
...
3
...

A variable of type Vector can be deﬁned like this:
Vector v;

However, by itself that is not of much use because v’s elem pointer doesn’t point to anything
...
For example, we can construct a Vector like this:
void vector_init(Vector& v, int s)
{
v
...
sz = s;
}

That is, v’s elem member gets a pointer produced by the new operator and v’s size member gets the
number of elements
...
2
...
7); that way, vector_init() can modify the vector passed to it
...
2)
...
elem[i];
// read into elements
double sum = 0;
for (int i=0; i!=s; ++i)
sum+=v
...

In particular, a user of Vector has to know every detail of Vector’s representation
...

Chapter 4 presents the standard-library vector, which contains many nice improvements, and Chapter 31 presents the complete vector in the context of other standard-library facilities
...

Don’t reinvent standard-library components, such as vector and string; use them
...
(dot) to access struct members through a name (and through a reference) and −> to
access struct members through a pointer
...
sz;
// access through name
int i2 = rv
...
3
...
However, a tighter connection between the representation and the
operations is needed for a user-deﬁned type to have all the properties expected of a ‘‘real type
...
To do that we have
to distinguish between the interface to a type (to be used by all) and its implementation (which has
access to the otherwise inaccessible data)
...
A
class is deﬁned to have a set of members, which can be data, function, or type members
...
For example:

Section 2
...
2

Classes

class Vector {
public:
Vector(int s) :elem{new double[s]}, sz{s} { }
double& operator[](int i) { return elem[i]; }
int size() { return sz; }
private:
double∗ elem; // pointer to the elements
int sz;
// the number of elements
};

49

// construct a Vector
// element access: subscripting

Given that, we can deﬁne a variable of our new type Vector:
Vector v(6);

// a Vector with 6 elements

We can illustrate a Vector object graphically:
Vector:
elem:
sz:

0:

1:

2:

3:

4:

5:

6

Basically, the Vector object is a ‘‘handle’’ containing a pointer to the elements (elem) plus the number of elements (sz)
...
2
...
3)
...
This is the basic technique for
handling varying amounts of information in C++: a ﬁxed-size handle referring to a variable amount
of data ‘‘elsewhere’’ (e
...
, on the free store allocated by new; §11
...
How to design and use such
objects is the main topic of Chapter 3
...
The read_and_sum()
example from §2
...
1 simpliﬁes to:
double read_and_sum(int s)
{
Vector v(s);
for (int i=0; i!=v
...
size(); ++i)
sum+=v[i];
return sum;

// make a vector of s elements
// read into elements

// take the sum of the elements

}

A ‘‘function’’ with the same name as its class is called a constructor, that is, a function used to construct objects of a class
...
3
...
Unlike an
ordinary function, a constructor is guaranteed to be used to initialize objects of its class
...

50

A Tour of C++: The Basics

Chapter 2

Vector(int) deﬁnes how objects of type Vector are constructed
...
That integer is used as the number of elements
...
Then, we initialize sz to s
...
It returns a reference
to the appropriate element (a double&)
...

Obviously, error handling is completely missing, but we’ll return to that in §2
...
3
...
2
...
2
shows how to use a destructor to elegantly do that
...
3
...
g
...
For example, Color::red is Color’s red
which is different from Trafﬁc_light::red
...
They are used to make code
more readable and less error-prone than it would have been had the symbolic (and mnemonic) enumerator names not been used
...
Being separate types, enum classes help prevent accidental misuses of constants
...
4
...

By default, an enum class has only assignment, initialization, and comparisons (e
...
, == and <;
§2
...
2) deﬁned
...
3
...
4
...

2
...
2
...
3, §3
...
2
...
4, Chapter 23)
...
The ﬁrst and most important step is to distinguish between the interface to a part and
its implementation
...
A declaration speciﬁes all that’s needed to use a function or a type
...
’’ For this
example, we might like for the representation of Vector to be ‘‘elsewhere’’ also, but we will deal
with that later (abstract types; §3
...
2)
...
algorithm as found in math textbook
...
However,
that makes no real difference: a library is simply some ‘‘other code we happen to use’’ written with
the same language facilities as we use
...
4
...
The deﬁnitions of those types and functions are in separate source ﬁles and compiled separately
...
Such separation can be used to minimize compilation times and to strictly enforce separation of logically distinct parts of a program (thus minimizing the chance of errors)
...
g
...

Typically, we place the declarations that specify the interface to a module in a ﬁle with a name
indicating its intended use
...
h:
class Vector {
public:
Vector(int s);
double& operator[](int i);
int size();
private:
double∗ elem;
// elem points to an array of sz doubles
int sz;
};

This declaration would be placed in a ﬁle
ﬁle, to access that interface
...
h,

and users will include that ﬁle, called a header

// user
...
h"
#include
using namespace std;

// get Vector’s interface
// get the the standard-librar y math function interface including sqrt()
// make std members visible (§2
...
2)

Section 2
...
1

Separate Compilation

double sqrt_sum(Vector& v)
{
double sum = 0;
for (int i=0; i!=v
...
h ﬁle providing its interface:

...
cpp:
#include "Vector
...
cpp and Vector
...
h,
but the two ﬁles are otherwise independent and can be separately compiled
...
h:
Vector

user
...
h"
use Vector

interface
Vector
...
h"
deﬁne Vector

Strictly speaking, using separate compilation isn’t a language issue; it is an issue of how best to
take advantage of a particular language implementation
...
The best approach is to maximize modularity, represent that modularity logically through
language features, and then exploit the modularity physically through ﬁles for effective separate
compilation (Chapter 14, Chapter 15)
...
4
...
2
...
3
...
4),
C++ offers namespaces (Chapter 14) as a mechanism for expressing that some declarations belong
together and that their names shouldn’t clash with other names
...
2
...
1, §18
...
4):
namespace My_code {
class complex { /*
...

int main();
}
int My_code::main()
{
complex z {1,2};
auto z2 = sqrt(z);
std::cout << '{' << z2
...
imag() << "}\n";
//
...
1
...
The precaution is wise, because the standard
library does provide support for complex arithmetic (§3
...
1
...
4)
...
g
...
The ‘‘real main()’’ is deﬁned in the global namespace,
that is, not local to a deﬁned namespace, class, or function
...
2
...
They
simplify the composition of a program out of separately developed parts
...
4
...
However, C++ provides a few features to
help
...
Instead of painstakingly building up our applications
from the built-in types (e
...
, char, int, and double) and statements (e
...
, if, while, and for), we build
more types that are appropriate for our applications (e
...
, string, map, and regex) and algorithms
(e
...
, sort(), ﬁnd_if(), and draw_all())
...
g
...
4
...
The majority of C++ constructs are
dedicated to the design and implementation of elegant and efﬁcient abstractions (e
...
, user-deﬁned
types and algorithms using them)
...
As programs grow, and especially when libraries are used extensively,
standards for handling errors become important
...
4
...
1 Exceptions
Consider again the Vector example
...
3
...

• The user of Vector cannot consistently detect the problem (if the user could, the out-of-range
access wouldn’t happen in the ﬁrst place)
...
The user can then take appropriate action
...
To do that, the implementation will unwind the
function call stack as needed to get back to the context of that caller (§13
...
1)
...

try { // exceptions here are handled by the handler deﬁned below
v[v
...
handle range error
...

}

We put code for which we are interested in handling exceptions into a try-block
...
size()] will fail
...
The out_of_range type is deﬁned in the standard library and is in fact used by some
standard-library container access functions
...
See Chapter 13 for further discussion, details, and examples
...
4
...
2 Invariants
The use of exceptions to signal out-of-range access is an example of a function checking its argument and refusing to act because a basic assumption, a precondition, didn’t hold
...
Whenever we deﬁne a
function, we should consider what its preconditions are and if feasible test them (see §12
...
4)
...
In particular, we did say ‘‘elem points to
an array of sz doubles’’ but we only said that in a comment
...
It is the job of a constructor
to establish the invariant for its class (so that the member functions can rely on it) and for the member functions to make sure that the invariant holds when they exit
...
It properly initialized the Vector members, but it failed to check
that the arguments passed to it made sense
...

Here is a more appropriate deﬁnition:
Vector::Vector(int s)
{
if (s<0) throw length_error{};
elem = new double[s];
sz = s;
}

I use the standard-library exception length_error to report a non-positive number of elements
because some standard-library operations use that exception to report problems of this kind
...
We can now write:
void test()
{
try {
Vector v(−27);
}
catch (std::length_error) {
// handle negative size
}
catch (std::bad_alloc) {
// handle memory exhaustion
}
}

You can deﬁne your own classes to be used as exceptions and have them carry arbitrary information
from a point where an error is detected to a point where it can be handled (§13
...

Often, a function has no way of completing its assigned task after an exception is thrown
...
5
...
1)
...
4
...
2

Invariants

57

The notion of invariants is central to the design of classes, and preconditions serve a similar role
in the design of functions
...

The notion of invariants underlies C++’s notions of resource management supported by constructors (§2
...
2) and destructors (§3
...
1
...
2)
...
4, §16
...
1, and §17
...

2
...
3
...
If an error can be found at compile time, it is usually
preferable to do so
...
However, we can also perform simple checks on other properties that are known at compile time and report failures as compiler error messages
...
We call such statements of expectations assertions
...
2
...
4)
...
458;

// km/s

void f(double speed)
{
const double local_max = 160
...
0/(60*60) km/s

static_assert(speedstatic_assert(local_max
// error : speed must be a constant
// OK

//
...

The most important uses of static_assert come when we make assertions about types used as
parameters in generic programming (§5
...
2, §24
...

For runtime-checked assertions, see §13
...

2
...

Those are the parts of C++ that underlie all programming techniques and styles supported by C++
...

58

A Tour of C++: The Basics

2
...
1
...
3
...

Focus on programming techniques, not on language features; §2
...

Chapter 2

3
A Tour of C++: Abstraction Mechanisms
Don’t Panic!
– Douglas Adams

•
•
•
•
•

Introduction
Classes
Concrete Types; Abstract Types; Virtual Functions; Class Hierarchies
Copy and Move
Copying Containers; Moving Containers; Resource Management; Suppressing Operations
Templates
Parameterized Types; Function Templates; Function Objects; Variadic Templates; Aliases
Advice

3
...
It informally presents ways of deﬁning and using new types
(user-deﬁned types)
...
Templates are
introduced as a mechanism for parameterizing types and algorithms with (other) types and algorithms
...
These are the language facilities supporting
the programming styles known as object-oriented programming and generic programming
...

The assumption is that you have programmed before
...
Even if you have programmed before, the language you used or the applications you
wrote may be very different from the style of C++ presented here
...

60

A Tour of C++: Abstraction Mechanisms

Chapter 3

As in Chapter 2, this tour presents C++ as an integrated whole, rather than as a layer cake
...
Such historical information can be found in §1
...

3
...
A class is a user-deﬁned type provided to represent a concept in the code of a program
...
, we try to represent it as a class in the program so that the idea is there in the code,
rather than just in our head, in a design document, or in some comments
...
In particular, classes are often what libraries offer
...
By ‘‘better,’’ I mean more correct,
easier to maintain, more efﬁcient, more elegant, easier to use, easier to read, and easier to reason
about
...
The needs and tastes of programmers vary immensely
...
Here, we will just consider the basic support for three important kinds of
classes:
• Concrete classes (§3
...
1)
• Abstract classes (§3
...
2)
• Classes in class hierarchies (§3
...
4)
An astounding number of useful classes turn out to be of these three kinds
...

3
...
1 Concrete Types
The basic idea of concrete classes is that they behave ‘‘just like built-in types
...
Similarly, a vector and a string are much
like built-in arrays, except that they are better behaved (§4
...
3
...
4
...

The deﬁning characteristic of a concrete type is that its representation is part of its deﬁnition
...
That allows implementations to be optimally efﬁcient in time and space
...
4
...
g
...
3
...
3)
...
3
...
Therefore, if the representation changes in any signiﬁcant way, a
user must recompile
...
2
...
For types that don’t change often, and where local variables provide much-needed clarity
and efﬁciency, this is acceptable and often ideal
...
That’s the way vector and string are implemented; they can
be considered resource handles with carefully crafted interfaces
...
2
...
1 An Arithmetic Type
The ‘‘classical user-deﬁned arithmetic type’’ is complex:
class complex {
double re, im; // representation: two doubles
public:
complex(double r, double i) :re{r}, im{i} {}
complex(double r) :re{r}, im{0} {}
complex() :re{0}, im{0} {}

// construct complex from two scalars
// construct complex from one scalar
// default complex: {0,0}

double real() const { return re; }
void real(double d) { re=d; }
double imag() const { return im; }
void imag(double d) { im=d; }
complex& operator+=(complex z) { re+=z
...
im; return ∗this; }

// add to re and im
// and return the result

complex& operator−=(complex z) { re−=z
...
im; return ∗this; }
complex& operator∗=(complex);
complex& operator/=(complex);

// deﬁned out-of-class somewhere
// deﬁned out-of-class somewhere

};

This is a slightly simpliﬁed version of the standard-library complex (§40
...
The class deﬁnition
itself contains only the operations requiring access to the representation
...
For practical reasons, it has to be compatible with what Fortran provided 50
years ago, and we need a conventional set of operators
...
This implies that simple operations must be inlined
...
Functions deﬁned in a class are inlined by default
...

A constructor that can be invoked without an argument is called a default constructor
...
By deﬁning a default constructor you eliminate the possibility of uninitialized variables of that type
...

Many useful operations do not require direct access to the representation of complex, so they
can be deﬁned separately from the class deﬁnition:

62

A Tour of C++: Abstraction Mechanisms

complex operator+(complex a, complex b) { return a+=b; }
complex operator−(complex a, complex b) { return a−=b; }
complex operator−(complex a) { return {−a
...
imag()}; }
complex operator∗(complex a, complex b) { return a∗=b; }
complex operator/(complex a, complex b) { return a/=b; }

Chapter 3

// unar y minus

Here, I use the fact that an argument passed by value is copied, so that I can modify an argument
without affecting the caller’s copy, and use the result as the return value
...
real()==b
...
imag()==b
...

Class complex can be used like this:
void f(complex z)
{
complex a {2
...
3,0
...
3
complex b {1/a};
complex c {a+z∗complex{1,2
...

if (c != b)
c = −(b/a)+2∗b;
}

The compiler converts operators involving complex numbers into appropriate function calls
...

User-deﬁned operators (‘‘overloaded operators’’) should be used cautiously and conventionally
...
Also, it is not possible to change
the meaning of an operator for built-in types, so you can’t redeﬁne + to subtract ints
...
2
...
2 A Container
A container is an object holding a collection of elements, so we call Vector a container because it is
the type of objects that are containers
...
3
...
4
...
2), provides rangechecked access (§2
...
3
...
However, it
does have a fatal ﬂaw: it allocates elements using new but never deallocates them
...
5), it is not

Section 3
...
1
...
In some environments you can’t use a collector, and sometimes you prefer more precise control of destruction
(§13
...
4) for logical or performance reasons
...
Vector’s constructor allocates some memory on the free store (also
called the heap or dynamic store) using the new operator
...
This is all done without intervention by users of Vector
...
For example:
void fct(int n)
{
Vector v(n);
//
...

{
Vector v2(2∗n);
//
...

} // v2 is destroyed here
//
...

} // v is destroyed here

obeys the same rules for naming, scope, allocation, lifetime, etc
...
For details on how to control the lifetime of an object, see §6
...
This Vector
has been simpliﬁed by leaving out error handling; see §2
...
3
...
In particular, it
is the basis for most C++ general resource management techniques (§5
...
3)
...
The destructor deallocates the elements
...
The technique of acquiring resources in a
constructor and releasing them in a destructor, known as Resource Acquisition Is Initialization or
RAII, allows us to eliminate ‘‘naked new operations,’’ that is, to avoid allocations in general code
and keep them buried inside the implementation of well-behaved abstractions
...
Avoiding naked new and naked delete makes code far less
error-prone and far easier to keep free of resource leaks (§5
...

3
...
1
...
We can handle that by creating a Vector with an appropriate number of elements and
then assigning to them, but typically other ways are more elegant
...

• push_back(): Add a new element at the end (at the back of) the sequence
...

void push_back(double);
//
...
For example:
Vector read(istream& is)
{
Vector v;
for (double d; is>>d;)
v
...
Until that happens, each number read is added to the Vector so that at the end, v’s size is the number of elements read
...
The implementation of push_back() is discussed in §13
...
4
...
The way to provide Vector with
a move constructor, so that returning a potentially huge amount of data from read() is cheap, is
explained in §3
...
2
...
2
...
3

Initializing Containers

65

The std::initializer_list used to deﬁne the initializer-list constructor is a standard-library type
known to the compiler: when we use a {}-list, such as {1,2,3,4}, the compiler will create an object of
type initializer_list to give to the program
...
23, 3
...
7, 8};
Vector’s

// v1 has 5 elements
// v2 has 4 elements

initializer-list constructor might be deﬁned like this:

Vector::Vector(std::initializer_list lst)
// initialize with a list
:elem{new double[lst
...
size()}
{
copy(lst
...
end(),elem);
// copy from lst into elem
}

3
...
2 Abstract Types
Types such as complex and Vector are called concrete types because their representation is part of
their deﬁnition
...
In contrast, an abstract type is a type that
completely insulates a user from implementation details
...
Since we don’t know anything about
the representation of an abstract type (not even its size), we must allocate objects on the free store
(§3
...
1
...
2) and access them through references or pointers (§2
...
5, §7
...
7)
...
2
...
1)
// destructor (§3
...
1
...
The word virtual means ‘‘may be
redeﬁned later in a class derived from this one
...
A class derived from Container provides an implementation for the Container interface
...
Thus, it is not possible to deﬁne an object that is just a
Container; a Container can only serve as the interface to a class that implements its operator[]() and
size() functions
...

This Container can be used like this:
void use(Container& c)
{
const int sz = c
...
It
uses size() and [] without any idea of exactly which type provides their implementation
...
3
...

As is common for abstract classes, Container does not have a constructor
...
On the other hand, Container does have a destructor and that destructor
is virtual
...
2
...

A container that implements the functions required by the interface deﬁned by the abstract class
Container could use the concrete class Vector:
class Vector_container : public Container { // Vector_container implements Container
Vector v;
public:
Vector_container(int s) : v(s) { }
// Vector of s elements
˜Vector_container() {}
double& operator[](int i) { return v[i]; }
int size() const { return v
...
’’ Class Vector_container is said to
be derived from class Container, and class Container is said to be a base of class Vector_container
...
The derived class is said to inherit members from its base class, so the use of base and
derived classes is commonly referred to as inheritance
...
3
...
The destructor (˜Vector_container()) overrides the base class destructor
(˜Container())
...

For a function like use(Container&) to use a Container in complete ignorance of implementation
details, some other function will have to make an object on which it can operate
...
For example:
class List_container : public Container { // List_container implements Container
std::list ld;
// (standard-librar y) list of doubles (§4
...
2)
public:
List_container() { }
// empty List
List_container(initializer_list il) : ld{il} { }
˜List_container() {}

Section 3
...
2

Abstract Types

67

double& operator[](int i);
int size() const { return ld
...
Usually, I would not implement a container with a subscript operation using a list, because performance of list subscripting is atrocious
compared to vector subscripting
...

A function can create a List_container and have use() use it:
void h()
{
List_container lc = { 1, 2, 3, 4, 5, 6, 7, 8, 9 };
use(lc);
}

The point is that use(Container&) has no idea if its argument is a Vector_container, a List_container,
or some other kind of container; it doesn’t need to know
...
It knows
only the interface deﬁned by Container
...

The ﬂip side of this ﬂexibility is that objects must be manipulated through pointers or references
(§3
...
4)
...
2
...
size();
for (int i=0; i!=sz; ++i)
cout << c[i] << '\n';
}

How is the call c[i] in use() resolved to the right operator[]()? When h() calls use(), List_container’s
operator[]() must be called
...
To
achieve this resolution, a Container object must contain information to allow it to select the right
function to call at run time
...
That table is usually

68

A Tour of C++: Abstraction Mechanisms

Chapter 3

called the virtual function table or simply the vtbl
...
This can be represented graphically like this:
vtbl:
Vector_container:
Vector_container::operator[]()
v
Vector_container::size()
Vector_container::˜Vector_container()

List_container:

vtbl:
List_container::operator[]()

ld

List_container::size()
List_container::˜List_container()

The functions in the vtbl allow the object to be used correctly even when the size of the object and
the layout of its data are unknown to the caller
...
This virtual call mechanism can be made almost as efﬁcient as the ‘‘normal function call’’
mechanism (within 25%)
...

3
...
4 Class Hierarchies
The Container example is a very simple example of a class hierarchy
...
g
...
We use class hierarchies to represent concepts that have hierarchical relationships, such as ‘‘A ﬁre engine is a kind of a truck
which is a kind of a vehicle’’ and ‘‘A smiley face is a kind of a circle which is a kind of a shape
...
As a semirealistic classic example, let’s consider shapes on a screen:
Shape

Circle

Triangle

Smiley

The arrows represent inheritance relationships
...
To represent that simple diagram in code, we must ﬁrst specify a class that deﬁnes the general properties of all shapes:

Section 3
...
4

Class Hierarchies

class Shape {
public:
virtual Point center() const =0;
virtual void move(Point to) =0;

69

// pure virtual

virtual void draw() const = 0;
virtual void rotate(int angle) = 0;

// draw on current "Canvas"

virtual ˜Shape() {}
//
...
Given this deﬁnition, we can
write general functions manipulating vectors of pointers to shapes:
void rotate_all(vector& v, int angle) // rotate v’s elements by angle degrees
{
for (auto p : v)
p−>rotate(angle);
}

To deﬁne a particular shape, we must say that it is a
(including its virtual functions):
class Circle : public Shape {
public:
Circle(Point p, int rr);

Shape

and specify its particular properties

// constructor

Point center() const { return x; }
void move(Point to) { x=to; }
void draw() const;
void rotate(int) {}
private:
Point x; // center
int r;
// radius
};

// nice simple algorithm

So far, the

Shape and Circle example provides
Vector_container example, but we can build further:

nothing new compared to the

class Smiley : public Circle { // use the circle as the base for a face
public:
Smiley(Point p, int r) : Circle{p,r}, mouth{nullptr} { }
˜Smiley()
{
delete mouth;
for (auto p : eyes) delete p;
}

Container

and

70

A Tour of C++: Abstraction Mechanisms

Chapter 3

void move(Point to);
void draw() const;
void rotate(int);
void add_eye(Shape∗ s) { eyes
...

private:
vector eyes;
Shape∗ mouth;
};

// usually two eyes

The push_back() member function adds its argument to the vector (here, eyes), increasing that
vector’s size by one
...
Shape’s destructor is virtual and Smiley’s destructor overrides it
...
In particular, it may be deleted through a pointer to
a base class
...

That destructor then implicitly invokes the destructors of its bases and members
...

We can add data members, operations, or both as we deﬁne a new class by derivation
...
See Chapter
21
...
That is, the base class acts as an interface for the derived class
...
Such classes are often abstract classes
...
Smiley’s uses of Circle’s constructor and of Circle::draw()
are examples
...

Concrete classes – especially classes with small representations – are much like built-in types: we
deﬁne them as local variables, access them using their names, copy them around, etc
...
2
...
For example, consider a function that reads data describing
shapes from an input stream and constructs the appropriate Shape objects:
enum class Kind { circle, triangle, smiley };
Shape∗ read_shape(istream& is) // read shape descriptions from input stream is
{
//
...

switch (k) {
case Kind::circle:
// read circle data {Point,int} into p and r
return new Circle{p,r};
case Kind::triangle:
// read triangle data {Point,Point,Point} into p1, p2, and p3
return new Triangle{p1,p2,p3};
case Kind::smiley:
// read smiley data {Point,int,Shape,Shape,Shape} into p, r, e1 ,e2, and m
Smiley∗ ps = new Smiley{p,r};
ps−>add_eye(e1);
ps−>add_eye(e2);
ps−>set_mouth(m);
return ps;
}
}

A program may use that shape reader like this:
void user()
{
std::vector v;
while (cin)
v
...
The user()
code can be compiled once and later used for new Shapes added to the program
...
This is done
with the delete operator and relies critically on Shape’s virtual destructor
...
This is crucial because a derived
class may have acquired all kinds of resources (such as ﬁle handles, locks, and output streams) that
need to be released
...

Experienced programmers will notice that I left open two obvious opportunities for mistakes:
• A user might fail to delete the pointer returned by read_shape()
...

In that sense, functions returning a pointer to an object allocated on the free store are dangerous
...
2
...

// §5
...
1

}
void user()
{
vector> v;
while (cin)
v
...

For the unique_ptr version of user() to work, we need versions of draw_all() and rotate_all() that
accept vector>s
...
4
...

3
...
This is true for objects of user-deﬁned types as well as for builtin types
...
For example,
using complex from §3
...
1
...

}

// copy initialization
// copy assignment

Now z1, z2, and z3 have the same value because both the assignment and the initialization copied
both members
...
For
simple concrete types, memberwise copy is often exactly the right semantics for copy
...

Section 3
...
1

Copying Containers

73

3
...
1 Copying Containers
When a class is a resource handle, that is, it is responsible for an object accessed through a pointer,
the default memberwise copy is typically a disaster
...
4
...
2)
...
6)
...

Copying of an object of a class is deﬁned by two members: a copy constructor and a copy
assignment:
class Vector {
private:
double∗ elem; // elem points to an array of sz doubles
int sz;
public:
Vector(int s);
// constructor: establish invariant, acquire resources
˜Vector() { delete[] elem; }
// destructor: release resources
Vector(const Vector& a);
Vector& operator=(const Vector& a);

// copy constructor
// copy assignment

double& operator[](int i);
const double& operator[](int i) const;
int size() const;
};

A suitable deﬁnition of a copy constructor for Vector allocates the space for the required number of
elements and then copies the elements into it, so that after a copy each Vector has its own copy of
the elements:

74

A Tour of C++: Abstraction Mechanisms

Vector::Vector(const Vector& a)
:elem{new double[sz]},
sz{a
...
elem[i];
}

Chapter 3

// copy constructor
// allocate space for elements

// copy elements

The result of the v2=v1 example can now be presented as:
v1:

v2:

4

2

4

3

Of course, we need a copy assignment in addition to the copy constructor:
Vector& Vector::operator=(const Vector& a)
{
double∗ p = new double[a
...
sz; ++i)
p[i] = a
...
sz;
return ∗this;
}

// copy assignment

The name this is predeﬁned in a member function and points to the object for which the member
function is called
...

3
...
2 Moving Containers
We can control copying by deﬁning a copy constructor and a copy assignment, but copying can be
costly for large containers
...
size()!=b
...
size());
for (int i=0; i!=a
...
3
...
We might use this + like this:

res

75

and into some place

void f(const Vector& x, const Vector& y, const Vector& z)
{
Vector r;
//
...

}

That would be copying a Vector at least twice (one for each use of the + operator)
...
The most embarrassing part is that res in
operator+() is never used again after the copy
...
Fortunately, we can
state that intent:
class Vector {
//
...
This means that r=x+y+z will involve no copying of Vectors
...

As is typical, Vector’s move constructor is trivial to deﬁne:
Vector::Vector(Vector&& a)
:elem{a
...
sz}
{
a
...
sz = 0;
}

The && means ‘‘rvalue reference’’ and is a reference to which we can bind an rvalue (§6
...
1)
...
’’ So an rvalue is – to a ﬁrst approximation – a value
that you can’t assign to, such as an integer returned by a function call, and an rvalue reference is a
reference to something that nobody else can assign to
...

A move constructor does not take a const argument: after all, a move constructor is supposed to
remove the value from its argument
...

A move operation is applied when an rvalue reference is used as an initializer or as the righthand side of an assignment
...
Typically, we should also allow assignment to a moved-from object (§17
...
6
...

Where the programmer knows that a value will not be used again, but the compiler can’t be
expected to be smart enough to ﬁgure that out, the programmer can be speciﬁc:
Vector f()
{
Vector x(1000);
Vector y(1000);
Vector z(1000);
//
...

return z;
};

// we get a copy
// we get a move
// we get a move

The standard-library function move() returns an rvalue reference to its argument
...

y:

0

1000

1

2

...

3
...
3 Resource Management
By deﬁning constructors, copy operations, move operations, and a destructor, a programmer can
provide complete control of the lifetime of a contained resource (such as the elements of a container)
...
That way, objects that we cannot or would not want to copy out of a scope can be
simply and cheaply moved out instead
...
3
...
We can’t copy the former and don’t want to
copy the latter
...
push_back(move(t));
//
...

// run hear tbeat concurrently (on its own thread)
// move t into my_threads

Section 3
...
3

Resource Management

77

Vector vec(n);
for (int i=0; i ...
In fact, the standard-library ‘‘smart pointers,’’ such as unique_ptr, are themselves resource
handles (§5
...
1)
...
4
...

In very much the same way as new and delete disappear from application code, we can make
pointers disappear into resource handles
...
In particular, we can achieve strong resource safety; that is, we can
eliminate resource leaks for a general notion of a resource
...

3
...
4 Suppressing Operations
Using the default copy or move for a class in a hierarchy is typically a disaster: given only a pointer
to a base, we simply don’t know what members the derived class has (§3
...
2), so we can’t know
how to copy them
...

};

Now an attempt to copy a Shape will be caught by the compiler
...
2
...

In this particular case, if you forgot to delete a copy or move operation, no harm is done
...
Furthermore, the generation of copy operations is deprecated in this case (§44
...
3)
...
2
...

A base class in a class hierarchy is just one example of an object we wouldn’t want to copy
...
2, §17
...
2)
...
6
...

78

A Tour of C++: Abstraction Mechanisms

Chapter 3

3
...
A vector is a general
concept, independent of the notion of a ﬂoating-point number
...
A template is a class or a function that we parameterize with a set of types or values
...

3
...
1 Parameterized Types
We can generalize our vector-of-doubles type to a vector-of-anything type by making it a
and replacing the speciﬁc type double with a parameter
...
copy and move operations
...
It is C++’s version of the mathematical ‘‘for all T’’ or more precisely ‘‘for all types T
...
4
...
It is not (as in C++98) necessary to place a space between the two >s
...
size(); ++i)
cout << vs[i] << '\n';
}

// Vector of some strings

To support the range-for loop for our Vector, we must deﬁne suitable begin() and end() functions:
template
T∗ begin(Vector& x)
{
return &x[0];
// pointer to ﬁrst element
}
template
T∗ end(Vector& x)
{
return x
...
size(); // pointer to one-past-last element
}

Given those, we can write:
void f2(const Vector& vs) // Vector of some strings
{
for (auto& s : vs)
cout << s << '\n';
}

Similarly, we can deﬁne lists, vectors, maps (that is, associative arrays), etc
...
4,
§23
...

Templates are a compile-time mechanism, so their use incurs no run-time overhead compared to
‘‘handwritten code’’ (§23
...
2)
...
4
...
In
particular, they are extensively used for parameterization of both types and algorithms in the standard library (§4
...
5, §4
...
5)
...
0);
// the sum of a vector of ints (add doubles)
double dd = sum(ld,0
...
0,0
...
Note how the types of the template arguments for sum are deduced from the function
arguments
...

This sum() is a simpliﬁed version of the standard-library accumulate() (§40
...

3
...
3 Function Objects
One particularly useful kind of template is the function object (sometimes called a functor), which
is used to deﬁne objects that can be called like functions
...

We can deﬁne named variables of type Less_than for some argument type:
Less_than lti {42};
// lti(i) will compare i to 42 using < (i<42)
Less_than lts {"Backus"}; // lts(s) will compare s to "Backus" using < (s<"Backus")

We can call such an object, just as we call a function:
void fct(int n, const string & s)
{
bool b1 = lti(n);
// true if n<42
bool b2 = lts(s);
// true if s<"Backus"
//
...
4
...
For example, we can count the
occurrences of values for which a predicate returns true:
template
int count(const C& c, P pred)
{
int cnt = 0;
for (const auto& x : c)
if (pred(x))
++cnt;
return cnt;
}

A predicate is something that we can invoke to return true or false
...
The beauty of these
function objects is that they carry the value to be compared against with them
...
Also, for a simple function object like Less_than inlining is simple,
so that a call of Less_than is far more efﬁcient than an indirect function call
...

Function objects used to specify the meaning of key operations of a general algorithm (such as
Less_than for count()) are often referred to as policy objects
...
That could be seen as inconvenient
...
4)
...
The [&] is a capture list specifying that local names used
(such as x) will be passed by reference
...
Had we wanted to give the generated object a copy of x, we could have said so: [=x]
...

Using lambdas can be convenient and terse, but also obscure
...

In §3
...
4, we noticed the annoyance of having to write many functions to perform operations on
elements of vectors of pointers and unique_ptrs, such as draw_all() and rotate_all()
...

First, we need a function that applies an operation to each object pointed to by the elements of a
container of pointers:
template
void for_all(C& c, Oper op)
// assume that C is a container of pointers
{
for (auto& x : c)
op(∗x);
// pass op() a reference to each element pointed to
}

Now, we can write a version of user() from §3
...
4 without writing a set of _all functions:
void user()
{
vector> v;
while (cin)
v
...
draw(); });
for_all(v,[](Shape& s){ s
...
In particular, those for_all() calls would still work if I changed v
to a vector
...
4
...
Such a
template is called a variadic template
...
Tail>
void f(T head, Tail
...
); // try again with tail
}
void f() { }

// do nothing

The key to implementing a variadic template is to note that when you pass a list of arguments to it,

Section 3
...
4

Variadic Templates

83

you can separate the ﬁrst argument from the rest
...
The ellipsis,
...
Eventually, of course, tail will become empty and we need a separate
function to deal with that
...
2,"hello");
cout << "\nsecond: "
f(0
...
2,"hello"), which will call f(2
...
What might the call g(head) do? Obviously, in a real program it will do whatever we wanted
done to each argument
...
2 hello
second: 0
...

The strength of variadic templates (sometimes just called variadics) is that they can accept any
arguments you care to give them
...
For details, see §28
...
For examples, see §34
...
4
...

3
...
5 Aliases
Surprisingly often, it is useful to introduce a synonym for a type or a template (§6
...
For example,
the standard header contains a deﬁnition of the alias size_t, maybe:
using size_t = unsigned int;

The actual type named size_t is implementation-dependent, so in another implementation size_t
may be an unsigned long
...

It is very common for a parameterized type to provide an alias for types related to their template
arguments
...

};

In fact, every standard-library container provides value_type as the name of its value type (§31
...
1)
...
For
example:
template
using Element_type = typename C::value_type;
template
void algo(Container& c)
{
Vector> vec; // keep results here
//
...
For example:
template
class Map {
//
...
6
...
5 Advice
[1]
[2]
[3]
[4]
[5]
[6]
[7]

Express ideas directly in code; §3
...

Deﬁne classes to represent application concepts directly in code; §3
...

Use concrete classes to represent simple concepts and performance-critical components;
§3
...
1
...
2
...
2
...
2
...
2
...
2
...

Use class hierarchies to represent concepts with inherent hierarchical structure; §3
...
4
...
5

[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]

Advice

85

When designing a class hierarchy, distinguish between implementation inheritance and interface inheritance; §3
...
4
...
3
...
3
...

Provide strong resource safety; that is, never leak anything that you think of as a resource;
§3
...
3
...
4
...

Use function templates to represent general algorithms; §3
...
2
...
4
...

Use type and template aliases to provide a uniform notation for types that may vary among
similar types or among implementations; §3
...
5
...
1 Libraries
No signiﬁcant program is written in just a bare programming language
...
These then form the basis for further work
...

Continuing from Chapters 2 and 3, this chapter and the next give a quick tour of key standardlibrary facilities
...
If not, please consider reading a
textbook, such as Programming: Principles and Practice Using C++ [Stroustrup,2009], before
continuing
...
If you ﬁnd this ‘‘lightning tour’’
confusing, you might skip to the more systematic and bottom-up language presentation starting in
Chapter 6
...

88

A Tour of C++: Containers and Algorithms

Chapter 4

I very brieﬂy present useful standard-library types, such as string, ostream, vector, map (this
chapter), unique_ptr, thread, regex, and complex (Chapter 5), as well as the most common ways of
using them
...
As in Chapter
2 and Chapter 3, you are strongly encouraged not to be distracted or discouraged by an incomplete
understanding of details
...

The speciﬁcation of the standard library is almost two thirds of the ISO C++ standard
...
Much though have gone into its design, more still into
its implementations, and much effort will go into its maintenance and extension
...
In addition to the standard-library components, most implementations offer ‘‘graphical user
interface’’ systems (GUIs), Web interfaces, database interfaces, etc
...
Here, I do not describe such systems and libraries
...
Naturally, a programmer is encouraged to
explore the more extensive facilities available on most systems
...
1
...
g
...
3
...

• Strings and I/O streams (with support for international character sets and localization); see
Chapter 36, Chapter 38, and Chapter 39
...

• A framework of containers (such as vector and map) and algorithms (such as ﬁnd(), sort(),
and merge()); see §4
...
5, Chapters 31-33
...

• Support for numerical computation (such as standard mathematical functions, complex
numbers, vectors with arithmetic operations, and random number generators); see §3
...
1
...

• Support for regular expression matching; see §5
...

• Support for concurrent programming, including threads and locks; see §5
...

The concurrency support is foundational so that users can add support for new models of
concurrency as libraries
...
g
...
4
...
2
...
4),
STL-style generic programming (e
...
, pair; §5
...
3, §34
...
4
...
g
...
4
...
2)
...
g
...
2
...
3)
and an interface to garbage collectors (§34
...

• Special-purpose containers, such as array (§34
...
1), bitset (§34
...
2), and tuple (§34
...
4
...

Section 4
...
1

Standard-Library Overview

89

The main criteria for including a class in the library were that:
• it could be helpful to almost every C++ programmer (both novices and experts),
• it could be provided in a general form that did not add signiﬁcant overhead compared to a
simpler version of the same facility, and
• that simple uses should be easy to learn (relative to the inherent complexity of their task)
...

4
...
2 The Standard-library Headers and Namespace
Every standard-library facility is provided through some standard header
...

The standard library is deﬁned in a namespace (§2
...
2, §14
...
1) called
library facilities, the std:: preﬁx can be used:

std
...
Neither will I always
#include the necessary headers explicitly
...
4
...
5
...
2) and make the names
they declare accessible
...

However, in this book, I use the standard library almost exclusively and it is good to know what it
offers
...
Nor do I #include the
appropriate headers in every example
...

Here is a selection of standard-library headers, all supplying declarations in namespace std:
Selected Standard Library Headers (continues)

copy(), ﬁnd(), sort()
array
duration, time_point
sqrt(), pow()
complex, sqrt(), pow()
fstream, ifstream, ofstream
future, promise
istream, ostream, cin, cout

§32
...
2
...
2
§40
...
4
§38
...
1
§5
...
5
§38
...
25
§iso
...
3
...
20
...
2
§iso
...
8
§iso
...
8
§iso
...
9
...
30
...
27
...
4
...
2
...
7
Chapter 37
Chapter 36
§31
...
3
§38
...
2
§5
...
1
§31
...
3
...
5
§31
...
23
...
4
§iso
...
6
§iso
...
5
§iso
...
8
§iso
...
3
§iso
...
4
...
27
...
30
...
23
...
4
§iso
...
1
§iso
...
3
...
2 for more information
...
2 Strings
The standard library provides a string type to complement the string literals
...
For example:

string

type pro-

string compose(const string& name, const string& domain)
{
return name + '@' + domain;
}
auto addr = compose("dmr","bell−labs
...
com
...
You can concatenate a string, a string literal, a C-style string, or a character to a
string
...
3
...

In many applications, the most common form of concatenation is adding something to the end
of a string
...
For example:
void m2(string& s1, string& s2)
{
s1 = s1 + '\n'; // append newline
s2 += '\n';
// append newline
}

The two ways of adding to the end of a string are semantically equivalent, but I prefer the latter
because it is more explicit about what it does, more concise, and possibly more efﬁcient
...
In addition to = and +=, subscripting (using []) and substring operations are
supported
...
Among other useful features, it
provides the ability to manipulate substrings
...
2

Strings

91

string name = "Niels Stroustrup";
void m3()
{
string s = name
...
replace(0,5,"nicholas");
name[0] = toupper(name[0]);
}

// s = "Stroustrup"
// name becomes "nicholas Stroustrup"
// name becomes "Nicholas Stroustrup"

The substr() operation returns a string that is a copy of the substring indicated by its arguments
...
Since indexing starts from 0, s gets the value Stroustrup
...
In this case, the substring starting at 0
with length 5 is Niels; it is replaced by nicholas
...
Thus, the ﬁnal value of name is Nicholas Stroustrup
...

Naturally, strings can be compared against each other and against string literals
...

}
//
...
The most common techniques for implementing
example (§19
...

4
...

The input operations are typed and extensible to handle user-deﬁned types
...

Other forms of user interaction, such as graphical I/O, are handled through libraries that are not
part of the ISO standard and therefore not described here
...
3
...
Further, it is easy to deﬁne output of a
user-deﬁned type (§4
...
3)
...

By default, values written to cout are converted to a sequence of characters
...

Equivalently, we could write:
void g()
{
int i {10};
cout << i;
}

Output of different types can be combined in the obvious way:
void h(int i)
{
cout << "the value of i is ";
cout << i;
cout << '\n';
}

For h(10), the output will be:
the value of i is 10

People soon tire of repeating the name of the output stream when outputting several related items
...
For example:
void h2(int i)
{
cout << "the value of i is " << i << '\n';
}

This h2() produces the same output as h()
...
Note that a character is output as
a character rather than as a numerical value
...

Section 4
...
2

Input

93

4
...
2 Input
The standard library offers istreams for input
...

The operator >> (‘‘get from’’) is used as an input operator; cin is the standard input stream
...
For example:
void f()
{
int i;
cin >> i;
double d;
cin >> d;

// read an integer into i

// read a double-precision ﬂoating-point number into d

}

This reads a number, such as 1234, from the standard input into the integer variable i and a ﬂoatingpoint number, such as 12
...

Often, we want to read a sequence of characters
...
For example:
void hello()
{
cout << "Please enter your name\n";
string str;
cin >> str;
cout << "Hello, " << str << "!\n";
}

If you type in Eric the response is:
Hello, Eric!

By default, a whitespace character (§7
...
2), such as a space, terminates the read, so if you enter Eric
pretending to be the ill-fated king of York, the response is still:

Bloodaxe

Hello, Eric!

You can read a whole line (including the terminating newline character) using the getline() function
...

The standard strings have the nice property of expanding to hold what you put in them; you
don’t have to precalculate a maximum size
...

4
...
3 I/O of User-Deﬁned Types
In addition to the I/O of built-in types and standard strings, the iostream library allows programmers
to deﬁne I/O for their own types
...
name << "\", " << e
...
See §38
...
2 for details
...
Note: formatted with { " " , and }
{
char c, c2;
if (is>>c && c=='{' && is>>c2 && c2=='"') { // star t with a { "
string name;
// the default value of a string is the empty string: ""
while (is
...
setf(ios_base::failbit);
return is;

// register the failure in the stream

}

An input operation returns a reference to its

istream

which can be used to test if the operation

Section 4
...
3

I/O of User-Deﬁned Types

95

succeeded
...
get(c) does not, so that this Entry-input operator
ignores (skips) whitespace outside the name string, but not within it
...
4
...
See §5
...

4
...

Reading characters into a string and printing out the string is a simple example
...
Providing suitable containers for
a given task and supporting them with useful fundamental operations are important steps in the
construction of any program
...
This is the kind of program for which different approaches appear ‘‘simple and
obvious’’ to people of different backgrounds
...
3
...
Here, we deliberately ignore many real-world complexities, such as the
fact that many phone numbers do not have a simple representation as a 32-bit int
...
4
...
A vector is a sequence of elements of a given
type
...
2
...
4 give an idea of the implementation of vector and §13
...
4 provide an exhaustive discussion
...
size(); ++i)
cout << book[i] << '\n';
}

As usual, indexing starts at 0 so that book[0] holds the entry for David Hume
...

The elements of a vector constitute a range, so we can use a range-for loop (§2
...
5):
void print_book(const vector& book)
{
for (const auto& x : book)
// for "auto" see §2
...
2
cout << x << '\n';
}

When we deﬁne a vector, we give it an initial size (initial number of elements):
vector v1 = {1, 2, 3, 4};
vector v2;
vector v3(23);
vector v4(32,9
...
9

An explicit size is enclosed in ordinary parentheses, for example, (23), and by default the elements
are initialized to the element type’s default value (e
...
, nullptr for pointers and 0 for numbers)
...
g
...
9 for the 32 elements of v4)
...
One of the most useful operations on a vector is push_back(),
which adds a new element at the end of a vector, increasing its size by one
...
push_back(e);
}

This reads Entrys from the standard input into phone_book until either the end-of-input (e
...
, the
end of a ﬁle) is reached or the input operation encounters a format error
...

A vector can be copied in assignments and initializations
...
4
...
3
...
Thus, after the initialization
of book2, book2 and phone_book hold separate copies of every Entry in the phone book
...
Where copying is undesirable, references or pointers (§7
...
7) or move operations (§3
...
2,
§17
...
2) should be used
...
4
...
1 Elements
Like all standard-library containers, vector is a container of elements of some type T, that is, a
vector
...
When you insert a new element, its value is copied
into the container
...
The element is not a reference or a pointer to some object
containing 7
...
For people who care about
memory sizes and run-time performance this is critical
...
4
...
2 Range Checking
The standard-library vector does not guarantee range checking (§31
...
2)
...
size()]
...

}

// book
...
This is
undesirable, and out-of-range errors are a common problem
...
3
...
1
T& operator[](int i)
{ return vector::at(i); }
const T& operator[](int i) const
{ return vector::at(i); }

// range check

// range check const objects; §3
...
1
...
The at() operation is a vector subscript operation that throws an exception of type
out_of_range if its argument is out of the vector’s range (§2
...
3
...
2
...

Vec

98

A Tour of C++: Containers and Algorithms

Chapter 4

For Vec, an out-of-range access will throw an exception that the user can catch
...
size()] = {"Joe",999999};
//
...
4
...
1, Chapter 13)
...
One way to minimize surprises from uncaught exceptions is to use a main()
with a try-block as its body
...
) {
cerr << "unknown exception thrown\n";
}

This provides default exception handlers so that if we fail to catch some exception, an error message is printed on the standard error-diagnostic output stream cerr (§38
...

Some implementations save you the bother of deﬁning Vec (or equivalent) by providing a rangechecked version of vector (e
...
, as a compiler option)
...
4
...
Insertion and deletion of phone book entries could be common, so a list could be appropriate for representing a simple phone book
...
4
...
Instead, we might search the list looking for an element with a given value
...
5:
int get_number(const string& s)
{
for (const auto& x : phone_book)
if (x
...
number;
return 0; // use 0 to represent "number not found"
}

The search for s starts at the beginning of the list and proceeds until s is found or the end of
phone_book is reached
...
For example, we may want to delete it or
insert a new entry before it
...
Every standard-library container provides the functions begin() and end(), which return an iterator to the ﬁrst and to one-past-the-last
element, respectively (§4
...
1
...
Using iterators explicitly, we can – less elegantly – write the
get_number() function like this:
int get_number(const string& s)
{
for (auto p = phone_book
...
end(); ++p)
if (p−>name==s)
return p−>number;
return 0; // use 0 to represent "number not found"
}

In fact, this is roughly the way the terser and less error-prone range-for loop is implemented by the
compiler
...
m
...
insert(p,ee);
// add ee before the element referred to by p
phone_book
...
3
...

These list examples could be written identically using vector and (surprisingly, unless you
understand machine architecture) perform better with a small vector than with a small list
...
Unless
you have a reason not to, use a vector
...
g
...
g
...

100

A Tour of C++: Containers and Algorithms

Chapter 4

4
...
3 map
Writing code to look up a name in a list of (name,number) pairs is quite tedious
...
The standard library offers a search tree (a redblack tree) called map:
map:

links
4

links
key:
value:

links
links

In other contexts, a map is known as an associative array or a dictionary
...

The standard-library map (§31
...
3) is a container of pairs of values optimized for lookup
...
4
...
4
...
For example:
int get_number(const string& s)
{
return phone_book[s];
}

In other words, subscripting a map is essentially the lookup we called get_number()
...
The default value for an integer
type is 0; the value I just happened to choose represents an invalid telephone number
...
4
...
1)
...
4
...
That’s pretty
good
...
However, in many cases, we can do better by using a hashed
lookup rather than comparison using an ordering function, such as <
...
4
...
4
...
4)
...

If necessary, you

4
...
5 Container Overview
The standard library provides some of the most general and useful container types to allow the programmer to select a container that best serves the needs of an application:
Standard Container Summary
vector
list
forward_list
deque
set
multiset
map
multimap
unordered_map
unordered_multimap
unordered_set
unordered_multiset

A variable-size vector (§31
...
4
...
4
...
2)
A set (§31
...
3)
A set in which a value can occur many times (§31
...
3)
An associative array (§31
...
3)
A map in which a key can occur many times (§31
...
3)
A map using a hashed lookup (§31
...
3
...
4
...
2)
A set using a hashed lookup (§31
...
3
...
4
...
2)

The unordered containers are optimized for lookup with a key (often a string); in other words, they
are implemented using hash tables
...
4
...
(§4
...
2, §30
...
In addition, the standard
library provides container adaptors queue (§31
...
2), stack (§31
...
1), deque (§31
...
5
...
The standard library also provides more specialized container-like
types, such as a ﬁxed-size array array (§34
...
1) and bitset (§34
...
2)
...
Furthermore, the meanings of the operations are equivalent for the various containers
...
For example:
• begin() and end() give iterators to the ﬁrst and one-beyond-the-last elements, respectively
...

• size() returns the number of elements
...
The range-checked vector, Vector
(§2
...
2, §2
...
3
...
The uniformity of container interfaces also allows us to
specify algorithms independently of individual container types
...
For example, subscripting and traversing a vector is cheap and easy
...
Please note that a vector is usually more efﬁcient than a list for short sequences of small
elements (even for insert() and erase())
...

4
...
To use one, we need operations for basic access such as adding and removing elements (as is provided for list and vector)
...
We sort them, print them, extract subsets,
remove elements, search for objects, etc
...
For
example, the following sorts a vector and places a copy of each unique vector element on a list:
bool operator<(const Entry& x, const Entry& y)
// less than
{
return x
...
name;
// order Entrys by their names
}
void f(vector& vec, list& lst)
{
sort(vec
...
end());
unique_copy(vec
...
end(),lst
...
They are expressed in terms of sequences of
elements
...
5

Algorithms

iterators:

begin()

103

end()

elements:
In the example, sort() sorts the sequence deﬁned by the pair of iterators vec
...
end() –
which just happens to be all the elements of a vector
...
If more than one element is written, the elements following that initial element will be overwritten
...

If we wanted to place the unique elements in a new container, we could have written:
list f(vector& vec)
{
list res;
sort(vec
...
end());
unique_copy(vec
...
end(),back_inserter(res)); // append to res
return res;
}

A back_inserter() adds elements at the end of a container, extending the container to make room for
them (§33
...
2)
...
5
...
The standard-library list has
a move constructor (§3
...
2, §17
...
2) that makes returning res by value efﬁcient (even for lists of
thousands of elements)
...
begin(),vec
...
5
...

4
...
1 Use of Iterators
When you ﬁrst encounter a container, a few iterators referring to useful elements can be obtained;
begin() and end() are the best examples of this
...
For
example, the standard algorithm ﬁnd looks for a value in a sequence and returns an iterator to the
element found:
bool has_c(const string& s, char c)
{
auto p = ﬁnd(s
...
end(),c);
if (p!=s
...
’’ An equivalent, shorter, deﬁnition of has_c() is:

104

A Tour of C++: Containers and Algorithms

Chapter 4

bool has_c(const string& s, char c)
// does s contain the character c?
{
return ﬁnd(s
...
end(),c)!=s
...
We can return the set of occurrences as a vector of string iterators
...
3
...
Assuming that we would like to
modify the locations found, we pass a non-const string:
vector ﬁnd_all(string& s, char c)
{
vector res;
for (auto p = s
...
end(); ++p)
if (∗p==c)
res
...
We could test
ﬁnd_all() like this:
void test()
{
string m {"Mary had a little lamb"};
for (auto p : ﬁnd_all(m,'a'))
if (∗p!='a')
cerr << "a bug!\n";
}

That call of ﬁnd_all() could be graphically represented like this:
ﬁnd_all(m,’a’):

m:

M a

r

y

h a d

a

l

i

t

t

l

e

l

a m b

Iterators and standard algorithms work equivalently on every standard container for which their use
makes sense
...
begin(); p!=c
...
push_back(p);
return res;
}

// ﬁnd all occurrences of v in c

Section 4
...
1

Use of Iterators

105

The typename is needed to inform the compiler that C’s iterator is supposed to be a type and not a
value of some type, say, the integer 7
...
4
...
begin(); p!=c
...
push_back(p);
return res;
}

We can now write:
void test()
{
string m {"Mary had a little lamb"};
for (auto p : ﬁnd_all(m,'a'))
if (∗p!='a')
cerr << "string bug!\n";

// p is a string::iterator

list ld {1
...
2, 3
...
1};
for (auto p : ﬁnd_all(ld,1
...
1)
cerr << "list bug!\n";
vector vs { "red", "blue", "green", "green", "orange", "green" };
for (auto p : ﬁnd_all(vs,"green"))
if (∗p!="green")
cerr << "vector bug!\n";
for (auto p : ﬁnd_all(vs,"green"))
∗p = "vert";
}

Iterators are used to separate algorithms and containers
...
Conversely, a
container knows nothing about the algorithms operating on its elements; all it does is to supply iterators upon request (e
...
, begin() and end())
...

4
...
2 Iterator Types
What are iterators really? Any particular iterator is an object of some type
...
These iterator types can be as different as the containers and
the specialized needs they serve
...

A list iterator must be something more complicated than a simple pointer to an element because
an element of a list in general does not know where the next element of that list is
...

What is common for all iterators is their semantics and the naming of their operations
...
Similarly, ∗ yields
the element to which the iterator refers
...
1
...
Furthermore, users rarely need to know the type of a speciﬁc iterator; each
container ‘‘knows’’ its iterator types and makes them available under the conventional names iterator and const_iterator
...

We rarely have to worry about the details of how that type is deﬁned
...
5
...

However, containers are not the only place where we ﬁnd sequences of elements
...

Consequently, the notion of iterators can be usefully applied to input and output
...
For example:

Section 4
...
3

Stream Iterators

ostream_iterator oo {cout};

107

// write strings to cout

The effect of assigning to ∗oo is to write the assigned value to cout
...
The ++oo is done to
mimic writing into an array through a pointer
...
Again, we must specify the stream to be used and the type of values expected:
istream_iterator ii {cin};

Input iterators are used in pairs representing a sequence, so we must provide an
indicate the end of input
...
Instead, they are provided as
arguments to algorithms
...
begin(),b
...
begin(),b
...
eof() || !os;

// return error state (§2
...
1, §38
...
The ostream_iterator’s second argument is used to delimit output values
...
We read the strings into a vector, then we
sort() them, and then we write them out, eliminating duplicates
...
This can be done by keeping the strings in a set, which does not keep duplicates and keeps its elements in order (§31
...
3)
...
begin(),b
...
begin(),b
...
eof() || !os;

// return error state (§2
...
1, §38
...

4
...
4 Predicates
In the examples above, the algorithms have simply ‘‘built in’’ the action to be done for each element of a sequence
...
For
example, the ﬁnd algorithm (§32
...
A
more general variant looks for an element that fulﬁlls a speciﬁed requirement, a predicate (§3
...
2)
...
A map allows us to
access its elements as a sequence of (key,value) pairs, so we can search a map’s sequence
for a pair where the int is greater than 42:
void f(map& m)
{
auto p = ﬁnd_if(m
...
end(),Greater_than{42});
//
...
4
...
second>val; }
};

Alternatively, we could use a lambda expression (§3
...
3):
int cxx = count_if(m
...
end(), [](const pair& r) { return r
...
5
...
5
...
Deﬁniteness
...
Output
...
1]
...

The standard library provides dozens of algorithms
...
These standard-library algorithms all take sequences
as inputs (§4
...
A half-open sequence from b to e is referred to as [b:e)
...

4
...
6 Container Algorithms
A sequence is deﬁned by a pair of iterators [begin:end)
...
For example:
sort(v
...
end());

Why don’t we just say sort(v)? We can easily provide that shorthand:
namespace Estd {
using namespace std;
template
void sort(C& c)
{
sort(c
...
end());
}

110

A Tour of C++: Containers and Algorithms

Chapter 4

template
void sort(C& c, Pred p)
{
sort(c
...
end(),p);
}
//
...

4
...
1
...
1
...
1
...
1
...

Remember that standard-library facilities are deﬁned in namespace std; §4
...
2
...
2
...
2, §4
...
2
...
3
...
4
...
4
...
4
...

Prefer compact data structures; §4
...
1
...

If in doubt, use a range-checked vector (such as Vec); §4
...
1
...

Use push_back() or back_inserter() to add elements to a container; §4
...
1, §4
...

Use push_back() on a vector rather than realloc() on an array; §4
...

Catch common exceptions in main(); §4
...
1
...

Know your standard algorithms and prefer them over handwritten loops; §4
...
5
...
5
...

Estd

5
A Tour of C++: Concurrency and Utilities
When you wish to instruct,
be brief
...
1 Introduction
From an end-user’s perspective, the ideal standard library would provide components directly supporting essentially every need
...
However, that is not what the C++ standard library is trying to do
...
Instead, the C++
standard library aims to provide components that are useful to most people in most application
areas
...
In addition, support for a few widely important application areas, such as mathematical computation and text
manipulation, have crept in
...
2 Resource Management
One of the key tasks of any nontrivial program is to manage resources
...
Examples are memory, locks,
sockets, thread handles, and ﬁle handles
...
Even for short programs, a leak can become an embarrassment, say by a resource
shortage increasing the run time by orders of magnitude
...
To do this, they rely on the
basic language support for resource management using constructor/destructor pairs to ensure that a
resource doesn’t outlive an object responsible for it
...
2
...
2) and all standard-library containers are implemented in similar ways
...
For example, the technique is used for the standard-library lock classes:
mutex m; // used to protect access to shared data
//
...
manipulate shared data
...
3
...
The corresponding destructor releases the resource
...

This is an application of the ‘‘Resource Acquisition Is Initialization’’ technique (RAII; §3
...
1
...
3)
...
Containers
(such as vector and map), string, and iostream manage their resources (such as ﬁle handles and buffers) similarly
...
2
...
3
...
3
...
For example:
void f(int i, int j)
// X* vs
...

Section 5
...
1

unique_ptr

if (i<99) throw Z{};
if (j<77) return;
p−>do_something();
sp−>do_something();
//
...
On the other hand, unique_ptr ensures that its object
is properly destroyed whichever way we exit f() (by throwing an exception, by executing return, or
by ‘‘falling off the end’’)
...

}

// use a local variable

Unfortunately, overuse of new (and of pointers and references) seems to be an increasing problem
...
Its further
uses include passing free-store allocated objects in and out of functions:
unique_ptr make_X(int i)
// make an X and immediately give it to a unique_ptr
{
//
...

return unique_ptr{new X{i}};
}

A unique_ptr is a handle to an individual object (or an array) in much the same way that a vector is
a handle to a sequence of objects
...

The shared_ptr is similar to unique_ptr except that shared_ptrs are copied rather than moved
...
For example:
void f(shared_ptr);
void g(shared_ptr);
void user(const string& name, ios_base::openmode mode)
{
shared_ptr fp {new fstream(name,mode)};
if (!∗fp) throw No_ﬁle{}; // make sure the ﬁle was properly opened
f(fp);
g(fp);
//
...
Note that f() or g() may spawn a task holding a copy of fp or in some
other way store a copy that outlives user()
...
This is
neither cost free nor exorbitantly expensive, but does make the lifetime of the shared object hard to
predict
...

Given unique_ptr and shared_ptr, we can implement a complete ‘‘no naked new’’ policy
(§3
...
1
...
However, these ‘‘smart pointers’’ are still conceptually pointers and
therefore only my second choice for resource management – after containers and other types that
manage their resources at a higher conceptual level
...
Data races
(§41
...
4) and other forms of confusion are not addressed simply by eliminating the resource management issues
...
’’
• When we share an object, we need pointers (or references) to refer to the shared object, so a
shared_ptr becomes the obvious choice (unless there is an obvious single owner)
...

• A shared polymorphic object typically requires shared_ptrs
...
3
...

5
...
All modern programming languages provide support for this
...
The standard-library support is primarily aimed at supporting systems-level concurrency rather than directly providing sophisticated higher-level concurrency models; those can be supplied as libraries built using the standard-library facilities
...
To allow that, C++ provides a suitable memory model (§41
...
3)
...
This section brieﬂy gives examples of the main standard-library
concurrency support facilities: threads, mutexes, lock() operations, packaged_tasks, and futures
...

Section 5
...
1

Tasks and threads

115

5
...
1 Tasks and threads
We call a computation that can potentially be executed concurrently with other computations a task
...
A task to be executed concurrently with other tasks is launched by constructing a std::thread (found in ) with the task as
its argument
...
4
...
join();
t2
...
To ‘‘join’’ means to
‘‘wait for the thread to terminate
...
In this, threads differ from processes, which
generally do not directly share data
...
3
...
Such communication is typically controlled by locks or other
mechanisms to prevent data races (uncontrolled concurrent access to a variable)
...
Consider possible implementations of the
tasks f (a function) and F (a function object):
void f() { cout << "Hello "; }
struct F {
void operator()() { cout << "Parallel World!\n"; }
};

This is an example of a bad error: Here, f and F() each use the object cout without any form of synchronization
...
The program may produce ‘‘odd’’ output, such as
PaHerallllel o World!

When deﬁning tasks of a concurrent program, our aim is to keep tasks completely separate except
where they communicate in simple and obvious ways
...
For that to work, we just
have to pass arguments, get a result back, and make sure that there is no use of shared data in
between (no data races)
...
3
...
We can easily pass data (or pointers or references to the
data) as arguments
...
4
...
join();
t2
...
F can now use that array and
hopefully no other task accesses vec2 while F is executing
...

The initialization with {f,some_vec} uses a thread variadic template constructor that can accept
an arbitrary sequence of arguments (§28
...
The compiler checks that the ﬁrst argument can be
invoked given the following arguments and builds the necessary function object to pass to the
thread
...

5
...
3 Returning Results
In the example in §5
...
2, I pass the arguments by non-const reference
...
7)
...
A less obscure technique is to pass the input data by const reference and to pass the location of a place to deposit the result as a separate argument:
void f(const vector& v, double∗ res);// take input from v; place result in *res
class F {
public:
F(const vector& vv, double∗ p) :v{vv}, res{p} { }
void operator()();
// place result in *res

Section 5
...
3

private:
const vector& v;
double∗ res;
};

Returning Results

117

// source of input
// target for output

int main()
{
vector some_vec;
vector vec2;
//
...
join();
t2
...
3
...
1
...
3
...
In that case, the access has to be synchronized so that at most
one task at a time has access
...
g
...

The fundamental element of the solution is a mutex, a ‘‘mutual exclusion object
...
lock())
...

Once a thread has completed its access to the shared data, the unique_lock releases the mutex (with
a call m
...
The mutual exclusion and locking facilities are found in
...
Obviously, this is error-prone,
and equally obviously we try to make the correspondence clear through various language means
...

};

It doesn’t take a genius to guess that for a Record called rec, rec
...

It is not uncommon to need to simultaneously access several resources to perform some action
...
For example, if thread1 acquires mutex1 and then tries to acquire mutex2
while thread2 acquires mutex2 and then tries to acquire mutex1, then neither task will ever proceed
further
...

unique_lock lck1 {m1,defer_lock};
unique_lock lck2 {m2,defer_lock};
unique_lock lck3 {m3,defer_lock};
//
...
manipulate shared data
...
The destructors for the individual unique_locks ensure that the
mutexes are released when a thread leaves the scope
...
In particular, the programmer has to
devise ways of knowing what work has and has not been done by various tasks
...
On the other hand, some people are convinced that sharing must be more efﬁcient than copying arguments and returns
...
On the other hand, modern machines are very good at copying data, especially compact
data, such as vector elements
...

5
...
4
...
The simplest ‘‘event’’ is simply time passing
...
3
...
1

Waiting for Events

using namespace std::chrono;

119

// see §35
...
count() << " nanoseconds passed\n";

Note that I didn’t even have to launch a thread; by default, this_thread refers to the one and only
thread (§42
...
6)
...
See §5
...
1 and
§35
...
The time facilities are found in

...
3
...
A condition_variable is a mechanism allowing one thread to
wait for another
...

Consider the classical example of two threads communicating by passing messages through a
queue
...

};

// object to be communicated

queue mqueue;
condition_variable mcond;
mutex mmutex;

// the queue of messages
// the variable communicating events
// the locking mechanism

The types queue, condition_variable, and mutex are provided by the standard library
...
wait(lck)) /* do nothing */;
auto m = mqueue
...
pop();
lck
...
process m
...
Waiting on condition_variable releases its lock argument until the wait is
over (so that the queue is non-empty) and then reacquires it
...
ﬁll the message
...
push(m);
mcond
...
3
...

5
...
5 Communicating Tasks
The standard library provides a few facilities to allow programmers to operate at the conceptual
level of tasks (work to potentially be done concurrently) rather than directly at the lower level of
threads and locks:
[1] future and promise for returning a value from a task spawned on a separate thread
[2] packaged_task to help launch tasks and connect up the mechanisms for returning a result
[3] async() for launching of a task in a manner very similar to calling a function
...

5
...
5
...
The basic
idea is simple: When a task wants to pass a value to another, it puts the value into a promise
...
We can represent this graphically:
task1:

task2:
set_value()

get()

future

promise
set_exception()

value
If we have a future called fx, we can get() a value of type X from it:
X v = fx
...
If the value couldn’t be computed,
get() might throw an exception (from the system or transmitted from the task from which we were
trying to get() the value)
...
3
...
1

future

and promise

121

The main purpose of a promise is to provide simple ‘‘put’’ operations (called set_value() and
set_exception()) to match future’s get()
...
They are yet another fertile source of puns
...
For example:
void f(promise& px) // a task: place the result in px
{
//
...
compute a value for res
...
set_value(res);
}
catch (
...
set_exception(current_exception());
}
}

The current_exception() refers to the caught exception (§30
...
1
...

To deal with an exception transmitted through a future, the caller of
catch it somewhere
...

try {
X v = fx
...
use v
...
) {
// oops: someone couldn’t compute v
//
...

}
}

5
...
5
...
A packaged_task provides wrapper
code to put the return value or exception from the task into a promise (like the code shown in
§5
...
5
...
If you ask it by calling get_future, a packaged_task will give you the future corresponding
to its promise
...
4
...
6):

122

A Tour of C++: Concurrency and Utilities

Chapter 5

double accum(double∗ beg, double ∗ end, double init)
// compute the sum of [beg:end) starting with the initial value init
{
return accumulate(beg,end,init);
}
double comp2(vector& v)
{
using Task_type = double(double∗,double∗,double);

// type of task

packaged_task pt0 {accum};
packaged_task pt1 {accum};

// package the task (i
...
, accum)

future f0 {pt0
...
get_future()};

// get hold of pt0’s future
// get hold of pt1’s future

double∗ ﬁrst = &v[0];
thread t1 {move(pt0),ﬁrst,ﬁrst+v
...
size()/2,ﬁrst+v
...

return f0
...
get();

// get the results

}

The packaged_task template takes the type of the task as its template argument (here Task_type, an
alias for double(double∗,double∗,double)) and the task as its constructor argument (here, accum)
...

Please note the absence of explicit mention of locks in this code: we are able to concentrate on
tasks to be done, rather than on the mechanisms used to manage their communication
...

5
...
5
...
It is far from the only model supported by the C++ standard library, but it serves well for a
wide range of needs
...
g
...

To launch tasks to potentially run asynchronously, we can use async():
double comp4(vector& v)
// spawn many tasks if v is large enough
{
if (v
...
begin(),v
...
0);
auto v0 = &v[0];
auto sz = v
...
3
...
3

async()

auto f0 = async(accum,v0,v0+sz/4,0
...
0);
auto f2 = async(accum,v0+sz/2,v0+sz∗3/4,0
...
0);

123

// ﬁrst quarter
// second quarter
// third quarter
// four th quar ter

return f0
...
get()+f2
...
get(); // collect and combine the results
}

Basically, async() separates the ‘‘call part’’ of a function call from the ‘‘get the result part,’’ and separates both from the actual execution of the task
...
Instead, you think just in terms of tasks that potentially compute their results
asynchronously
...
For example, async() may check whether any idle cores (processors) are available before deciding how many threads to use
...
For example, it can also be used to spawn a task for getting information
from a user, leaving the ‘‘main program’’ active with something else (§42
...
6)
...
4 Small Utility Components
Not all standard-library components come as part of obviously labeled facilities, such as ‘‘containers’’ or ‘‘I/O
...

• Type functions, such as iterator_traits and is_arithmetic, for gaining information about types
...

The point here is that a function or a type need not be complicated or closely tied to a mass of other
functions and types to be useful
...

5
...
1 Time
The standard library provides facilities for dealing with time
...
2

auto t0 = high_resolution_clock::now();
do_work();
auto t1 = high_resolution_clock::now();
cout << duration_cast(t1−t0)
...
Subtracting two time_points gives a duration (a
period of time)
...
That’s what duration_cast does
...
2)
...

Guesses about performance are most unreliable
...
4
...
The standard library provides a variety of type functions to help library implementers and programmers in general to write code that take advantage of aspects of the language,
the standard library, and code in general
...
6
...
For example:
constexpr ﬂoat min = numeric_limits<ﬂoat>::min();

// smallest positive ﬂoat (§40
...
2
...
For example:
constexpr int szi = sizeof(int); // the number of bytes in an int

Such type functions are part of C++’s mechanisms for compile-time computation that allow tighter
type checking and better performance than would otherwise have been possible
...
Here, I just present two facilities provided by the standard library: iterator_traits
(§5
...
2
...
4
...
2)
...
4
...
1 iterator_traits
The standard-library sort() takes a pair of iterators supposed to deﬁne a sequence (§4
...
Furthermore, those iterators must offer random access to that sequence, that is, they must be randomaccess iterators
...
In particular, a forward_list is a singly-linked list so subscripting would be expensive and there is no reasonable way
to refer back to a previous element
...
1
...

The standard library provides a mechanism, iterator_traits that allows us to check which kind of
iterator is supported
...
5
...
For example:
void test(vector& v, forward_list& lst)
{
sort(v); // sor t the vector
sort(lst); // sor t the singly-linked list
}

The techniques needed to make that work are generally useful
...
The version taking random-access iterator
arguments is trivial:

Section 5
...
2
...
begin(),v
...
begin(),v
...
3
...
3)
...

The real ‘‘type magic’’ is in the selection of helper functions:
template
void sort(C& c)
{
using Iter = Iterator_type;
sort_helper(c
...
end(),Iterator_category{});
}

Here, I use two type functions: Iterator_type returns the iterator type of C (that is, C::iterator) and
then Iterator_category{} constructs a ‘‘tag’’ value indicating the kind of iterator provided:
• std::random_access_iterator_tag if C’s iterator supports random access
...

Given that, we can select between the two sorting algorithms at compile time
...

The standard-library support for techniques for using iterators, such as tag dispatch, comes in
the form of a simple class template iterator_traits from (§33
...
3)
...
But then you can’t use the
techniques they support to improve your own code
...
4
...
2 Type Predicates
A standard-library type predicate is a simple type function that answers a fundamental question
about types
...
4
...
Other examples are is_class,
is_pod, is_literal_type, has_virtual_destructor, and is_base_of
...
For example:
template
class complex {
Scalar re, im;
public:
static_assert(Is_arithmetic(), "Sorry, I only support complex of arithmetic types");
//
...

5
...
3 pair and tuple
Often, we need some data that is just data; that is, a collection of values, rather than an object of a
class with a well-deﬁned semantics and an invariant for its value (§2
...
3
...
4)
...
Alternatively, we could let the standard library write the deﬁnition for us
...
6
...
We can use that to search in a sorted sequence of Records:
auto rec_eq = [](const Record& r1, const Record& r2) { return r1
...
name;};// compare names
void f(const vector& v)
// assume that v is sorted on its "name" ﬁeld
{
auto er = equal_range(v
...
end(),Record{"Reg"},rec_eq);

Section 5
...
3

pair

for (auto p = er
...
second; ++p)
cout << ∗p;

and tuple

127

// print all equal records
// assume that << is deﬁned for Record

}

The ﬁrst member of a pair is called ﬁrst and the second member is called second
...

The standard-library pair (from ) is quite frequently used in the standard library and
elsewhere
...
The make_pair() function makes it easy to create a pair without explicitly mentioning its type (§34
...
4
...
For example:
void f(vector& v)
{
auto pp = make_pair(v
...

}

// pp is a pair::iterator,int>

If you need more than two elements (or less), you can use tuple (from ; §34
...
4
...
A tuple
is a heterogeneous sequence of elements; for example:
tuple t2("Sild",123, 3
...
23);

// the type is deduced
// t is a tuple

string s = get<0>(t); // get ﬁrst element of tuple
int x = get<1>(t);
double d = get<2>(t);

The elements of a tuple are numbered (starting with zero), rather than named the way elements of
pairs are (ﬁrst and second)
...
5
...

Like pairs, tuples can be assigned and compared if their elements can be
...
It is less common to need three or more parts to
a result, so tuples are more often found in the implementations of generic algorithms
...
5 Regular Expressions
Regular expressions are a powerful tool for text processing
...
g
...
S
...
In , the standard library provides
support for regular expressions in the form of the std::regex class and its supporting functions
...
It speciﬁes a pattern starting with two letters \w{2} optionally followed by some space \s∗
followed by ﬁve digits \d{5} and optionally followed by a dash and four digits −\d{4}
...
Regular expressions are summarized in §37
...
1
...
3
...
1) starting with R"( and terminated by )"
...

The simplest way of using a pattern is to search for it in a stream:
int lineno = 0;
for (string line; getline(cin,line);) {
// read into line buffer
++lineno;
smatch matches;
// matched strings go here
if (regex_search(line,matches,pat))
// search for pat in line
cout << lineno << ": " << matches[0] << '\n';
}

The regex_search(line,matches,pat) searches the line for anything that matches the regular expression
stored in pat and if it ﬁnds any matches, it stores them in matches
...
The matches variable is of type smatch
...
The ﬁrst element, here matches[0], is
the complete match
...

5
...
However, C++ is heavily
used for numerical computation and the standard library reﬂects that
...
6
...
3)
...
4)
...
For
example:
void f()
{
list lst {1, 2, 3, 4, 5, 9999
...
begin(),lst
...
0); // calculate the sum
cout << s << '\n';
// print 10014
...
6)
...
6
...
6
...
3
...
, the standard
library complex is a template:
template
class complex {
public:
complex(const Scalar& re ={}, const Scalar& im ={});
//
...
For example:
void f(complex<ﬂoat> ﬂ, complex db)
{
complex ld {ﬂ+sqrt(db)};
db += ﬂ∗3;
ﬂ = pow(1/ﬂ,2);
//
...
For more details, see §40
...

5
...
3 Random Numbers
Random numbers are useful in many contexts, such as testing, games, simulation, and security
...
A random number generator consists of two parts:
[1] an engine that produces a sequence of random or pseudo-random values
...

Examples of distributions are uniform_int_distribution (where all integers produced are equally
likely), normal_distribution (‘‘the bell curve’’), and exponential_distribution (exponential growth);
each for some speciﬁed range
...
6
// make a generator

int x = die();

// roll the die: x becomes a value in [1:6]

The standard-library function bind() makes a function object that will invoke its ﬁrst argument
(here, one_to_six) given its second argument (here, re) as its argument (§33
...
1)
...

130

A Tour of C++: Concurrency and Utilities

Chapter 5

Thanks to its uncompromising attention to generality and performance one expert has deemed the
standard-library random number component ‘‘what every random number library wants to be when
it grows up
...
’’ The using statements makes
what is being done a bit more obvious
...

For novices (of any background) the fully general interface to the random number library can be
a serious obstacle
...

For example:
Rand_int rnd {1,10};
int x = rnd();

// make a random number generator for [1:10]
// x is a number in [1:10]

So, how could we get that? We have to get something like die() inside a class Rand_int:
class Rand_int {
public:
Rand_int(int low, int high) :dist{low,high} { }
int operator()() { return dist(re); }
// draw an int
private:
default_random_engine re;
uniform_int_distribution<> dist;
};

That deﬁnition is still ‘‘expert level,’’ but the use of Rand_int() is manageable in the ﬁrst week of a
C++ course for novices
...
size(); ++i) {
// write out a bar graph
cout << i << '\t';
for (int j=0; j!=mn[i]; ++j) cout << '∗';
cout << endl;
}
}

The output is a (reassuringly boring) uniform distribution (with reasonable statistical variation):
0
1
2
3
4

∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗
∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗
∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗
∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗
∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗∗

Section 5
...
3

Random Numbers

131

There is no standard graphics library for C++, so I use ‘‘ASCII graphics
...

For more information about random numbers, see §40
...

5
...
4 Vector Arithmetic
The vector described in §4
...
1 was designed to be a general mechanism for holding values, to be
ﬂexible, and to ﬁt into the architecture of containers, iterators, and algorithms
...
Adding such operations to vector would be easy, but its
generality and ﬂexibility precludes optimizations that are often considered essential for serious
numerical work
...

};

The usual arithmetic operations and the most common mathematical functions are supported for
valarrays
...
14+a2/a1;
// numeric array operators *, +, /, and =
a2 += a1∗3
...

}

For more details, see §40
...
In particular,
mensional computations
...
6
...
2
...
2
...
4)
...
7 Advice
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]

Use resource handles to manage resources (RAII); §5
...

Use unique_ptr to refer to objects of polymorphic type; §5
...
1
...
2
...

Use type-safe mechanisms for concurrency; §5
...

Minimize the use of shared data; §5
...
4
...
3
...

Think in terms of concurrent tasks, rather than threads; §5
...
5
...
4
...
4
...

You can write code to explicitly depend on properties of types; §5
...
2
...
5
...
6
...
6
...

Part II
Basic Facilities
This part describes C++’s built-in types and the basic facilities for constructing programs out of them
...
It also discusses the basic facilities for
composing a C++ program out of logical and physical parts
...
I have long entertained a suspicion, with regard to the decisions of philosophers
upon all subjects, and found in myself a greater inclination to dispute, than assent to
their conclusions
...
When a philosopher
has once laid hold of a favourite principle, which perhaps accounts for many natural
effects, he extends the same principle over the whole creation, and reduces to it every
phænomenon, though by the most violent and absurd reasoning
...
PART I
...

– C
...
Parkinson

•
•

•

•
•
•

The ISO C++ Standard
Implementations; The Basic Source Character Set
Types
Fundamental Types; Booleans; Character Types; Integer Types; Floating-Point Types; Preﬁxes and Sufﬁxes; void; Sizes; Alignment
Declarations
The Structure of Declarations; Declaring Multiple Names; Names; Scope; Initialization;
Deducing a Type: auto and decltype()
Objects and Values
Lvalues and Rvalues; Lifetimes of Objects
Type Aliases
Advice

6
...
In
this book, references to the standard are of the form §iso
...
3
...
1
...
But don’t
expect the standard to be a tutorial or to be easily accessible by non-experts
...
The standard doesn’t say whether a piece of code is good or bad; it
simply says what a programmer can and cannot rely on from an implementation
...
They do so to access system interfaces and

136

Types and Declarations

Chapter 6

hardware features that cannot be expressed directly in C++ or require reliance on speciﬁc implementation details
...
This means that
each implementation must provide a speciﬁc, well-deﬁned behavior for a construct and that behavior must be documented
...
However, the behavior
of the initialization of c2 is implementation-deﬁned because the number of bits in a char is implementation-deﬁned
...
5
...
1)
...

Other behaviors are unspeciﬁed; that is, a range of possible behaviors are acceptable, but the
implementer is not obliged to specify which actually occur
...
For example,
the exact value returned by new is unspeciﬁed
...
2)
...
Such behavior is the price we pay for the ability to operate effectively on a large range of
systems
...
However, 16-bit and 32-bit character sets are not uncommon, and machines with
16-bit and 64-bit pointers are in wide use
...
A typical
example of this practice is to present all dependencies on hardware sizes in the form of constants
and type deﬁnitions in some header ﬁle
...
2)
...
4
...
3)
...
A construct is deemed undeﬁned by the standard if no reasonable
behavior is required by an implementation
...
For example:
const int size = 4∗1024;
char page[size];
void f()
{
page[size+size] = 7; // undeﬁned
}

Plausible outcomes of this code fragment include overwriting unrelated data and triggering a hardware error/exception
...

Where powerful optimizers are used, the actual effects of undeﬁned behavior can become quite
unpredictable
...
1

The ISO C++ Standard

137

unspeciﬁed or implementation-deﬁned rather than undeﬁned
...
In many cases, tools exist to help do this
...
1
...
17
...
1
...
A hosted implementation includes all the standard-library facilities as described in the standard (§30
...

A freestanding implementation may provide fewer standard-library facilities, as long as the following are provided:
Freestanding Implementation Headers
Types
Implementation properties
Integer types
Start and termination
Dynamic memory management
Type identiﬁcation
Exception handling
Initializer lists
Other run-time support
Type traits
Atomics

§10
...
1
§40
...
7
§43
...
2
...
5
§30
...
1
...
3
...
2
...
3
...
4
...
3

Freestanding implementations are meant for code running with only the most minimal operating
system support
...

6
...
2 The Basic Source Character Set
The C++ standard and the examples in this book are written using the basic source character set
consisting of the letters, digits, graphical characters, and whitespace characters from the U
...
variant of the international 7-bit character set ISO 646-1983 called ASCII (ANSI3
...
This can
cause problems for people who use C++ in an environment with a different character set:
• ASCII contains punctuation characters and operator symbols (such as ], {, and !) that are not
available in some character sets
...

• ASCII doesn’t contain characters (such as ñ, Þ, and Æ) that are used for writing languages
other than English
...
2
...
2)
...
2 Types
Consider:
x = y+f(2);

For this to make sense in a C++ program, the names x, y, and f must be suitably declared
...

Every name (identiﬁer) in a C++ program has a type associated with it
...
For example:
ﬂoat x;
int y = 7;
ﬂoat f(int);

// x is a ﬂoating-point variable
// y is an integer variable with the initial value 7
// f is a function taking an argument of type int and returning a ﬂoating-point number

These declarations would make the example meaningful
...
On the other hand, f is declared to be a function that
takes an int as its argument, so it can be called given the interger 2
...
2
...
3)
...
More extensive and
realistic examples are saved for later chapters
...
You must know these elements, plus the terminology and simple syntax that go with them, in order to complete a real project in C++ and especially to read code written by others
...
Consequently, you
may prefer to skim through this chapter, observing the major concepts, and return later as the need
for understanding more details arises
...
2
...
2
...
2
...
2
...
2
...
2
...
2 Pointer types (such as int∗)
§7
...
7 Reference types (such as double& and vector&&)
In addition, a user can deﬁne additional types:
§8
...
4 Enumeration types for representing speciﬁc sets of values (enum and enum class)

Section 6
...
1

Fundamental Types

139

The Boolean, character, and integer types are collectively called integral types
...
Enumerations and classes (Chapter 16)
are called user-deﬁned types because they must be deﬁned by users rather than being available for
use without previous declaration, the way fundamental types are
...
The standard library provides
many user-deﬁned types (Chapter 4, Chapter 5)
...
2
...
The assumption is that a computer provides bytes for holding characters, words for holding and computing integer values, some entity most suitable for ﬂoating-point computation, and
addresses for referring to those entities
...

For most applications, we could use bool for logical values, char for characters, int for integer
values, and double for ﬂoating-point values
...

6
...
2 Booleans
A Boolean, bool, can have one of the two values
results of logical operations
...

A Boolean is used to express the

void f(int a, int b)
{
bool b1 {a==b};
//
...

A common use of bool is as the type of the result of a function that tests some condition (a predicate)
...
Conversely, integers can be implicitly converted to bool values: nonzero integers convert to true and 0
converts to false
...
2
...
5)

int i1 = true;
int i2 {true};

// i1 becomes 1
// i2 becomes 1

If you prefer to use the {}-initializer syntax to prevent narrowing, yet still want to convert an int to a
bool, you can be explicit:

140

Types and Declarations

Chapter 6

void f(int i)
{
bool b {i!=0};
//
...
If the result needs to be converted back to bool,
a 0 is converted to false and a nonzero value is converted to true
...
5
...
5)
...
For example:
void g(int∗ p)
{
bool b = p;
bool b2 {p!=nullptr};

true;

// narrows to true or false
// explicit test against nullptr

if (p) {
// equivalent to p!=nullptr
//
...
The shorter form leaves fewer opportunities for mistakes
...
2
...
C++ provides a variety of character types that reﬂect that – often bewildering – variety:
• char: The default character type, used for program text
...

• signed char: Like char, but guaranteed to be signed, that is, capable of holding both positive
and negative values
...

• wchar_t: Provided to hold characters of a larger character set such as Unicode (see §7
...
2
...

The size of wchar_t is implementation-deﬁned and large enough to hold the largest character
set supported by the implementation’s locale (Chapter 39)
...

• char32_t: A type for holding 32-bit character sets, such as UTF-32
...
5)
...
2
...

A char variable can hold a character of the implementation’s character set
...
Typically, the
character set is a variant of ISO-646, for example ASCII, thus providing the characters appearing
on your keyboard
...

Serious variations occur between character sets supporting different natural languages and
between character sets supporting the same natural language in different ways
...
The larger and more interesting issue of
how to program in a multilingual, multi-character-set environment is beyond the scope of this book,
although it is alluded to in several places (§6
...
3, §36
...
1, Chapter 39)
...
It is not safe to
assume that:
• There are no more than 127 characters in an 8-bit character set (e
...
, some sets provide 255
characters)
...
g
...

• The alphabetic characters are contiguous (EBCDIC leaves a gap between 'i' and 'j')
...
g
...

• A char ﬁts in 1 byte
...
Also, one could reasonably use a 16-bit Unicode encoding for the
basic chars
...
This
general rule applies even to characters
...
For example, the value of 'b' is 98 in the ASCII character set
...

The possibility of converting a char to an integer raises the question: is a char signed or unsigned?
The 256 values represented by an 8-bit byte can be interpreted as the values 0 to 255 or as the values −127 to 127
...
Unfortunately, the choice of signed or unsigned for a plain char is implementationdeﬁned
...

142

Types and Declarations

Chapter 6

Fortunately, the difference matters only for values outside the 0 to 127 range, and the most common
characters are within that range
...
See
§6
...
3
...

Note that the character types are integral types (§6
...
1) so that arithmetic and bitwise logical
operations (§10
...
For example:
void digits()
{
for (int i=0; i!=10; ++i)
cout << static_cast('0'+i);
}

This is a way of writing the ten digits to cout
...
The resulting int is then converted to a char and written to cout
...

6
...
3
...
This opens the
possibility for some nasty surprises and implementation dependencies
...
On an implementation with
8-bit bytes, the answer depends on the meaning of the ‘‘all ones’’ char bit pattern when extended
into an int
...
On a machine where a char
is signed, the answer is −1
...
However, C++ does not offer a general mechanism for detecting this kind
of problem
...
Unfortunately,
some standard-library functions, such as strcmp(), take plain chars only (§43
...

A char must behave identically to either a signed char or an unsigned char
...
For example:
void f(char c, signed char sc, unsigned char uc)
{
char∗ pc = &uc;
// error : no pointer conversion
signed char∗ psc = pc;
// error : no pointer conversion
unsigned char∗ puc = pc;
// error : no pointer conversion
psc = puc;
// error : no pointer conversion
}

Variables of the three char types can be freely assigned to each other
...
5
...
1) is still undeﬁned
...
2
...
1

c = sc;
c = uc;
sc = uc;
uc = sc;
sc = c;
uc = c;

Signed and Unsigned Characters

143

// OK
// implementation-deﬁned if plain chars are signed and if uc’s value is too large
// implementation deﬁned if uc’s value is too large
// OK: conversion to unsigned
// implementation-deﬁned if plain chars are unsigned and if c’s value is too large
// OK: conversion to unsigned

}

To be concrete, assume that a char is 8 bits:
signed char sc = −160;
unsigned char uc = sc;
cout << uc;

// uc == 116 (because 256-160==116)
// print 't'

char count[256];
++count[sc];
++count[uc];

// assume 8-bit chars
// likely disaster: out-of-range access
// OK

None of these potential problems and confusions occur if you use plain
negative character values
...
2
...
2 Character Literals
A character literal is a single character enclosed in single quotes, for example, 'a' and '0'
...
A character literal can be implicitly converted to its integer value in
the character set of the machine on which the C++ program is to run
...
The use of character literals
rather than decimal notation makes programs more portable
...

Despite their appearance, these are single characters
...
There is no limit to the number of hexadecimal digits in the sequence
...
For example:
Octal

Hexadecimal

Decimal

ASCII

'\6'
'\60'
'\137'

'\x6'
'\x30'
'\x05f'

6
48
95

ACK
'0'
'_'

This makes it possible to represent every character in the machine’s character set and, in particular,
to embed such characters in character strings (see §7
...
2)
...

It is possible to enclose more than one character in a character literal, for example, 'ab'
...
The type of such a multicharacter
literal is int
...
The notation is hard enough to read without having to worry about
whether or not the character after a constant is a digit
...

Consider these examples:
char v1[] = "a\xah\129";
char v2[] = "a\xah\127";
char v3[] = "a\xad\127";
char v4[] = "a\xad\0127";

// 6 chars: 'a' '\xa' 'h' '\12' '9' '\0'
// 5 chars: 'a' '\xa' 'h' '\127' '\0'
// 4 chars: 'a' '\xad' '\127' '\0'
// 5 chars: 'a' '\xad' '\012' '7' '\0'

Wide character literals are of the form L'ab' and are of type wchar_t
...

A C++ program can manipulate character sets that are much richer than the 127-character
ASCII set, such as Unicode
...
For example:
U'\UFADEBEEF'
u'\uDEAD'
u'\xDEAD'

The shorter notation u'\uXXXX' is equivalent to U'\U0000XXXX' for any hexadecimal digit X
...
The meaning of the hexadecimal number is deﬁned by the ISO/IEC 10646 standard and such values are called universal
character names
...
2
...
2
...
2
...
3, §iso
...
14
...
E
...
2
...
In addition, integers come in four sizes: short int, ‘‘plain’’ int, long int, and long long int
...
Similarly, short is
a synonym for short int, unsigned for unsigned int, and signed for signed int
...

Section 6
...
4

Integer Types

145

The unsigned integer types are ideal for uses that treat storage as a bit array
...

Attempts to ensure that some values are positive by declaring variables unsigned will typically be
defeated by the implicit conversion rules (§10
...
1, §10
...
2
...

Unlike plain chars, plain ints are always signed
...

If you need more detailed control over integer sizes, you can use aliases from (§43
...
The plain integer types have well-deﬁned minimal sizes (§6
...
8), so the
are sometimes redundant and can be overused
...
These types must behave like integers and are considered integer types
when considering conversions and integer literal values, but they usually have greater range
(occupy more space)
...
2
...
1 Integer Literals
Integer literals come in three guises: decimal, octal, and hexadecimal
...
3
...

A literal starting with zero followed by x or X (0x or 0X) is a hexadecimal (base 16) number
...
For example:
Decimal

Octal

2
63
83

0
02
077
0123

Hexadecimal
0x0
0x2
0x3f
0x63

The letters a, b, c, d, e, and f, or their uppercase equivalents, are used to represent 10, 11, 12, 13, 14,
and 15, respectively
...

Using these notations to express genuine numbers can lead to surprises
...
Had more bits been used to represent an integer, it would have been the positive decimal number 65535
...
Similarly, the sufﬁx L can be used
to write explicitly long literals
...

Combinations of sufﬁxes are allowed
...
2
...
2)
...
5),
constexpr (§10
...
4) initializers
...
2
...
2 Types of Integer Literals
In general, the type of an integer literal depends on its form, value, and sufﬁx:
• If it is decimal and has no sufﬁx, it has the ﬁrst of these types in which its value can be represented: int, long int, long long int
...

• If it is sufﬁxed by u or U, its type is the ﬁrst of these types in which its value can be represented: unsigned int, unsigned long int, unsigned long long int
...

• If it is octal or hexadecimal and sufﬁxed by l or L, its type is the ﬁrst of these types in which
its value can be represented: long int, unsigned long int, long long int, unsigned long long int
...

• If it is decimal and is sufﬁxed by ll or LL, its type is long long int
...

• If it is sufﬁxed by llu, llU, ull, Ull, LLu, LLU, uLL, or ULL, its type is unsigned long long int
...
Similarly, 0XA000 is of type int on a machine with 32-bit ints but
of type unsigned int on a machine with 16-bit ints
...

6
...
5 Floating-Point Types
The ﬂoating-point types represent ﬂoating-point numbers
...
There are three ﬂoating-point
types: ﬂoat (single-precision), double (double-precision), and long double (extended-precision)
...

Choosing the right precision for a problem where the choice matters requires signiﬁcant understanding of ﬂoating-point computation
...

6
...
5
...
Again, a compiler ought to warn about ﬂoating-point literals that are too large to be represented
...
2
...
1

1
...
23

Floating-Point Literals

0
...

1
...
2e10

1
...
For example, 65
...
43

e

−

147

e−21

is

21

If you want a ﬂoating-point literal of type ﬂoat, you can deﬁne one using the sufﬁx f or F:
3
...
0f

2
...
9e−3f

If you want a ﬂoating-point literal of type long double, you can deﬁne one using the sufﬁx l or L:
3
...
0L

2
...
9e−3L

6
...
6 Preﬁxes and Sufﬁxes
There is a minor zoo of sufﬁxes indicating types of literals and also a few preﬁxes:
Arithmetic Literal Preﬁxes and Sufﬁxes
∗ﬁx
Meaning
Example
Reference

Notation
0
0x
u
l
ll

0X
U
L
LL

f
e

...
2
...
2
§iso
...
14
...
2
...
2
§iso
...
14
...
2
...
2
§iso
...
14
...
2
...
4
§iso
...
14
...
2
...
3
§iso
...
14
...
2
...
3
§iso
...
14
...
2
...
5
§iso
...
14
...
2
...
5
§iso
...
14
...
2
...
5
§iso
...
14
...
3

char
char16_t
char32_t
wchar_t

'c'
u'c'
U'c'
L'c'

string
raw string
UTF-8 string
UTF-16 string
UTF-32 string
wchar_t string

"mess"
R"(\b)"
u8"foo"
u"foo"
U"foo"
L"foo"

§6
...
4
...
2
...
1
§6
...
4
...
2
...
1
§6
...
4
...
2
...
1
§6
...
5
...
2
...
1
§6
...
3
...
2
...
2
§6
...
3
...
2
...
2
§7
...
2
§7
...
2
...
3
...
2
§7
...
2
...
3
...
2
§7
...
2
...
3
...
’’
Obviously, we could also consider
...
However, I consider the nomenclature less important than giving an overview of the
bewildering variety of literals
...

For example:

148

Types and Declarations

1LU
2UL
3ULL
4LLU
5LUL

Chapter 6

// unsigned long
// unsigned long
// unsigned long long
// unsigned long long
// error

The sufﬁxes l and L can be used for ﬂoating-point literals to express long double
...
0L

// long int
// long double

Combinations of R, L, and u preﬁxes are allowed, for example, uR"∗∗(foo\(bar))∗∗"
...
3
...
2)
...
For example, by deﬁning a
user-deﬁned literal operator (§19
...
6), we can get
"foo bar"s
123_km

// a literal of type std::string
// a literal of type Distance

Sufﬁxes not starting with _ are reserved for the standard library
...
2
...
It can, however, be used only as part of a more
complicated type; there are no objects of type void
...
For example:
void x;
void& r;
void f();
void∗ pv;

// error: there are no void objects
// error: there are no references to void
// function f does not return a value (§12
...
4)
// pointer to object of unknown type (§7
...
1)

When declaring a function, you must specify the type of the value returned
...
However, that would make a mess of the grammar (§iso
...
Consequently, void is used as a ‘‘pseudo
return type’’ to indicate that a function doesn’t return a value
...
2
...
1)
...
Why should you bother? People who program on a variety of systems or
use a variety of compilers care a lot because if they don’t, they are forced to waste time ﬁnding and
ﬁxing obscure bugs
...
’’ This is a narrow and shortsighted view
...
In addition, programs often need to be compiled with other compilers for the same system,
and even a future release of your favorite compiler may do some things differently from the current

Section 6
...
8

Sizes

149

one
...

It is relatively easy to limit the impact of implementation-dependent language features
...
Using standard-library facilities
wherever feasible is one approach
...
On many machines, there are signiﬁcant differences in memory requirements, memory access
times, and computation speed among the different varieties of fundamental types
...
Writing truly portable low-level code is harder
...
3
...
2 inch to a byte), a megabyte of memory would stretch about 3 miles (5 km) to
the right
...
The size of an object or type can be obtained using the sizeof operator
(§10
...
This is what is guaranteed about sizes of fundamental types:
•
•
•
•
•

1 ≡ sizeof(char) ≤ sizeof(short) ≤ sizeof(int) ≤ sizeof(long)
1 ≤ sizeof(bool) ≤ sizeof(long)
sizeof(char) ≤ sizeof(wchar_t) ≤ sizeof(long)
sizeof(ﬂoat) ≤ sizeof(double) ≤ sizeof(long double)
sizeof(N) ≡ sizeof(signed N) ≡ sizeof(unsigned N)

≤ sizeof(long long)

150

Types and Declarations

Chapter 6

In that last line, N can be char, short, int, long, or long long
...
A char can hold a character of
the machine’s character set
...
Similarly, the int type is supposed to be chosen to be the most suitable for holding
and manipulating integers on a given computer; it is typically a 4-byte (32-bit) word
...
For example, there are machines with 32-bit chars
...
Note that it is not guaranteed that
sizeof(long) ...
For example:
#include
// §40
...
2) are constexpr (§10
...

The fundamental types can be mixed freely in assignments and expressions
...
5)
...
Conversions that are not value-preserving are best avoided (§2
...
2, §10
...
2
...

If you need a speciﬁc size of integer, say, a 16-bit integer, you can #include the standard header
that deﬁnes a variety of types (or rather type aliases; §6
...
For example:
int16_t x {0xaabb};
int64_t xxxx {0xaaaabbbbccccdddd};
int_least16_t y;
int_least32_t yy
int_fast32_t z;

// 2 bytes
// 8 bytes
// at least 2 bytes (just like int)
// at least 4 bytes (just like long)
// the fastest int type with at least 4 bytes

The standard header deﬁnes an alias that is very widely used in both standard-library declarations and user code: size_t is an implementation-deﬁned unsigned integer type that can hold the
size in bytes of every object
...
For
example:
void∗ allocate(size_t n); // get n bytes

Similarly, deﬁnes the signed integer type ptrdiff_t for holding the result of subtracting two
pointers to get a number of elements
...
2
...
2
...
In addition, on some
machine architectures, the bytes used to hold it must have proper alignment for the hardware to
access it efﬁciently (or in extreme cases to access it at all)
...
Of course, this is all very implementation speciﬁc, and for most programmers
completely implicit
...
Where alignment most often becomes visible is in object layouts: sometimes
structs contain ‘‘holes’’ to improve alignment (§8
...
1)
...
For example:
auto ac = alignof('c');
auto ai = alignof(1);
auto ad = alignof(2
...
Instead, we can use the type speciﬁer alignas: alignas(T) means ‘‘align just like a T
...
size(),bufmax/sizeof(X));
uninitialized_copy(vx
...
begin()+max,buffer);
//
...
3 Declarations
Before a name (identiﬁer) can be used in a C++ program, it must be declared
...
For example:
char ch;
string s;
auto count = 1;
const double pi {3
...
2
...
4
...
5)
// type name

As can be seen from these examples, a declaration can do more than simply associate a type with a
name
...
A deﬁnition is a declaration that supplies all
that is needed in a program for the use of an entity
...
A different terminology deems declarations
parts of an interface and deﬁnitions parts of an implementation
...
2
...

Assuming that these declarations are in the global scope (§6
...
4), we have:
char ch;
auto count = 1;
const char∗ name = "Njal";

// set aside memory for a char and initialize it to 0
// set aside memory for an int initialized to 1
// set aside memory for a pointer to char
// set aside memory for a string literal "Njal"
// initialize the pointer with the address of that string literal

struct Date { int d, m, y; };
int day(Date∗ p) { return p−>d; }

// Date is a struct with three members
// day is a function that executes the speciﬁed code

using Point = std::complex;// Point is a name for std::complex

Of the declarations above, only three are not also deﬁnitions:
double sqrt(double);
extern int error_number;
struct User;

// function declaration
// variable declaration
// type name declaration

That is, if used, the entity they refer to must be deﬁned elsewhere
...
*/ }
int error_number = 1;
struct User { /*
...
2
...
However, there can be many declarations
...
So, this fragment has two errors:
int count;
int count;

// error : redeﬁnition

Section 6
...
2):
extern int error_number;
extern int error_number; // OK: redeclaration

Some deﬁnitions explicitly specify a ‘‘value’’ for the entities they deﬁne
...
1415926535897};

// Point is a name for std::complex

For types, aliases, templates, functions, and constants, the ‘‘value’’ is permanent
...
For example:
void f()
{
int count {1};
// initialize count to 1
const char∗ name {"Bjarne"}; // name is a variable that points to a constant (§7
...
3
...
3
...

Any declaration that speciﬁes a value is a deﬁnition
...
3
...
A)
...
However, without too
many radical simpliﬁcations, we can consider a declaration as having ﬁve parts (in order):
• Optional preﬁx speciﬁers (e
...
, static or virtual)
• A base type (e
...
, vector or const int)
• A declarator optionally including a name (e
...
, p[7], n, or ∗(∗)[])
• Optional sufﬁx function speciﬁers (e
...
, const or noexcept)
• An optional initializer or function body (e
...
, ={7,5,3} or {return x;})
Except for function and namespace deﬁnitions, a declaration is terminated by a semicolon
...

A speciﬁer is an initial keyword, such as virtual (§3
...
3, §20
...
2), extern (§15
...
2
...

154

Types and Declarations

Chapter 6

A declarator is composed of a name and optionally some declarator operators
...
7
...
7
...
However, ∗, [], and () were
designed to mirror their use in expressions (§10
...
Thus, ∗ is preﬁx and [] and () are postﬁx
...
Consequently, char∗kings[] is an array
of pointers to char, whereas char(∗kings)[] is a pointer to an array of char
...
2
...
For example:
const c = 7;

// error : no type

gt(int a, int b) // error : no return type
{
return (a>b) ? a : b;
}
unsigned ui;
long li;

// OK: ‘‘unsigned’’means ‘‘unsigned int’’
// OK: ‘‘long’’ means ‘‘long int’’

In this, standard C++ differs from early versions of C and C++ that allowed the ﬁrst two examples
by considering int to be the type when none was speciﬁed (§44
...
This ‘‘implicit int’’ rule was a
source of subtle errors and much confusion
...

Some type names don’t even look much like names, such as decltype(f(x)) (the return type of a call
f(x); §6
...
6
...

The volatile speciﬁer is described in §41
...

The alignas() speciﬁer is described in §6
...
9
...
3
...
The declaration simply contains a list
of comma-separated declarators
...
3
...
For example:
int∗ p, y;
// int* p; int y; NOT int* y;
int x, ∗q;
// int x; int* q;
int v[10], ∗pv; // int v[10]; int* pv;

Such declarations with multiple names and nontrivial declarators make a program harder to read
and should be avoided
...
3
...
The ﬁrst character must be a letter
...
C++ imposes no limit on the number of characters in a name
...
Some
run-time environments also make it necessary to extend or restrict the set of characters accepted in
an identiﬁer
...
g
...
A
C++ keyword (§6
...
3
...

Examples of names are:
hello
DEFINED
var0

this_is_a_most_unusually_long_identiﬁer_that_is_better_avoided
foO
bAr
u_name
HorseSense
var1
CLASS
_class
___

Examples of character sequences that cannot be used as identiﬁers are:
012
pay
...
name

class
if

3var

Nonlocal names starting with an underscore are reserved for special facilities in the implementation
and the run-time environment, so such names should not be used in application programs
...
g
...
17
...
4
...

When reading a program, the compiler always looks for the longest string of characters that
could make up a name
...

Also, elseif is a single name, not the keyword else followed by the keyword if
...
In general, it is best to avoid
names that differ only in subtle ways
...
Consequently, l0, lO, l1, ll, and I1l are poor choices for identiﬁer names
...

Names from a large scope ought to have relatively long and reasonably obvious names, such as
vector, Window_with_border, and Department_number
...
Functions (Chapter 12), classes
(Chapter 16), and namespaces (§14
...
1) can be used to keep scopes small
...

156

Types and Declarations

Chapter 6

Choose names to reﬂect the meaning of an entity rather than its implementation
...
4)
...
g
...

• The compiler is better at keeping track of types than you are
...
g
...

• Any system of type abbreviations you can come up with will become overelaborate and
cryptic as the variety of types you use increases
...

Try to maintain a consistent naming style
...

Also, use all capitals for macros (if you must use macros (§12
...
Use underscores to separate words in an identiﬁer;
number_of_elements is more readable than numberOfElements
...
Be consistent in your use of abbreviations and acronyms
...

phone_book

6
...
3
...

and_eq
break
class
decltype
else
for
long
not_eq
protected
signed
switch
try
using
xor

asm
case
compl
default
enum
friend
mutable
nullptr
public
sizeof
template
typedef
virtual
xor_eq

auto
catch
const
delete
explicit
goto
namespace
operator
register
static
this
typeid
void

Section 6
...
4

Scope

157

6
...
4 Scope
A declaration introduces a name into a scope; that is, a name can be used only in a speciﬁc part of
the program text
...
4) is called a local
name
...
A block is a section of code delimited by a {} pair
...

• Class scope: A name is called a member name (or a class member name) if it is deﬁned in a
class outside any function, class (Chapter 16), enum class (§8
...
1), or other namespace
...

• Namespace scope: A name is called a namespace member name if it is deﬁned in a namespace (§14
...
1) outside any function, lambda (§11
...
4
...
Its scope extends from the point of declaration to the end of
its namespace
...
2)
...
4
...
3
...
The scope of a global name
extends from the point of declaration to the end of the ﬁle in which its declaration occurs
...
2)
...

• Statement scope: A name is in a statement scope if it is deﬁned within the () part of a for-,
while-, if-, or switch-statement
...
All names in statement scope are local names
...
6) is in scope from its point of declaration until the end of the
function
...

That is, a name can be redeﬁned to refer to a different entity within a block
...
For example:
int x;
void f()
{
int x;
x = 1;
{
int x;
x = 2;
}
x = 3;
}
int∗ p = &x;

// global x

// local x hides global x
// assign to local x
// hides ﬁrst local x
// assign to second local x
// assign to ﬁrst local x

// take address of global x

158

Types and Declarations

Chapter 6

Hiding names is unavoidable when writing large programs
...
Because such errors are relatively rare, they can be very difﬁcult to ﬁnd
...

Using names such as i and x for global variables or for local variables in a large function is asking
for trouble
...
For example:
int x;
void f2()
{
int x = 1; // hide global x
::x = 2;
// assign to global x
x = 2;
// assign to local x
//
...

The scope of a name that is not a class member starts at its point of declaration, that is, after the
complete declarator and before the initializer
...
For example:
int x = 97;
void f3()
{
int x = x;
}

// perverse: initialize x with its own (uninitialized) value

A good compiler warns if a variable is used before it has been initialized
...
For example:
int x = 11;
void f4()
{
int y = x;
int x = 22;
y = x;
}

// perverse: use of two different objects both called x in a single scope
// use global x: y = 11
// use local x: y = 22

Again, such subtleties are best avoided
...

For example:
void f5(int x)
{
int x;
}

// error

Section 6
...
4

Scope

159

This is an error because x is deﬁned twice in the same scope
...
This
allows us to use conventional names for loop variables repeatedly in a function
...
size(), ++i) cout << v[i] << '\n';
for (auto i : {1, 2, 3, 4, 5, 6, 7}) cout << i << '\n';
}

This contains no name clashes
...
4
...

6
...
5 Initialization
If an initializer is speciﬁed for an object, that initializer determines the initial value of an object
...
It is clearer
and less error-prone than the alternatives
...
The two forms using = are what you use in
C
...
For example:
int x1 = 0;
char c1 = 'z';

However, anything much more complicated than that is better done using {}
...
8
...
4)
...
For example,
char to int is allowed, but not int to char
...
For example, ﬂoat to double is allowed, but not double to ﬂoat
...

• An integer value cannot be converted to a ﬂoating-point type
...
9, x2 becomes 7
char c2 = val2;
// if val2==1025, c2 becomes 1

160

Types and Declarations

Chapter 6

int x3 {val};
char c3 {val2};

// error : possible truncation
// error : possible narrowing

char c4 {24};
char c5 {264};

// OK: 24 can be represented exactly as a char
// error (assuming 8-bit chars): 264 cannot be represented as a char

int x4 {2
...

}

See §10
...

There is no advantage to using {} initialization, and one trap, when using auto to get the type
determined by the initializer
...
3
...
2)
...

It is possible to deﬁne a class so that an object can be initialized by a list of values and alternatively be constructed given a couple of arguments that are not simply values to be stored
...
Most types do not
offer such confusing alternatives – even most vectors do not; for example:
vector v1{"hello!"};
vector v2("hello!");

// v1 is a vector of 1 element with the value "hello!"
// error : no vector constructor takes a string literal

So, prefer {} initialization over alternatives unless you have a strong reason not to
...
For example:
int x4 {};
double d4 {};
char∗ p {};
vector v4{};
string s4 {};

// x4 becomes 0
// d4 becomes 0
...
For integral types, the default value is a suitable representation of
zero
...
2
...
For user-deﬁned types, the default value (if
any) is determined by the type’s constructors (§17
...
3)
...
2
...

Initialization of particular kinds of objects is discussed where appropriate:
• Pointers: §7
...
2, §7
...
2, §7
...
7
...
7
...
3
...
3
...
3
...
4
Classes: §17
...
1 (not using constructors), §17
...
2 (using constructors), §17
...
3 (default),
§17
...
5 (copy and move)
User-deﬁned containers: §17
...
4

6
...
5
...
If you do that
– and that has unfortunately been common – the situation is more complicated
...
The only really good case for an uninitialized variable is a large input buffer
...
get(buf,max); // read at most max characters into buf

We could easily have initialized buf:
char buf[max] {};

// initialize every char to 0

By redundantly initializing, we would have suffered a performance hit which just might have been
signiﬁcant
...
g
...

If no initializer is speciﬁed, a global (§6
...
4), namespace (§14
...
1), local static (§12
...
8), or
static member (§16
...
12) (collectively called static objects) is initialized to {} of the appropriate
type
...
0

Local variables and objects created on the free store (sometimes called dynamic objects or heap
objects; §11
...
3
...
For example:
void f()
{
int x;
char buf[1024];

// x does not have a well-deﬁned value
// buf[i] does not have a well-deﬁned value

int∗ p {new int};
char∗ q {new char[1024]};
string s;
vector v;

// s=="" because of string’s default constructor
// v=={} because of vector’s default constructor

string∗ ps {new string};
//
...
For example:
void ff()
{
int x {};
char buf[1024]{};

// x becomes 0
// buf[i] becomes 0 for all i

int∗ p {new int{10}};
char∗ q {new char[1024]{}};

// *p becomes 10
// q[i] becomes 0 for all i

//
...

6
...
5
...
More complicated
objects can require more than one value as an initializer
...
For example:
int a[] = { 1, 2 };
struct S { int x, string s };
S s = { 1, "Helios" };
complex z = { 0, pi };
vector v = { 0
...
1, 2
...
3 };

// array initializer
// struct initializer
// use constructor
// use list constructor

For C-style initialization of arrays, see §7
...
1
...
2
...
3
...
2
...
For initializer-list constructors, see §17
...
4
...
However, some prefer to add it to emphasize that a set of
values are used to initialize a set of member variables
...
3, §16
...
5)
...
3);

// use constructor
// use constructor : v gets 10 elements initialized to 3
...
1)
...
For example:
complex z1(1,2);
complex f1();

// function-style initializer (initialization by constructor)
// function declaration

complex z2 {1,2};
complex f2 {};

// initialization by constructor to {1,2}
// initialization by constructor to the default value {0,0}

Note that initialization using the {} notation does not narrow (§6
...
5)
...
For example:
auto x1 {1,2,3,4};
// x1 is an initializer_list
auto x2 {1
...
25, 3
...
0,2};
// error: cannot deduce the type of {1
...
3
...
2)

Section 6
...
5
...
3
...

• decltype(expr) for deducing the type of something that is not a simple initializer, such as the
return type for a function or the type of a class member
...

6
...
6
...

Instead, we can let the variable have the type of its initializer
...
That is, auto is a placeholder for the type of
the initializer
...
The
harder the type is to write and the harder the type is to know, the more useful auto becomes
...
begin(); p!=arg
...
begin(); p!=arg
...
Also, it is more resilient
to code changes
...
So, unless there is a good reason not to,
use auto in small scopes
...
That is, compared to
using a speciﬁc type, using auto can delay the detection of type errors
...

}

If auto causes surprises, the best cure is typically to make functions smaller, which most often is a
good idea anyway (§12
...

164

Types and Declarations

Chapter 6

We can decorate a deduced type with speciﬁers and modiﬁers (§6
...
1), such as const and & (reference; §7
...
For example:
void f(vector& v)
{
for (const auto& x : v) {
//
...

Note that the type of an expression is never a reference because references are implicitly dereferenced in expressions (§7
...
For example:
void g(int& v)
{
auto x = v;
auto& y = v;
}

// x is an int (not an int&)
// y is an int&

6
...
6
...
For example:
char v1 = 12345;
int v2 = 'c';
T v3 = f();

// 12345 is an int
// 'c' is a char

By using the {}-initializer syntax for such deﬁnitions, we minimize the chances for unfortunate conversions:
char v1 {12345};
int v2 {'c'};
T v3 {f()};

// error : narrowing
// ﬁne: implicit char->int conversion
// works if and only if the type of f() can be implicitly converted to a T

When we use auto, there is only one type involved, the type of the initializer, and we can safely use
the = syntax:
auto v1 = 12345;
auto v2 = 'c';
auto v3 = f();

// v1 is an int
// v2 is a char
// v3 is of some appropriate type

In fact, it can be an advantage to use the
prise someone:
auto v1 {12345};
auto v2 {'c'};
auto v3 {f()};

=

syntax with

// v1 is a list of int
// v2 is a list of char
// v3 is a list of some appropriate type

This is logical
...
3
...
2

auto x0 {};
auto x1 {1};
auto x2 {1,2};
auto x3 {1,2,3};

auto

and {}-lists

165

// error: cannot deduce a type
// list of int with one element
// list of int with two elements
// list of int with three elements

The type of a homogeneous list of elements of type T is taken to be of type initializer_list
(§3
...
1
...
3
...
In particular, the type of x1 is not deduced to be int
...
’’

6
...
6
...
But sometimes, we want to have a type
deduced without deﬁning an initialized variable
...
This is mostly useful in generic programming
...
What should be
the type of the result of the addition? A matrix, of course, but what might its element type be? The
obvious answer is that the element type of the sum is the type of the sum of the elements
...
1) to be able to express the return type in terms of the arguments: Matrix
...

In the deﬁnition, I again need decltype() to express Matrix’s element type:
template
auto operator+(const Matrix& a, const Matrix& b) −> Matrix
{
Matrix res;
for (int i=0; i!=a
...
cols(); ++j)
res(i,j) += a(i,j) + b(i,j);
return res;
}

6
...
g
...
g
...
Consequently, we need a name for
‘‘something in memory
...
That is,
an object is a contiguous region of storage; an lvalue is an expression that refers to an object
...
’’ However, not every lvalue may be used on the left-hand side of an assignment; an

166

Types and Declarations

Chapter 6

lvalue can refer to a constant (§7
...
An lvalue that has not been declared const is often called a
modiﬁable lvalue
...
2
...
3
...

6
...
1 Lvalues and Rvalues
To complement the notion of an lvalue, we have the notion of an rvalue
...
g
...

If you need to be more technical (say, because you want to read the ISO C++ standard), you
need a more reﬁned view of lvalue and rvalue
...

• Movable: The object may be moved from (i
...
, we are allowed to move its value to another
location and leave the object in a valid but unspeciﬁed state, rather than copying; §17
...

It turns out that three of the four possible combinations of those two properties are needed to precisely describe the C++ language rules (we have no need for objects that do not have identity and
cannot be moved)
...
The other
alternatives are prvalue (‘‘pure rvalue’’), glvalue (‘‘generalized lvalue’’), and xvalue (‘‘x’’ for ‘‘extraordinary’’ or ‘‘expert only’’; the suggestions for the meaning of this ‘‘x’’ have been quite imaginative)
...

}

// move vs to v2

Here, std::move(vs) is an xvalue: it clearly has identity (we can refer to it as vs), but we have explicitly given permission for it to be moved from by calling std::move() (§3
...
2, §35
...
1)
...
Note
that every expression is either an lvalue or an rvalue, but not both
...
4
...
Objects of types without a declared constructor, such as an int, can be considered to
have default constructors and destructors that do nothing
...
4
...
1
...
2
...
Such objects are sometimes called automatic objects
...

• Static: Objects declared in global or namespace scope (§6
...
4) and statics declared in functions (§12
...
8) or classes (§16
...
12) are created and initialized once (only) and ‘‘live’’ until
the program terminates (§15
...
3)
...
A static object has
the same address throughout the life of a program execution
...
3
...
3)
...
2)
...
g
...
If they
are bound to a reference, their lifetime is that of the reference; otherwise, they ‘‘live’’ until
the end of the full expression of which they are part
...
Typically, temporary objects are automatic
...
2
...

Static and automatic are traditionally referred to as storage classes
...

6
...
Possible reasons include:
• The original name is too long, complicated, or ugly (in some programmer’s eyes)
...

• A speciﬁc type is mentioned in one place only to simplify maintenance
...

};

// every container has a value_type

168

Types and Declarations

template
class list {
using value_type = T;
//
...
That is, an
alias refers to the type for which it is an alias
...
4) and classes (Chapter 16)
...
For example:
typedef int int32_t;
typedef short int16_t;
typedef void(∗PtoF)(int);

// equivalent to ‘‘using int32_t = int;’’
// equivalent to ‘‘using int16_t = short;’’
// equivalent to ‘‘using PtoF = void(*)(int);’’

Aliases are used when we want to insulate our code from details of the underlying machine
...
Having written our code in
terms of int32_t, rather than ‘‘plain int,’’ we can port our code to a machine with sizeof(int)==2 by
redeﬁning the single occurrence of int32_t in our code to use a longer integer:
using int32_t = long;

The _t sufﬁx is conventional for aliases (‘‘typedefs’’)
...
7)
...
3
...

The using keyword can also be used to introduce a template alias (§23
...
For example:
template
using Vector = std::vector>;

We cannot apply type speciﬁers, such as unsigned, to an alias
...
6 Advice
[1]
[2]
[3]
[4]
[5]

For the ﬁnal word on language deﬁnition issues, see the ISO C++ standard; §6
...

Avoid unspeciﬁed and undeﬁned behavior; §6
...

Isolate code that must depend on implementation-deﬁned behavior; §6
...

Avoid unnecessary assumptions about the numeric value of characters; §6
...
3
...
5
...
1
...
2
...
1
...
6

[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]

Advice

169

Avoid ‘‘magic constants’’; §6
...
4
...

Avoid unnecessary assumptions about the size of integers; §6
...
8
...
2
...

Prefer plain char over signed char and unsigned char; §6
...
3
...

Beware of conversions between signed and unsigned types; §6
...
3
...

Declare one name (only) per declaration; §6
...
2
...
3
...

Avoid similar-looking names; §6
...
3
...
3
...

Maintain a consistent naming style; §6
...
3
...
3
...

Keep scopes small; §6
...
4
...
3
...

Prefer the {}-initializer syntax for declarations with a named type; §6
...
5
...
3
...

Avoid uninitialized variables; §6
...
5
...

Use an alias to deﬁne a meaningful name for a built-in type in cases in which the built-in type
used to represent a value might change; §6
...

Use an alias to deﬁne synonyms for types; use enumerations and classes to deﬁne new types;
§6
...

This page intentionally left blank

7
Pointers, Arrays, and References
The sublime and the ridiculous
are often so nearly related that
it is difﬁcult to class them separately
...
1 Introduction
This chapter deals with the basic language mechanisms for referring to memory
...
’’ That is, they reside at a
speciﬁc address in memory, and an object can be accessed if you know its address and its type
...

172

Pointers, Arrays, and References

Chapter 7

7
...
’’ That is, a variable of type T∗ can hold the address of an
object of type T
...
This operation is also called indirection
...
For example:
char c = 'a';
char∗ p = &c; // p holds the address of c; & is the address-of operator
char c2 = ∗p; // c2 == ’a’; * is the dereference operator

The object pointed to by p is c, and the value stored in c is 'a', so the value of ∗p assigned to c2 is 'a'
...
4)
...
Most machines can address a byte
...
On the other hand, few machines can directly address
an individual bit
...
Note that a bool occupies at least as much space as
a char (§6
...
8)
...
1
...
2
...
2
...

The ∗, meaning ‘‘pointer to,’’ is used as a sufﬁx for a type name
...
3
...
A for the complete grammar
...
5
...
6
...
2
...
A void∗ is used for that
...
’’

Section 7
...
1

void∗

173

A pointer to any type of object can be assigned to a variable of type void∗, but a pointer to function (§12
...
6) cannot
...
Other operations would be unsafe because the compiler cannot know what kind of
object is really pointed to
...
To use a
void∗, we must explicitly convert it to a pointer to a speciﬁc type
...
5
...
For example, a machine may assume that every double is allocated on an 8-byte boundary
...
This form of explicit type conversion is inherently unsafe and ugly
...
5
...

The primary use for void∗ is for passing pointers to functions that are not allowed to make
assumptions about the type of the object and for returning untyped objects from functions
...

Functions using void∗ pointers typically exist at the very lowest level of the system, where real
hardware resources are manipulated
...
Where used for optimization, void∗ can be hidden behind
a type-safe interface (§27
...
1)
...
5) and pointers to members (§20
...

7
...
2 nullptr
The literal nullptr represents the null pointer, that is, a pointer that does not point to an object
...

174

Pointers, Arrays, and References

Chapter 7

Before nullptr was introduced, zero (0) was used as a notation for the null pointer
...
Zero (0) is an int
...
5
...
3) allow 0 to be
used as a constant of pointer or pointer-to-member type
...
For example:
int∗ p = NULL; // using the macro NULL

However, there are differences in the deﬁnition of NULL in different implementations; for example,
NULL might be 0 or 0L
...
2
...
3
...

7
...
’’ The elements are indexed from 0
to size−1
...
a[31]

You can access an array using the subscript operator, [], or through a pointer (using operator
operator []; §7
...
For example:

∗

or

void f()
{
int aa[10];
aa[6] = 9;
// assign to aa’s 7th element
int x = aa[99]; // undeﬁned behavior
}

Access out of the range of an array is undeﬁned and usually disastrous
...

The number of elements of the array, the array bound, must be a constant expression (§10
...
If
you need variable bounds, use a vector (§4
...
1, §31
...
For example:
void f(int n)
{
int v1[n];
vector v2(n);
}

// error: array size not a constant expression
// OK: vector with n int elements

Multidimensional arrays are represented as arrays of arrays (§7
...
2)
...
If what
you want is a simple ﬁxed-length sequence of objects of a given type in memory, an array is the
ideal solution
...

Section 7
...
4
...
For example:
int a1[10];

// 10 ints in static storage

void f()
{
int a2 [20];
int∗p = new int[40];
//
...
There is no array assignment, and the name of an array implicitly converts to a pointer to
its ﬁrst element at the slightest provocation (§7
...
In particular, avoid arrays in interfaces (e
...
, as
function arguments; §7
...
3, §12
...
2) because the implicit conversion to pointer is the root cause of
many common errors in C code and C-style C++ code
...
2
...
That’s most easily and
most reliably done by having the lifetime of the free-store array controlled by a resource handle
(e
...
, string (§19
...
3), vector (§13
...
2), or unique_ptr (§34
...
1))
...
Obviously, C programmers cannot follow
these pieces of advice because C lacks the ability to encapsulate arrays, but that doesn’t make the
advice bad in the context of C++
...
That’s the way
C stores strings, so a zero-terminated array of char is often called a C-style string
...
3
...
g
...
4) rely on it
...

7
...
1 Array Initializers
An array can be initialized by a list of values
...
Consequently, v1 and v2 are of type int[4] and
char[4], respectively
...
For example:
char v3[2] = { 'a', 'b', 0 };
char v4[3] = { 'a', 'b', 0 };

// error : too many initializers
// OK

If the initializer supplies too few elements for an array, 0 is used for the rest
...
You cannot initialize one array with another (not
even of exactly the same type), and there is no array assignment:
int v6[8] = v5; // error: can’t copy an array (cannot assign an int* to an array)
v6 = v5;
// error : no array assignment

Similarly, you can’t pass arrays by value
...
4
...
4
...
6, §34
...
2
...
5) instead
...
3
...

7
...
2 String Literals
A string literal is a character sequence enclosed within double quotes:
"this is a string"

A string literal contains one more character than it appears to have; it is terminated by the null character, '\0', with the value 0
...

In C and in older C++ code, you could assign a string literal to a non-const char∗:
void f()
{
char∗ p = "Plato";
p[4] = 'e';
}

// error, but accepted in pre-C++11-standard code
// error: assignment to const

It would obviously be unsafe to accept that assignment
...
Having string literals immutable is not only obvious but also allows implementations to do signiﬁcant optimizations
in the way string literals are stored and accessed
...
For example:
const char∗ error_message(int i)
{
//
...
3
...

Whether two identical string literals are allocated as one array or as two is implementationdeﬁned (§6
...
For example:
const char∗ p = "Heraclitus";
const char∗ q = "Heraclitus";
void g()
{
if (p == q) cout << "one!\n";
//
...

The empty string is written as a pair of adjacent double quotes, "", and has the type const
char[1]
...

The backslash convention for representing nongraphic characters (§6
...
3
...
This makes it possible to represent the double quote (") and the escape character
backslash (\) within a string
...

For example:
cout<<"beep at end of message\a\n";

The escape character, '\a', is the ASCII character BEL (also known as alert), which causes a sound
to be emitted
...
For example:
char alpha[] = "abcdefghijklmnopqrstuvwxyz"
"ABCDEFGHIJKLMNOPQRSTUVWXYZ";

The compiler will concatenate adjacent strings, so alpha could equivalently have been initialized by
the single string
"abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ";

It is possible to have the null character in a string, but most programs will not suspect that there are
characters after it
...
4
...
3
...
1 Raw Character Strings
To represent a backslash (\) or a double quote (") in a string literal, we have to precede it with a
backslash
...
However, if we need a lot of backslashes
and a lot of quotes in string literals, this simple technique becomes unmanageable
...
1
...
This is a convention shared by many programming languages, so we can’t just change it
...
Consider how to write the pattern representing two words separated by a
backslash (\):
string s = "\\w\\\\w";

// I hope I got that right

To prevent the frustration and errors caused by this clash of conventions, C++ provides raw string
literals
...
The initial R is there
to distinguish raw string literals from ordinary string literals
...
For example:
R"("quoted string")"

// the string is "quoted string"

So, how do we get the character sequence )" into a raw string literal? Fortunately, that’s a rare
problem, but "( and )" is only the default delimiter pair
...
For example:
R"∗∗∗("quoted string containing the usual terminator ("))")∗∗∗"
// "quoted string containing the usual terminator ("))"

The character sequence after the ) must be identical to the sequence before the (
...

Unless you work with regular expressions, raw string literals are probably just a curiosity (and
one more thing to learn), but regular expressions are useful and widely used
...
)∗\")|"

// Are the ﬁve backslashes correct or not?

With examples like that, even experts easily become confused, and raw string literals provide a signiﬁcant service
...
For example:
string counts {R"(1
22
333)"};

is equivalent to
string x {"1\n22\n333"};

7
...
2
...
2
...
Its type is const
wchar_t[]
...
3
...
1) of
wide characters of type const wchar_t[]
...

Section 7
...
2
...
This sounds
excessive, but there are three major encodings of Unicode: UTF-8, UTF-16, and UTF-32
...
All three UTF encodings support all Unicode characters, so which you use depends on the system you need to ﬁt into
...
g
...

UTF-8 is a variable-width encoding: common characters ﬁt into 1 byte, less frequently used
characters (by some estimate of use) into 2 bytes, and rarer characters into 3 or 4 bytes
...
The various Latin alphabets, Greek, Cyrillic, Hebrew, Arabic, and more ﬁt into 2 bytes
...

We can represent an ordinary English character string in a variety of ways
...

Obviously, the real purpose of Unicode strings is to be able to put Unicode characters into them
...
"

Printing that string appropriately gives you
The ofﬁcial vowels in Danish are: a, e, i, o, u, æ, ø, å and y
...
2
...
3) [Unicode,1996]
...
For example, u'0430' (Cyrillic lowercase letter ‘‘a’’) is the
2-byte hexadecimal value D0B0 in UTF-8, the 2-byte hexadecimal value 0403 in UTF-16, and the
4-byte hexadecimal value 00000403 in UTF-32
...

The order of the us and Rs and their cases are signiﬁcant: RU and Ur are not valid string preﬁxes
...
4 Pointers into Arrays
In C++, pointers and arrays are closely related
...
For example:

180

Pointers, Arrays, and References

Chapter 7

int v[] = { 1, 2, 3, 4 };
int∗ p1 = v;
// pointer to initial element (implicit conversion)
int∗ p2 = &v[0];
// pointer to initial element
int∗ p3 = v+4;
// pointer to one-beyond-last element

or graphically:
p1

v:

p2

1

2

3

p3

4

Taking a pointer to the element one beyond the end of an array is guaranteed to work
...
5, §33
...
However, since such a pointer does not in fact point
to an element of the array, it may not be used for reading or writing
...
For example:
int∗ p4 = v−1; // before the beginning, undeﬁned: don’t do it
int∗ p5 = v+7; // beyond the end, undeﬁned: don’t do it

The implicit conversion of an array name to a pointer to the initial element of the array is extensively used in function calls in C-style code
...
h>

void f()
{
char v[] = "Annemarie";
char∗ p = v;
// implicit conversion of char[] to char*
strlen(p);
strlen(v);
// implicit conversion of char[] to char*
v = p;
// error: cannot assign to array
}

The same value is passed to the standard-library function strlen() in both calls
...
In other words, there is no way of declaring a function
so that the array v is copied when the function is called
...

The implicit conversion of the array argument to a pointer means that the size of the array is lost
to the called function
...
Like other C standard-library functions taking pointers to characters, strlen()
relies on zero to indicate end-of-string; strlen(p) returns the number of characters up to and not
including the terminating 0
...
The standard-library vector (§4
...
1, §13
...
4), array (§8
...
4, §34
...
1), and string (§4
...
These library types
give their number of elements as their size() without having to count elements each time
...
4
...
4
...
5, Chapter 32)
...
For example:
void ﬁ(char v[])
{
for (int i = 0; v[i]!=0; ++i)
use(v[i]);
}
void fp(char v[])
{
for (char∗ p = v; ∗p!=0; ++p)
use(∗p);
}

The preﬁx ∗ operator dereferences a pointer so that ∗p is the character pointed to by p, and ++ increments the pointer so that it refers to the next element of the array
...
With modern compilers, identical code should be (and usually is) generated for both examples
...

Subscripting a built-in array is deﬁned in terms of the pointer operations + and ∗
...
For example, 3["Texas"]=="Texas"[3]=='a'
...
These equivalences are pretty low-level and do not
hold for standard-library containers, such as array and vector
...
When an arithmetic operator is applied to a pointer p of type T∗, p is assumed
to point to an element of an array of objects of type T; p+1 points to the next element of that array,
and p−1 points to the previous element
...
For example:
template
int byte_diff(T∗ p, T∗ q)
{
return reinterpret_cast(q)−reinterpret_cast(p);
}
void diff_test()
{
int vi[10];
short vs[10];

182

Pointers, Arrays, and References

Chapter 7

cout << vi << ' ' << &vi[1] << ' ' << &vi[1]−&vi[0] << ' ' << byte_diff(&vi[0],&vi[1]) << '\n';
cout << vs << ' ' << &vs[1] << ' ' << &vs[1]−&vs[0] << ' ' << byte_diff(&vs[0],&vs[1]) << '\n';
}

This produced:
0x7fffaef0 0x7fffaef4 1 4
0x7fffaedc 0x7fffaede 1 2

The pointer values were printed using the default hexadecimal notation
...

Subtraction of pointers is deﬁned only when both pointers point to elements of the same array
(although the language has no fast way of ensuring that is the case)
...
One can add an integer to a pointer or subtract an integer from a pointer; in both cases, the
result is a pointer value
...
For example:
void f()
{
int v1[10];
int v2[10];
int i1 = &v1[5]−&v1[3];
int i2 = &v1[5]−&v2[3];

// i1 = 2
// result undeﬁned

int∗ p1 = v2+2;
int∗ p2 = v2−2;

// p1 = &v2[2]
// *p2 undeﬁned

}

Complicated pointer arithmetic is usually unnecessary and best avoided
...

Arrays are not self-describing because the number of elements of an array is not guaranteed to
be stored with the array
...
For example:
void fp(char v[], int size)
{
for (int i=0; i!=size; ++i)
use(v[i]);
for (int x : v)
use(x);
const int N = 7;
char v2[N];
for (int i=0; i!=N; ++i)
use(v2[i]);
for (int x : v2)
use(x);
}

// hope that v has at least size elements
// error : range-for does not work for pointers

// range-for works for arrays of known size

Section 7
...
1

Navigating Arrays

183

This array concept is inherently low-level
...
2
...
2
...

Some C++ implementations offer optional range checking for arrays
...
If you are not using range checking for individual accesses, try to maintain a consistent policy of accessing elements only in well-deﬁned ranges
...

7
...
2 Multidimensional Arrays
Multidimensional arrays are represented as arrays of arrays; a 3-by-5 array is declared like this:
int ma[3][5];

// 3 arrays with 5 ints each

We can initialize ma like this:
void init_ma()
{
for (int i = 0; i!=3; i++)
for (int j = 0; j!=5; j++)
ma[i][j] = 10∗i+j;
}

or graphically:
ma:

00 01 02 03 04 10 11 12 13 14 20 21 22 23 24

The array ma is simply 15 ints that we access as if it were 3 arrays of 5 ints
...
The dimensions 3
and 5 exist in the compiler source only
...
For example, we might print ma like this:
void print_ma()
{
for (int i = 0; i!=3; i++) {
for (int j = 0; j!=5; j++)
cout << ma[i][j] << '\t';
cout << '\n';
}
}

The comma notation used for array bounds in some languages cannot be used in C++ because the
comma (,) is a sequencing operator (§10
...
2)
...
For example:
int bad[3,5];
int good[3][5];
int ouch = good[1,4];
int nice = good[1][4];

// error: comma not allowed in constant expression
// 3 arrays with 5 ints each
// error: int initialized by int* (good[1,4] means good[4], which is an int*)

184

Pointers, Arrays, and References

Chapter 7

7
...
3 Passing Arrays
Arrays cannot directly be passed by value
...
For example:
void comp(double arg[10])
{
for (int i=0; i!=10; ++i)
arg[i]+=99;
}

// arg is a double*

void f()
{
double a1[10];
double a2[5];
double a3[100];
comp(a1);
comp(a2);
comp(a3);

// disaster!
// uses only the ﬁrst 10 elements

};

This code looks sane, but it is not
...
Also, anyone who guessed that the array was passed by value will be disappointed:
the writes to arg[i] are writes directly to the elements of comp()’s argument, rather than to a copy
...
When used as a function argument, the ﬁrst dimension of
an array is simply treated as a pointer
...
This implies
that if you want to pass a sequence of elements without losing size information, you should not
pass a built-in array
...

If you insist on using arrays directly, you will have to deal with bugs and confusion without getting noticeable advantages in return
...
If the dimensions are known at compile time, there is no problem:
void print_m35(int m[3][5])
{
for (int i = 0; i!=3; i++) {
for (int j = 0; j!=5; j++)
cout << m[i][j] << '\t';
cout << '\n';
}
}

Section 7
...
3

Passing Arrays

185

A matrix represented as a multidimensional array is passed as a pointer (rather than copied; §7
...

The ﬁrst dimension of an array is irrelevant to ﬁnding the location of an element; it simply states
how many elements (here, 3) of the appropriate type (here, int[5]) are present
...
The ﬁrst dimension can therefore be passed as an argument:
void print_mi5(int m[][5], int dim1)
{
for (int i = 0; i!=dim1; i++) {
for (int j = 0; j!=5; j++)
cout << m[i][j] << '\t';
cout << '\n';
}
}

When both dimensions need to be passed, the ‘‘obvious solution’’ does not work:
void print_mij(int m[][], int dim1, int dim2)
// doesn’t behave as most people would think
{
for (int i = 0; i!=dim1; i++) {
for (int j = 0; j!=dim2; j++)
cout << m[i][j] << '\t';
// sur prise!
cout << '\n';
}
}

Fortunately, the argument declaration m[][] is illegal because the second dimension of a multidimensional array must be known in order to ﬁnd the location of an element
...
A correct solution is:
void print_mij(int∗ m, int dim1, int dim2)
{
for (int i = 0; i!=dim1; i++) {
for (int j = 0; j!=dim2; j++)
cout << m[i∗dim2+j] << '\t'; // obscure
cout << '\n';
}
}

The expression used for accessing the members in print_mij() is equivalent to the one the compiler
generates when it knows the last dimension
...
This kind of subtle and messy code is best hidden
...
In that way, you might ease the
task of the next programmer to touch the code
...
2
...
5
...

The standard vector (§31
...

7
...
2
...
4)
...
2
...

Basically, constexpr’s role is to enable and ensure compile-time evaluation, whereas const’s primary role is to specify immutability in interfaces
...

Many objects don’t have their values changed after initialization:
• Symbolic constants lead to more maintainable code than using literals directly in code
...

• Most function parameters are read but not written to
...
For example:
const int model = 90;
const int v[] = { 1, 2, 3, 4 };
const int x;

// model is a const
// v[i] is a const
// error : no initializer

Because an object declared const cannot be assigned to, it must be initialized
...
For example:
void g(const X∗ p)
{
// can’t modify *p here
}

Section 7
...

}

Pointers and const

187

// val can be modiﬁed here

When using a pointer, two objects are involved: the pointer itself and the object pointed to
...
To
declare a pointer itself, rather than the object pointed to, to be a constant, we use the declarator
operator ∗const instead of plain ∗
...
There is no const∗ declarator operator, so a const appearing before the ∗ is taken to be part of the base type
...
’’
An object that is a constant when accessed through one pointer may be variable when accessed
in other ways
...
By declaring a pointer argument
const, the function is prohibited from modifying the object pointed to
...
The second version is used for mutable strings
...
However, the address of a constant cannot be assigned to an unrestricted pointer
because this would allow the object’s value to be changed
...
2
...
5)
...
6 Pointers and Ownership
A resource is something that has to be acquired and later released (§5
...
Memory acquired by new
and released by delete (§11
...
2) are examples of resources where the most direct handle to the resource is a pointer
...
Consider:
void confused(int∗ p)
{
// delete p?
}
int global {7};
void f()
{
X∗ pn = new int{7};
int i {7};
int q = &i;
confused(pn);
confused(q);
confused(&global);
}

If confused() deletes p the program will seriously misbehave for the second two calls because we
may not delete objects not allocated by new (§11
...
If confused() does not delete p the program
leaks (§11
...
1)
...

It is usually a good idea to immediately place a pointer that represents ownership in a resource
handle class, such as vector, string, and unique_ptr
...
Chapter 13 discusses
resource management in greater detail
...
7

References

189

7
...
The type of the pointer determines what can
be done to the data through the pointer
...
m
...

• We must be more careful when using pointers than when using an object directly: a pointer
may be a nullptr or point to an object that wasn’t the one we expected
...
Worse, managing pointer variables with varying values and protecting code against the possibility of nullptr can be a signiﬁcant burden
...
The language mechanism addressing these problems is
called a reference
...

• A reference always refers to the object to which it was initialized
...
7
...

A reference is an alternative name for an object, an alias
...
For example:
template
class vector {
T∗ elem;
//
...

// pass element to be added by reference

};
void f(const vector& v)
{
double d1 = v[1];
// copy the value of the double referred to by v
...
operator[](2)
v
...

190

Pointers, Arrays, and References

Chapter 7

To reﬂect the lvalue/rvalue and const/non-const distinctions, there are three kinds of references:
• lvalue references: to refer to objects whose value we want to change
• const references: to refer to objects whose value we do not want to change (e
...
, a constant)
• rvalue references: to refer to objects whose value we do not need to preserve after we have
used it (e
...
, a temporary)
Collectively, they are called references
...

7
...
1 Lvalue References
In a type name, the notation X& means ‘‘reference to X
...
For example:
void f()
{
int var = 1;
int& r {var};
int x = r;
r = 2;

// r and var now refer to the same int
// x becomes 1
// var becomes 2

}

To ensure that a reference is a name for something (that is, that it is bound to an object), we must
initialize the reference
...
Despite appearances, no operator operates on a reference
...
Consequently, the value of a reference cannot be changed after initialization; it always
refers to the object it was initialized to denote
...
Thus, we cannot have a pointer to a reference
...
In that sense, a reference is not an object
...
It doesn’t do much harm to think about references that way, as long as one remembers that a reference isn’t an object that can be manipulated the way a pointer is:

Section 7
...
1

Lvalue References

pp:

191

&ii
rr:

ii:

1

In some cases, the compiler can optimize away a reference so that there is no object representing
that reference at run time
...
4)
...

The initializer for a const T& need not be an lvalue or even of type T
...
5)
...

[3] Finally, this temporary variable is used as the value of the initializer
...

References to variables and references to constants are distinguished because introducing a temporary for a variable would have been highly error-prone; an assignment to the variable would
become an assignment to the – soon-to-disappear – temporary
...
2
...

A reference can be used to specify a function argument so that the function can change the
value of an object passed to it
...
To keep a program readable, it is often best to
avoid functions that modify their arguments
...
Consequently, ‘‘plain’’ reference arguments should be used only where the name of
the function gives a strong hint that the reference argument is modiﬁed
...
This is mostly used to deﬁne functions that can be
used on both the left-hand and right-hand sides of an assignment
...
For
example:
template
class Map {
// a simple map class
public:
V& operator[](const K& v);
// return the value corresponding to the key v
pair∗ begin() { return &elem[0]; }
pair∗ end() { return &elem[0]+elem
...
4
...
4
...
ﬁrst)
return x
...
push_back({k,V{}});
return elem
...
second;

// add pair at end (§4
...
2)
// return the (default) value of the new element

}

I pass the key argument, k, by reference because it might be of a type that is expensive to copy
...

I use a const reference for k because I don’t want to modify it and because I might want to use a literal or a temporary object as an argument
...
For example:

Section 7
...
1

Lvalue References

193

int main() // count the number of occurrences of each word on input
{
Map buf;
for (string s; cin>>s;) ++buf[s];
for (const auto& x : buf)
cout << x
...
second << '\n';
}

Each time around, the input loop reads one word from the standard input stream cin into the string s
(§4
...
2) and then updates the counter associated with it
...
For example, given the input
aa bb bb aa aa bb aa aa

this program will produce
aa: 5
bb: 3

The range- for loop works for this because
standard-library map
...
7
...

• A const lvalue reference refers to a constant, which is immutable from the point of view of
the user of the reference
...

We want to know if a reference refers to a temporary, because if it does, we can sometimes turn an
expensive copy operation into a cheap move operation (§3
...
2, §17
...
5
...
An object (such as
a string or a list) that is represented by a small descriptor pointing to a potentially huge amount of
information can be simply and cheaply moved if we know that the source isn’t going to be used
again
...
3
...

An rvalue reference can bind to an rvalue, but not to an lvalue
...
For example:
string var {"Cambridge"};
string f();
string& r1 {var};
string& r2 {f()};
string& r3 {"Princeton"};

// lvalue reference, bind r1 to var (an lvalue)
// lvalue reference, error : f() is an rvalue
// lvalue reference, error : cannot bind to temporar y

194

Pointers, Arrays, and References

string&& rr1 {f()};
string&& rr2 {var};
string&& rr3 {"Oxford"};

Chapter 7

// rvalue reference, ﬁne: bind rr1 to rvalue (a temporar y)
// rvalue reference, error : var is an lvalue
// rr3 refers to a temporar y holding "Oxford"

const string cr1& {"Harvard"}; // OK: make temporar y and bind to cr1

The && declarator operator means ‘‘rvalue reference
...
Both a
const lvalue reference and an rvalue reference can bind to an rvalue
...

• We use a const lvalue reference to prevent modiﬁcation of an argument
...
For example:
string f(string&& s)
{
if (s
...
Consider:
template
swap(T& a, T& b)
// "old-style swap"
{
T tmp {a}; // now we have two copies of a
a = b;
// now we have two copies of b
b = tmp; // now we have two copies of tmp (aka a)
}

If T is a type for which it can be expensive to copy elements, such as string and vector, this swap()
becomes an expensive operation
...
We can tell that to the compiler:
template
void swap(T& a, T& b)
// "perfect swap" (almost)
{
T tmp {static_cast(a)}; // the initialization may write to a
a = static_cast(b);
// the assignment may write to b
b = static_cast(tmp); // the assignment may write to tmp
}

The result value of static_cast(x) is an rvalue of type T&& for x
...
In particular, if a type T has a move constructor (§3
...
2, §17
...
2) or a move assignment, it will be used
...
7
...

vector(const vector& r); // copy constructor (copy r’s representation)
vector(vector&& r);
// move constructor ("steal" representation from r)
};
vector s;
vector s2 {s};
vector s3 {s+"tail");

// s is an lvalue, so use copy constructor
// s+"tail" is an rvalue so pick move constructor

The use of static_cast in swap() is a bit verbose and slightly prone to mistyping, so the standard
library provides a move() function: move(x) means static_cast(x) where X is the type of x
...

Since move(x) does not move x (it simply produces an rvalue reference to x), it would have been
better if move() had been called rval(), but by now move() has been used for years
...
Consider:
void f(vector& v)
{
swap(v,vector{1,2,3});
//
...
A solution is to augment it by two overloads:
template void swap(T&& a, T& b);
template void swap(T& a, T&& b)

Our example will be handled by that last version of swap()
...
(§31
...
3) to handle the most
common cases of rvalue arguments to swap():
void f(string& s, vector& v)
{
s
...
capacity()==s
...
capacity()==s
...
clear();
swap(v
...
5
...
1, §35
...
1)
...
3
...

Also, their operations that insert new elements, such as insert() and push_back(), have versions that
take rvalue references
...
7
...
But what kind of reference? Lvalue reference or rvalue
reference? Consider:
using rr_i = int&&;
using lr_i = int&;
using rr_rr_i = rr_i&&;
using lr_rr_i = rr_i&;
using rr_lr_i = lr_i&&;
using lr_lr_i = lr_i&;

// ‘‘int && &&’’ is an int&&
// ‘‘int && &’’ is an int&
// ‘‘int & &&’’ is an int&
// ‘‘int & &’’ is an int&

In other words, lvalue reference always wins
...
This is sometimes known as reference
collapse
...
4
...
5) or a template type
argument (§23
...
2
...

7
...
4 Pointers and References
Pointers and references are two mechanisms for referring to an object from different places in a
program without copying
...

If you need to change which object to refer to, use a pointer
...
1
...
For example:

Section 7
...
4

Pointers and References

197

void fp(char∗ p)
{
while (∗p)
cout << ++∗p;
}
void fr(char& r)
{
while (r)
cout << ++r;

// oops: increments the char referred to, not the reference
// near-inﬁnite loop!

}
void fr2(char& r)
{
char∗ p = &r;
// get a pointer to the object referred to
while (∗p)
cout << ++∗p;
}

Conversely, if you want to be sure that a name always refers to the same object, use a reference
...

};

// Proxy refers to the object with which it is initialized

template class Handle { // Handle refers to its current object
T∗ m;
public:
Proxy(T∗ mm) :m{mm} {}
void rebind(T∗ mm) { m = mm; }
//
...
1) on something that refers to an
object, use a reference:
Matrix operator+(const Matrix&, const Matrix&);
Matrix operator−(const Matrix∗, const Matrix∗);

// OK
// error : no user-deﬁned type argument

Matrix y, z;
//
...
2
...

198

Pointers, Arrays, and References

Chapter 7

If you want a collection of something that refers to an object, you must use a pointer:
int x, y;
string& a1[] = {x, y};
string∗ a2[] = {&x, &y};
vector s1 = {x , y};
vector s2 = {&x, &y};

// error : array of references
// OK
// error : vector of references
// OK

Once we leave the cases where C++ leaves no choice for the programmer, we enter the domain of
aesthetics
...

If you need a notion of ‘‘no value,’’ pointers offer nullptr
...
For example:
void fp(X∗ p)
{
if (p == nullptr) {
// no value
}
else {
// use *p
}
}
void fr(X& r) // common style
{
// assume that r is valid and use it
}

If you really want to, you can construct and check for a ‘‘null reference’’ for a particular type:
void fr2(X& r)
{
if (&r == &nullX) {
// no value
}
else {
// use r
}
}

// or maybe r==nullX

Obviously, you need to have suitably deﬁned nullX
...
A programmer is allowed to assume that a reference is valid
...
For example:
char∗ ident(char ∗ p) { return p; }
char& r {∗ident(nullptr)}; // invalid code

This code is not valid C++ code
...

Section 7
...
8 Advice
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]

Keep use of pointers simple and straightforward; §7
...
1
...
4
...
4
...

Avoid multidimensional arrays; deﬁne suitable containers instead; §7
...
2
...
2
...

Use containers (e
...
, vector, array, and valarray) rather than built-in (C-style) arrays; §7
...
1
...
4
...
3
...
1
...
7
...

Use rvalue references (only) for forwarding and move semantics; §7
...
2
...
6
...
2
...

Use const pointers and const references to express immutability in interfaces; §7
...

Prefer references to pointers as arguments, except where ‘‘no object’’ is a reasonable option;
§7
...
4
...

– The people

•
•

•
•
•

Introduction
Structures
struct Layout; struct Names; Structures and Classes; Structures and Arrays; Type Equivalence; Plain Old Data; Fields
Unions
Unions and Classes; Anonymous unions
Enumerations
enum classes; Plain enums; Unnamed enums
Advice

8
...
This chapter introduces the three most primitive variants of the notion of a user-deﬁned type:
• A struct (a structure) is a sequence of elements (called members) of arbitrary types
...

• An enum (an enumeration) is a type with a set of named constants (called enumerators)
...

Variants of these kinds of simple types have existed since the earliest days of C++
...

The notion of a struct as described here is a simple form of a class (§3
...

202

Structures, Unions, and Enumerations

Chapter 8

8
...
In its simplest form, a struct is an aggregate
of elements of arbitrary types
...
Note the terminating semicolon
...
(dot) operator
...
name = "Jim Dandy";
jd
...
3
...
For example:
Address jd = {
"Jim Dandy",
61, "South St",
"New Providence",
{'N','J'}, "07974"
};

Note that jd
...
Strings are terminated by a zero character, '\0', so "NJ" has three characters – one more than will ﬁt into jd
...
I deliberately use rather
low-level types for the members to illustrate how that can be done and what kinds of problems it
can cause
...

For example:
void print_addr(Address∗ p)
{
cout << p−>name << '\n'
<< p−>number << ' ' << p−>street << '\n'
<< p−>town << '\n'
<< p−>state[0] << p−>state[1] << ' ' << p−>zip << '\n';
}

When p is a pointer, p−>m is equivalent to (∗p)
...

Section 8
...

203

(struct member

void print_addr2(const Address& r)
{
cout << r
...
number << ' ' << r
...
town << '\n'
<< r
...
state[1] << ' ' << r
...
2
...
For example:
Address current;
Address set_current(Address next)
{
address prev = current;
current = next;
return prev;
}

Other plausible operations, such as comparison (== and !=), are not available by default
...
2
...
1, Chapter 18)
...
2
...
For example, we might store
primitive equipment readout in a structure like this:
struct Readout {
char hour;
int value;
char seq;
};

// [0:23]
// sequence mark ['a':'z']

You could imagine the members of a Readout object laid out in memory like this:
hour: value:

seq:

Members are allocated in memory in declaration order, so the address of hour must be less than the
address of value
...
2
...

However, the size of an object of a struct is not necessarily the sum of the sizes of its members
...
For example, integers are often allocated on word boundaries
...
2
...
This leads to ‘‘holes’’ in the structures
...

You can minimize wasted space by simply ordering members by size (largest member ﬁrst)
...
The
reason is that we need to maintain alignment when we put two objects next to each other, say, in an
array of Readouts
...

It is usually best to order members for readability and sort them by size only if there is a
demonstrated need to optimize
...
e
...
5)
...
2
...
For example:
struct Link {
Link∗ previous;
Link∗ successor;
};

However, it is not possible to declare new objects of a struct until its complete declaration has been
seen
...
To allow two (or

Section 8
...
2

struct

Names

205

more) structs to refer to each other, we can declare a name to be the name of a struct
...

The name of a struct can be used before the type is deﬁned as long as that use does not require
the name of a member or the size of the structure to be known
...
For example:
struct S; // ‘‘S’’ is the name of some type
extern S a;
S f();
void g(S);
S∗ h(S∗);

However, many such declarations cannot be used unless the type S is deﬁned:
void k(S∗ p)
{
S a;

// error: S not deﬁned; size needed to allocate

f();
g(a);
p−>m = 7;

// error: S not deﬁned; size needed to return value
// error: S not deﬁned; size needed to pass argument
// error: S not deﬁned; member name not known

S∗ q = h(p);
q−>m = 7;

// ok: pointers can be allocated and passed
// error: S not deﬁned; member name not known

}

For reasons that reach into the prehistory of C, it is possible to declare a struct and a non-struct with
the same name in the same scope
...
*/ };
int stat(char∗ name, struct stat∗ buf);

In that case, the plain name (stat) is the name of the non-struct, and the struct must be referred to
with the preﬁx struct
...
3), and enum (§8
...
However, it is best not to overload names to make such explicit disambiguation necessary
...
2
...
So, a struct can have member
functions (§2
...
2, Chapter 16)
...
For example:
struct Points {
vector elem;// must contain at least one Point
Points(Point p0) { elem
...
push_back(p0); elem
...

};
Points x0;
Points x1{ {100,200} };
Points x1{ {100,200}, {300,400} };

// error : no default constructor
// one Point
// two Points

You do not need to deﬁne a constructor simply to initialize members in order
...
3
...
1)
// default construction: {{},{}}; that is {0
...
4
...
2, §13
...
For example:
struct Address {
string name;
int number;
string street;
string town;
char state[2];
char zip[5];

// "Jim Dandy"
// 61
// "South St"
// "New Providence"
// ’N’ ’J’
// 07974

Address(const string n, int nu, const string& s, const string& t, const string& st, int z);
};

Here, I added a constructor to ensure that every member was initialized and to allow me to use a
string and an int for the postal code, rather than ﬁddling with individual characters
...
2
...
1)

The Address constructor might be deﬁned like this:

Section 8
...
3

Structures and Classes

207

Address::Address(const string& n, int nu, const string& s, const string& t, const string& st, int z)
// validate postal code
:name{n},
number{nu},
street{s},
town{t}
{
if (st
...
4
...
str()};
switch (zi
...
check that the code makes sense
...
2
...
For example:
struct Point {
int x,y
};
Point points[3] {{1,2},{3,4},{5,6}};
int x2 = points[2]
...
elem[2]
...
For
example:

208

Structures, Unions, and Enumerations

Chapter 8

Array shift(Array a, Point p)
{
for (int i=0; i!=3; ++i) {
a
...
x += p
...
elem[i]
...
y;
}
return a;
}
Array ax = shift(points2,{10,20});

The notation for Array is a bit primitive: Why i!=3? Why keep repeating
...
2
...
2
...

};

This array is a template to allow arbitrary numbers of elements of arbitrary types
...
5
...
1) and const objects (§16
...
9
...
Using array,
we can now write:
struct Point {
int x,y
};
using Array = array; // array of 3 Points
Array points {{1,2},{3,4},{5,6}};
int x2 = points[2]
...
y;

Section 8
...
4

Structures and Arrays

209

Array shift(Array a, Point p)
{
for (int i=0; i!=a
...
x += p
...
y += p
...
) and does not implicitly convert to a pointer to an individual element:
ostream& operator<<(ostream& os, Point p)
{
cout << '{' << p[i]
...
y << '}';
}
void print(Point a[],int s) // must specify number of elements
{
for (int i=0; i!=s; ++i)
cout << a[i] << '\n';
}
template
void print(array& a)
{
for (int i=0; i!=a
...
2
...
For example:
struct S1 { int a; };
struct S2 { int a; };
S1

and S2 are two different types, so:
S1 x;
S2 y = x; // error : type mismatch

A struct is also a different type from a type used as a member
...
2
...

8
...
6 Plain Old Data
Sometimes, we want to treat an object as just ‘‘plain old data’’ (a contiguous sequence of bytes in
memory) and not worry about more advanced semantic notions, such as run-time polymorphism
(§3
...
3, §20
...
2), user-deﬁned copy semantics (§3
...
5), etc
...
For example, copying a 100-element array using 100 calls of a copy constructor is unlikely to be as fast as
calling std::memcpy(), which typically simply uses a block-move machine instruction
...
Such ‘‘tricks’’
are not uncommon, and are important, in implementations of containers, such as vector, and in lowlevel I/O routines
...

So, a POD (‘‘Plain Old Data’’) is an object that can be manipulated as ‘‘just data’’ without worrying about complications of class layouts or user-deﬁned semantics for construction, copy, and
move
...
*/ };
struct S6 : S1 { };
struct S7 : S0 { int b; };
struct S8 : S1 { int b; };
struct S9 : S0, S1 {};

// a POD
// a POD
// not a POD (no default constructor)
// a POD (user-deﬁned default constructor)
// a POD
// not a POD (has a virtual function)

// a POD
// a POD
// not a POD (data in both S1 and S8)
// a POD

For us to manipulate an object as ‘‘just data’’ (as a POD), the object must
• not have a complicated layout (e
...
, with a vptr; (§3
...
3, §20
...
2),
• not have nonstandard (user-deﬁned) copy semantics, and
• have a trivial default constructor
...
2
...
Formally (§iso
...
9, §iso
...

A related concept is a trivial type, which is a type with
• a trivial default constructor and
• trivial copy and move operations
Informally, a default constructor is trivial if it does not need to do any work (use =default if you
need to deﬁne one §17
...
1)
...
2
...
3
...
3
...
7),
• has multiple access speciﬁers for non-static data members (§20
...

Basically, a standard layout type is one that has a layout with an obvious equivalent in C and is in
the union of what common C++ Application Binary Interfaces (ABIs) can handle
...
2
...
2, §17
...
Informally, a copy operation is trivial if it can be implemented as a bitwise copy
...

• Its class has a virtual function
...

• Its class has a base or a member that is not trivial
...
Also, an array of trivially
copyable objects is trivially copyable and an array of standard layout objects has standard layout
...
I could do that by only calling mycopy() for
PODs, but that’s error-prone: if I use mycopy() can I rely on a maintainer of the code to remember
never to call mycopy() for non-PODs? Realistically, I cannot
...
Anyway, here is the general
and optimized code:

212

Structures, Unions, and Enumerations

Chapter 8

template
void mycopy(T∗ to, const T∗ from, int count)
{
if (is_pod::value)
memcpy(to,from,count∗sizeof(T));
else
for (int i=0; i!=count; ++i)
to[i]=from[i];
}

The is_pod is a standard-library type property predicate (§35
...
1) deﬁned in allowing
us to ask the question ‘‘Is T a POD?’’ in our code
...

Note that adding or subtracting non-default constructors does not affect layout or performance
(that was not true in C++98)
...
3
...
9) and try to think about their implications to programmers and compiler
writers
...

8
...
7 Fields
It seems extravagant to use a whole byte (a char or a bool) to represent a binary variable – for example, an on/off switch – but a char is the smallest object that can be independently allocated and
addressed in C++ (§7
...
It is possible, however, to bundle several such tiny variables together as
ﬁelds in a struct
...
A member is deﬁned to be a ﬁeld by specifying
the number of bits it is to occupy
...
They do not affect the meaning of
the named ﬁelds, but they can be used to make the layout better in some machine-dependent way:
struct PPN {
// R6000 Physical Page Number
unsigned int PFN : 22; // Page Frame Number
int : 3;
// unused
unsigned int CCA : 3;
// Cache Coherency Algorithm
bool nonreachable : 1;
bool dirty : 1;
bool valid : 1;
bool global : 1;
};

This example also illustrates the other main use of ﬁelds: to name parts of an externally imposed
layout
...
2
...
It is not possible to take the
address of a ﬁeld
...
Note that a
bool ﬁeld really can be represented by a single bit
...

Section 8
...
7

Fields

213

if (p−>dirty) { // contents changed
// copy to disk
p−>dirty = 0;
}
}

Surprisingly, using ﬁelds to pack several variables into a single byte does not necessarily save
space
...
Programs have been known to shrink signiﬁcantly when binary variables were
converted from bit-ﬁelds to characters! Furthermore, it is typically much faster to access a char or
an int than to access a ﬁeld
...
1
...

8
...
Naturally, a union can hold a value for only one
member at a time
...

}

The members s and i can never be used at the same time, so space is wasted
...
s if t==str; use v
...
s;
//
...
3
...

Unions are sometimes misused for ‘‘type conversion
...
For example, the following ‘‘converts’’ an int to an int∗ simply by assuming bitwise
equivalence:
union Fudge {
int i;
int∗ p;
};
int∗ cheat(int i)
{
Fudge a;
a
...
p;
}

// bad use

This is not really a conversion at all
...
Such use of a union is dangerous and nonportable
...
5
...
For example:
int∗ cheat2(int i)
{
return reinterpret_cast(i);
}

// obviously ugly and dangerous

Here, at least the compiler has a chance to warn you if the sizes of objects are different and such
code stands out like the sore thumb it is
...
However, most programs don’t improve much from the use of unions and unions are rather error-prone
...

Section 8
...
1

Unions and Classes

215

8
...
1 Unions and Classes
Many nontrivial unions have a member that is much larger than the most frequently used members
...
This waste
can often be eliminated by using a set of derived classes (§3
...
2, Chapter 20) instead of a union
...
2) which in turn is a kind of a class (Chapter 16)
...

[2] A union cannot have members of reference type
...

[4] If a union has a member with a user-deﬁned constructor, a copy operation, a move operation, or a destructor, then that special function is deleted (§3
...
4, §17
...
4) for that union;
that is, it cannot be used for an object of the union type
...
4
...

[6] A union cannot be used as a base class
...
The latter
is important because the use of unions is often an optimization and we won’t want ‘‘hidden costs’’
imposed to compromise that
...
) from a union with a member that has a constructor (etc
...
For example, since Entry has no member with constructors, destructors, or assignments, we can create and copy Entrys freely
...
For example:
void f2(U x)
{
U u;
U u2 = x;
u
...
m3;
return;
}

// error : which default constructor?
// error : which copy constructor?
// assign to int member
// disaster : read from string member
// error : which destructors are called for x, u, and u2?

It’s illegal to write one member and then read another, but people do that nevertheless (usually by
mistake)
...
It is

216

Structures, Unions, and Enumerations

Chapter 8

fortunate that U won’t compile
...
3
...
If
desired, such a class can also prevent the error of writing one member and then reading another
...
If so, this initializer will
be used for default initialization
...
p == ""
// x2
...
3
...
3):

union,

consider a

class Entry2 { // two alternative representations represented as a union
private:
enum class Tag { number, text };
Tag type; // discriminant
union { // representation
int i;
string s; // string has default constructor, copy operations, and destructor
};
public:
struct Bad_entry { };
// used for exceptions
string name;
˜Entry2();
Entry2& operator=(const Entry2&);
Entry2(const Entry2&);
//
...

};

I’m not a fan of get/set functions, but in this case we really need to perform a nontrivial user-speciﬁed action on each access
...
That happens to be my favorite among the many naming conventions
...
3
...
Such a union
is often called a tagged union or a discriminated union
...
˜string();
type = Tag::number;
}
i = n;
}

// explicitly destroy string (§11
...
4)

void Entry2::set_text(const string& ss)
{
if (type==Tag::text)
s = ss;
else {
new(&s) string{ss};
// placement new: explicitly construct string (§11
...
4)
type = Tag::text;
}
}

The use of a union forces us to use otherwise obscure and low-level language facilities (explicit
construction and destruction) to manage the lifetime of the union elements
...

Note that the union in the declaration of Entry2 is not named
...
An anonymous union is an object, not a type, and its members can be accessed without
mentioning an object name
...

Entry2 has a member of a type with a user-deﬁned assignment operator, string, so Entry2’s
assignment operator is deleted (§3
...
4, §17
...
4)
...

Assignment combines the complexities of reading and writing but is otherwise
logically similar to the access functions:
Entry2& Entry2::operator=(const Entry2& e) // necessar y because of the string variant
{
if (type==Tag::text && e
...
s;
// usual string assignment
return ∗this;
}
if (type==Tag::text) s
...
2
...
type) {
case Tag::number:
i = e
...
s); // placement new: explicit construct (§11
...
4)
type = e
...
We need at least a constructor or two to establish the correspondence between the type tag and a value
...
˜string(); // explicit destroy (§11
...
4)
}

8
...
7
...

Some of an enumeration’s possible values are named and called enumerators
...
‘‘An enumeration’’ is colloquially shortened to ‘‘an enum
...
g
...

Section 8
...
1

enum classes

219

8
...
1 enum classes
An enum class is a scoped and strongly typed enumeration
...
*/ }
if (x == red) { /*
...
*/ }
if (x == Trafﬁc_light::red) { /*
...

An enumeration is represented by some integer type and each enumerator by some integer
value
...
The underlying type
must be one of the signed or unsigned integer types (§6
...
4); the default is int
...
Here, we get:
static_cast(Warning::green)==0
static_cast(Warning::yellow)==1
static_cast(Warning::orange)==2
static_cast(Warning::red)==3

Declaring a variable Warning instead of plain
as to the intended use
...

An enumerator can be initialized by a constant expression (§10
...
2
...
For
example:
enum class Printer_ﬂags {
acknowledge=1,
paper_empty=2,
busy=4,
out_of_black=8,
out_of_color=16,
//
};

The values for the Printer_ﬂags enumerators are chosen so that they can be combined by bitwise
operations
...
2
...
1,
Chapter 18)
...

Given these deﬁnitions of | and & for Printer_ﬂags, we can write:
void try_to_print(Printer_ﬂags x)
{
if (x&Printer_ﬂags::acknowledge) {
//
...

}
else if (x&(Printer_ﬂags::out_of_black|Printer_ﬂags::out_of_color)) {
// either we are out of black or we are out of color
//
...

}

Section 8
...
1

enum classes

221

I deﬁned operator|() and operator&() to be constexpr functions (§10
...
1
...
For example:
void g(Printer_ﬂags x)
{
switch (x) {
case Printer_ﬂags::acknowledge:
//
...

break;
case Printer_ﬂags::out_of_black:
//
...

break;
case Printer_ﬂags::out_of_black&Printer_ﬂags::out_of_color:
// we are out of black *and* out of color
//
...

}

It is possible to declare an enum class without deﬁning it (§6
...
For example:
enum class Color_code : char;
void foobar(Color_code∗ p);
//
...
The result of such a
conversion is undeﬁned unless the value is within the range of the enumeration’s underlying type
...

222

Structures, Unions, and Enumerations

Chapter 8

Each enumerator has an integer value
...
For example:
int i = static_cast(Flag::y);
char c = static_cast(Flag::e);

// i becomes 2
// c becomes 8

The notion of a range of values for an enumeration differs from the enumeration notion in the Pascal family of languages
...
g
...

The sizeof an enum class is the sizeof of its underlying type
...

8
...
2 Plain enums
A ‘‘plain enum’’ is roughly what C++ offered before the enum classes were introduced, so you’ll
ﬁnd them in lots of C and C++98-style code
...
Consider the examples from §8
...
1 with the ‘‘class’’ removed:
enum Trafﬁc_light { red, yellow, green };
enum Warning { green, yellow, orange, red }; // ﬁre alert levels
// error: two deﬁnitions of yellow (to the same value)
// error: two deﬁnitions of red (to different values)
Warning a1 = 7;
int a2 = green;
int a3 = Warning::green;
Warning a4 = Warning::green;

// error : no int->Warning conversion
// OK: green is in scope and converts to int
// OK: Warning->int conversion
// OK

void f(Trafﬁc_light x)
{
if (x == 9) { /*
...
*/ }
if (x == Warning::red) { /*
...
*/ }
}

// OK (but Trafﬁc_light doesn’t have a 9)
// error : two reds in scope
// OK (Ouch!)
// OK

We were ‘‘lucky’’ that deﬁning red in two plain enumerations in a single scope saved us from hardto-spot errors
...
4
...
*/ }
if (x == Warning::red) { /*
...
*/ }
}

223

// OK (ouch!)
// OK (ouch!)
// error : red is not a Trafﬁc_light value

The compiler accepts the x==red, which is almost certainly a bug
...

You can specify the underlying type of a plain enumeration, just as you can for enum classes
...
For example:
enum Trafﬁc_light : char { tl_red, tl_yellow, tl_green };

// underlying type is char

enum Color_code : char;
// declaration
void foobar(Color_code∗ p); // use of declaration
//
...
If there are negative enumerators, the range is [-2 :2 -1]
...
For example:
enum E1 { dark, light };
// range 0:1
enum E2 { a = 3, b = 9 };
// range 0:15
enum E3 { min = −10, max = 1000000 }; // range -1048576:1048575

The rule for explicit conversion of an integer to a plain enum is the same as for the class enum
except that when there is no explicit underlying type, the result of such a conversion is undeﬁned
unless the value is within the range of the enumeration
...
The
sizeof an enumeration is the sizeof its underlying type
...
For example, sizeof(e1) could be 1 or
maybe 4 but not 8 on a machine where sizeof(int)==4
...
4
...
For example:
enum { arrow_up=1, arrow_down, arrow_sideways };

We use that when all we need is a set of integer constants, rather than a type to use for variables
...
5 Advice
[1]
[2]
[3]
[4]
[5]
[6]
[7]

When compactness of data is important, lay out structure data members with larger members
before smaller ones; §8
...
1
...
2
...

Don’t naively try to optimize memory consumption by packing several values into a single
byte; §8
...
7
...
3
...
4
...
4
...
4
...

9
Statements
A programmer is a machine
for turning caffeine into code
...
1 Introduction
C++ offers a conventional and ﬂexible set of statements
...
Note that a declaration is a statement and
that an expression becomes a statement when you add a semicolon at its end
...
Instead, statements are used to specify
the order of execution
...
A compiler may reorder code
to improve performance as long as the result is identical to that of the simple order of execution
...
2 Statement Summary
Here is a summary of C++ statements:
statement:
declaration
expressionopt ;
{ statement-listopt }
try { statement-listopt } handler-list
case constant-expression :
default : statement
break ;
continue ;
return

statement

expressionopt ;

identiﬁer ;
identiﬁer : statement

goto

selection-statement
iteration-statement
selection-statement:
if ( condition ) statement
if ( condition ) statement else statement
switch ( condition ) statement
iteration-statement:
while ( condition ) statement
do statement while ( expression ) ;
for ( for-init-statement conditionopt ; expressionopt ) statement
for ( for-init-declaration : expression ) statement
statement-list:
statement statement-listopt
condition:
expression
type-speciﬁer declarator = expression
type-speciﬁer declarator { expression }
handler-list:
handler handler-listopt
handler:
catch (

exception-declaration ) { statement-listopt }

A semicolon is by itself a statement, the empty statement
...
2

Statement Summary

227

A (possibly empty) sequence of statements within ‘‘curly braces’’ (i
...
, { and }) is called a block
or a compound statement
...
3
...

A declaration is a statement and there is no assignment statement or procedure-call statement;
assignments and function calls are expressions
...
Note that both end
with a semicolon
...

The statements for handling exceptions, try-blocks, are described in §13
...

9
...
Unless a variable is declared static, its initializer is executed whenever
the thread of control passes through the declaration (see also §6
...
2)
...
4
...
5
...
There is rarely a reason to introduce a variable before there is a value for it to hold
...
size()<=i)
error("bad index");
string s = v[i];
if (s == p) {
//
...

}

The ability to place declarations after executable code is essential for many constants and for single-assignment styles of programming where a value of an object is not changed after initialization
...
For example:
void use()
{
string s1;
s1 = "The best is the enemy of the good
...

}

This requests a default initialization (to the empty string) followed by an assignment
...
Input variables are among the few reasonable examples of that:
void input()
{
int buf[max];
int count = 0;
for (int i; cin>>i;) {
if (i<0) error("unexpected negative value");
if (count==max) error("buffer overﬂow");
buf[count++] = i;
}
//
...
Often,
push_back() (§3
...
1
...
6, §31
...
6) provides a better solution to such examples
...
4 Selection Statements
A value can be tested by either an if-statement or a switch-statement:
condition ) statement
condition ) statement else statement
switch ( condition ) statement
if (
if (

A condition is either an expression or a declaration (§9
...
3)
...
4
...
If a condition evaluates to something different
from a Boolean, it is – if possible – implicitly converted to a bool
...
For example, if x is an integer, then
if (x) //
...

For a pointer p,
if (p) //
...

Note that a ‘‘plain’’ enum can be implicitly converted to an integer and then to a
enum class cannot (§8
...
1)
...
4
...

if (y)
//
...

}

// OK
// error: no conversion to bool
// OK

The logical operators
&& ||

!

are most commonly used in conditions
...
For example,
if (p && 1count) //
...

For choosing between two alternatives each of which produces a value, a conditional expression
(§11
...
3) is a more direct expression of intent than an if-statement
...
In particular, it cannot be used
on another branch of an if-statement
...

}
else {
++x; // error: x is not in scope
}
++x;
// error: x is not in scope
}

A branch of an if-statement cannot be just a declaration
...
2)
...
4
...
The expression in the case
labels must be a constant expression of integral or enumeration type
...
For example:
void f(int i)
{
switch (i) {
case 2
...

case 2:
//
...

};

A switch-statement can alternatively be written as a set of if-statements
...
This makes the switch-statement
easier to read for nontrivial examples
...
Instead, a jump table can be used
...
4
...
Consider:
switch (val) {
// beware
case 1:
cout << "case 1\n";
case 2:
cout << "case 2\n";
default:
cout << "default: case not found\n";
}

Invoked with val==1, the output will greatly surprise the uninitiated:
case 1
case 2
default: case not found

It is a good idea to comment the (rare) cases in which a fall-through is intentional so that an uncommented fall-through can be assumed to be an error
...

}

A break is the most common way of terminating a case, but a return is often useful (§10
...
1)
...
One use is for the default to handle the most common case
...
However, there is one case where a default should not be used: if a switch is intended
to have one case for each enumerator of an enumeration
...
For example, this is almost certainly an error:
enum class Vessel { cup, glass, goblet, chalice };
void problematic(Vessel v)
{
switch (v) {
case Vessel::cup:
case Vessel::glass:
case Vessel::goblet:
}
}

/*
...
*/
/*
...

Testing for an ‘‘impossible’’ enumerator value is best done separately
...
4
...
1 Declarations in Cases
It is possible, and common, to declare variables within the block of a switch-statement
...
For example:
void f(int i)
{
switch (i) {
case 0:
int x;
int y = 3;
string s;
case 1:
++x;
++y;
s = "nasty!";
}
}

// uninitialized
// error: declaration can be bypassed (explicitly initialized)
// error: declaration can be bypassed (implicitly initialized)
// error: use of uninitialized object

Here, if i==1, the thread of execution would bypass the initializations of y and s, so f() will not compile
...
However, its use is an error: we read an uninitialized variable
...
As
usual, avoid uninitialized variables (§6
...
5
...

If we need a variable within a switch-statement, we can limit its scope by enclosing its declaration and its use in a block
...
2
...

9
...
3 Declarations in Conditions
To avoid accidental misuse of a variable, it is usually a good idea to introduce the variable into the
smallest scope possible
...
That way, one cannot get into trouble by using the variable
before its initial value is assigned
...
Consider:
if (double d = prim(true)) {
left /= d;
break;
}

Here, d is declared and initialized and the value of d after initialization is tested as the value of the
condition
...
For example, had there been an else-branch to the if-statement, d would be in
scope on both branches
...
4
...
However, this opens
the scope (literally) for the use of d before its initialization or after its intended useful life:
double d;
//
...

if (d = prim(true)) {
left /= d;
break;
}
//
...
0; // two unrelated uses of d

In addition to the logical beneﬁts of declaring variables in conditions, doing so also yields the most
compact source code
...

9
...
Note that both end
with a semicolon
...

More complicated loops can be expressed as an algorithm plus a lambda expression (§11
...
2)
...
5
...
For example:
int sum(vector& v)
{
int s = 0;
for (int x : v)
s+=x;
return s;
}

234

Statements

Chapter 9

The for (int x : v) can be read as ‘‘for each element x in the range v’’ or just ‘‘for each x in v
...

The scope of the variable naming the element (here, x) is the for-statement
...
begin() and v
...
5):
[1] the compiler ﬁrst looks for members begin and end and tries to use those
...
g
...

[2] Otherwise, the compiler looks for a begin/end member pair in the enclosing scope
...
g
...

The compiler uses v and v+N as begin(v) and end(v) for a built-in array T v[N]
...
For
sequences of our own design, we can deﬁne begin() and end() in the same way as it is done for standard-library containers (§4
...
5)
...

For example, we can increment each element of a vector like this:
void incr(vector& v)
{
for (int& x : v)
++x;
}

References are also appropriate for elements that might be large, so that copying them to the element value could be costly
...
For example, using it you can’t touch
two elements at the same time and can’t effectively traverse two ranges simultaneously
...

Section 9
...
2

for

Statements

235

9
...
2 for Statements
There is also a more general for-statement allowing greater control of the iteration
...
For example:
void f(int v[], int max)
{
for (int i = 0; i!=max; ++i)
v[i] = i∗i;
}

This is equivalent to
void f(int v[], int max)
{
int i = 0;
// introduce loop variable
while (i!=max) {
// test termination condition
v[i] = i∗i; // execute the loop body
++i;
// increment loop variable
}
}

A variable can be declared in the initializer part of a for-statement
...

It is not always obvious what is the right type to use for a controlled variable in a for loop, so
auto often comes in handy:
for (auto p = begin(c); c!=end(c); ++p) {
//
...

}

If the ﬁnal value of an index needs to be known after exit from a for-loop, the index variable must
be declared outside the for-loop (e
...
, see §9
...

If no initialization is needed, the initializing statement can be empty
...
If the loop isn’t of the simple ‘‘introduce a loop variable, test the condition, update the loop variable’’ variety, it is often better
expressed as a while-statement
...
push_back(s);

Here, the reading and testing for termination and combined in cin>>s, so we don’t need an explicit
loop variable
...

A for-statement is also useful for expressing a loop without an explicit termination condition:
for (;;) { // ‘‘forever’’
//
...

}

// ‘‘forever’’

9
...
3 while Statements
A while-statement executes its controlled statement until its condition becomes false
...

A for-statement (§9
...
2) is easily rewritten into an equivalent while-statement and vice versa
...
5
...
For

// i must be positive

This might be called like this: print_backwards(s,strlen(s)); but it is all too easy to make a horrible
mistake
...
The reason is that its
body is always executed once before the condition is evaluated
...
More often
than I would have guessed, I have found that condition not to hold as expected either when the program was ﬁrst written and tested or later after the code preceding it has been modiﬁed
...
’’ Consequently, I recommend avoiding do-statements
...
5
...
1
...
6), throw
(§13
...
4
...
A break ‘‘breaks out of’’ the

Section 9
...
5

Loop Exit

237

nearest enclosing switch-statement (§9
...
2) or iteration-statement
...

if (c == '\n') break;
//
...
’’ Unless it warps the logic of
a loop (e
...
, requires the introduction of an extra varible), it is usually better to have the complete
exit condition as the condition of a while-statement or a for-statement
...
A continue skips the rest of the body of an iteration-statement
...
size(); ++i) {
if (!prime(v[i]) continue;
return v[i];
}
}

After a continue, the increment part of the loop (if any) is executed, followed by the loop condition
(if any)
...
size(); ++i) {
if (!prime(v[i]) {
return v[i];
}
}
}

9
...

238

Statements

Chapter 9

The scope of a label is the function it is in (§6
...
4)
...
The only restriction is that you cannot jump past an initializer or into
an exception handler (§13
...

One of the few sensible uses of goto in ordinary code is to break out from a nested loop or
switch-statement (a break breaks out of only the innermost enclosing loop or switch-statement)
...

found:
// nm[i][j] == a
}

Note that this goto just jumps forward to exit its loop
...
That makes it the least troublesome and least confusing use of a goto
...
7 Comments and Indentation
Judicious use of comments and consistent use of indentation can make the task of reading and
understanding a program much more pleasant
...
I see no fundamental reason to prefer one over another (although, like most programmers, I
have my preferences, and this book reﬂects them)
...

Comments can be misused in ways that seriously affect the readability of a program
...

Most programs contain comments that are incomprehensible, ambiguous, and just plain wrong
...

If something can be stated in the language itself, it should be, and not just mentioned in a comment
...
7

Comments and Indentation

239

// don’t use function "weird()"
// function "f(int
...

Once something has been stated clearly in the language, it should not be mentioned a second
time in a comment
...
They increase the amount of text the reader has
to look at, they often obscure the structure of the program, and they may be wrong
...
This is one of the many ways a program in a textbook differs from a real program
...
Preferably, a comment is expressed
at a suitably high level of abstraction so that it is easy for a human to understand without delving
into minute details
...

• A comment for each class, template, and namespace
• A comment for each nontrivial function stating its purpose, the algorithm used (unless it is
obvious), and maybe something about the assumptions it makes about its environment
• A comment for each global and namespace variable and constant
• A few comments where the code is nonobvious and/or nonportable
• Very little else
For example:
//

tbl
...

/*

Gaussian elimination with partial pivoting
...
" pg 411
...

// Revised to handle invalid dates
...
Writing
good comments can be as difﬁcult as writing the program itself
...

240

Statements

Chapter 9

Note that /∗ ∗/ style comments do not nest
...

9
...
3, §9
...
3, §9
...
2
...
4
...

Prefer a range-for-statement to a for-statement when there is a choice; §9
...
1
...
5
...

Prefer a while-statement to a for-statement when there is no obvious loop variable; §9
...
3
...
5
...
6
...
7
...
7
...
7
...
7
...

– apologies to Richard Feynman

•
•

•
•

•
•

Introduction
A Desk Calculator
The Parser; Input; Low-Level Input; Error Handling; The Driver; Headers; Command-Line
Arguments; A Note on Style
Operator Summary
Results; Order of Evaluation; Operator Precedence; Temporary Objects
Constant Expressions
Symbolic Constants; consts in Constant Expressions; Literal Types; Reference Arguments;
Address Constant Expressions
Implicit Type Conversion
Promotions; Conversions; Usual Arithmetic Conversions
Advice

10
...
In C++, an assignment is an expression, a function call is an expression, the construction of an object is an expression, and so are many other
operations that go beyond conventional arithmetic expression evaluation
...
’’ Next, the complete set of operators is listed and their meaning for builtin types is brieﬂy outlined
...

242

Expressions

Chapter 10

10
...
The user can also deﬁne variables
...
5
area = pi ∗ r ∗ r

(pi is predeﬁned) the calculator program will write
2
...
635

where 2
...
635 is the result of the second
...
Actually, it is a miniature compiler in which the parser does the syntactic analysis, the input
function handles input and lexical analysis, the symbol table holds permanent information, and the
driver handles initialization, output, and errors
...

10
...
1 The Parser
Here is a grammar for the language accepted by the calculator:
program:
end
expr_list end

// end is end-of-input

expr_list:
expression print
expression print expr_list

// print is newline or semicolon

expression:
expression + term
expression − term
term
term:
term / primary
term ∗ primary
primary
primary:
number
name
name = expression
− primary
( expression )

// number is a ﬂoating-point literal
// name is an identiﬁer

Section 10
...
1

The Parser

243

In other words, a program is a sequence of expressions separated by semicolons
...
Names need not be declared before use
...
In a language such as C++, in which function calls are relatively cheap, it is also
efﬁcient
...
Terminal symbols (for example, end, number, +, and −) are recognized by a lexical analyzer and nonterminal symbols are recognized by the syntax analyzer functions, expr(), term(), and prim()
...

For input, the parser uses a Token_stream that encapsulates the reading of characters and their
composition into Tokens
...
45, into Tokens
...
45}, where the
123
...
The main parts of the parser need only to know
the name of the Token_stream, ts, and how to get Tokens from it
...
get()
...
current()
...
We’ll see that
they can come directly from a user typing to cin, from a program command line, or from any other
input stream (§10
...
7)
...
This works as long as no character used as input has a value used
as an enumerator – and no current character set I know of has a printing character with a singledigit integer value
...

};

The implementation is presented in §10
...
2
...
2
...
Each parser function evaluates ‘‘its’’

244

Expressions

Chapter 10

expression and returns the value
...
It consists
of a single loop that looks for terms to add or subtract:
double expr(bool get)
{
double left = term(get);

// add and subtract

for (;;) {
// ‘‘forever’’
switch (ts
...
kind) {
case Kind::plus:
left += term(true);
break;
case Kind::minus:
left −= term(true);
break;
default:
return left;
}
}
}

This function really does not do much itself
...

The switch-statement (§2
...
4, §9
...
2) tests the value of its condition, which is supplied in parentheses after the switch keyword, against a set of constants
...
If the value tested does not match any case label, the default is chosen
...

Note that an expression such as 2−3+4 is evaluated as (2−3)+4, as speciﬁed in the grammar
...
5); while(true) is an alternative
...

The operators += and −= are used to handle the addition and subtraction; left=left+term(true) and
left=left−term(true) could have been used without changing the meaning of the program
...
Each assignment operator is a separate lexical token, so a + = 1; is a syntax error because
of the space between the + and the =
...
3 summarizes the operators
and their meanings
...

Section 10
...
1

The Parser

245

The function term() handles multiplication and division in the same way expr() handles addition
and subtraction:
double term(bool get)
{
double left = prim(get);

// multiply and divide

for (;;) {
switch (ts
...
kind) {
case Kind::mul:
left ∗= prim(true);
break;
case Kind::div:
if (auto d = prim(true)) {
left /= d;
break;
}
return error("divide by 0");
default:
return left;
}
}
}

The result of dividing by zero is undeﬁned and usually disastrous
...
The function error() is described in §10
...
4
...
The scope of a name introduced in a condition is the statement controlled by that condition,
and the resulting value is the value of the condition (§9
...
3)
...

The function prim() handling a primary is much like expr() and term(), except that because we are
getting lower in the call hierarchy a bit of real work is being done and no loop is necessary:
double prim(bool get)
// handle primaries
{
if (get) ts
...
current()
...
current()
...
get();
return v;
}
case Kind::name:
{
double& v = table[ts
...
string_value];
if (ts
...
kind == Kind::assign) v = expr(true);
return v;
}

// ﬁnd the corresponding
// ’=’ seen: assignment

246

Expressions

Chapter 10

case Kind::minus:
// unar y minus
return −prim(true);
case Kind::lp:
{
auto e = expr(true);
if (ts
...
kind != Kind::rp) return error("')' expected");
ts
...
Similarly, when a Token that is a name (however deﬁned; see §10
...
2 and
§10
...
3) is seen, its value is placed in its string_value
...

The reason is that it must do that in some cases (e
...
, to see if a name is assigned to), so for consistency it must do it in all cases
...
get()
...
current()
...
get())
...
In both cases, the symbol table is consulted
...
4
...
4
...
For example, if the user enters

double

corresponding to the

radius = 6378
...
expr() calculates the value to be assigned
...
388;

The reference v is used to hold on to the double associated with radius while expr() calculates the
value 6378
...

Chapter 14 and Chapter 15 discuss how to organize a program as a set of modules
...
The exception is expr(), which calls term(), which calls
prim(), which in turn calls expr()
...
A declaration
double expr(bool);

before the deﬁnition of prim() will do nicely
...
2
...
2
...
To communicate with a person, the program
must cope with that person’s whims, conventions, and seemingly random errors
...
The task of a low-level input routine is to read characters and compose higher-level tokens
from them
...
Here, low-level input
is done by ts
...
Writing a low-level input routine need not be an everyday task
...

First we need to see the complete deﬁnition of Token_stream:
class Token_stream {
public:
Token_stream(istream& s) : ip{&s}, owns{false} { }
Token_stream(istream∗ p) : ip{p}, owns{true} { }
˜Token_stream() { close(); }
Token get();
Token& current();

// read and return next token
// most recently read token

void set_input(istream& s) { close(); ip = &s; owns=false; }
void set_input(istream∗ p) { close(); ip = p; owns = true; }
private:
void close() { if (owns) delete ip; }
istream∗ ip;
bool owns;
Token ct {Kind::end} ;

// pointer to an input stream
// does the Token_stream own the istream?
// current token

};

We initialize a Token_stream with an input stream (§4
...
2, Chapter 38) from which it gets its characters
...
2
...
2,
§11
...
This may be a bit
elaborate for this simple program, but it is a useful and general technique for classes that hold a
pointer to a resource requiring destruction
...

I gave ct a default value because it seemed sloppy not to
...
I chose Kind::end as the initial value for ct so
that a program that misuses current() will not get a value that wasn’t on the input stream
...
First, I provide a deceptively simple version that
imposes a burden on the user
...
The idea for get() is to read a character, use that character to decide what kind of token
needs to be composed, read more characters when needed, and then return a Token representing the
characters read
...
) and leaves the value
of ch unchanged if the input operation failed
...

Assignment is an operator, and the result of the assignment is the value of the variable assigned
to
...

Having a single statement rather than two is useful in maintenance
...

Note also how the {}-list notation (§3
...
1
...
3) is used on the right-hand side of an assignment
...
I could have written that return-statement as:
ct
...
The {Kind::end} is equivalent to {Kind::end,0,0}
...
Neither is the
case here, but in general dealing with complete objects is clearer and less error-prone than manipulating data members individually
...

Consider some of the cases separately before considering the complete function
...
5
...
4
...

Numbers are handled like this:
case '0': case '1': case '2': case '3': case '4': case '5': case '6': case '7': case '8': case '9':
case '
...
2
...
) back into the input stream
∗ip >> ct
...
kind=Kind::number;
return ct;

Stacking case labels horizontally rather than vertically is generally not a good idea because this
arrangement is harder to read
...
Because operator >> is already deﬁned for reading ﬂoating-point values into a double, the code is trivial
...
Then, the ﬂoating-point value can be read
into ct
...

If the token is not the end of input, an operator, a punctuation character, or a number, it must be
a name
...
string_value;
// read the string into ct
ct
...
The simple-minded, but reasonably effective way to deal
with an error is the write call an error() function and then return a print token if error() returns:
error("bad token");
return ct={Kind::print};

The standard-library function isalpha() (§36
...
1) is used to avoid listing every character as a separate case label
...
Consequently, a user must terminate a name by a space before an operator using the name
as an operand
...
2
...

Here, ﬁnally, is the complete input function:
Token Token_stream::get()
{
char ch = 0;
∗ip>>ch;
switch (ch) {
case 0:
return ct={Kind::end};
// assign and return
case ';': // end of expression; print
case '∗':
case '/':
case '+':
case '−':
case '(':
case ')':
case '=':
return ct=={static_cast(ch)};

250

Expressions

Chapter 10

case '0': case '1': case '2': case '3': case '4': case '5': case '6': case '7': case '8': case '9':
case '
...
) back into the input stream
∗ip >> ct
...
kind=Kind::number;
return ct;
default:
// name, name =, or error
if (isalpha(ch)) {
ip−>putback(ch);
// put the ﬁrst character back into the input stream
∗ip>>ct
...
kind=Kind::name;
return ct;
}
error("bad token");
return ct={Kind::print};
}
}

The conversion of an operator to its Token value is trivial because the
deﬁned as the integer value of the operator (§10
...
1)
...
2
...
It is tedious to remember to
add a semicolon after an expression in order to get its value printed, and having a name terminated
by whitespace only is a real nuisance
...
To get what we (usually) want, we would have to add
whitespace after x: x =7
...

First, we’ll make a newline equivalent to the semicolon used to mark the end-of-expression:
Token Token_stream::get()
{
char ch;
do { // skip whitespace except ’\n’
if (!ip−>get(ch)) return ct={Kind::end};
} while (ch!='\n' && isspace(ch));
switch (ch) {
case ';':
case '\n':
return ct={Kind::print};

Here, I use a do-statement; it is equivalent to a while-statement except that the controlled statement
is always executed at least once
...
By default, get() does not skip whitespace the way >> does
...
The operator ! (not) is used because get() returns true in case of success
...
2
...
2
...
The test is
implemented as a table lookup, so using isspace() is much faster than testing for the individual
whitespace characters
...

After whitespace has been skipped, the next character is used to determine what kind of lexical
token is coming
...
Constructing programs so that improvements can be implemented through local modiﬁcations only is an important design aim
...
It
would be for very long strings, but all modern string implementations provide the ‘‘small string
optimization’’ (§19
...
3)
...
In particular,
using a short string doesn’t require any use of free store
...

10
...
4 Error Handling
It is always important to detect and report errors
...
The error() function simply counts the errors, writes out an error message,
and returns:
int no_of_errors;
double error(const string& s)
{
no_of_errors++;
cerr << "error: " << s << '\n';
return 1;
}

The stream cerr is an unbuffered output stream usually used to report errors (§38
...

The reason for returning a value is that errors typically occur in the middle of the evaluation of
an expression, so we should either abort that evaluation entirely or return a value that is unlikely to
cause subsequent errors
...
Had Token_stream::get()

252

Expressions

Chapter 10

kept track of the line numbers, error() could have informed the user approximately where the error
occurred
...

A more stylized and general error-handling strategy would separate error detection from error
recovery
...
4
...
1, Chapter 13), but what we have
here is quite suitable for a 180-line calculator
...
2
...
I decided on two
functions: main() to do setup and error reporting and calculate() to handle the actual calculation:
Token_stream ts {cin};

// use input from cin

void calculate()
{
for (;;) {
ts
...
current()
...
current()
...
1415926535897932385;
table["e"] = 2
...
2
...
Returning the number of errors accomplishes this nicely
...

The primary task of the main loop (in calculate()) is to read expressions and write out the
answer
...
get() to read a token on which to work
...
get() encounters an input
error or an end-of-ﬁle
...
5)
...
A continue-statement is equivalent to going to the very end of a loop
...
2
...
2
...
Therefore, appropriate headers must be #included to
complete the program:
#include // I/O
#include
// strings
#include
// map
#include // isalpha(), etc
...
Chapter
14 and Chapter 15 discuss ways of organizing this calculator into modules using namespaces and
how to organize it into source ﬁles
...
2
...
My most common use was to evaluate a single expression
...

A program starts by calling main() (§2
...
1, §15
...
When this is done, main() is given two arguments specifying the number of arguments, conventionally called argc, and an array of arguments,
conventionally called argv
...
2
...
3), so the type
of argv is char∗[argc+1]
...
The list of arguments is zero-terminated; that is, argv[argc]==0
...
1934

the arguments have these values:
argc:

2

argv:

0

"dc"
"150/1
...

The idea is to read from the command string in the same way that we read from the input
stream
...
2
...
So to
calculate expressions presented on the command line, we simply have to get our Token_stream to
read from an appropriate istringstream:

254

Expressions

Chapter 10

Token_stream ts {cin};
int main(int argc, char∗ argv[])
{
switch (argc) {
case 1:
// read from standard input
break;
case 2:
// read from argument string
ts
...
1415926535897932385;
table["e"] = 2
...

It would be easy to modify main() to accept several command-line arguments, but this does not
appear to be necessary, especially as several expressions can be passed as a single argument:
dc "rate=1
...
75/rate;217/rate"

I use quotes because ; is the command separator on my UNIX systems
...

Simple as they are, argc and argv are still a source of minor, yet annoying, bugs
...
push_back(argv[i]);
return res;
}

More elaborate argument parsing functions are not uncommon
...
2
...
It is not
...
Often, a library has received more care in its design and implementation than a

Section 10
...
8

A Note on Style

255

programmer could afford for a handcrafted piece of code to be used in just one program
...
Many of the traditional tricky details have been
replaced by uses of standard-library classes such as ostream, string, and map (§4
...
1, §4
...
4
...
4, Chapter 36, Chapter 38)
...
This is the way things ought to
be in code that doesn’t manipulate hardware directly or implement low-level abstractions
...
3 Operator Summary
This section presents a summary of expressions and some examples
...
In these tables:
• A name is an identiﬁer (e
...
, sum and map), an operator name (e
...
, operator int, operator+,
and operator"" km), or the name of a template specialization (e
...
, sort and
array), possibly qualiﬁed using :: (e
...
, std::vector and vector::operator[])
...

• A member is a member name (including the name of a destructor or a member template)
...

• A pointer is an expression yielding a pointer (including this and an object of that type that
supports the pointer operation)
...
g
...

• An lvalue is an expression denoting a modiﬁable object (§6
...
1)
...
) only when it appears in parentheses; elsewhere, there are restrictions (§iso
...

• A lambda-declarator is a (possibly empty, comma-separated) list of parameters optionally
followed by the mutable speciﬁer, optionally followed by a noexcept speciﬁer, optionally followed by a return type (§11
...

• A capture-list is a (possibly empty) list specifying context dependencies (§11
...

• A stmt-list is a (possibly empty) list of statements (§2
...
4, Chapter 9)
...
The meanings presented here apply
when the operands are of built-in types (§6
...
1)
...
3, Chapter 18)
...
For details, see §iso
...
A
...
5
...
4
§16
...
3
§14
...
1
§14
...
1

Each box holds operators with the same precedence
...
For example, N::x
...
m rather than the illegal N::(x
...

256

Expressions

Chapter 10

Operator Summary (continued, continues)
Member selection
Member selection
Subscripting
Function call
Value construction
Function-style type conversion
Post increment
Post decrement
Type identiﬁcation
Run-time type identiﬁcation
Run-time checked conversion
Compile-time checked conversion
Unchecked conversion
const conversion
Size of object
Size of type
Size of parameter pack
Alignment of type
Pre increment
Pre decrement
Complement
Not
Unary minus
Unary plus
Address of
Dereference
Create (allocate)
Create (allocate and initialize)
Create (allocate and initialize)
Create (place)
Create (place and initialize)
Create (place and initialize)
Destroy (deallocate)
Destroy array
Can expression throw?
Cast (type conversion)
Member selection
Member selection

object
...
name
alignof ( type )
++ lvalue
−− lvalue
˜ expr
! expr
− expr
+ expr
& lvalue
∗ expr
new type
new type ( expr-list )
new type { expr-list }
new ( expr-list ) type
new ( expr-list ) type ( expr-list )
new ( expr-list ) type { expr-list }
delete pointer
delete [] pointer
noexcept ( expr )
( type ) expr
object
...
2
...
2
...
3
§12
...
3
...
5
...
1
...
1
...
5
§22
...
2
...
5
...
5
...
5
...
2
...
2
...
6
...
2
...
1
...
1
...
1
...
1
...
2
...
2
...
2
§7
...
2
§11
...
2
§11
...
4
§11
...
4
§11
...
4
§11
...
2
...
5
...
2
§11
...
3
§20
...
6

For example, postﬁx ++ has higher precedence than unary ∗, so ∗p++ means ∗(p++), not (∗p)++
...
3

Operator Summary

257

Operator Summary (continued)
Multiply
Divide
Modulo (remainder)
Add (plus)
Subtract (minus)
Shift left
Shift right
Less than
Less than or equal
Greater than
Greater than or equal
Equal
Not equal
Bitwise and
Bitwise exclusive-or
Bitwise inclusive-or
Logical and
Logical inclusive or
Conditional expression
List
Throw exception
Simple assignment
Multiply and assign
Divide and assign
Modulo and assign
Add and assign
Subtract and assign
Shift left and assign
Shift right and assign
Bitwise and and assign
Bitwise inclusive-or and assign
Bitwise exclusive-or and assign
comma (sequencing)

expr ∗ expr
expr / expr
expr % expr
expr + expr
expr − expr
expr << expr
expr >> expr
expr < expr
expr <= expr
expr > expr
expr >= expr
expr == expr
expr != expr
expr & expr
expr ˆ expr
expr | expr
expr && expr
expr || expr
expr ? expr : expr
{ expr-list }
throw expr
lvalue = expr
lvalue ∗= expr
lvalue /= expr
lvalue %= expr
lvalue += expr
lvalue −= expr
lvalue <<= expr
lvalue >>= expr
lvalue &= expr
lvalue |= expr
lvalue ˆ= expr
expr , expr

§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§11
...
2
§11
...
2
§2
...
2
§2
...
2
§2
...
2
§2
...
2
§2
...
2
§2
...
2
§11
...
2
§11
...
2
§11
...
2
§11
...
1
§11
...
1
§11
...
3
§11
...
5
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
1
§10
...
2

For example: a+b∗c means a+(b∗c) rather than (a+b)∗c because ∗ has higher precedence than +
...

For example, a=b=c means a=(b=c) whereas a+b+c means (a+b)+c
...
For example, a=blook at the grammar (§iso
...

258

Expressions

Chapter 10

Before applying the grammar rules, lexical tokens are composed from characters
...
For example, && is a single operator,
rather than two & operators, and a+++1 means (a ++) + 1
...

Token Summary (§iso
...
7)
Token Class
Identiﬁer
Keyword
Character literal
Integer literal
Floating-point literal
String literal
Operator
Punctuation
Preprocessor notation

Examples
vector, foo_bar, x3
int, for, virtual
’x’, \n’, ’U’\UFADEFADE’
12, 012, 0x12
1
...
2e−3, 1
...
3
...
3
...
1
§6
...
3
...
2
...
1
§6
...
5
...
3
...
3
§12
...
g
...
g
...

Some characters from the basic source character set (§6
...
2), such as |, are not convenient to
type on some keywords
...
Consequently, a set of alternative representation are provided as
keywords:
Alternative Representation (§iso
...
12)
and
&

and_eq
&=

bitand
&

bitor
|

compl
˜

not
!

not_eq
!=

or
|

or_eq
|=

xor
ˆ

xor_eq
ˆ=

For example
bool b = not (x or y) and z;
int x4 = ˜ (x1 bitor x2) bitand x3;

is equivalent to
bool b = !(x || y) && z;
int x4 = ˜(x1 | x2) & x3;

Note that and= is not equivalent to &=; if you prefer keywords, you must write and_eq
...
3
...
5
...
The overall aim is to produce a result of the ‘‘largest’’ operand type
...
Similarly, if it has a long operand, the
computation is done using long integer arithmetic, and the result is a long
...

Section 10
...
1

Results

259

The relational operators, ==, <=, etc
...
The meaning and result type of
user-deﬁned operators are determined by their declarations (§18
...

Where logically feasible, the result of an operator that takes an lvalue operand is an lvalue
denoting that lvalue operand
...
Preserving lvalues in this way allows greater ﬂexibility in using operators
...
g
...

The result of sizeof is of an unsigned integral type called size_t deﬁned in
...

Implementations do not have to check for arithmetic overﬂow and hardly any do
...
What happens then is undeﬁned, but
typically the value ‘‘wraps around’’ to a negative number (on my machine −2147483648)
...
In particular, underﬂow, overﬂow, and division by zero do not throw standard exceptions
(§30
...
1
...

10
...
2 Order of Evaluation
The order of evaluation of subexpressions within an expression is undeﬁned
...
For example:
int x = f(2)+g(3);

// undeﬁned whether f() or g() is called ﬁrst

Better code can be generated in the absence of restrictions on expression evaluation order
...
For example:
int i = 1;
v[i] = i++; // undeﬁned result

The assignment may be evaluated as either v[1]=1 or v[2]=1 or may cause some even stranger behavior
...
Unfortunately, most do not, so be careful not to
write an expression that reads or writes an object more than once, unless it does so using a single

260

Expressions

Chapter 10

operator that makes it well deﬁned, such as ++ and +=, or explicitly express sequencing using ,
(comma), &&, or ||
...
For example, b=(a=2,a+1) assigns 3 to b
...
3
...
For built-in types, the second operand of && is
evaluated only if its ﬁrst operand is true, and the second operand of || is evaluated only if its ﬁrst operand is false; this is sometimes called short-circuit evaluation
...
For
example:
f1(v[i],i++);
f2( (v[i],i++) );

// two arguments
// one argument

The call of f1 has two arguments, v[i] and i++, and the order of evaluation of the argument expressions is undeﬁned
...
Order dependence of argument expressions is very
poor style and has undeﬁned behavior
...
That is confusing, so that too should be avoided
...
For example, a∗b/c means (a∗b)/c, so parentheses
must be used to get a∗(b/c); a∗(b/c) may be evaluated as (a∗b)/c only if the user cannot tell the difference
...

10
...
3 Operator Precedence
Precedence levels and associativity rules reﬂect the most common usage
...

means ‘‘if i is less than or equal to 0 or if max is less than i
...

and not the legal but nonsensical
if (i <= (0||max) < i) //
...
Use of
parentheses becomes more common as the subexpressions become more complicated, but complicated subexpressions are a source of errors
...

There are cases when the operator precedence does not result in the ‘‘obvious’’ interpretation
...
Because == has higher precedence
than &, the expression is interpreted as i&(mask==0)
...
In this case, parentheses are important:
if ((i&mask) == 0) //
...
3
...

This is legal, but it is interpreted as (0<=x)<=99, where the result of the ﬁrst comparison is either true
or false
...
To test whether x is in the range 0
...

A common mistake for novices is to use = (assignment) instead of == (equals) in a condition:
if (a = 7) // oops! constant assignment in condition

This is natural because = means ‘‘equals’’ in many languages
...
I do not recommend warping your style to compensate for compilers with weak warnings
...
3
...
For
example, for v=x+y∗z the result of y∗z has to be put somewhere before it is added to x
...
However, for a user-deﬁned type that holds a resource knowing the lifetime of a
temporary can be important
...
A full
expression is an expression that is not a subexpression of some other expression
...
3) that returns a C-style pointer to a zeroterminated array of characters (§2
...
5, §43
...
Also, the operator + is deﬁned to mean string concatenation
...
However, in combination they can cause obscure
problems
...
c_str();
cout << cs;
if (strlen(cs=(s2+s3)
...
However, such code does get written, so it is worth knowing how it is interpreted
...
Next, a pointer to a C-style string is
extracted from that object
...

However, the C-style string returned by c_str() was allocated as part of the temporary object holding
s1+s2, and that storage is not guaranteed to exist after that temporary is destroyed
...
The output operation cout<would be sheer luck
...

262

Expressions

Chapter 10

The problem with the if-statement is a bit more subtle
...

However, that temporary is destroyed before the controlled statement is entered, so any use of cs
there is not guaranteed to work
...
A cleaner programming style yields a more understandable program fragment and avoids the problems with temporaries completely
...
length()<8 && s[0]=='a') {
// use s here
}
}

A temporary can be used as an initializer for a const reference or a named object
...
The temporary is destroyed when ‘‘its’’ reference or named object goes out of scope
...
1
...
7)
...
5
...
For example:
void f(Shape& s, int n, char ch)
{
s
...

}

Such temporaries are destroyed in exactly the same way as the implicitly generated temporaries
...
4 Constant Expressions
C++ offers two related meanings of ‘‘constant’’:
• constexpr: Evaluate at compile time (§2
...
3)
...
2
...
5)
...
4

Constant Expressions

263

primary role is to specify immutability in interfaces
...

A constant expression is an expression that a compiler can evaluate
...
Ultimately, a constant expression
must start out with an integral value (§6
...
1), a ﬂoating-point value (§6
...
5), or an enumerator
(§8
...
In addition, some addresses can be used in some forms of constant expressions
...
4
...

There are a variety of reasons why someone might want a named constant rather than a literal or
a value stored in a variable:
[1] Named constants make the code easier to understand and maintain
...

[3] The language requires constant expressions for array sizes, case labels, and template
value arguments
...
Also, data in read-only memory is immune to
most system crashes
...

[6] Sometimes, evaluating something once (at compile time) gives signiﬁcantly better performance than doing so a million times at run time
...
We don’t just use constant expressions
because of an obsession with performance
...

As part of the deﬁnition of a data item (here, I deliberately avoid the word ‘‘variable’’),
constexpr expresses the need for compile-time evaluation
...
For example:
int x1 = 7;
constexpr int x2 = 7;
constexpr int x3 = x1;
constexpr int x4 = x2;

// error : initializer is not a constant expression
// OK

void f()
{
constexpr int y3 = x1;
constexpr int y4 = x2;
//
...
However, we
prefer not to rely on degrees of cleverness in compilers
...

264

Expressions

Chapter 10

The expressive power of constant expressions is great
...
We can use any operator that doesn’t modify state (e
...
, +, ?:, and [], but not =
or ++)
...
1
...
4
...
It is almost unfair to compare this to what is commonly
done with macros (§12
...

The conditional-expression operator ?: is the means of selection in a constant expression
...
The alternative not
selected is not evaluated and might even not be a constant expression
...
This feature is primarily useful in
constexpr functions that are sometimes used as constant expressions and sometimes not
...
4
...
Symbolic names should be used systematically to avoid ‘‘magic numbers’’
in code
...

If a numeric constant, such as an array bound, is repeated in code, it becomes hard to revise that
code because every occurrence of that constant must be changed to update the code correctly
...
Usually, a numeric constant represents an
assumption about the program
...
24 the exchange factor between Danish
kroner and U
...
dollars
...
Also, many such values need to change over time
...
Representing assumptions as well-commented
named (symbolic) constants minimizes such maintenance problems
...
4
...
5)
...
For example:

const

can also be used to express

Section 10
...
2

consts

in Constant Expressions

265

const int x = 7;
const string s = "asdf";
const int y = sqrt(x);

A const initialized with a constant expression can be used in a constant expression
...
For example:
constexpr int xx = x;
constexpr string ss = s;
constexpr int yy = y;

// OK
// error : s is not a constant expression
// error : sqr t(x) is not a constant expression

The reasons for the errors are that string is not a literal type (§10
...
3) and sqrt() is not a constexpr
function (§12
...
6)
...
In many cases, enumerators (§8
...

10
...
3 Literal Types
A sufﬁciently simple user-deﬁned type can be used in a constant expression
...

};

A class with a constexpr constructor is called a literal type
...
For example:
constexpr Point origo {0,0};
constexpr int z = origo
...
move(3,3)
};
constexpr int x = a[1]
...

Naturally, we can deﬁne constexpr functions to take arguments of literal types
...
x)+square(p
...
z));
}
constexpr Point p1 {10,20,30};
// the default constructor is constexpr
constexpr p2 {p1
...

For a member function constexpr implies const, so I did not have to write:
constexpr Point move(int dx, int dy) const { return {x+dx,y+dy}; }

10
...
4 Reference Arguments
When working with constexpr, the key thing to remember is that constexpr is all about values
...
That said, you might guess that constexpr cannot
deal with references, but that’s only partially true because const references refer to values and can
therefore be used
...
0, double im = 0
...

};

Obviously, operations, such as = and +=, that modify an object cannot be constexpr
...
The interesting member is the template constructor from
another complex type
...
real();
constexpr double im = z1
...
4
...

Literal types allow for type-rich compile-time programming
...
This has resulted in
code that was unnecessarily complicated and error-prone, as people encoded every kind of information as integers
...

Other programmers have simply preferred run-time evaluation to avoid the difﬁculties of writing in
an impoverished language
...
4
...
4
...
However, its value is assigned by the linker, rather than the compiler, so the compiler cannot know the
value of such an address constant
...
For example:
constexpr const char∗ p1 = "asdf";
constexpr const char∗ p2 = p1;
constexpr const char∗ p2 = p1+2;
constexpr char c = p1[2];

// OK
// error : the compiler does not know the value of p1
// OK, c==’d’; the compiler knows the value pointed to by p1

10
...
2
...

Wherever possible, values are converted so as not to lose information
...
A conversion is value-preserving if you can convert a value and then convert the result back to its original type and get the
original value
...
5
...
6)
...

10
...
1 Promotions
The implicit conversions that preserve values are commonly referred to as promotions
...
Similarly, ﬂoating-point promotion is used to create doubles out of ﬂoats
...
This reﬂects the original purpose of
these promotions in C: to bring operands to the ‘‘natural’’ size for arithmetic operations
...

• A char16_t, char32_t, wchar_t (§6
...
3), or a plain enumeration type (§8
...
2) is converted to
the ﬁrst of the following types that can represent all the values of its underlying type: int,
unsigned int, long, unsigned long, or unsigned long long
...
2
...
Otherwise, no integral promotion applies to it
...

Promotions are used as part of the usual arithmetic conversions (§10
...
3)
...
5
...
4)
...
For example:
void f(double d)
{
char c = d;
}

// beware: double-precision ﬂoating-point to char conversion

When writing code, you should always aim to avoid undeﬁned behavior and conversions that quietly throw away information (‘‘narrowing conversions’’)
...
Fortunately, many compilers do
...
3
...
For example:
void f(double d)
{
char c {d};
}

// error: double-precision ﬂoating-point to char conversion

If potentially narrowing conversions are unavoidable, consider using some form of run-time
checked conversion function, such as narrow_cast<>() (§11
...

10
...
2
...
A plain enumeration value can be converted to
an integer type (§8
...
2)
...
More precisely, the result
is the least unsigned integer congruent to the source integer modulo 2 to the nth, where n is the
number of bits used to represent the unsigned type
...
2
...

A Boolean or plain enumeration value can be implicitly converted to its integer equivalent
(§6
...
2, §8
...

Section 10
...
2
...
5
...
2 Floating-Point Conversions
A ﬂoating-point value can be converted to another ﬂoating-point type
...
If the source
value is between two adjacent destination values, the result is one of those values
...
For example:
ﬂoat f = FLT_MAX;
double d = f;

// largest ﬂoat value
// OK: d == f

double d2 = DBL_MAX; // largest double value
ﬂoat f2 = d2;
// undeﬁned if FLT_MAXlong double ld = d2;
// OK: ld = d3
long double ld2 = numeric_limits::max();
double d3 = ld2;
// undeﬁned if sizeof(long double)>sizeof(double)
DBL_MAX

and FLT_MAX are deﬁned in ; numeric_limits is deﬁned in (§40
...

10
...
2
...
2
...
A pointer (reference)
to a derived class can be implicitly converted to a pointer (reference) to an accessible and unambiguous base (§20
...
Note that a pointer to function or a pointer to member cannot be implicitly
converted to a void∗
...
4) that evaluates to 0 can be implicitly converted to a null pointer of
any pointer type
...
6)
...
2
...

A T∗ can be implicitly converted to a
verted to a const T&
...
5)
...
5
...
4 Pointer-to-Member Conversions
Pointers and references to members can be implicitly converted as described in §20
...
3
...
5
...
5 Boolean Conversions
Pointer, integral, and ﬂoating-point values can be implicitly converted to
value converts to true; a zero value converts to false
...

}

// true if p!=0
// true if i!=0

bool

(§6
...
2)
...

ﬁ(p);
// error : no pointer to int conversion
fb(p);
// OK: pointer to bool conversion (surprise!?)
}

Hope for a compiler warning for fb(p)
...
5
...
6 Floating-Integral Conversions
When a ﬂoating-point value is converted to an integer value, the fractional part is discarded
...
For example, the
value of int(1
...
The behavior is undeﬁned if the truncated value cannot be represented in the
destination type
...
7;
char b = 2000
...

Loss of precision occurs if an integral value cannot be represented exactly as a value of the ﬂoating
type
...

Clearly, it is best to avoid potentially value-destroying implicit conversions
...
However, general compile-time detection is impractical, so the programmer must
be careful
...
For
example:
char checked_cast(int i)
{
char c = i;
// warning: not portable (§10
...
2
...

}

Section 10
...
2
...
2
...
1
...
2)
...
3
...

10
...
3 Usual Arithmetic Conversions
These conversions are performed on the operands of a binary operator to bring them to a common
type, which is then used as the type of the result:
[1] If either operand is of type long double, the other is converted to long double
...

• Otherwise, if either operand is ﬂoat, the other is converted to ﬂoat
...
5
...

[2] Otherwise, if either operand is unsigned long long, the other is converted to unsigned long
long
...
Otherwise, if either operand is unsigned long long, the other is converted
to unsigned long long
...

• Otherwise, if either operand is long, the other is converted to long
...

• Otherwise, both operands are int
...
That is yet another reason to avoid mixing unsigned and signed integers
...
6 Advice
[1]
[2]
[3]
[4]

Prefer the standard library to other libraries and to ‘‘handcrafted code’’; §10
...
8
...
2
...

When reading, always consider ill-formed input; §10
...
3
...
) to direct use of language features (e
...
,
ints, statements); §10
...
8
...
3
...

[6] If in doubt about operator precedence, parenthesize; §10
...
3
...
3
...

[8] Avoid narrowing conversions; §10
...
2
...
4
...

[10] Avoid narrowing conversions; §10
...
2
...

– Alan Perlis

•

•
•
•

•
•

Etc
...
1 Etc
...
They have little in common beyond their details not ﬁtting elsewhere in the discussions of operators
...
1
...
The && and || operators evaluate their second argument only if necessary, so they can be used to control evaluation order (§10
...
2)
...

11
...
2 Bitwise Logical Operators
The bitwise logical operators & (and), | (or), ˆ (exclusive or, xor), ˜ (complement), >> (right shift),
and << (left shift) are applied to objects of integral types – that is, char, short, int, long, long long
and their unsigned counterparts, and bool, wchar_t, char16_t, and char32_t
...
The usual arithmetic conversions (§10
...
3) determine the type of the result
...

In this case, each bit of an unsigned integer represents one member of the set, and the number of
bits limits the number of members
...
An enumeration can be used to name the members
of such a set
...

if (state&(badbit|failbit)) // stream not good

The extra parentheses are necessary because & has higher precedence than | (§10
...

A function that reaches the end-of-input might report it like this:
state |= eofbit;

The |= operator is used to add to the state
...

These stream state ﬂags are observable from outside the stream implementation
...
rdstate();
//
...

if (cin
...

}

// rdstate() returns the state
// has anything changed?

Computing differences of stream states is not common
...
For example, consider comparing a bit vector that represents the set of interrupts
being handled with another that represents the set of interrupts waiting to be handled
...
1
...
Convenient bit manipulation can be very important, but for reliability, maintainability, portability, etc
...
For more general notions of a
set, see the standard-library set (§31
...
3) and bitset (§34
...
2)
...
For example, one could
extract the middle 16 bits of a 32-bit int like this:
constexpr unsigned short middle(int a)
{
static_assert(sizeof(int)==4,"unexpected int size");
static_assert(sizeof(short)==2,"unexpected short size");
return (a>>8)&0xFFFF;
}
int x = 0xFF00FF00; // assume sizeof(int)==4
short y = middle(x); // y = 0x00FF

Using ﬁelds (§8
...
7) is a convenient shorthand for such shifting and masking
...
The latter
return true or false, and they are primarily useful for writing the test in an if-, while-, or for-statement
(§9
...
5)
...

11
...
3 Conditional Expressions
Some if-statements can conveniently be replaced by conditional-expressions
...

Conditional expressions are important in that they can be used in constant expressions (§10
...

A pair of expressions e1 and e2 can be used as alternatives in a conditional expression, c?e1:e2,
if they are of the same type or if there is a common type T, to which they can both be implicitly
converted
...
5
...
For other types, either e1 must be implicitly convertible to e2’s type or vice versa
...
5
...
For example:
void fct(int∗ p)
{
int i = (p) ? ∗p : std::runtime_error{"unexpected nullptr};
//
...
1
...
Provided lvalue has no side effects, ++lvalue means
lvalue+=1, which again means lvalue=lvalue+1
...
Decrementing is similarly expressed by the −− operator
...
The value of ++x is
the new (that is, incremented) value of x
...
The value
of x++, however, is the old value of x
...

Like adding an int to a pointer, or subtracting it, ++ and −− on a pointer operate in terms of elements of the array into which the pointer points; p++ makes p point to the next element (§7
...
1)
...
For example, one can copy a zero-terminated C-style string like this:
void cpy(char∗ p, const char∗ q)
{
while (∗p++ = ∗q++) ;
}

Like C, C++ is both loved and hated for enabling such terse, expression-oriented coding
...
Consider ﬁrst a more traditional way of copying
an array of characters:
int length = strlen(q);
for (int i = 0; i<=length; i++)
p[i] = q[i];

This is wasteful
...
Thus, we read the string twice: once to ﬁnd its length and once to copy it
...
1
...
We can therefore rewrite the example like this:
while ((∗p++ = ∗q++) != 0) { }

In this case, we don’t notice that ∗q is zero until we already have copied it into ∗p and incremented
p
...
Finally, we can
reduce the example further by observing that we don’t need the empty block and that the !=0 is
redundant because the result of an integral condition is always compared to zero anyway
...
Is this version more efﬁcient in time or space than the previous versions? Except for the ﬁrst
version that called strlen(), not really; the performance will be equivalent and often identical code
will be generated
...
h>

For more general copying, the standard copy algorithm (§4
...
5) can be used
...
Standardlibrary functions may be inlined (§12
...
3) or even implemented using specialized machine instructions
...
Even if it does, the advantage may not exist on some other
handware+compiler combination, and your alternative may give a maintainer a headache
...
2 Free Store
A named object has its lifetime determined by its scope (§6
...
4)
...
For example, it is common to create objects that can be used after returning from the function in which they were created
...
Objects
allocated by new are said to be ‘‘on the free store’’ (also, ‘‘on the heap’’ or ‘‘in dynamic memory’’)
...
2)
...

};

278

Select Operations

Chapter 11

Enode∗ expr(bool get)
{
Enode∗ left = term(get);
for (;;) {
switch (ts
...
kind) {
case Kind::plus:
case Kind::minus:
left = new Enode {ts
...
kind,left,term(true)};
break;
default:
return left;
// return node
}
}
}

In cases Kind::plus and Kind::minus, a new Enode is created on the free store and initialized by the
value {ts
...
kind,left,term(true)}
...

I used the {}-list notation for specifying arguments
...
However, trying the = notation for initializing an object
created using new results in an error:
int∗ p = new int = 7; // error

If a type has a default constructor, we can leave out the initializer, but built-in types are by default
uninitialized
...
To be sure to get default initialization, use {}
...
Then, the space it occupied can be reused by new
...
Consequently, I will assume that objects created by new are manually freed using delete
...
2

Free Store

279

The delete operator may be applied only to a pointer returned by new or to the nullptr
...

If the deleted object is of a class with a destructor (§3
...
1
...
2), that destructor is called by
delete before the object’s memory is released for reuse
...
2
...

• Premature deletion: People delete an object that they have some other pointer to and later
use that other pointer
...

Leaked objects are potentially a bad problem because they can cause a program to run out of space
...
Consider
this example of very bad code:
int∗ p1 = new int{99};
int∗ p2 = p1;
delete p1;
p1 = nullptr;
char∗ p3 = new char{'x'};
∗p2 = 999;
cout << ∗p3 << '\n';

// potential trouble
// now p2 doesn’t point to a valid object
// gives a false sense of safety
// p3 may now point to the memory pointed to by p2
// this may cause trouble
// may not print x

Double deletion is a problem because resource managers typically cannot track what code owns a
resource
...
use *p
...
wait a while
...
Replace int with string in that example, and we’ll see string’s
destructor trying to read memory that has been reallocated and maybe overwritten by other code,
and using what it read to try to delete memory
...

The reason people make these mistakes is typically not maliciousness and often not even simple
sloppiness; it is genuinely hard to consistently deallocate every allocated object in a large program
(once and at exactly the right point in a computation)
...

280

Select Operations

Chapter 11

As alternatives to using ‘‘naked’’ news and deletes, I can recommend two general approaches to
resource management that avoid such problems:
[1] Don’t put objects on the free store if you don’t have to; prefer scoped variables
...
Examples are string,
vector and all the other standard-library containers, unique_ptr (§5
...
1, §34
...
1), and
shared_ptr (§5
...
1, §34
...
2)
...
Many classical uses of free store can be eliminated by using move semantics
(§3
...
5
...

This rule [2] is often referred to as RAII (‘‘Resource Acquisition Is Initialization’’; §5
...
3) and
is the basic technique for avoiding resource leaks and making error handling using exceptions simple and safe
...
push_back(c);
//
...

In this example, push_back() does news to acquire space for its elements and deletes to free space
that it no longer needs
...

The Token_stream from the calculator example is an even simpler example (§10
...
2)
...
For example:
string reverse(const string& s)
{
string ss;
for (int i=s
...
push_back(s[i]);
return ss;
}

Like vector, a string is really a handle to its elements
...
3
...

The resource management ‘‘smart pointers’’ (e
...
, unique_ptr and smart_ptr) are a further example of these ideas (§5
...
1, §34
...
1)
...
2
...

if (n%2) throw runtime_error("odd");
delete[] p1;
// we may never get here
}

For f(3) the memory pointed to by p1 is leaked, but the memory pointed to by p2 is correctly and
implicitly deallocated
...
In addition, new is often used in arguments to resource handles
...
g
...
5)
...
2
...
For example:
char∗ save_string(const char∗ p)
{
char∗ s = new char[strlen(p)+1];
strcpy(s,p);
// copy from p to s
return s;
}
int main(int argc, char∗ argv[])
{
if (argc < 2) exit(1);
char∗ p = save_string(argv[1]);
//
...

Unless you really must use a char∗ directly, the standard-library string can be used to simplify
the save_string():
string save_string(const char∗ p)
{
return string{p};
}
int main(int argc, char∗ argv[])
{
if (argc < 2) exit(1);
string s = save_string(argv[1]);
//
...

282

Select Operations

Chapter 11

To deallocate space allocated by new, delete and delete[] must be able to determine the size of
the object allocated
...
At a minimum, space is needed to hold the
object’s size
...
Most
modern machines use 8-byte words
...
g
...

Note that a vector (§4
...
1, §31
...
For example:
void f(int n)
{
vector∗ p = new vector(n);
// individual object
int∗ q = new int[n];
// array
//
...
2
...
Applying delete[] to the null pointer has no effect
...
For example:
void f1()
{
X∗ p =new X;
//
...

delete p;
}

That’s verbose, inefﬁcient, and error-prone (§13
...
In particular, a return or an exception thrown
before the delete will cause a memory leak (unless even more code is added)
...
use x
...

11
...
3 Getting Memory Space
The free-store operators
in the header:

new, delete, new[],

void∗ operator new(size_t);
void operator delete(void∗ p);

and

delete[]

are implemented using functions presented

// allocate space for individual object
// if (p) deallocate space allocated using operator new()

Section 11
...
3

void∗ operator new[](size_t);
void operator delete[](void∗ p);

Getting Memory Space

283

// allocate space for array
// if (p) deallocate space allocated using operator new[]()

When operator new needs to allocate space for an object, it calls operator new() to allocate a suitable
number of bytes
...

The standard implementations of operator new() and operator new[]() do not initialize the memory returned
...
Consequently, they take arguments or return
values of type void∗
...

What happens when new can ﬁnd no store to allocate? By default, the allocator throws a standard-library bad_alloc exception (for an alternative, see §11
...
4
...
For example:
void f()
{
vector v;
try {
for (;;) {
char ∗ p = new char[10000];
v
...
Please
be careful: the new operator is not guaranteed to throw when you run out of physical main memory
...

We can specify what new should do upon memory exhaustion; see §30
...
1
...

In addition to the functions deﬁned in , a user can deﬁne operator new(), etc
...
2
...
Class members operator new(), etc
...

11
...
4 Overloading new
By default, operator new creates its object on the free store
...

};

284

Select Operations

Chapter 11

We can place objects anywhere by providing an allocator function (§11
...
3) with extra arguments
and then supplying such extra arguments when using new:
void∗ operator new(size_t, void∗ p) { return p; }

// explicit placement operator

void∗ buf = reinterpret_cast(0xF00F);
X∗ p2 = new(buf) X;

// signiﬁcant address
// construct an X at buf;
// invokes: operator new(sizeof(X),buf)

Because of this usage, the new(buf) X syntax for supplying extra arguments to operator new() is
known as the placement syntax
...
2
...
The operator new() used by the
new operator is chosen by the usual argument matching rules (§12
...

The ‘‘placement’’ operator new() is the simplest such allocator
...
5)
...

};
void∗ operator new(size_t sz, Arena∗ a)
{
return a−>alloc(sz);
}

Now objects of arbitrary types can be allocated from different Arenas as needed
...

}

// X in persistent storage
// X in shared memory

Section 11
...
4

Overloading new

285

Placing an object in an area that is not (directly) controlled by the standard free-store manager
implies that some care is required when destroying the object
...
Even most resource handles can be written using new and delete
...
4
...
3
...
A novice should think thrice before
calling a destructor explicitly and also should ask a more experienced colleague before doing so
...
6
...

There is no special syntax for placement of arrays
...
However, an operator delete() can be deﬁned for arrays (§11
...
3)
...
2
...
1 nothrow new
In programs where exceptions must be avoided (§13
...
5), we can use
delete
...
handle allocation error
...

operator delete(nothrow,p);
// deallocate *p
}

That nothrow is the name of an object of the standard-library type nothrow_t that is used for disambiguation; nothrow and nothrow_t are declared in
...

286

Select Operations

Chapter 11

11
...
3
...
2), {}-lists can be used as expressions
in many (but not all) places
...
}, meaning ‘‘create an object of type T initialized by T{
...
3
...
}, for which the the type must be determined from the context of use;
§11
...
3
For example:
struct S { int a, b; };
struct SS { double a, b; };
void f(S);

// f() takes an S

void g(S);
void g(SS);

// g() is overloaded

void h()
{
f({1,2});
g({1,2});
g(S{1,2});
g(SS{1,2});

// OK: call f(S{1,2})
// error : ambiguous
// OK: call g(S)
// OK: call g(SS)

}

As in their use for initializing named variables (§6
...
5), lists can have zero, one, or more elements
...

11
...
1 Implementation Model
The implementation model for {}-lists comes in three parts:
• If the {}-list is used as constructor arguments, the implementation is just as if you had used a
()-list
...

• If the {}-list is used to initialize the elements of an aggregate (an array or a class without a
constructor), each list element initializes an element of the aggregate
...

• If the {}-list is used to construct an initializer_list object each list element is used to initialize
an element of the underlying array of the initializer_list
...

Note that this is the general model that we can use to understand the semantics of a {}-list; a compiler may apply clever optimizations as long as the meaning is preserved
...
14};

The standard-library

vector

has an initializer-list constructor (§17
...
4), so the initializer list

Section 11
...
1

{1,2,3
...
14 } ;
const initializer_list tmp(temp,sizeof(temp)/sizeof(double));
vector v(tmp);

That is, the compiler constructs an array containing the initializers converted to the desired type
(here, double)
...
The
initializer-list constructor then copies the values from the array into its own data structure for elements
...

The underlying array is immutable, so there is no way (within the standard’s rules) that the
meaning of a {}-list can change between two uses
...
begin() << '\n';
∗lst
...
begin() << '\n';
}

In particular, having a {}-list be immutable implies that a container taking elements from it must use
a copy operation, rather than a move operation
...
4
...
When used to initialize a variable of type initializer_list, the list lives as long as the
variable
...

11
...
2 Qualiﬁed Lists
The basic idea of initializer lists as expressions is that if you can initialize a variable
notation

x

using the

T x {v};

then you can create an object with the same value as an expression using T{v} or new T{v}
...
4
...
For example:
struct S { int a, b; };
void f()
{
S v {7,8};
v = S{7,8};
S∗ p = new S{7,8};
}

// direct initialization of a variable
// assign using qualiﬁed list
// construct on free store using qualiﬁed list

The rules constructing an object using a qualiﬁed list are those of direct initialization (§16
...
6)
...
For example:
template
T square(T x)
{
return x∗x;
}
void f(int i)
{
double d = square(double{i});
complex z = square(complex{i});
}

That idea is explored further in §11
...
1
...
3
...
It can be used as an
expression only as:
• A function argument
• A return value
• The right-hand operand of an assignment operator (=, +=, ∗=, etc
...
0});
return {11};

// right-hand operand of assignment
// right-hand operand of assignment
// error: not left-hand operand of assignment
// error: not an operand of a non-assignment operator
// function argument
// return value

}

The reason that an unqualiﬁed list is not allowed on the left-hand side of assignments is primarily
that the C++ grammar allows { in that position for compound statements (blocks), so that readability would be a problem for humans and ambiguity resolution would be tricky for compilers
...

When used as the initializer for a named object without the use of a = (as for v above), an
unqualiﬁed {}-list performs direct initialization (§16
...
6)
...
2
...
In particular, the otherwise redundant = in an initializer restricts the set of
initializations that can be performed with a given {}-list
...
3
...
2
...

Its most obvious use is to allow initializer lists for user-deﬁned containers (§3
...
1
...
size()==0) return high;
for (auto x : val)
if (x>high) high = x;
return high;
}
int v1 = high_value({1,2,3,4,5,6,7});
int v2 = high_value({−1,2,v1,4,−9,20,v1});

A {}-list is the simplest way of dealing with homogeneous lists of varying lengths
...
If so, that case should be handled by a default constructor (§17
...
3)
...
For example:
auto x0 = {};
auto x1 = {1};
auto x2 = {1,2};
auto x3 = {1,2,3};
auto x4 = {1,2
...
For
example:
template
void f(T);
f({});
f({1});
f({1,2});
f({1,2,3});

// error: type of initializer is unknown
// error: an unqualiﬁed list does not match ‘‘plain T’’
// error: an unqualiﬁed list does not match ‘‘plain T’’
// error: an unqualiﬁed list does not match ‘‘plain T’’

I say ‘‘unfortunately’’ because this is a language restriction, rather than a fundamental rule
...

Similarly, we do not deduce the element type of a container represented as a template
...
To deduce T the compiler would ﬁrst have to decide that
the user really wanted a vector and then look into the deﬁnition of vector to see if it has a constructor that accepts {1,2,3}
...
2)
...
To call f2(), be
more speciﬁc:
f2(vector{1,2,3});
f2(vector{"Kona","Sidney"});

// OK
// OK

11
...
Instead of deﬁning a named class with an operator(), later making an object of that
class, and ﬁnally invoking it, we can use a shorthand
...
In the context of graphical user interfaces (and
elsewhere), such operations are often referred to as callbacks
...
4
...
4, §33
...
2)
...
The capture list is delimited by [] (§11
...
3)
...
The
parameter list is delimited by () (§11
...
4)
...
e
...
4
...
4)
...

• An optional return type declaration of the form −> type (§11
...
4)
...
The body is delimited by {} (§11
...
3)
...
The notion of ‘‘capture’’ of local variables is not provided for
functions
...

11
...
1 Implementation Model
Lambda expressions can be implemented in a variety of ways, and there are some rather effective
ways of optimizing them
...
Consider a relatively simple
example:

Section 11
...
1

Implementation Model

291

void print_modulo(const vector& v, ostream& os, int m)
// output v[i] to os if v[i]%m==0
{
for_each(begin(v),end(v),
[&os,m](int x) { if (x%m==0) os << x << '\n'; }
);
}

To see what this means, we can deﬁne the equivalent function object:
class Modulo_print {
ostream& os; // members to hold the capture list
int m;
public:
Modulo_print(ostream& s, int mm) :os(s), m(mm) {}
void operator()(int x) const
{ if (x%m==0) os << x << '\n'; }
};

// capture

The capture list, [&os,m], becomes two member variables and a constructor to initialize them
...
This use of & mirrors its use in function argument declarations
...
Since the lambda doesn’t
return a value, the operator()() is void
...
That’s by far the most common case
...
4
...
4)
...

An object of a class generated from a lambda is called a closure object (or simply a closure)
...

[&]),

the

11
...
2 Alternatives to Lambdas
That ﬁnal version of print_modulo() is actually quite attractive, and naming nontrivial operations is
generally a good idea
...

However, many lambdas are small and used only once
...
For example:

292

Select Operations

void print_modulo(const vector& v, ostream& os, int m)
// output v[i] to os if v[i]%m==0
{
class Modulo_print {
ostream& os; // members to hold the capture list
int m;
public:
Modulo_print (ostream& s, int mm) :os(s), m(mm) {}
void operator()(int x) const
{ if (x%m==0) os << x << '\n'; }
};

Chapter 11

// capture

for_each(begin(v),end(v),Modulo_print{os,m});
}

Compared to that, the version using the lambda is a clear winner
...
Doing so forces us to consider the design of the operation
a bit more carefully
...
4
...

Writing a for-loop is an alternative to using a lambda with a for_each()
...
However, for_each is a
rather special algorithm, and vector is a very speciﬁc container
...
The C++ range-for-statement speciﬁcally caters to the special
case of traversing a sequence from its beginning to its end
...
4
...
For example, using a for-statement to traverse a map gives a depth-ﬁrst
traversal
...
For example:
template
void print_modulo(const C& v, ostream& os, int m)
// output v[i] to os if v[i]%m==0
{
breadth_ﬁrst(begin(v),end(v),
[&os,m](int x) { if (x%m==0) os << x << '\n'; }
);
}

Thus, a lambda can be used as ‘‘the body’’ for a generalized loop/traversal construct represented as
an algorithm
...

The performance of a lambda as an argument to a traversal algorithm is equivalent (typically
identical) to that of the equivalent loop
...
The implication is that we have to base our choice between ‘‘algorithm plus
lambda’’ and ‘‘for-statement with body’’ on stylistic grounds and on estimates of extensibility and
maintainability
...
4
...
Lambdas allow that to
be done ‘‘inline’’ without having to name a function (or function object) and use it elsewhere
...
Such lambdas are deﬁned with the
empty lambda introducer []
...
begin(),v
...

sort(v
...
end(),[](int x, int y) { return abs(x)//
...

sort(v
...
end(),
[](int x, int y) { return sensitive ? x);
}

// error : can’t access sensitive

I used the lambda introducer []
...
The ﬁrst character of a lambda expression is
always [
...
This implies that no local names from the surrounding context can
be used in the lambda body
...

• [&]: implicitly capture by reference
...
All local variables are
accessed by reference
...
All local names can be used
...

• [capture-list]: explicit capture; the capture-list is the list of names of local variables to be
captured (i
...
, stored in the object) by reference or by value
...
Other variables are captured by value
...
as elements
...
The capture list can contain this
...

Variables named in the capture list are captured by value
...
The capture list cannot contain this
...
Variables named in the capture list are captured by reference
...
Only capture by reference allows modiﬁcation of variables
in the calling environment
...
For example:
void f(vector& v)
{
bool sensitive = true;
//
...
begin(),v
...
By not
specifying otherwise, we ensure that the capture of sensitive is done ‘‘by value’’; just as for argument passing, passing a copy is the default
...

The choice between capturing by value and by reference is basically the same as the choice for
function arguments (§12
...
We use a reference if we need to write to the captured object or if it is
large
...
4
...
1)
...

If you need to capture a variadic template (§28
...
For example:

Section 11
...
3

Capture

295

template ...
v)
{
auto helper = [&s,&v
...
)+h2(v
...

}

Beware that is it easy to get too clever about capture
...
When that’s the case, capture is usually the least typing but has the greatest
potential for confusion
...
4
...
1 Lambda and Lifetime
A lambda might outlive its caller
...
For example:
void setup(Menu& m)
{
//
...
add("draw triangle",[&]{ m
...

}

// probable disaster

Assuming that add() is an operation that adds a (name,action) pair to a menu and that the draw()
operation makes sense, we are left with a time bomb: the setup() completes and later – maybe minutes later – a user presses the draw triangle button and the lambda tries to access the long-gone local
variables
...

If a lambda might outlive its caller, we must make sure that all local information (if any) is
copied into the closure object and that values are returned through the return mechanism (§12
...
4)
or through suitable arguments
...
add("draw triangle",[=]{ m
...
4
...

[=]

and

[&]

as short-hand

11
...
3
...
For example:
template
ostream& operator<<(ostream& os, const pair& p)
{
return os << '{' << p
...
second << '}';
}

296

Select Operations

Chapter 11

void print_all(const map& m, const string& label)
{
cout << label << ":\n{\n";
for_each(m
...
end(),
[](const pair& p) { cout << p << '\n'; }
);
cout << "}\n";
}

Here, we don’t need to capture cout or the output operator for pair
...
4
...
3 Lambda and this
How do we access members of a class object from a lambda used in a member function? We can
include class members in the set of names potentially captured by adding this to the capture list
...
For
example, we might have a class for building up requests and retrieving results:
class Request {
function(const map&)> oper;
map values;
// arguments
map results;
// targets
public:
Request(const string& s);
// parse and store request
void execute()
{
[this]() { results=oper(values); }
}

// operation

// do oper to values yielding results

};

Members are always captured by reference
...
Unfortunately, [this] and [=] are incompatible
...
4
...

11
...
3
...
That is, the operator()() for the generated function object (§11
...
1) is a const member function
...
4
...
For example:
void algo(vector& v)
{
int count = v
...
begin(),v
...

Section 11
...
4

Call and Return

297

11
...
4 Call and Return
The rules for passing arguments to a lambda are the same as for a function (§12
...
1
...
In fact, with the exception of the rules for capture (§11
...
3)
most rules for lambdas are borrowed from the rules for functions and classes
...

Thus, the minimal lambda expression is []{}
...
Unfortunately, that is
not also done for a function
...
If a lambda
body consists of just a single return-statement, the lambda’s return type is the type of the return’s
expression
...
For example:
void g(double y)
{
[&]{ f(y); }
auto z1 = [=](int x){ return x+y; }
auto z2 = [=,y]{ if (y) return 1; else return 2; }
auto z3 =[y]() { return 1 : 2; }
auto z4 = [=,y]()−>int { if (y) return 1; else return 2; }

// return type is void
// return type is double
// error : body too complicated
// for return type deduction
// return type is int
// OK: explicit return type

}

When the sufﬁx return type notation is used, we cannot omit the argument list
...
4
...
However, it is deﬁned to be the type of a function object in the style presented in §11
...
1
...

Had two lambdas had the same type, the template instantiation mechanism might have gotten confused
...
In addition to using a lambda as an argument, we can use it to initialize a variable declared
auto or std::function where R is the lambda’s return type and AL is its argument list of types
(§33
...
3)
...

Instead, I can introduce a name and then use it:
void f(string& s1, string& s2)
{
function rev =
[&](char∗ b, char∗ e) { if (1
298

Select Operations

Chapter 11

rev(&s1[0],&s1[0]+s1
...
size());
}

Now, the type of rev is speciﬁed before it is used
...
size());
rev(&s2[0],&s2[0]+s2
...
For
example:
double (∗p1)(double) = [](double a) { return sqrt(a); };
double (∗p2)(double) = [&](double a) { return sqrt(a); };
double (∗p3)(int) = [](int a) { return sqrt(a); };

// error : the lambda captures
// error : argument types do not match

11
...
Many (arguably too
many) such conversions are done implicitly according to the language rules (§2
...
2, §10
...
For
example:
double d = 1234567890; // integer to ﬂoating-point
int i = d;
// ﬂoating-point to integer

In other cases, we have to be explicit
...
5
...
5)
• static_cast for reversing a well-deﬁned implicit conversion (§11
...
2)
• reinterpret_cast for changing the meaning of bit patterns (§11
...
2)
• dynamic_cast for dynamically checked class hierarchy navigation (§22
...
1)
• C-style casts, providing any of the named conversions and some combinations of those
(§11
...
3)
• Functional notation, providing a different notation for C-style casts (§11
...
4)
I have ordered these conversions in my order of preference and safety of use
...
For conversion between two scalar numeric types, I tend to use a homemade
explicit conversion function, narrow_cast, where a value might be narrowed:

Section 11
...
That is a generalization of the rule the language
applies to values in {} initialization (§6
...
5
...
For example:
void test(double d, int i, char∗ p)
{
auto c1 = narrow_cast(64);
auto c2 = narrow_cast(−64);
auto c3 = narrow_cast(264);

// will throw if chars are unsigned
// will throw if chars are 8-bit and signed

auto d1 = narrow_cast(1/3
...
0);
// will probably throw
auto c4 = narrow_cast(i);
auto f2 = narrow_cast<ﬂoat>(d);

// may throw
// may throw

auto p1 = narrow_cast(i);
auto i1 = narrow_cast(p);

// compile-time error
// compile-time error

auto d2 = narrow_cast(i);
auto i2 = narrow_cast(d);

// may throw (but probably will not)
// may throw

}

Depending on your use of ﬂoating-point numbers, it may be worthwhile to use a range test for
ﬂoating-point conversions, rather than !=
...
3
...
1) or
type traits (§35
...
1)
...
5
...
8
...
4)
...
0
// d1==0
...

300

Select Operations

Chapter 11

void f(int);
void f(double);
void g(int i, double d)
{
f(i);
f(double{i});

// call f(int)
// error : {} doesn’t do int to ﬂoating conversion

f(d);
f(int{d});
f(static_cast(d));

// call f(double)
// error : {} doesn’t truncate
// call f(int) with a truncated value

f(round(d));
// call f(double) with a rounded value
f(static_cast(lround(d))); // call f(int) with a rounded value
// if the d is overﬂows the int, this still truncates
}

I don’t consider truncation of ﬂoating-point numbers (e
...
, 7
...
If rounding is desirable, we can use the standardlibrary function round(); it performs ‘‘conventional 4/5 rounding,’’ such as 7
...
4 to 7
...
Consider:
static_assert(sizeof(int)==sizeof(double),"unexpected sizes");
int x = numeric_limits::max(); // largest possible integer
double d = x;
int y = x;

We will not get x==y
...
For example:
double d { 1234 };

double

with an integer literal that can be

// ﬁne

Explicit qualiﬁcation with the desired type does not enable ill-behaved conversions
...

}

// error: no char* to int conversion
// error: no char* to int* conversion

For T{v}, ‘‘reasonably well behaved’’ is deﬁned as having a ‘‘non-narrowing’’ (§10
...
3)
...
For example:

Section 11
...
1

Construction

301

template void f(const T&);
void g3()
{
f(int{});
f(complex{});
//
...
3
...

Thus, int{} is another way of writing 0
...
2
...
1, §17
...

Explicitly constructed unnamed objects are temporary objects, and (unless bound to a reference)
their lifetime is limited to the full expression in which they are used (§6
...
2)
...
2)
...
5
...
For example:
IO_device∗ d1 = reinterpret_cast(0Xff00); // device at 0Xff00

There is no way a compiler can know whether the integer 0Xff00 is a valid address (of an I/O device
register)
...
Explicit type conversion, often called casting, is occasionally essential
...

Another classical example of the need for explicit type conversion is dealing with ‘‘raw memory,’’ that is, memory that holds or will hold objects of a type not known to the compiler
...
2
...

}

// new allocation used as ints

A compiler does not know the type of the object pointed to by the void∗
...
It also does conversions deﬁned by constructors (§16
...
6, §18
...
3, §iso
...
2
...
4)
...
5
...
10)
...
5
...
11)
...
2
...
5
...
7)
...
Some static_casts are portable, but few reinterpret_casts are
...
If the target has at least as many bits as the original value, we can reinterpret_cast the result back to its original type and use it
...
Note that reinterpret_cast
is the kind of conversion that must be used for pointers to functions (§12
...
Consider:
char x = 'a';
int∗ p1 = &x;
int∗ p2 = static_cast(&x);
int∗ p3 = reinterpret_cast(&x);

// error : no implicit char* to int* conversion
// error : no implicit char* to int* conversion
// OK: on your head be it

struct B { /*
...
*/ };

// see §3
...
2 and §20
...
2

B∗ pb = new D;
D∗ pd = pb;
D∗ pd = static_cast(pb);

// OK: implicit conversion from D* to B*
// error : no implicit conversion from B* to D*
// OK

Conversions among class pointers and among class reference types are discussed in §22
...

If you feel tempted to use an explicit type conversion, take the time to consider if it is really
necessary
...
3
...
3
...
2
...
In many programs, explicit type conversion can be completely avoided; in others, its use can be localized to a
few routines
...
5
...
2
...
Unfortunately, the C-style cast can also cast from a pointer to a class to a
pointer to a private base of that class
...
This C-style cast is far more dangerous than the named conversion operators
because the notation is harder to spot in a large program and the kind of conversion intended by the
programmer is not explicit
...
Without knowing the exact types of T and e, you cannot tell
...
5
...
5
...
For example:

T

from a value

e

can be expressed by the functional notation

void f(double d)
{
int i = int(d);
// truncate d
complex z = complex(d); // make a complex from d
//
...
Unfortunately, for a built-in
type T, T(e) is equivalent to (T)e (§11
...
3)
...

void f(double d, char∗ p)
{
int a = int(d); // truncates
int b = int(p); // not portable
//
...

Prefer T{v} conversions for well-behaved construction and the named casts (e
...
, static_cast) for
other conversions
...
6 Advice
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]

Prefer preﬁx ++ over sufﬁx ++; §11
...
4
...
2
...

Don’t put objects on the free store if you don’t have to; prefer scoped variables; §11
...
1
...
2
...

Use RAII; §11
...
1
...
4
...

Prefer a named function object to a lambda if the operation is generally useful; §11
...
2
...
4
...

For maintainability and correctness, be careful about capture by reference; §11
...
3
...

Let the compiler deduce the return type of a lambda; §11
...
4
...
5
...

Avoid explicit type conversion (casts); §11
...

When explicit type conversion is necessary, prefer a named cast; §11
...

Consider using a run-time checked cast, such as narrow_cast<>(), for conversion between
numeric types; §11
...

This page intentionally left blank

12
Functions
Death to all fanatics!
– Paradox

•

•

•

•
•
•
•

Function Declarations
Why Functions?; Parts of a Function Declaration; Function Deﬁnitions; Returning Values;
inline Functions; constexpr Functions; [[noreturn]] Functions; Local Variables
Argument Passing
Reference Arguments; Array Arguments; List Arguments; Unspeciﬁed Number of Arguments; Default Arguments
Overloaded Functions
Automatic Overload Resolution; Overloading and Return Type; Overloading and Scope;
Resolution for Multiple Arguments; Manual Overload Resolution
Pre- and Postconditions
Pointer to Function
Macros
Conditional Compilation; Predeﬁned Macros; Pragmas
Advice

12
...
Deﬁning a
function is the way you specify how an operation is to be done
...

A function declaration gives the name of the function, the type of the value returned (if any),
and the number and types of the arguments that must be supplied in a call
...
2
...

Argument types are checked and implicit argument type conversion takes place when necessary
...

A function declaration may contain argument names
...
As a return type, void means that the function does not return a value (§6
...
7)
...
For class member
functions (§2
...
2, §16
...
For example:
double f(int i, const Info&);
char& String::operator[](int);

// type: double(int,const Info&)
// type: char& String::(int)

12
...
1 Why Functions?
There is a long and disreputable tradition of writing very long functions – hundreds of lines long
...
Writers of
such functions seem to fail to appreciate one of the primary purposes of functions: to break up complicated computations into meaningful chunks and name them
...
The ﬁrst step to comprehensibility is to break computational tasks into comprehensible chunks (represented as functions and
classes) and name those
...
The C++ standard algorithms (e
...
, ﬁnd, sort, and iota) provide a good start (Chapter 32)
...

The number of errors in code correlates strongly with the amount of code and the complexity of
the code
...
Using a function
to do a speciﬁc task often saves us from writing a speciﬁc piece of code in the middle of other code;
making it a function forces us to name the activity and document its dependencies
...
6) and continues (§9
...
5)
...
g
...
6)
...
Bugs tend to creep in when we can view only part of an algorithm at a time
...
My ideal is a much smaller size still,
maybe an average of 7 lines
...
Where that cost
could be signiﬁcant (e
...
, for frequently used access functions, such as vector subscripting) inlining
can eliminate it (§12
...
5)
...

Section 12
...
2

Parts of a Function Declaration

307

12
...
2 Parts of a Function Declaration
In addition to specifying a name, a set of arguments, and a return type, a function declaration can
contain a variety of speciﬁers and modiﬁers
...
1
...
1
...
5
...
1)
• A linkage speciﬁcation, for example, static (§15
...
1
...
3
...
3
...
1)
• ﬁnal, indicating that it cannot be overriden in a derived class (§20
...
4
...
2
...
2
...
1, §16
...
9
...
1
...
2
...
A function deﬁnition is a function declaration in which the body of the function is presented
...
Unfortunately, to preserve C compatibility, a const is ignored at the highest level of an argument type
...

Function argument names are not part of the function type and need not be identical in different
declarations
...
Conversely, we can indicate that an argument is unused in a function deﬁnition by not naming it
...
In both cases, leaving the argument in place, although unused, ensures that callers are
not affected by the change
...
2):
• Constructors (§2
...
2, §16
...
5) are technicallly not functions; in particular, they don’t return
a value, can initialize bases and members (§17
...

• Destructors (§3
...
1
...
2) can’t be overloaded and can’t have their address taken
...
4
...
2
...

• Lambda expressions (§3
...
3, §11
...

12
...
4 Returning Values
Every function declaration contains a speciﬁcation of the function’s return type (except for constructors and type conversion functions)
...
However, a function declaration can also
be written using a syntax that places the return type after the argument list
...
1
...
The sufﬁx
return type is preceded by −>
...
For example:
template
auto product(const vector& x, const vector& y) −> decltype(x∗y);

However, the sufﬁx return syntax can be used for any function
...
4
...
4); it
is a pity those two constructs are not identical
...

A value must be returned from a function that is not declared void (however, main() is special;
see §2
...
1)
...
For example:
int f1() { }
void f2() { }

// error: no value returned
// OK

int f3() { return 1; }
void f4() { return 1; }

// OK
// error : return value in void function

int f5() { return; }
void f6() { return; }

// error : return value missing
// OK

A return value is speciﬁed by a return-statement
...

There can be more than one return-statement in a function:
int fac2(int n)
{
if (n > 1)
return n∗fac2(n−1);
return 1;
}

Like the semantics of argument passing, the semantics of function value return are identical to the
semantics of copy initialization (§16
...
6)
...
The type of a return expression is checked against the type of the returned type, and all standard and user-deﬁned type conversions are performed
...
The store is reused after the function returns, so a pointer to a local non-static variable should
never be returned
...

return &local; // bad
}

An equivalent error can occur when using references:
int& fr()
{
int local = 1;
//
...

There are no void values
...
For example:
void g(int∗ p);
void h(int∗ p)
{
//
...

A return-statement is one of ﬁve ways of exiting a function:
• Executing a return-statement
...

This is allowed only in functions that are not declared to return a value (i
...
, void functions)
and in main(), where falling off the end indicates successful completion (§12
...
4)
...
5)
...
5
...
1)
...
g
...
4)
...
e
...
1
...

12
...
5 inline Functions
A function can be deﬁned to be inline
...
1
...
A clever compiler can generate the constant 720 for a call fac(6)
...
, makes it impossible to guarantee that every call of an inline function is actually inlined
...
If you want a guarantee that a value is
computed at compile time, declare it constexpr and make sure that all functions used in its evaluation are constexpr (§12
...
6)
...
2)
...
In particular, an inline function still has
a unique address, and so do static variables (§12
...
8) of an inline function
...
g
...
2
...
2
...

12
...
6 constexpr Functions
In general, a function cannot be evaluated at compile time and therefore cannot be called in a constant expression (§2
...
3, §10
...
By specifying a function constexpr, we indicate that we want it to
be usable in constant expressions if given constant expressions as arguments
...
’’ When used in an object deﬁnition, it means
‘‘evaluate the initializer at compile time
...

}

To be evaluated at compile time, a function must be suitably simple: a

constexpr

function must

312

Functions

Chapter 12

consist of a single return-statement; no loops and no local variables are allowed
...
That is, a constexpr function is a pure function
...
4
...

A constexpr function allows recursion and conditional expressions
...
However, you’ll ﬁnd the
debugging gets unnecessarily difﬁcult and compile times longer than you would like unless you
restrict the use of constexpr functions to the relatively simple tasks for which they are intended
...
4
...

Like inline functions, constexpr functions obey the ODR (‘‘one-deﬁnition rule’’), so that deﬁnitions in the different translation units must be identical (§15
...
3)
...
1
...

12
...
6
...
However, a constexpr function can refer to nonlocal objects as long as it does not write to them
...
Of course, it cannot write through such references, but const reference parameters are as useful as ever
...
4) we ﬁnd:

Section 12
...
6
...

explicit constexpr complex(const complex&);
//
...
0};

The temporary variable that is logically constructed to hold the const reference argument simply
becomes a value internal to the compiler
...
For example:
constexpr const int∗ addr(const int& r) { return &r; }

// OK

However, doing so brings us away from the fundamental role of constexpr functions as parts of constant expression evaluation
...
Consider:
static const int x = 5;
constexpr const int∗ p1 = addr(x);
constexpr int xx = ∗p1;

// OK
// OK

static int y;
constexpr const int∗ p2 = addr(y);
constexpr int yy = ∗y;

// OK
// error : attempt to read a variable

constexpr const int∗ tp = addr(5);

// error : address of temporar y

12
...
6
...
This
implies that a branch not taken can require run-time evaluation
...

constexpr int val = check(f(x,y,z));

You might imagine low and high to be conﬁguration parameters that are known at compile time, but
not at design time, and that f(x,y,z) computes some implementation-dependent value
...
1
...
]] is called an attribute and can be placed just about anywhere in the C++ syntax
...
In addition, an attribute can be placed in front of a declaration
...
7
...
The other is [[carries_dependency]] (§41
...

Placing [[noreturn]] at the start of a function declaration indicates that the function is not
expected to return
...

What happens if the function returns despite a [[noreturn]] attribute is undeﬁned
...
1
...
A local variable or constant
is initialized when a thread of execution reaches its deﬁnition
...
If a local variable is declared static, a single,
statically allocated object (§6
...
2) will be used to represent that variable in all calls of the function
...
For example:
void f(int a)
{
while (a−−) {
static int n = 0;
int x = 0;

// initialized once
// initialized ’a’ times in each call of f()

cout << "n == " << n++ << ", x == " << x++ << '\n';
}
}
int main()
{
f(3);
}

This prints:
n == 0, x == 0
n == 1, x == 0
n == 2, x == 0

A static local variable allows the function to preserve information between calls without introducing a global variable that might be accessed and corrupted by other functions (see also §16
...
12)
...
3
...
6
...
That is, the C++ implementation
must guard the initialization of a local static variable with some kind of lock-free construct (e
...
, a
call_once; §42
...
3)
...
For example:

Section 12
...
8

int fn(int n)
{
static int n1 = n;
static int n2 = fn(n−1)+1;
return n;
}

Local Variables

315

// OK
// undeﬁned

A static local variable is useful for avoiding order dependencies among nonlocal variables
(§15
...
1)
...
4
...
4)
...
6), should you be foolhardy enough to use one, is the complete function, independent of which nested scope it may be in
...
2 Argument Passing
When a function is called (using the sufﬁx (), known as the call operator or application operator),
store is set aside for its formal arguments (also known as its parameters), and each formal argument is initialized by its corresponding actual argument
...
2
...
In particular,
the type of an actual argument is checked against the type of the corresponding formal argument,
and all standard and user-deﬁned type conversions are performed
...
For example:
int∗ ﬁnd(int∗ ﬁrst, int∗ last, int v)
// ﬁnd x in [ﬁrst:last)
{
while (ﬁrst!=last && ∗ﬁrst!=v)
++ﬁrst;
return ﬁrst;
}
void g(int∗ p, int∗ q)
{
int∗ pp = ﬁnd(p,q,'x');
//
...
The pointer is passed by value
...
2
...
2
...
2
...
The use of initializer lists is
described in §12
...
3 and the ways of passing arguments to template functions in §23
...
2 and
§28
...
2
...
2
...
Consider:

++ref

incre-

void g()
{
int i = 1;
int j = 1;
f(i,j);
}

The call f(i,j) will increment j but not i
...
As mentioned in §7
...
2
...
It can,
however, be noticeably more efﬁcient to pass a large object by reference than to pass it by value
...
5)
}

The absence of const in the declaration of a reference argument is taken as a statement of intent to
modify the variable:
void g(Large& arg);

// assume that g() modiﬁes arg

Similarly, declaring a pointer argument const tells readers that the value of an object pointed to by
that argument is not changed by the function
...

Note that the semantics of argument passing are different from the semantics of assignment
...

Following the rules for reference initialization, a literal, a constant, and an argument that
requires conversion can be passed as a const T& argument, but not as a plain (non-const) T& argument
...
2
...

ﬂoat fsqrt(const ﬂoat&); // Fortran-style sqrt taking a reference argument
void g(double d)
{
ﬂoat r = fsqrt(2
...
0f
// pass reference to r
// pass reference to temp holding static_cast<ﬂoat>(d)

Disallowing conversions for non-const reference arguments (§7
...
For example:
void update(ﬂoat& i);
void g(double d, ﬂoat r)
{
update(2
...
Usually, that would come as an unpleasant surprise to the programmer
...
As described in §7
...
For example:
void f(vector&);
void f(const vector&);
void f(vector&&);

// (non-const) lvalue reference argument
// const lvalue reference argument
// rvalue reference argument

void g(vector& vi, const vector& cvi)
{
f(vi);
// call f(vector&)
f(vci);
// call f(const vector&)
f(vector{1,2,3,4}); // call f(vector&&);
}

We must assume that a function will modify an rvalue argument, leaving it good only for destruction or reassignment (§17
...
The most obvious use of rvalue references is to deﬁne move
constructors and move assignments (§3
...
2, §17
...
2)
...

Please note that for a template argument T, the template argument type deduction rules give T&&
a signiﬁcantly different meaning from X&& for a type X (§23
...
2
...
For template arguments, an
rvalue reference is most often used to implement ‘‘perfect forwarding’’ (§23
...
2
...
6
...

318

Functions

Chapter 12

How do we choose among the ways of passing arguments? My rules of thumb are:
[1] Use pass-by-value for small objects
...

[3] Return a result as a return value rather than modifying an object through an argument
...
3
...
5
...
5
...
1)
...

[6] Use pass-by-reference only if you have to
...
7
...
7
...

12
...
2 Array Arguments
If an array is used as a function argument, a pointer to its initial element is passed
...
This implies
that an assignment to an element of an array argument changes the value of an element of the argument array
...

Instead, a pointer is passed (by value)
...
For example:
void odd(int∗ p);
void odd(int a[]);
void odd(int buf[1020]);

These three declarations are equivalent and declare the same function
...
1
...
The rules and techniques for passing multidimensional arrays can be found in §7
...
3
...
This is a major source of errors, but
there are several ways of circumventing this problem
...
g
...
4)
...
For example:
void compute1(int∗ vec_ptr, int vec_size);

// one way

At best, this is a workaround
...
4
...
4), array (§34
...
1), or map (§4
...
3, §31
...
3)
...
For example:

Section 12
...
2

Array Arguments

319

void f(int(&r)[4]);
void g()
{
int a1[] = {1,2,3,4};
int a2[] = {1,2};
f(a1);
f(a2);

// OK
// error : wrong number of elements

}

Note that the number of elements is part of a reference-to-array type
...
The main use of references to arrays
is in templates, where the number of elements is then deduced
...

}
int a1[10];
double a2[100];
void g()
{
f(a1);
f(a2);
}

// T is int; N is 10
// T is double; N is 100

This typically gives rise to as many function deﬁnitions as there are calls to f() with distinct array
types
...
3), but often arrays of pointers can be used instead,
and they need no special treatment
...

12
...
3 List Arguments
A {}-delimited list can be used as an argument to a parameter of:
[1] Type std::initializer_list, where the values of the list can be implicitly converted to T
[2] A type that can be initialized with the values provided in the list
[3] A reference to an array of T, where the values of the list can be implicitly converted to T
Technically, case [2] covers all examples, but I ﬁnd it easier to think of the three cases separately
...
For example:
template
void f(initializer_list);
struct S {
int a;
string s;
};
void f(S);
template
void f(T (&r)[N]);
void f(int);
void g()
{
f({1,2,3,4});
f({1,"MKS"});
f({1});
}

// T is int and the initializer_list has size() 4
// calls f(S)
// T is int and the initializer_list has size() 1

The reason that a function with an initializer_list argument take priority is that it could be very confusing if different functions were chosen based on the number of elements of a list
...
4, §17
...
4
...

Section 12
...
3

List Arguments

321

If there is a function with an initializer-list argument in scope, but the argument list isn’t a
match for that, another function can be chosen
...

Note that these rules apply to std::initializer_list arguments only
...

12
...
4 Unspeciﬁed Number of Arguments
For some functions, it is not possible to specify the number and type of all arguments expected in a
call
...
6): this allows us to handle an arbitrary number of arbitrary
types in a type-safe manner by writing a small template metaprogram that interprets the
argument list to determine its meaning and take appropriate actions
...
2
...
This allows us to handle an arbitrary
number of arguments of a single type in a type-safe manner
...

[3] Terminate the argument list with the ellipsis (
...
’’ This allows us to handle an arbitrary number of (almost) arbitrary types by
using some macros from
...
However, this mechanism has been
used from the earliest days of C
...
For example:
int printf(const char∗
...
3) must have at least one argument, a C-style string, but may or may not have others
...
In the case of printf(), the ﬁrst argument is a format string containing special character
sequences that allow printf() to handle other arguments correctly; %s means ‘‘expect a char∗ argument’’ and %d means ‘‘expect an int argument
...

For example:
#include
int main()
{
std::printf("My name is %s %s\n",2);
}

This is not valid code, but most compilers will not catch this error
...

322

Functions

Chapter 12

Clearly, if an argument has not been declared, the compiler does not have the information
needed to perform the standard type checking and type conversion for it
...
This is not necessarily what the programmer expects
...
Overloaded functions, functions using default arguments, functions taking
initializer_list arguments, and variadic templates can be used to take care of type checking in most
cases when one would otherwise consider leaving argument types unspeciﬁed
...

The most common use of the ellipsis is to specify an interface to C library functions that were
deﬁned before C++ provided alternatives:
int fprintf(FILE∗, const char∗
...
);

// from
// from UNIX header

A standard set of macros for accessing the unspeciﬁed arguments in such functions can be found in

...
The idea is to compose the error message by passing each word as a separate C-style string argument
...
);
extern char∗ itoa(int, char[]); // int to alpha
int main(int argc, char∗ argv[])
{
switch (argc) {
case 1:
error(0,argv[0],nullptr);
break;
case 2:
error(0,argv[0],argv[1],nullptr);
break;
default:
char buffer[8];
error(1,argv[0],"with",itoa(argc−1,buffer),"arguments",nullptr);
}
//
...
It is popular in C, but not
part of the C standard
...

Note that using the integer 0 as the terminator would not have been portable: on some implementations, the integer 0 and the null pointer do not have the same representation (§6
...
8)
...

Section 12
...
4

Unspeciﬁed Number of Arguments

323

The error() function could be deﬁned like this:
#include
void error(int severity
...
The macro va_start takes the name of
the va_list and the name of the last formal argument as arguments
...
In each call, the programmer must supply a type; va_arg()
assumes that an actual argument of that type has been passed, but it typically has no way of ensuring that
...
The reason is that va_start() may modify the stack in such a way that a return cannot successfully be done; va_end() undoes any such modiﬁcations
...
For example:
switch (argc) {
case 1:
error(0,{argv[0]});
break;
case 2:
error(0,{argv[0],argv[1]});
break;
default:
error(1,{argv[0],"with",to_string(argc−1),"arguments"});
}

324

Functions

Chapter 12

The int-to-string conversion function to_string() is provided by the standard library (§36
...
5)
...
push_back(argv[i]);
return res
}
int main(int argc, char∗ argv[])
{
auto args = arguments(argc,argv);
error((args
...

}

The helper function, arguments(), is trivial, and main() and error() are simple
...
That would allow later
improvements of error()
...

12
...
5 Default Arguments
A general function often needs more arguments than are necessary to handle simple cases
...
2
...
Consider class complex from §3
...
1
...

};

// construct complex from two scalars
// construct complex from one scalar

The actions of complex’s constructors are quite trivial, but logically there is something odd about
having three functions (here, constructors) doing essentially the same task
...
2
...
We could deal with the repetitiveness
by considering one of the constructors ‘‘the real one’’ and forward to that (§17
...
3):
complex(double r, double i) :re{r}, im{i} {}
complex(double r) :complex{2,0} {}
complex() :complex{0,0} {}

// construct complex from two scalars
// construct complex from one scalar
// default complex: {0,0}

Say we wanted to add some debugging, tracing, or statistics-gathering code to
have a single place to do so
...

The intent of having a single constructor plus some shorthand notation is now explicit
...
For example:
class X {
public:
static int def_arg;
void f(int =def_arg);
//
...
f();
// maybe f(7)
a
...
f();
// f(9)
}

Default arguments that can change value are most often best avoided because they introduce subtle
context dependencies
...
For example:
int f(int, int =0, char∗ =nullptr);// OK
int g(int =0, int =0, char∗);
// error
int h(int =0, int, char∗ =nullptr);
// error

Note that the space between the ∗ and the = is signiﬁcant (∗= is an assignment operator; §10
...

For example:
void f(int x = 7);
void f(int = 7);
void f(int = 8);

// error: cannot repeat default argument
// error: different default arguments

326

Functions

Chapter 12

void g()
{
void f(int x = 9);
//
...

12
...
Using the same name for operations on different types is called overloading
...
That is, there is only one
name for addition, +, yet it can be used to add values of integer and ﬂoating-point types and combinations of such types
...
For
example:
void print(int);
// print an int
void print(const char∗); // print a C-style string

As far as the compiler is concerned, the only thing functions of the same name have in common is
that name
...
Thus, overloaded function names are primarily a notational convenience
...

When a name is semantically signiﬁcant, this convenience becomes essential
...
2
...
1), and in
generic programming (§4
...

Templates provide a systematic way of deﬁning sets of overloaded functions (§23
...

12
...
1 Automatic Overload Resolution
When a function fct is called, the compiler must determine which of the functions named fct to
invoke
...
The idea is to invoke the function that is the best match to
the arguments and give a compile-time error if no function is the best match
...
0);
print(1);
}

// print(long)
// print(double)
// error, ambiguous: print(long(1)) or print(double(1))?

Section 12
...
1

Automatic Overload Resolution

327

To approximate our notions of what is reasonable, a series of criteria are tried in order:
[1] Exact match; that is, match using no or only trivial conversions (for example, array name
to pointer, function name to pointer to function, and T to const T)
[2] Match using promotions; that is, integral promotions (bool to int, char to int, short to int,
and their unsigned counterparts; §10
...
1) and ﬂoat to double
[3] Match using standard conversions (e
...
, int to double, double to int, double to long double,
Derived∗ to Base∗ (§20
...
2
...
5))
[4] Match using user-deﬁned conversions (e
...
, double to complex; §18
...
in a function declaration (§12
...
4)
If two matches are found at the highest level where a match is found, the call is rejected as ambiguous
...
5)
...
The call print('a') invokes print(char) because 'a'
is a char (§6
...
3
...
The reason to distinguish between conversions and promotions is that we want
to prefer safe promotions, such as char to int, over unsafe conversions, such as int to char
...
3
...

Overload resolution is independent of the order of declaration of the functions considered
...
5
...
There are separate rules for overloading when a
{}-list is used (initializer lists take priority; §12
...
3, §17
...
4
...
5
...
1)
...
So, why bother? Consider the alternative to overloading
...
Without overloading, we
must deﬁne several functions with different names:

328

Functions

Chapter 12

void print_int(int);
void print_char(char);
void print_string(const char∗);

// C-style string

void g(int i, char c, const char∗ p, double d)
{
print_int(i);
// OK
print_char(c);
// OK
print_string(p);
// OK
print_int(c);
print_char(i);
print_string(i);
print_int(d);

// OK? calls print_int(int(c)), prints a number
// OK? calls print_char(char(i)), narrowing
// error
// OK? calls print_int(int(d)), narrowing

}

Compared to the overloaded print(), we have to remember several names and remember to use those
correctly
...
5), and generally
encourages the programmer to focus on relatively low-level type issues
...
It can also lead to errors
...
In particular, two calls rely on error-prone narrowing (§2
...
2, §10
...

Thus, overloading can increase the chances that an unsuitable argument will be rejected by the
compiler
...
3
...
The reason is to keep resolution for an individual operator (§18
...
1, §18
...
5) or function call context-independent
...

12
...
3 Overloading and Scope
Overloading takes place among the members of an overload set
...
For
example:

Section 12
...
3

Overloading and Scope

329

void f(int);
void g()
{
void f(double);
f(1);
// call f(double)
}

Clearly, f(int) would have been the best match for f(1), but only f(double) is in scope
...
As always, intentional
hiding can be a useful technique, but unintentional hiding is a source of surprises
...
For example:
struct Base {
void f(int);
};
struct Derived : Base {
void f(double);
};
void g(Derived& d)
{
d
...
3
...
4
...
2
...
Argument-dependent lookup (§14
...
4) can
also lead to overloading across namespaces
...
3
...
For example:
int pow(int, int);
double pow(double, double);
complex pow(double, complex);
complex pow(complex, int);
complex pow(complex, complex);
void k(complex z)
{
int i = pow(2,2);
double d = pow(2
...
0);
complex z2 = pow(2,z);
complex z3 = pow(z,2);
complex z4 = pow(z,z);
}

// invoke pow(int,int)
// invoke pow(double,double)
// invoke pow(double,complex)
// invoke pow(complex,int)
// invoke pow(complex,complex)

330

Functions

Chapter 12

In the process of choosing among overloaded functions with two or more arguments, a best match
is found for each argument using the rules from §12
...
A function that is the best match for one
argument and a better or equal match for all other arguments is called
...
For example:
void g()
{
double d = pow(2
...
0),2) or pow(2
...
0 is the best match for the ﬁrst argument of pow(double,double) and
2 is the best match for the second argument of pow(int,int)
...
3
...
For
example:
void f1(char);
void f1(long);
void f2(char∗);
void f2(int∗);
void k(int i)
{
f1(i);
f2(0);
}

// ambiguous: f1(char) or f1(long)?
// ambiguous: f2(char*) or f2(int*)?

Where possible, consider the set of overloaded versions of a function as a whole and see if it makes
sense according to the semantics of the function
...
For example, adding
inline void f1(int n) { f1(long(n)); }

would resolve all ambiguities similar to f1(i) in favor of the larger type long int
...
For example:
f2(static_cast(0));

However, this is most often simply an ugly stopgap
...

Some C++ novices get irritated by the ambiguity errors reported by the compiler
...

12
...
Some of these expectations are expressed
in the argument types, but others depend on the actual values passed and on relationships among

Section 12
...
The compiler and linker can ensure that arguments are of the right types, but it is
up to the programmer to decide what to do about ‘‘bad’’ argument values
...
For example:
int area(int len, int wid)
/*
calculate the area of a rectangle

precondition: len and wid are positive
postcondition: the return value is positive
postcondition: the return value is the area of a rectange with sides len and wid
*/
{
return len∗wid;
}

Here, the statements of the pre- and postconditions are longer than the function body
...
For example, we learn that 0 and −12 are not considered valid arguments
...

What should we do about a call area(numeric_limits::max(),2)?
[1] Is it the caller’s task to avoid it? Yes, but what if the caller doesn’t?
[2] Is it the implementer’s task to avoid it? If so, how is an error to be handled?
There are several possible answers to these questions
...
It is also difﬁcult for an implementer to cheaply, efﬁciently, and
completely check preconditions
...
For now, just note that some pre- and postconditions are
easy to check (e
...
, len is positive and len∗wid is positive)
...
For example, how do we test ‘‘the return value is the area of a rectangle with sides
len and wid’’? This is a semantic constraint because we have to know the meaning of ‘‘area of a
rectangle,’’ and just trying to multiply len and wid again with a precision that precluded overﬂow
could be costly
...
This is not uncommon
...
Mechanisms for documenting and enforcing conditions are discussed in §13
...

If a function depends only on its arguments, its preconditions are on its arguments only
...
g
...
In essence, we have to consider every nonlocal value
read as an implicit argument to a function
...

332

Functions

Chapter 12

The writer of a function has several alternatives, including:
[1] Make sure that every input has a valid result (so that we don’t have a precondition)
...

[3] Check that the precondition holds and throw an exception if it does not
...

If a postconditon fails, there was either an unchecked precondition or a programming error
...
4
discusses ways to represent alternative strategies for checking
...
5 Pointer to Function
Like a (data) object, the code generated for a function body is placed in memory somewhere, so it
has an address
...

However, for a variety of reasons – some related to machine architecture and others to system
design – a pointer to function does not allow the code to be modiﬁed
...
The pointer obtained by taking the address of
a function can then be used to call the function
...
*/ }
void (∗efct)(string);

// pointer to function taking a string argument and returning nothing

void f()
{
efct = &error;
efct("error");
}

// efct points to error
// call error through efct

The compiler will discover that efct is a pointer and call the function pointed to
...
Similarly, using & to get the address of a function
is optional:
void (∗f1)(string) = &error;
void (∗f2)(string) = error;
void g()
{
f1("Vasa");
(∗f1)("Mary Rose");
}

// OK: same as = error
// OK: same as = &error

// OK: same as (*f1)("Vasa")
// OK: as f1("Mary Rose")

Pointers to functions have argument types declared just like the functions themselves
...
For example:
void (∗pf)(string);
void f1(string);
int f2(string);
void f3(int∗);

// pointer to void(string)
// void(string)
// int(string)
// void(int*)

Section 12
...

You can convert a pointer to function to a different pointer-to-function type, but you must cast
the resulting pointer back to its original type or strange things may happen:
using P1 = int(∗)(int∗);
using P2 = void(∗)(void);
void f(P1 pf)
{
P2 pf2 = reinterpret_cast(pf)
pf2();
P1 pf1 = reinterpret_cast(pf2);
int x = 7;
int y = pf1(&x);
//
...
The
reason is that the result of using a pointer to function of the wrong type is so unpredictable and system-dependent
...
Because C does not have
function objects (§3
...
3) or lambda expressions (§11
...
For example, we can provide the comparison operation needed
by a sorting function as a pointer to function:
using CFT = int(const void∗, const void∗);
void ssort(void∗ base, size_t n, size_t sz, CFT cmp)
/*
Sor t the "n" elements of vector "base" into increasing order
using the comparison function pointed to by "cmp"
...

Shell sort (Knuth, Vol3, pg84)
*/

334

Functions

Chapter 12

{
for (int gap=n/2; 0for (int i=gap; i!=n; i++)
for (int j=i−gap; 0<=j; j−=gap) {
char∗ b = static_cast(base);
// necessar y cast
char∗ pj = b+j∗sz;
// &base[j]
char∗ pjg = b+(j+gap)∗sz;
// &base[j+gap]
if (cmp(pjg,pj)<0) {
// swap base[j] and base[j+gap]:
for (int k=0; k!=sz; k++) {
char temp = pj[k];
pj[k] = pjg[k];
pjg[k] = temp;
}
}
}
}

The ssort() routine does not know the type of the objects it sorts, only the number of elements (the
array size), the size of each element, and the function to call to perform a comparison
...
Real
programs use qsort(), the C++ standard-library algorithm sort (§32
...

This style of code is common in C, but it is not the most elegant way of expressing this algorithm in
C++ (see §23
...
3
...
1)
...
M
...
",
"Szymanski T
...
",
"Schryer N
...
",
"Schryer N
...
",
"Kernighan B
...
",
};

"dmr",
"ravi",
"tgs",
"nls",
"nls",
"bwk",

11271,
11272,
11273,
11274,
11275,
11276

void print_id(vector& v)
{
for (auto& x : v)
cout << x
...
id << '\t' << x
...
A comparison function
must return a negative value if its ﬁrst argument is less than the second, zero if the arguments are
equal, and a positive number otherwise:

Section 12
...
This means that you cannot avoid the ugly and error-prone casts by writing:
int cmp3(const User∗ p, const User∗ q) // Compare ids
{
return strcmp(p−>id,q−>id);
}

The reason is that accepting cmp3 as an argument to ssort() would violate the guarantee that
will be called with arguments of type const User∗ (see also §15
...
6)
...
begin(), head
...
name ...
begin(), head
...
dept ...
If the explicit use of begin() and
annoying, it can be eliminated by using a version of sort() that takes a container (§14
...
5):

end()

is

sort(heads,[](const User& x, const User& y) { return x
...
name; });

You can take the address of an overloaded function by assigning to or initializing a pointer to function
...
For
example:
void f(int);
int f(char);
void (∗pf1)(int) = &f;
int (∗pf2)(char) = &f;
void (∗pf3)(char) = &f;

// void f(int)
// int f(char)
// error: no void f(char)

It is also possible to take the address of member functions (§20
...

A pointer to a noexcept function can be declared noexcept
...
2
...
Neither linkage speciﬁcation
nor noexcept may appear in type aliases:
using Pc = extern "C" void(int);
using Pn = void(int) noexcept;

// error : linkage speciﬁcation in alias
// error : noexcept in alias

12
...
The ﬁrst rule about macros is:
don’t use them unless you have to
...
Because they rearrange the program text before
the compiler proper sees it, macros are also a major problem for many programming support tools
...
If you must use macros, please read the reference manual for your
own implementation of the C++ preprocessor carefully and try not to be too clever
...
The syntax of macros is
presented in §iso
...
3
...
6
...
3
...

A simple macro is deﬁned like this:
#deﬁne NAME rest of line

Section 12
...
For example:
named = NAME

will expand into
named = rest of line

A macro can also be deﬁned to take arguments
...
They will replace x and y when MAC()
is expanded
...
Only the expanded form of a macro is seen by the compiler, so an error in a
macro will be reported when the macro is expanded, not when it is deﬁned
...

Here are some plausible macros:
#deﬁne CASE break;case
#deﬁne FOREVER for(;;)

Here are some completely unnecessary macros:
#deﬁne PI 3
...
3
...
For
example:
#deﬁne MIN(a,b) (((a)<(b))?(a):(b))

This handles the simpler syntax problems (which are often caught by compilers), but not the problems with side effects
...
For example:
#deﬁne M2(a) something(a)

/* thoughtful comment */

Using macros, you can design your own private language
...
Furthermore, the preprocessor is a very simple-minded macro processor
...
The auto, constexpr, const,
decltype, enum, inline, lambda expressions, namespace, and template mechanisms can be used as
better-behaved alternatives to many traditional uses of preprocessor constructs
...
A string can be created
by concatenating two strings using the ## macro operator
...
For example:
#deﬁne printx(x) cout << #x " = " << x << '\n';
int a = 7;
string str = "asdf";

Section 12
...
Adjacent string
literals are concatenated (§7
...
2)
...
This
affords some protection against undesired macros
...

The argument list (‘‘replacement list’’) of a macro can be empty:
#deﬁne EMPTY() std::cout<<"empty\n"
EMPTY();
// print "empty\n"
EMPTY;
// error: macro replacement list missing

I have a hard time thinking of uses of an empty macro argument list that are not error-prone or
malicious
...
For example:
#deﬁne err_print(
...
) means that
the output is:

__VA_ARGS__

represents the arguments actually passed as a string, so

error: The answer 54

12
...
1 Conditional Compilation
One use of macros is almost impossible to avoid
...
For example:
int f(int a
#ifdef arg_two
,int b
#endif
);

Unless a macro called arg_two has been #deﬁned , this produces:
int f(int a
);

This example confuses tools that assume sane behavior from the programmer
...
See also §15
...
3
...
For example:

def

struct Call_info {
Node∗ arg_one;
Node∗ arg_two;
//
...

12
...
2 Predeﬁned Macros
A few macros are predeﬁned by the compiler (§iso
...
8, §iso
...
4
...
Its value is 201103L
in a C++11 program; previous C++ standards have lower values
...

• __TIME__: time in ‘‘hh:mm:ss’’ format
...

• __LINE__: source line number within the current source ﬁle
...

• __STDC_HOSTED__: 1 if the implementation is hosted (§6
...
1); otherwise 0
...
1) might have a code value that differs from its value as an ordinary character literal
• __STDCPP_STRICT_POINTER_SAFETY__: 1 if the implementation has strict pointer safety
(§34
...

• __STDCPP_THREADS__: 1 if a program can have more than one thread of execution; otherwise undeﬁned
...
For example, NDEBUG is deﬁned unless
the compilation is done in (some implementation-speciﬁc) ‘‘debug mode’’ and is used by the
assert() macro (§13
...
This can be useful, but it does imply that you can’t be sure of the meaning
of a program just by reading its source text
...
6
...
6
...

Obviously, the standard cannot specify how such facilities are provided, but one standard syntax is
a line of tokens preﬁxed with the preprocessor directive #pragma
...

12
...
1
...
1
...
1
...
1
...

If a function may have to be evaluated at compile time, declare it constexpr; §12
...
6
...
1
...

Use pass-by-value for small objects; §12
...
1
...
2
...

Return a result as a return value rather than modifying an object through an argument;
§12
...
1
...
2
...

Pass a pointer if ‘‘no object’’ is a valid alternative (and represent ‘‘no object’’ by nullptr);
§12
...
1
...
2
...

Use const extensively and consistently; §12
...
1
...
2
...

Avoid passing arrays as pointers; §12
...
2
...
2
...

Avoid unspeciﬁed numbers of arguments (
...
2
...

Use overloading when functions perform conceptually the same task on different types;
§12
...

When overloading on integers, provide functions to eliminate common ambiguities; §12
...
5
...
4
...
5
...
6
...
6
...

– Winston S
...
1 Error Handling
This chapter presents error handling using exceptions
...
Consequently, this chapter presents the exceptionsafety guarantees that are central to recovery from run-time errors and the Resource Acquisition Is
Initialization (RAII) technique for resource management using constructors and destructors
...

The language facilities and techniques presented here address problems related to the handling
of errors in software; the handling of asynchronous events is a different topic
...
Such parts of a program are often separately developed
...
’’ A library is just ordinary code, but in
the context of a discussion of error handling it is worth remembering that a library designer often
cannot even know what kind of programs the library will become part of:
• The author of a library can detect a run-time error but does not in general have any idea
what to do about it
...

The discussion of exceptions focuses on problems that need to be handled in long-running systems,
systems with stringent reliability requirements, and libraries
...
For example,
I would not apply every technique recommended here to a two-page program written just for
myself
...

13
...
1 Exceptions
The notion of an exception is provided to help get information from the point where an error is
detected to a point where it can be handled
...
A function that wants
to handle a kind of problem indicates that by catching the corresponding exception (§2
...
3
...

• A called component that cannot complete its assigned task reports its failure to do so by
throwing an exception using a throw-expression
...

if (/* could perform the task */)
return result;
else
throw Some_error{};
}

Section 13
...
1

Exceptions

345

The taskmaster() asks do_task() to do a job
...
Otherwise, do_task() must report a failure by throwing some exception
...
For example,
do_task() may call other functions to do a lot of subtasks, and one of those may throw because it
can’t do its assigned subtask
...

A called function cannot just return with an indication that an error happened
...
The exception-handling mechanism
is integrated with the constructor/destructor mechanisms and the concurrency mechanisms to help
ensure that (§5
...
The exception-handling mechanism:
• Is an alternative to the traditional techniques when they are insufﬁcient, inelegant, or errorprone
• Is complete; it can be used to handle all errors detected by ordinary code
• Allows the programmer to explicitly separate error-handling code from ‘‘ordinary code,’’
thus making the program more readable and more amenable to tools
• Supports a more regular style of error handling, thus simplifying cooperation between separately written program fragments
An exception is an object thrown to represent the occurrence of an error
...
That way, we minimize the chances of two unrelated libraries using the same
value, say 17, to represent different errors, thereby throwing our recovery code into chaos
...
Thus, the simplest way of deﬁning an exception is to deﬁne a class
speciﬁcally for a kind of error and throw that
...

}

If that gets tedious, the standard library deﬁnes a small hierarchy of exception classes (§13
...
2)
...
Its type represents the kind of
error, and whatever data it holds represents the particular occurrence of that error
...
5
...

13
...
2 Traditional Error Handling
Consider the alternatives to exceptions for a function detecting a problem that cannot be handled
locally (e
...
, an out-of-range access) so that an error must be reported to a caller
...
This is a pretty drastic approach
...
For example, in most situations we should at
least write out a decent error message or log the error before terminating
...
A library that unconditionally terminates cannot
be used in a program that cannot afford to crash
...
This is not always feasible because there is often no acceptable
‘‘error value
...
At a minimum, we would have to modify get_int() to return a pair
of values
...
This can easily double the size of a program (§13
...
7)
...

Consequently, this approach is rarely used systematically enough to detect all errors
...
3) returns a negative value if an output or encoding error occurred, but
programmers essentially never test for that
...

Return a legal value and leave the program in an ‘‘error state
...
For
example, many standard C library functions set the nonlocal variable errno to indicate an
error (§43
...
3):
double d = sqrt(−1
...
0 isn’t an acceptable
argument for a ﬂoating-point square root function
...
Furthermore, the use of nonlocal variables for
recording error conditions doesn’t work well in the presence of concurrency
...
For example:
if (something_wrong) something_handler(); // and possibly continue here

This must be some other approach in disguise because the problem immediately becomes
‘‘What does the error-handling function do?’’ Unless the error-handling function can completely resolve the problem, the error-handling function must in turn either terminate the
program, return with some indication that an error had occurred, set an error state, or throw
an exception
...

Section 13
...
3

Muddling Through

347

13
...
3 Muddling Through
One aspect of the exception-handling scheme that will appear novel to some programmers is that
the ultimate response to an unhandled error (an uncaught exception) is to terminate the program
...
Thus, exception handling makes programs more ‘‘brittle’’ in the sense that more care and effort must be taken to get a
program to run acceptably
...
Where termination is unacceptable, we can catch all exceptions (§13
...
2
...

Thus, an exception terminates a program only if a programmer allows it to terminate
...
Where termination is an acceptable response, an uncaught
exception will achieve that because it turns into a call of terminate() (§13
...
2
...
Also, a noexcept
speciﬁer (§13
...
1
...

Sometimes, people try to alleviate the unattractive aspects of ‘‘muddling through’’ by writing
out error messages, putting up dialog boxes asking the user for help, etc
...
In the hands of nondevelopers, a library that asks the (possibly absent) user/operator for help is unacceptable
...
If a user has to be
informed, an exception handler can compose a suitable message (e
...
, in Finnish for Finnish users
or in XML for an error-logging system)
...
Only a part of the system that has some idea of the context in which the program runs has
any chance of composing a meaningful error message
...
The C++ exception-handling mechanism provides the programmer with a way of handling errors where they are most naturally handled, given the structure of a system
...
However, exceptions are not the cause of that complexity
...

13
...
4 Alternative Views of Exceptions
‘‘Exception’’ is one of those words that means different things to different people
...
In particular, it is intended to support error handling in programs composed of independently developed components
...
Can an event that happens most times a program is run be considered
exceptional? Can an event that is planned for and handled be considered an error? The answer to
both questions is ‘‘yes
...
’’

348

Exception Handling

Chapter 13

13
...
4
...
Asynchronous events, such as keyboard interrupts and power failures, are not necessarily exceptional and are not handled directly by this mechanism
...
Many systems offer mechanisms, such as signals, to deal with asynchrony, but because
these tend to be system-dependent, they are not described here
...
1
...
2 Exceptions That Are Not Errors
Think of an exception as meaning ‘‘some part of the system couldn’t do what it was asked to do’’
(§13
...
1, §13
...

Exception throws should be infrequent compared to function calls or the structure of the system
has been obscured
...

If an exception is expected and caught so that it has no bad effects on the behavior of the program, then how can it be an error? Only because the programmer thinks of it as an error and of the
exception-handling mechanisms as tools for handling errors
...
Consider a binary tree search function:
void fnd(Tree∗ p, const string& s)
{
if (s == p−>str) throw p;
if (p−>left) fnd(p−>left,s);
if (p−>right) fnd(p−>right,s);
}

// found s

Tree∗ ﬁnd(Tree∗ p, const string& s)
{
try {
fnd(p,s);
}
catch (Tree∗ q) {
// q->str==s
return q;
}
return 0;
}

This actually has some charm, but it should be avoided because it is likely to cause confusion and
inefﬁciencies
...

When this is done, code is clearly separated into two categories: ordinary code and error-handling
code
...
Furthermore, the implementations of the exception
mechanisms are optimized based on the assumption that this simple model underlies the use of
exceptions
...
Anything that helps preserve a clear model of what is an
error and how it is handled should be treasured
...
1
...
1
...
However, we must reluctantly conclude that there are programs that for practical and historical reasons cannot use exceptions
...
In the absence of tools that can accurately estimate the maximum time for an exception to propagate from a throw to a catch, alternative
error-handling methods must be used
...
g
...
g
...
2, §4
...

In such cases, we are thrown back onto ‘‘traditional’’ (pre-exception) techniques
...
However, I can point to two popular techniques:
• To mimic RAII, give every class with a constructor an invalid() operation that returns some
error_code
...
If the constructor
fails to establish the class invariant, it ensures that no resource is leaked and invalid() returns
a nonzero error_code
...
A user can then systematically test invalid() after each construction of an object and
engage in suitable error handling in case of failure
...
invalid()) {
//
...

}
//
...
4
...
A user can then systematically test the error_code after
each function call and engage in suitable error handling in case of failure
...
second) {
//
...

}
auto val = v
...

}

Variations of this scheme have been reasonably successful, but they are clumsy compared to using
exceptions in a systematic manner
...
1
...
The assumption is that the two parts of the program are written independently and that the part of the program that handles the exception often can do something sensible about the error
...
That is, the various parts
of the program must agree on how exceptions are used and where errors are dealt with
...

This implies that the error-handling strategy is best considered in the earliest phases of a design
...
Something complicated would not be consistently adhered to in an area as inherently
tricky as error recovery
...
Each level copes with as many errors as it can
without getting too contorted and leaves the rest to higher levels
...

Furthermore, terminate() supports this view by providing an escape if the exception-handling mechanism itself is corrupted or if it has been incompletely used, thus leaving exceptions uncaught
...

Not every function should be a ﬁrewall
...
The reasons that this will not work vary from program to program and from programmer to programmer
...

[2] The overhead in time and space is too great for the system to run acceptably (there will be
a tendency to check for the same errors, such as invalid arguments, over and over again)
...

[4] This purely local notion of ‘‘reliability’’ leads to complexities that actually become a burden to overall system reliability
...
Thus, major libraries, subsystems, and
key interface functions should be designed in this way
...

Usually, we don’t have the luxury of designing all of the code of a system from scratch
...
To do this we must
address a variety of concerns relating to the way a program fragment manages resources and the
state in which it leaves the system after an error
...

Occasionally, it is necessary to convert from one style of error reporting to another
...
1
...
local cleanup, if possible and necessary
...
) {
//
...

errno = E_CPLPLFCTBLEWIT;
}
}

In such cases, it is important to be systematic enough to ensure that the conversion of error-reporting styles is complete
...

Error handling should be – as far as possible – hierarchical
...
Such requests set
up cycles in the system dependencies
...

13
...
7 Exceptions and Efﬁciency
In principle, exception handling can be implemented so that there is no run-time overhead when no
exception is thrown
...
Doing so without adding signiﬁcant memory overhead while
maintaining compatibility with C calling sequences, debugger conventions, etc
...
However, please remember that the alternatives to exceptions are not free either
...

Consider a simple function f() that appears to have nothing to do with exception handling:
void f()
{
string buf;
cin>>buf;
//
...

Had g() not thrown an exception, it would have had to report its error some other way
...

if (g(1)) {
if (h(s)) {
free(s);
return true;
}
else {
free(s);
return false;
}
}
else {
free(s);
return false;
}
}

Using a local buffer for s would simplify the code by eliminating the calls to free(), but then we’d
have range-checking code instead
...

People don’t usually handle errors this systematically, though, and it is not always critical to do
so
...

The noexcept speciﬁer (§13
...
1
...
Consider:
void g(int) noexcept;
void h(const string&) noexcept;

Now, the code generated for f() can possibly be improved
...

In particular, a standard-library implementer knows that only a few standard C library functions
(such as atexit() and qsort()) can throw, and can take advantage of that fact to generate better code
...
For example, it might have been converted to use the C++ operator new, which can
throw bad_alloc, or it might call a C++ library that throws an exception
...

Section 13
...
2 Exception Guarantees
To recover from an error – that is, to catch an exception and continue executing a program – we
need to know what can be assumed about the state of the program before and after the attempted
recovery action
...
Therefore, we call an operation exceptionsafe if that operation leaves the program in a valid state when the operation is terminated by throwing an exception
...
’’ For practical design using exceptions, we must also break down the
overly general ‘‘exception-safe’’ notion into a few speciﬁc guarantees
...
4
...
2, §17
...
1)
...
So, by valid state we mean that a
constructor has completed and the destructor has not yet been entered
...
That is, if two pieces of nonlocal data are assumed
to have a speciﬁc relationship, we must consider that an invariant and our recovery action must preserve it
...
size()==vy
...
However, that was only stated in a comment, and compilers do not read comments
...

Before a throw, a function must place all constructed objects in valid states
...
For example, a string may be left as the empty
string or a container may be left unsorted
...

The C++ standard library provides a generally useful conceptual framework for design for
exception-safe program components
...
In particular, the basic invariants of every
built-in and standard-library type guarantee that you can destroy an object or assign to it
after every standard-library operation (§iso
...
6
...
1)
...
This guarantee is provided for key operations,
such as push_back(), single-element insert() on a list, and uninitialized_copy()
...
This guarantee is provided for a
few simple operations, such as swap() of two containers and pop_back()
...
17
...
5
...

Violating a standard-library requirement, such as having a destructor exit by throwing an exception,
is logically equivalent to violating a fundamental language rule, such as dereferencing a null
pointer
...

Both the basic guarantee and the strong guarantee require the absence of resource leaks
...
In particular, an operation that throws
an exception must not only leave its operands in well-deﬁned states but must also ensure that every
resource that it acquired is (eventually) released
...
For example:
void f(int i)
{
int∗ p = new int[10];
//
...

}

Remember that memory isn’t the only kind of resource that can leak
...
Files, locks, network connections, and threads are examples of system resources
...

The C++ language rules for partial construction and destruction ensure that exceptions thrown
while constructing subobjects and members will be handled correctly without special attention
from standard-library code (§17
...
3)
...

In general, we must assume that every function that can throw an exception will throw one
...
When analyzing code for potential errors, simple,
highly structured, ‘‘stylized’’ code is the ideal; §13
...

13
...
– it is often essential for the future running of the system that the
resource be properly released
...
For example:

Section 13
...
use f
...
Exactly the same problem can occur in languages that do not support exception handling
...
Even an ordinary return-statement could exit use_ﬁle without closing f
...
use f
...
) {
// catch every possible exception
fclose(f);
throw;
}
fclose(f);
}

The code using the ﬁle is enclosed in a try-block that catches every exception, closes the ﬁle, and
rethrows the exception
...
Worse
still, such code becomes signiﬁcantly more complex when several resources must be acquired and
released
...
The general form of the problem looks like
this:
void acquire()
{
// acquire resource 1
//
...
use resources
...

// release resource 1
}

It is typically important that resources are released in the reverse order of their acquisition
...
Thus, we can handle such resource acquisition and release problems using objects of classes
with constructors and destructors
...
c_str(),a}
{}
explicit File_ptr(FILE∗ pp)
// assume ownership of pp
:p{pp}
{
if (p==nullptr) throw runtime_error("File_ptr: nullptr"};
}
//
...

˜File_ptr() { fclose(p); }
operator FILE∗() { return p; }
};

We can construct a File_ptr given either a FILE∗ or the arguments required for fopen()
...
File_ptr
throws an exception if it cannot open a ﬁle because otherwise every operation on the ﬁle handle
would have to test for nullptr
...
use f
...
That is, the exception-handling mechanisms enable us to remove
the error-handling code from the main algorithm
...

This technique for managing resources using local objects is usually referred to as ‘‘Resource
Acquisition Is Initialization’’ (RAII; §5
...
This is a general technique that relies on the properties
of constructors and destructors and their interaction with exception handling
...
3

Resource Management

357

It is often suggested that writing a ‘‘handle class’’ (a RAII class) is tedious so that providing a
nicer syntax for the catch(
...
The problem with that
approach is that you need to remember to ‘‘catch and correct’’ the problem wherever a resource is
acquired in an undisciplined way (typically dozens or hundreds of places in a large program),
whereas the handler class need be written only once
...
Then and only then
will stack unwinding (§13
...
1) call the destructor for the object
...
An array is constructed to the
extent that its elements have been constructed (and only fully constructed elements are destroyed
during unwinding)
...
When that
cannot be achieved, a well-written constructor restores – as far as possible – the state of the system
to what it was before creation
...
This can be simply
achieved by applying the RAII technique to the members
...
3
...
This acquisition might fail and throw an exception
...
Furthermore, this should be achieved without imposing a burden of complexity on the programmer
...
3
...
The acquisition of a resource is represented by the initialization of the local object that
represents the resource:
class Locked_ﬁle_handle {
File_ptr p;
unique_lock lck;
public:
X(const char∗ ﬁle, mutex& m)
: p{ﬁle,"rw"},
// acquire ‘‘ﬁle’’
lck{m}
// acquire ‘‘m’’
{}
//
...
The user
doesn’t have to keep track at all
...

This implies that where this simple model for acquisition of resources is adhered to, the author
of the constructor need not write explicit exception-handling code
...
Compared to ad hoc memory management
using new (and possibly also delete), this saves lots of work and avoids lots of errors
...
2
...
3) to avoid leaks
...
3
...
Again and again, people have invented ‘‘ﬁnally’’ language constructs for writing arbitrary code to clean up after an exception
...
First, we deﬁne a class
that will execute an arbitrary action from its destructor
...

Next, we deﬁne a function that conveniently deduces the type of an action:
template
Final_action ﬁnally(F f)
{
return Final_action(f);
}

Finally, we can test ﬁnally():
void test()
// handle undiciplined resource acquisition
// demonstrate that arbitrary actions are possible
{
int∗ p = new int{7};
// probably should use a unique_ptr (§5
...
3
...

It is generally a good idea to place a guard close to the deﬁnition of whatever it is guarding
...
The connection between ﬁnally() actions and the resources they manipulate is still ad hoc and implicit compared to the use of RAII for resource handles, but using ﬁnally()
is far better than scattering cleanup code around in a block
...
5
...
It says what is to be done upon exit
from a scope, saving the programmer from trying to write code at each of the potentially many
places from which the thread of control might exit the scope
...
4 Enforcing Invariants
When a precondition for a function (§12
...

Similarly, when a constructor cannot establish its class invariant (§2
...
3
...
2
...
In those cases, I typically throw exceptions
...
1
...

• Terminate the program: Violating a precondition is a serious design error, and the program
must not proceed in the presence of such errors
...

Why would anyone choose one of these alternatives? The ﬁrst approach often relates to the need
for performance: systematically checking preconditions can lead to repeated tests of logically
unnecessary conditions (for example, if a caller has correctly validated data, millions of tests in
thousands of called functions may be logically redundant)
...
It may be worthwhile to suffer repeated crashes during testing to gain that performance
...
For some systems, typically systems completely under the control of a single organization,
that can be a realistic aim
...
That is, making sure that recovery is complete
imposes unacceptable complexity on the system design and implementation
...
For example, it is not unreasonable to consider
program termination acceptable if it is easy to rerun the program with inputs and parameters that
make repeated failure unlikely
...

Realistically, many systems use a mix of exceptions and these two alternative approaches
...
Program structure can be radically different depending on whether (localized) recovery is an aim
...
For example, I often throw an exception to
ensure some error logging or to produce a decent error message before terminating or re-initializing
a process (e
...
, from a catch(
...

A variety of techniques are used to express checks of desired conditions and invariants
...
An assertion is simply a logical expression that is assumed to be
true
...
Looking at a variety of systems, I see a variety of needs when it comes to
expressing assertions:
• We need to choose between compile-time asserts (evaluated by the compiler) and run-time
asserts (evaluated at run time)
...

• No code should be generated unless some logical condition is true
...
Usually, the logical
condition is something like a debug ﬂag, a level of checking, or a mask to select among
asserts to enforce
...

Not every system has a need for or supports every alternative
...
6
...
If
the assertion fails, the compiler writes out an error message containing the (failed) assertion,
the source ﬁle name, and the source ﬁle line number and terminates the program
...
4
...
3)
...

Where assert() and static_assert() are insufﬁcient, we could use ordinary code for checking
...

}

Section 13
...
Are we:
• Evaluating the conditions under which we test? (Yes, the 2 ...
1
...
2
...
)
Worse, the precondition testing (or invariant testing) can easily get dispersed in other code and thus
be harder to spot and easier to get wrong
...
What follows here is a (possibly slightly overelaborate) mechanism for
expressing a variety of assertions and a variety of responses to failures
...

}

The idea is to test whenever an assertion has a ‘‘level’’ lower than or equal to current_level
...
The current_level and current_mode are constants because the idea is to generate no code whatsoever for an assertion unless
we have made a decision to do so
...

The programmer will use Assert::dynamic() to make assertions:
namespace Assert {
//
...
str();
}

362

Exception Handling

Chapter 13

template
void dynamic(bool assertion, const string& message ="Assert::dynamic failed")
{
if (assertion)
return;
if (current_mode == Assert_mode::throw_)
throw Except{message};
if (current_mode == Assert_mode::terminate_)
std::terminate();
}
template<>
void dynamic(bool, const string&)
{
}

// do nothing

void dynamic(bool b, const string& s)
{
dynamic(b,s);
}

// default action

void dynamic(bool b)
{
dynamic(b);
}

// default message

}

I chose the name Assert::dynamic (meaning ‘‘evaluate at run time’’) to contrast with static_assert
(meaning ‘‘evaluate at compile time’’; §2
...
3
...

Further implementation trickery could be used to minimize the amount of code generated
...
This Assert
is not part of the standard and is presented primarily as an illustration of the problems and the
implementation techniques
...

We can use Assert::dynamic like this:
void f(int n)
// n should be in [1:max)
{
Assert::dynamic(
(n<=0 || max//
...
6
...
I can’t hide them from the user’s view by placing them inside the implementation of
Assert where they belong
...
Similarly, if we are
willing to use the default assertion level, we don’t need to mention the level explicitly:

Section 13
...

}

I do not recommend obsessing about the amount of text needed to express an assertion, but by
using a namespace directive (§14
...
3) and the default message, we can get to a minimum:
void f(int n)
// n should be in [1:max)
{
dynamic(n<=0||max//
...
g
...
That way, you can
have a debug version of a system that tests extensively and enters the debugger and a production
version that does hardly any testing
...
For
example, with Assert the obvious convention is that assertions marked as level zero will always be
checked
...
Also, even if all else works perfectly, having a few ‘‘sanity checks’’ left to deal with hardware failures can be wise
...

The writer of a library or reusable component usually does not have the luxury of terminating
unconditionally
...

As usual, destructors should not throw, so don’t use a throwing Assert() in a destructor
...
5 Throwing and Catching Exceptions
This section presents exceptions from a language-technical point of view
...
5
...
For example:
class No_copy {
No_copy(const No_copy&) = delete;
};
class My_error {
//
...
6
...
5
...
This temporary may be further copied several times before it is caught: the exception is passed
(back) from called function to calling function until a suitable handler is found
...
The data in the exception object – if any – is typically used to produce error messages or to help recovery
...
In each scope exited, the destructors are invoked so that every fully constructed object is properly destroyed
...

}
}
void g()
{
string s = "excess";
{
string s = "or";
h();
}
}
void h()
{
string s = "not";
throw My_error{};
string s2 = "at all";
}

After the throw in h(), all the strings that were constructed are destroyed in the reverse order of their
construction: "not", "or", "excess", "in", but not "at all", which the thread of control never reached,
and not "Byron", which was unaffected
...
5
...
Exceptions containing a few words are very common
...
g
...
Some of the most common exceptions carry no information; the name of the type is sufﬁcient to report the error
...

if (something_wrong)
throw Some_error{};
}

There is a small standard-library hierarchy of exception types (§13
...
2) that can be used either
directly or as base classes
...

For example:
void g(int n)
// throw some exception
{
if (n)
throw std::runtime_error{"I give up!"};
else
throw My_error2{};
}
void f(int n)
// see what exception g() throws
{
try {
void g(n);
}
catch (std::exception& e) {
cerr << e
...
5
...
1 noexcept Functions
Some functions don’t throw exceptions and some really shouldn’t
...
For example:
double compute(double) noexcept; // may not throw an exception

366

Exception Handling

Chapter 13

Now no exception will come out of compute()
...
The programmer need not worry about providing
try-clauses (for dealing with failures in a noexcept function) and an optimizer need not worry about
control paths from exception handling
...
What happens if the
programmer ‘‘lied’’ so that a noexcept function deliberately or accidentally threw an exception that
wasn’t caught before leaving the noexcept function? Consider:
double compute(double x) noexcept;
{
string s = "Courtney and Anya";
vector tmp(10);
//
...
In
that case, the program terminates
...
4
...
3)
...
It is implementation-deﬁned
whether destructors from scopes between the throw and the noexcept (e
...
, for s in compute()) are
invoked
...

By adding a noexcept speciﬁer, we indicate that our code was not written to cope with a throw
...
5
...
2 The noexcept Operator
It is possible to declare a function to be conditionally noexcept
...
I may want to write this if my_fct() copies its argument
...
g
...

The predicate in a noexcept() speciﬁcation must be a constant expression
...

The standard library provides many type predicates that can be useful for expressing the conditions under which a function may throw an exception (§35
...

What if the predicate we want to use isn’t easily expressed using type predicates only? For
example, what if the critical operation that may or may not throw is a function call f(x)? The noexcept() operator takes an expression as its argument and returns true if the compiler ‘‘knows’’ that it
cannot throw and false otherwise
...
5
...
2

The noexcept Operator

367

The double mention of noexcept looks a bit odd, but noexcept is not a common operator
...

A noexcept(expr) operator does not go to heroic lengths to determine whether expr can throw; it
simply looks at every operation in expr and if they all have noexcept speciﬁcations that evaluate to
true, it returns true
...

Conditional noexcept speciﬁcations and the noexcept() operator are common and important in
standard-library operations that apply to containers
...
20
...
2):
template
void swap(T (&a)[N], T (&b)[N]) noexcept(noexcept(swap(∗a, ∗b)));

13
...
1
...
For example:
void f(int) throw(Bad,Worse); // may only throw Bad or Worse exceptions
void g(int) throw();
// may not throw

An empty exception speciﬁcation throw() is deﬁned to be equivalent to noexcept (§13
...
1
...
That
is, if an exception is thrown, the program terminates
...
The default effect of an unexpected
exception is to terminate the program (§30
...
1
...
A nonempty throw speciﬁcation is hard to use
well and implies potentially expensive run-time checks to determine if the right exception is
thrown
...
Don’t use it
...

13
...
2 Catching Exceptions
Consider:
void f()
{
try {
throw E{};
}
catch(H) {
// when do we get here?
}
}

The handler is invoked:
[1] If H is the same type as E
[2] If H is an unambiguous public base of E
[3] If H and E are pointer types and [1] or [2] holds for the types to which they refer
[4] If H is a reference and [1] or [2] holds for the type to which H refers
In addition, we can add const to the type used to catch an exception in the same way that we can

368

Exception Handling

Chapter 13

add it to a function parameter
...

In principle, an exception is copied when it is thrown (§13
...
The implementation may apply a
wide variety of strategies for storing and transmitting exceptions
...
2
...

Note the possibility of catching an exception by reference
...
For examples, see §13
...
2
...
4
...
1
...

The {} in both the try-part and a catch-clause of a try-block are real scopes
...
For example:
void g()
{
int x1;
try {
int x2 = x1;
//
...

}
catch(
...

}
++x1;
++x2;
++x3;

// OK
// error: x2 not in scope
// error: x3 not in scope

}

The ‘‘catch everything’’ clause, catch(
...
5
...
2
...
5
...
1 Rethrow
Having caught an exception, it is common for a handler to decide that it can’t completely handle
the error
...
Thus, an error can be handled where it is most appropriate
...
For example:

Section 13
...
2
...
code that might throw an exception
...
handle it
...
do what can be done here
...
A rethrow may occur in a catch-clause or in
a function called from a catch-clause
...
5
...
5) will be called
...

The exception rethrown is the original exception caught and not just the part of it that was
accessible as an exception
...
Had I written throw err; instead
of the simpler throw;, the exception would have been sliced (§17
...
1
...

13
...
2
...
4
...
1)
...
do something
...
cleanup
...
However, the standard-library exceptions are just
one set of exception types
...
If someone (unwisely) threw an int or an exception from some application-speciﬁc hierarchy,
it would not be caught by the handler for std::exception&
...
For example, if m() is supposed
to leave some pointers in the state in which it found them, then we can write code in the handler to

370

Exception Handling

Chapter 13

give them acceptable values
...
, indicates ‘‘any argument’’ (§12
...
4),
so catch(
...
’’ For example:
void m()
{
try {
//
...

}
catch (
...
cleanup
...
5
...
3 Multiple Handlers
A try-block may have multiple catch-clauses (handlers)
...
The handlers are tried in order
...

}
catch (std::ios_base::failure) {
//
...
4
...
1)
...
handle any standard-librar y exception (§30
...
1
...

}
catch (
...
handle any other exception (§13
...
2
...

}
}

The compiler knows the class hierarchy, so it can warn about many logical mistakes
...

}
catch (
...
handle every exception (§13
...
2
...

}
catch (std::exception& e) {
//
...
4
...
1)
...
5
...
3

Multiple Handlers

371

catch (std::bad_cast) {
//
...
2
...

}
}

Here, the exception is never considered
...
Matching exception types to catchclauses is a (fast) run-time operation and is not as general as (compile-time) overload resolution
...
5
...
4 Function try-Blocks
The body of a function can be a try-block
...
do something
...
} {
//
...

}

For most functions, all we gain from using a function try-block is a bit of notational convenience
...
4)
...
However, the
constructor itself can catch such exceptions by enclosing the complete function body – including
the member initializer list – in a try-block
...

public:
X(int,int);
//
...

}
catch (std::exception& err) { // exceptions thrown for vi and vs are caught here
//
...
Similarly, we can catch exceptions
thrown by member destructors in a destructor (though a destructor should never throw)
...
Also, other
member objects will either not be constructed or already have had their destructors invoked as part
of the stack unwinding
...
The default action is to rethrow the original exception when we ‘‘fall off the
end’’ of the catch-clause (§iso
...
3)
...

13
...
2
...
The guiding principles are:
• Don’t throw an exception while handling an exception
...

If the exception-handling implementation catches you doing either, it will terminate your program
...
Note that an exception is considered handled immediately upon entry
into a catch-clause
...
5
...
1) or throwing a new exception from within
a catch-clause is considered a new throw done after the original exception has been handled
...

The speciﬁc rules for calling terminate() are (§iso
...
5
...
g
...
In addition, a user can call terminate() if less
drastic approaches are infeasible
...

By default, terminate() will call abort() (§15
...
3)
...
If that is not acceptable, the user can provide a terminate
handler function by a call std::set_terminate() from :

Section 13
...
2
...

set_terminate(old); // restore the old terminate handler
}

The return value is the previous function given to set_terminate()
...
The intent is for terminate() to be a drastic measure to be applied when the error recovery
strategy implemented by the exception-handling mechanism has failed and it is time to go to
another level of a fault tolerance strategy
...
Even writing
an error message using cerr must be assumed to be hazardous
...
A throw or even a return before set_terminate(old) will leave
my_handler in place when it wasn’t meant to be
...
3)
...
If it tries to, terminate() will call abort()
...
The function exit() can be used to
exit a program with a return value that indicates to the surrounding system whether the exit is normal or abnormal (§15
...
3)
...
On some systems, it is essential that the destructors are not
called so that the program can be resumed from the debugger
...

If you want to ensure cleanup when an otherwise uncaught exception happens, you can add a
catch-all handler (§13
...
2
...

For example:
int main()
try {
//
...
handle my error
...
) {
//
...
5
...
There is no way of catching exceptions thrown during initialization or destruction of namespace and thread-local variables
...

When an exception is caught, the exact point where it was thrown is generally not known
...
In some C++ development environments, for some programs, and for some people, it might
therefore be preferable not to catch exceptions from which the program isn’t designed to recover
...
4) for an example of how one might encode the location of a throw into the
thrown exception
...
5
...
3
...
2), std::terminate() (§13
...
2
...
So, if
we don’t want an error in a thread to stop the whole program, we must catch all errors from which
we would like to recover and somehow report them to a part of the program that is interested in the
results of the thread
...
) (§13
...
2
...

We can transfer an exception thrown on one thread to a handler on another thread using the
standard-library function current_exception() (§30
...
1
...
For example:
try {
//
...

}
catch(
...
set_exception(current_exception());
}

This is the basic technique used by packaged_task to handle exceptions from user code (§5
...
5
...

13
...

Obviously, a vector implementation relies on many language facilities provided to support the
implementation and use of classes
...
However, a good understanding of the use of exceptions in C++ requires a more
extensive example than the code fragments so far in this chapter
...
6

A vector Implementation

375

The basic tools available for writing exception-safe code are:
• The try-block (§13
...

• The support for the ‘‘Resource Acquisition Is Initialization’’ technique (§13
...

The general principles to follow are to
• Never let go of a piece of information before its replacement is ready for use
...

That way, we can always back out of an error situation
...

Knowing what to look for in an application takes experience
...
2) and always to provide the basic guarantee
...
For example, if I write a simple data analysis program for my
own use, I’m usually quite willing to have the program terminate in the unlikely event of memory
exhaustion
...
In particular, the techniques for providing basic exception safety, such as deﬁning and checking invariants (§13
...
It follows that the overhead of providing the basic exception-safety guarantee (§13
...

13
...
1 A Simple vector
A typical implementation of vector (§4
...
1, §31
...
2
...
The default allocator (§34
...
1) uses new and delete to acquire and release memory
...

};

Consider ﬁrst a naive implementation of the constructor that initializes a
tialized to val:

vector

to

n

elements ini-

template
vector::vector(size_type n, const T& val, const A& a) // warning: naive implementation
:alloc{a}
// copy the allocator
{
elem = alloc
...
4)
space = last = elem+n;
for (T∗ p = elem; p!=last; ++p)
a
...
4)
}

There are two potential sources of exceptions here:
[1] allocate() may throw an exception if no memory is available
...

What about the copy of the allocator? We can imagine that it throws, but the standard speciﬁcally
requires that it does not do that (§iso
...
6
...
5)
...

In both cases of a throw, no vector object is created, so vector’s destructor is not called (§13
...

When allocate() fails, the throw will exit before any resources are acquired, so all is well
...
Worse still, the copy constructor for T might throw an exception after correctly constructing a few elements but before constructing them all
...

Section 13
...
1

A Simple vector

377

To handle this problem, we could keep track of which elements have been constructed and
destroy those (and only those) in case of an error:
template
vector::vector(size_type n, const T& val, const A& a)
// elaborate implementation
:alloc{a}
// copy the allocator
{
elem = alloc
...
construct(p,val);
last = space = p;
}
catch (
...
destroy(q);
alloc
...
4)

// destroy constructed elements
// free memory
// rethrow

}

Note that the declaration of p is outside the try-block; otherwise, we would not be able to access it
in both the try-part and the catch-clause
...
In a good C++ implementation, this overhead is negligible compared to the cost of allocating memory and initializing elements
...

The main part of this constructor is a repeat of the implementation of std::uninitialized_ﬁll():
template
void uninitialized_ﬁll(For beg, For end, const T& x)
{
For p;
try {
for (p=beg; p!=end; ++p)
::new(static_cast(&∗p)) T(x);
}
catch (
...
2
...
2
...
5
...
1)

The curious construct &∗p takes care of iterators that are not pointers
...
Together with the explicitly

378

Exception Handling

Chapter 13

global ::new, the explicit cast to void∗ ensures that the standard-library placement function (§17
...
4)
is used to invoke the constructor, and not some user-deﬁned operator new() for T∗s
...
construct() in the vector constructors are simply syntactic sugar for this placement new
...
destroy() call simply hides explicit destruction (like (&∗q)−>˜T())
...

Fortunately, we don’t have to invent or implement uninitialized_ﬁll(), because the standard library
provides it (§32
...
6)
...
Consequently, the standard library provides uninitialized_ﬁll(), uninitialized_ﬁll_n(), and uninitialized_copy()
(§32
...
6), which offer the strong guarantee (§13
...

The uninitialized_ﬁll() algorithm does not protect against exceptions thrown by element destructors or iterator operations (§32
...
6)
...

The uninitialized_ﬁll() algorithm can be applied to many kinds of sequences
...
1
...

Using uninitialized_ﬁll(), we can simplify our constructor:
template
vector::vector(size_type n, const T& val, const A& a)
// still a bit messy
:alloc(a)
// copy the allocator
{
elem = alloc
...
) {
alloc
...

The constructor rethrows a caught exception
...
All standard-library containers
have this property
...
This is in contrast to major parts of a system (‘‘modules’’) that generally need
to take responsibility for all exceptions thrown
...
Achieving this may involve grouping exceptions into hierarchies (§13
...
2) and using catch(
...
5
...
2)
...
6
...
In fact, it is unnecessarily difﬁcult because there is an alternative: The

Section 13
...
2

Representing Memory Explicitly

379

‘‘Resource Acquisition Is Initialization’’ technique (§13
...
In this case, the key resource
required by the vector is memory to hold its elements
...
allocate(n)}, space{elem+n}, last{elem+n} { }
˜vector_base() { alloc
...
Class vector_base deals with
memory for a type T, not objects of type T
...

The vector_base is designed exclusively to be part of the implementation of vector
...
alloc},
elem{a
...
space},
last{a
...
elem = a
...
last = nullptr; // no longer owns any memor y
}
template
vector_base::& vector_base::operator=(vector_base&& a)
{
swap(∗this,a);
return ∗this;
}

380

Exception Handling

Chapter 13

This deﬁnition of the move assignment uses swap() to transfer ownership of any memory allocated
for elements
...

Given vector_base, vector can be deﬁned like this:
template >
class vector {
vector_base vb;
void destroy_elements();
public:
using size_type = unsigned int;

// the data is here

explicit vector(size_type n, const T& val = T(), const A& = A());
vector(const vector& a);
vector& operator=(const vector& a);

// copy constructor
// copy assignment

vector(vector&& a);
vector& operator=(vector&& a);

// move constructor
// move assignment

˜vector() { destroy_elements(); }
size_type size() const { return vb
...
elem; }
size_type capacity() const { return vb
...
elem; }
void reserve(size_type);

// increase capacity

void resize(size_type, T = {});
void clear() { resize(0); }
void push_back(const T&);

// change the number of elements
// make the vector empty
// add an element at the end

//
...
elem; p!=vb
...
2
...
space=vb
...
This implies that if an
element destructor throws an exception, the vector destruction fails
...
5
...
5)
...
There is no
really good way to protect against exceptions thrown from destructors, so the library makes no
guarantees if an element destructor throws (§13
...

Section 13
...
2

Representing Memory Explicitly

381

Now the constructor can be simply deﬁned:
template
vector::vector(size_type n, const T& val, const A& a)
:vb{a,n}
// allocate space for n elements
{
uninitialized_ﬁll(vb
...
elem+n,val); // make n copies of val
}

The simpliﬁcation achieved for this constructor carries over to every vector operation that deals
with initialization or allocation
...
alloc,a
...
begin(),a
...
elem);
}

This style of constructor relies on the fundamental language rule that when an exception is thrown
from a constructor, subobjects (including bases) that have already been completely constructed will
be properly destroyed (§13
...
The uninitialized_ﬁll() algorithm and its cousins (§13
...
1) provide the
equivalent guarantee for partially constructed sequences
...
vb)}
// transfer ownership
{
}

// move constructor

The vector_base move constructor will set the argument’s representation to ‘‘empty
...
However, I don’t know if some programmer has been
playing games with std::move()
...
6
...
First
consider a straightforward implementation:

382

Exception Handling

Chapter 13

template
vector& vector::operator=(const vector& a)
// offers the strong guarantee (§13
...
size());
// get memory
uninitialized_copy(a
...
end(),b
...
We can avoid repetition:
template
vector& vector::operator=(const vector& a)
{
vector temp {a};
// copy allocator
std::swap(∗this,temp);
// swap representations
return ∗this;
}

// offers the strong guarantee (§13
...

The reason that the standard-library swap() (§35
...
2) works for vector_bases is that we deﬁned
vector_base move operations for swap() to use
...
Essentially, they are just two different ways of specifying the same set of operations
...

Note that I did not test for self-assignment, such as v=v
...
This obviously handles self-assignment correctly
...

In either case, two potentially signiﬁcant optimizations are missing:
[1] If the capacity of the vector assigned to is large enough to hold the assigned vector, we
don’t need to allocate new memory
...

Implementing these optimizations, we get:
template
vector& vector::operator=(const vector& a)
// optimized, basic guarantee (§13
...
size()) { // allocate new vector representation:
vector temp {a};
// copy allocator
swap(∗this,temp);
// swap representations
return ∗this;
// implicitly destroy the old value
}

Section 13
...
3

Assignment

if (this == &a) return ∗this;

383

// optimize self assignment

size_type sz = size();
size_type asz = a
...
alloc = a
...
alloc;
// copy the allocator
if (asz<=sz) {
copy(a
...
begin()+asz,vb
...
elem+asz; p!=vb
...
2
...
begin(),a
...
elem);
uninitialized_copy(a
...
end(),vb
...
space = vb
...
Obviously, the complexity of the code is far higher
...
However, I do so mostly to show how it is done because here it is only an
optimization
...
5
...
Thus, if
T::operator=() throws an exception during copy(), the vector being assigned to need not be a copy of
the vector being assigned, and it need not be unchanged
...
It is also plausible that an element – the element that was being copied when T::operator=() threw an exception – ends up with a
value that is neither the old value nor a copy of the corresponding element in the vector being
assigned
...

The standard-library vector assignment offers the (weaker) basic exception-safety guarantee of
this last implementation – and its potential performance advantages
...
For example:
template
void safe_assign(vector& a, const vector& b)
// simple a = b
{
vector temp{b};
// copy the elements of b into a temporar y
swap(a,temp);
}

Alternatively, we could simply use call-by-value (§12
...

384

Exception Handling

Chapter 13

13
...
4 Changing Size
One of the most useful aspects of vector is that we can change its size to suit our needs
...
push_back(x), which adds an x at the end of v, and
v
...

13
...
4
...
In other words, reserve() increases the capacity() of a vector
...
We could try the trick from the unoptimized assignment (§13
...
3):
template
void vector::reserve(size_type newalloc)
// ﬂawed ﬁrst attempt
{
if (newalloc<=capacity()) return;
// never decrease allocation
vector v(capacity());
// make a vector with the new capacity
copy(elem,elem+size(),v
...
However, not all types have a default
value, so this implementation is ﬂawed
...
So let us optimize:
template
void vector::reserve(size_type newalloc)
{
if (newalloc<=capacity()) return;
vector_base b {vb
...
elem);
swap(vb,b);
} // implicitly release old space

// never decrease allocation
// get new space
// move elements
// install new base

The problem is that the standard library doesn’t offer uninitialized_move(), so we have to write it:
template
Out uninitialized_move(In b, In e, Out oo)
{
for (; b!=e; ++b,++oo) {
new(static_cast(&∗oo)) T{move(∗b)}; // move construct
b−>˜T();
// destroy
}
return b;
}

In general, there is no way of recovering the original state from a failed move, so I don’t try to
...
However, it is simple and for the vast
majority of cases it is fast
...

Section 13
...
4
...
3
...

Remember that a move operation should not throw
...
A throw from a
move operation is rare, unexpected, and damaging to normal reasoning about code
...
The standard-library move_if_noexcept() operations may be of help here (§35
...
1)
...

13
...
4
...
Given reserve(), the implementation resize() is fairly simple
...
Conversely, if the number of elements decrease, we must destroy the surplus elements:
template
void vector::resize(size_type newsize, const T& val)
{
reserve(newsize);
if (size()uninitialized_ﬁll(elem+size(),elem+newsize,val);
else
destroy(elem
...
space = vb
...
elem+newsize;
}

// construct new elements: [size():newsize)
// destroy sur plus elements: [newsize:size())

There is no standard destroy(), but that easily written:
template
void destroy(In b, In e)
{
for (; b!=e; ++b)
// destroy [b:e)
b−>˜T();
}

13
...
4
...
alloc
...
elem[size()],val);
++vb
...
If that happens,

386

Exception Handling

Chapter 13

the value of the vector remains unchanged, with space left unincremented
...

This deﬁnition of push_back() contains two ‘‘magic numbers’’ (2 and 8)
...
As it happens, these are not unreasonable or uncommon values
...
The factor two is larger than the mathematically optimal factor to minimize average
memory use (1
...

13
...
4
...

The approach of gaining exception safety through ordering and the RAII technique (§13
...
More
problems with exception safety arise from a programmer ordering code in unfortunate ways than
from lack of speciﬁc exception-handling code
...

Exceptions introduce possibilities for surprises in the form of unexpected control ﬂows
...
It is relatively simple to look at such code and
ask, ‘‘Can this line of code throw an exception, and what happens if it does?’’ For large functions
with complicated control structures, such as complicated conditional statements and nested loops,
this can be hard
...
3)
...
Simple, stylized code is easier to understand, easier to get
right, and easier to generate good code for
...
The standard does not require an implementation
to be exactly like the one presented here
...

tialized_copy())
...
7 Advice
[1]
[2]
[3]
[4]

Develop an error-handling strategy early in a design; §13
...

Throw an exception to indicate that you cannot perform an assigned task; §13
...
1
...
1
...
2
...
1
...

Section 13
...
1
...

Use hierarchical error handling; §13
...
6
...
1
...

Don’t try to catch every exception in every function; §13
...
6
...
2, §13
...

Provide the strong guarantee unless there is a reason not to; §13
...
6
...
2
...
2
...
3
...
1
...

Use the ‘‘Resource Acquisition Is Initialization’’ technique to manage resources; §13
...

Minimize the use of try-blocks; §13
...

Not every program needs to be exception-safe; §13
...

Use ‘‘Resource Acquisition Is Initialization’’ and exception handlers to maintain invariants;
§13
...
2
...

Prefer proper resource handles to the less structured ﬁnally; §13
...
1
...
4
...
4
...
4
...
5
...
1
Don’t use exception speciﬁcation; §13
...
1
...

Catch exceptions that may be part of a hierarchy by reference; §13
...
2
...
5
...
2
...
5
...
2, §13
...
2
...

Don’t destroy information before you have its replacement ready; §13
...

Leave operands in valid states before throwing an exception from an assignment; §13
...

Never let an exception escape from a destructor; §13
...

Keep ordinary code and error-handling code separate; §13
...
1, §13
...
4
...

Beware of memory leaks caused by memory allocated by new not being released in case of
an exception; §13
...

Assume that every exception that can be thrown by a function will be thrown; §13
...

A library shouldn’t unilaterally terminate a program
...
4
...
Instead, throw an exception and let a caller decide; §13
...
3
...
D
...
Safety; Namespace Aliases; Namespace Composition; Composition and
Selection; Namespaces and Overloading; Versioning; Nested Namespaces; Unnamed Namespaces; C Headers
Advice

14
...
Functions (§2
...
1, Chapter 12) and
classes (§3
...
4, Chapter 15) provide coarser grain
...
C++ does not provide a single language feature supporting the
notion of a module; there is no module construct
...

This chapter and the next deal with the coarse structure of a program and its physical representation as source ﬁles
...

Consider some of the problems that can arise when people fail to design for modularity
...
*/ };
class Line : public Shape { /*
...
*/ };
class Text : public Shape { /*
...
*/ };
class Word { /*
...
*/ };
class Text { /*
...

Assume (realistically enough) that the facilities of Graph_lib are deﬁned in a header (§2
...
1),
Graph_lib
...
h
...
h"
#include "Text_lib
...

Just #includeing those headers causes a slurry of error messages: Line, Text, and open() are deﬁned
twice in ways that a compiler cannot disambiguate
...

There are many techniques for dealing with such name clashes
...
g
...
g
...
Each of these techniques (also known as ‘‘workarounds’’ and ‘‘hacks’’) works in some cases, but they are not general and can be inconvenient to
use
...
4)
...
2

Namespaces

391

14
...
The members of a namespace are in the
same scope and can refer to each other without special notation, whereas access from outside the
namespace requires explicit notation
...
g
...
For example, we might call the graph
library Graph_lib:
namespace Graph_lib {
class Shape { /*
...
*/ };
class Poly_line: public Shape { /*
...
*/ };

// connected sequence of lines
// text label

Shape operator+(const Shape&, const Shape&);

// compose

Graph_reader open(const char∗);

// open ﬁle of Shapes

}

Similarly, the obvious name for our text library is Text_lib:
namespace Text_lib {
class Glyph { /*
...
*/ };
class Line { /*
...
*/ };

// sequence of Glyphs
// sequence of Words
// sequence of Lines

File∗ open(const char∗);

// open text ﬁle

Word operator+(const Line&, const Line&);

// concatenate

}

As long as we manage to pick distinct namespace names, such as Graph_lib and Text_lib (§14
...
2),
we can now compile the two sets of declarations together without name clashes
...
They should be seen as a logical unit, for example, ‘‘the graphics library’’ or
‘‘the text manipulation library,’’ similar to the way we consider the members of a class
...

A namespace is a (named) scope
...
For example:
class Glyph { /*
...
*/ };
namespace Text_lib {
class Glyph { /*
...
*/ };

// sequence of Glyphs

392

Namespaces

Chapter 14

class Line { /*
...
*/ }; // sequence of Lines
File∗ open(const char∗);

// open text ﬁle

Word operator+(const Line&, const Line&);

// concatenate

}
Glyph glyph(Line& ln, int i);

// ln[i]

Here, the

Word and Line in the declaration of Text_lib::operator+() refer to Text_lib::Word and
Text_lib::Line
...
Conversely, the Glyph and
Line in the declaration of the global glyph() refer to the global ::Glyph and ::Line
...

To refer to members of a namespace, we can use its fully qualiﬁed name
...
2
...
2
...
2
...

14
...
1 Explicit Qualiﬁcation
A member can be declared within a namespace deﬁnition and deﬁned later using the namespacename :: member-name notation
...

}

// deﬁnition

We cannot declare a new member of a namespace outside a namespace deﬁnition using the qualiﬁer
syntax (§iso
...
3
...
2)
...
For example:

Section 14
...
1

Explicit Qualiﬁcation

void Parser::logical(bool);
double Parser::trem(bool);
double Parser::prim(int);

393

// error : no logical() in Parser
// error : no trem() in Parser (misspelling)
// error : Parser ::prim() takes a bool argument (wrong type)

A namespace is a scope
...
Thus, ‘‘namespace’’ is a very
fundamental and relatively simple concept
...
The global scope is a namespace and can be explicitly referred to using ::
...
2)
...
2
...
Consider:
#include
#include
#include
std::vector split(const std::string& s)
// split s into its whitespace-separated substrings
{
std::vector res;
std::istringstream iss(s);
for (std::string buf; iss>>buf;)
res
...
In particular, we repeat std::string four
times in this small example
...
push_back(buf);
return res;
}

A using-declaration introduces a synonym into a scope
...

When used for an overloaded name, a using-declaration applies to all the overloaded versions
...
3
...

14
...
3 using-Directives
In the split() example (§14
...
2), we still had three uses of std:: left after introducing a synonym for
std::string
...
That can be
achieved by providing a using-declaration for each name from the namespace, but that’s tedious
and requires extra work each time a new name is added to or removed from the namespace
...
For example:
using namespace std;

// make every name from std accessible

vector split(const string& s)
// split s into its whitespace-separated substrings
{
vector res;
istringstream iss(s);
for (string buf; iss>>buf;)
res
...
4)
...
This is the technique used to access standard-library facilities throughout this book
...

Section 14
...
3

using-Directives

395

Within a function, a using-directive can be safely used as a notational convenience, but care
should be taken with global using-directives because overuse can lead to exactly the name clashes
that namespaces were introduced to avoid
...
*/ };
class Line : Shape { /*
...
*/ };
class Text : Shape { /*
...
*/ };
class Word { /*
...
*/ };
class Text { /*
...
In particular, we can use names that do not clash, such as Glyph and Shape
...
For example:
Text txt;
File∗ fp = open("my_precious_data");

// error : ambiguous
// error : ambiguous

Consequently, we must be careful with using-directives in the global scope
...
g
...

14
...
4 Argument-Dependent Lookup
A function taking an argument of user-deﬁned type X is more often than not deﬁned in the same
namespace as X
...
For example:

396

Namespaces

Chapter 14

namespace Chrono {
class Date { /*
...

// make string representation

}
void f(Chrono::Date d, int i)
{
std::string s = format(d);
std::string t = format(i);
}

// Chrono::format()
// error : no format() in scope

This lookup rule (called argument-dependent lookup or simply ADL) saves the programmer a lot of
typing compared to using explicit qualiﬁcation, yet it doesn’t pollute the namespace the way a
using-directive (§14
...
3) can
...
2
...
3
...

Note that the namespace itself needs to be in scope and the function must be declared before it
can be found and used
...
For example:
void f(Chrono::Date d, std::string s)
{
if (d == s) {
//
...

}
}

In such cases, we look for the function in the scope of the call (as ever) and in the namespaces of
every argument (including each argument’s class and base classes) and do the usual overload resolution (§12
...
In particular, for the call d==s, we look for operator== in the
scope surrounding f(), in the std namespace (where == is deﬁned for string), and in the Chrono
namespace
...
See also §18
...
5
...
2
...
2
...
For example:
namespace N {
struct S { int i };
void f(S);
void g(S);
void h(int);
}

Section 14
...
4

Argument-Dependent Lookup

397

struct Base {
void f(N::S);
};
struct D : Base {
void mf();
void g(N::S x)
{
f(x);
// call Base::f()
mf(x);
// call D::mf()
h(1);
// error: no h(int) available
}
};

In the standard, the rules for argument-dependent lookup are phrased in terms of associated namespaces (§iso
...
4
...
Basically:
• If an argument is a class member, the associated namespaces are the class itself (including
its base classes) and the class’s enclosing namespaces
...

• If an argument is a built-in type, there are no associated namespaces
...
For example, the search for a declaration of a function f() does not have a
preference for functions in a namespace in which f() is called (the way it does for functions in a
class in which f() is called):
namespace N {
template
void f(T, int); // N::f()
class X { };
}
namespace N2 {
N::X x;
void f(N::X, unsigned);
void g()
{
f(x,1);
}

// calls N::f(X,int)

}

It may seem obvious to choose N2::f(), but that is not done
...

Conversely, examples have been seen where a function in the caller’s namespace is chosen but the
programmer expected a better function from a known namespace to be used (e
...
, a standard-library
function from std)
...
See also §26
...
6
...
2
...
For example:
namespace A {
int f();
// now A has member f()
}
namespace A {
int g();
// now A has two members, f() and g()
}

That way, the members of a namespace need not be placed contiguously in a single ﬁle
...
For example, consider a header
ﬁle written without the use of namespaces:
// my header:
void mf();
void yf();
int mg();
//
...
This
can be rewritten without reordering the declarations:
// my header:
namespace Mine {
void mf();
// my function
//
...

}

When writing new code, I prefer to use many smaller namespaces (see §14
...
However, that is often impractical when converting major pieces of software to use namespaces
...
3 provides an example
...
4
...

Section 14
...
3 Modularization and Interfaces
Any realistic program consists of a number of separate parts
...

Consider the desk calculator example from §10
...
It can be viewed as composed of ﬁve parts:
[1] The parser, doing syntax analysis: expr(), term(), and prim()
[2] The lexer, composing tokens out of characters: Kind, Token, Token_stream, and ts
[3] The symbol table, holding (string,value) pairs: table
[4] The driver: main() and calculate()
[5] The error handler: error() and number_of_errors
This can be represented graphically:
driver
error handler

parser
lexer
symbol table

where an arrow means ‘‘using
...
In fact, the calculator was conceived as three parts, with the driver
and error handler added for completeness
...
Ideally, most of the details of a module are unknown to its users
...
For example, the parser directly relies on the lexer’s interface
(only), rather than on the complete lexer
...
This can be presented graphically like this:
driver
parser interface

parser implementation

lexer interface

lexer implementation

symbol table interface

error handler

symbol table implementation

A dashed line means ‘‘implements
...
That done, the code will be simple, efﬁcient, comprehensible, maintainable, etc
...

400

Namespaces

Chapter 14

The following subsections show how the logical structure of the desk calculator program can be
made clear, and §15
...
The calculator is a tiny program, so in ‘‘real life’’ I wouldn’t bother using namespaces and separate compilation (§2
...
1, §15
...
Making the structure of
the calculator explicit is simply an illustration of techniques useful for larger programs without
drowning in code
...

Error handling permeates the structure of a program
...
C++ provides exceptions to decouple the
detection and reporting of errors from the handling of errors (§2
...
3
...

There are many more notions of modularity than the ones discussed in this chapter and the next
...
3, Chapter 41) or
processes to represent important aspects of modularity
...
I consider these notions of modularity largely independent and orthogonal
...
The hard problem is to provide safe,
convenient, and efﬁcient communication across module boundaries
...
3
...
That is, if some declarations logically belong together according to some criteria, they can be put in a common namespace to
express that fact
...
For
example, the declarations of the parser from the desk calculator (§10
...
1) may be placed in a namespace Parser:
namespace Parser {
double expr(bool);
double prim(bool get) { /*
...
*/ }
double expr(bool get) { /*
...
2
...

The input part of the desk calculator could also be placed in its own namespace:
namespace Lexer {
enum class Kind : char { /*
...
*/ };
class Token_stream { /*
...
3
...
*/ }
}
int main() { /*
...
*/ }
}

This use of namespaces makes explicit what the lexer and the parser provide to a user
...
If function
bodies are included in the declaration of a realistically sized namespace, you typically have to wade
through screenfuls of information to ﬁnd what services are offered, that is, to ﬁnd the interface
...
I don’t consider that a good solution
...

Here is a version of the Parser with the interface separated from the implementation:
namespace Parser {
double prim(bool);
double term(bool);
double expr(bool);
}
double Parser::prim(bool get) { /*
...
*/ }
double Parser::expr(bool get) { /*
...
Users will see only the interface containing declarations
...

Ideally, every entity in a program belongs to some recognizable logical unit (‘‘module’’)
...
The exception is main(), which must be global in order for
the compiler to recognize it as special (§2
...
1, §15
...

402

Namespaces

Chapter 14

14
...
2 Implementations
What will the code look like once it has been modularized? That depends on how we decide to
access code in other namespaces
...
However, for names in other namespaces, we
have to choose among explicit qualiﬁcation, using-declarations, and using-directives
...
If we use explicit qualiﬁcation, we get:
double Parser::prim(bool get)
{
if (get) Lexer::ts
...
current()
...
current()
...
get();
return v;
}
case Lexer::Kind::name:
{
double& v = Table::table[Lexer::ts
...
string_value];
if (Lexer::ts
...
kind == Lexer::Kind::assign) v = expr(true); // ’=’ seen: assignment
return v;
}
case Lexer::Kind::minus:
// unar y minus
return −prim(true);
case Lexer::Kind::lp:
{
double e = expr(true);
if (Lexer::ts
...
kind != Lexer::Kind::rp) return Error::error(" ')' expected");
Lexer::ts
...
I didn’t use Parser:: because that would be
redundant within namespace Parser
...
3
...
get();

Implementations

403

// handle primaries

switch (ts
...
kind) {
case Kind::number:
// ﬂoating-point constant
{
double v = ts
...
number_value;
ts
...
current()
...
get()
...
current()
...
get();
// eat ’)’
return e;
}
default:
return error("primary expected");
}
}

My guess is that the using-declarations for Lexer:: were worth it, but that the value of the others was
marginal
...

So, the tradeoff among explicit qualiﬁcation, using-declarations, and using-directives must be
made on a case-by-case basis
...

[2] If some qualiﬁcation is common for a particular name from a namespace, use a using-declaration for that name
...

Don’t use explicit qualiﬁcation for names in the same namespace as the user
...
3
...
Instead, that Parser declares the set of declarations that is needed to
write the individual parser functions conveniently
...

The functions implementing the parser should see whichever interface we decided on as the best
for expressing those functions’ shared environment
...

We could give the user’s interface and the implementer’s interface different names, but (because
namespaces are open; §14
...
5) we don’t have to
...
3
...
Had we decided to use a separate implementation namespace, the design would not have
looked different to users:

Section 14
...
3

Interfaces and Implementations

405

namespace Parser { // user interface
double expr(bool);
}
namespace Parser_impl {
using namespace Parser;

// implementer interface

double prim(bool);
double term(bool);
double expr(bool);
using namespace Lexer; // use all facilities offered by Lexer
using Error::error;
using Table::table;
}

or graphically:
Parser

(user interface)
Parser_impl

Driver

code

(implementer interface)

Parser

code

For larger programs, I lean toward introducing _impl interfaces
...
Had this
interface been for a realistically sized module in a real system, it would change more often than the
interface seen by users
...

14
...
This section examines technical aspects of
composing code out of namespaces
...
4
...
Safety
A using-declaration adds a name to a local scope
...
For example:
namespace X {
int i, j, k;
}

406

Namespaces

Chapter 14

int k;
void f1()
{
int i = 0;
using namespace X;
i++;
j++;
k++;
::k++;
X::k++;
}
void f2()
{
int i = 0;
using X::i;
using X::j;
using X::k;
i++;
j++;
k++;

// make names from X accessible
// local i
// X::j
// error : X’s k or the global k?
// the global k
// X’s k

// error: i declared twice in f2()
// hides global k

// X::j
// X::k

}

A locally declared name (declared either by an ordinary declaration or by a using-declaration) hides
nonlocal declarations of the same name, and any illegal overloading of the name is detected at the
point of declaration
...
Global names are not given preference over names
from namespaces made accessible in the global scope
...

When libraries declaring many names are made accessible through using-directives, it is a signiﬁcant advantage that clashes of unused names are not considered errors
...
4
...

}
A::String s1 = "Grieg";
A::String s2 = "Nielsen";

However, long namespace names can be impractical in real code:

Section 14
...
2

namespace American_Telephone_and_Telegraph {
//
...
For example:
namespace Lib = Foundation_library_v2r11;
//
...
By using
Lib rather than Foundation_library_v2r11 directly, you can update to version ‘‘v3r02’’ by changing
the initialization of the alias Lib and recompiling
...
On the other hand, overuse of aliases (of any kind) can lead to confusion
...
4
...
For example:
namespace His_string {
class String { /*
...

}
namespace Her_vector {
template
class Vector { /*
...

}

408

Namespaces

Chapter 14

namespace My_lib {
using namespace His_string;
using namespace Her_vector;
void my_fct(String&);
}

Given this, we can now write the program in terms of My_lib:
void f()
{
My_lib::String s = "Byron";
//
...

my_fct(vs[5]);
//
...

Only if we need to deﬁne something do we need to know the real namespace of an entity:
void My_lib::ﬁll(char c)
{
//
...

}

// OK: ﬁll() declared in His_string

void My_lib::my_fct(String& v)// OK: String is My_lib::String, meaning His_string::String
{
//
...

Together with the #include mechanism (§15
...
2), the composition techniques presented here and in
the following subsections provide strong support for this
...
4
...
4
...
With these mechanisms, we can provide access to a
variety of facilities in such a way that we resolve name clashes and ambiguities arising from their
composition
...
*/ };
template
class Vector { /*
...

}
namespace Her_lib {
template
class Vector { /*
...
*/ };
//
...
*/ };
//
...
4
...
Consequently, a user of My_lib will see the name clashes for String and Vector
resolved in favor of His_lib::String and Her_lib::Vector
...

Usually, I prefer to leave a name unchanged when including it into a new namespace
...
However, sometimes a new name
is needed or simply nice to have
...
*/ };
//
...
4
...
5)
...
4
...
3) works across namespaces
...
For example:
// old A
...

// old B
...

// old user
...
h"
#include "B
...
h
}

This program can be upgraded to a version using namespaces without changing the actual code:
// new A
...

}
// new B
...

}

Section 14
...
5

Namespaces and Overloading

411

// new user
...
h"
#include "B
...
h

Had we wanted to keep user
...
However, it is usually best to avoid using-directives in header ﬁles, because
putting them there greatly increases the chances of name clashes
...
For example, people
often wonder why they have to explicitly mention a sequence to manipulate a container using a
standard-library algorithm
...
begin(),v
...
2), but manipulating a container is by far the most common case
...
begin(),c
...
begin(),c
...
Those are of
course implemented using std::sort() from

Notesale: Turn your study into money

Already a Member? >

Search for notes by fellow students, in your own course and all over the country.

My Basket

Document Preview