Software Engineering 1: Lecture Notes 
 Barry McMullin 
 Last Modified: 12th April 1995 
 

Note that this complete document is available in
 LaTex  swe1lctr.tex  and plain
 ASCII  swe1lctr.txt  forms, to allow downloading and
offline browsing.

 Introduction 

This document is one component of the hypermedia documentation for the
course  Software Engineering 1 
 http://www.eeng.dcu.ie/ 7Emcmullin/swe1/swe1root/swe1root.html . It
presents additional material to complement what is covered in the
lectures, and in the assigned textbook  textbook . Note
carefully that the material presented here is    additional  to the
textbook, not a substitute for it!

All chapter and page numbers in this document refer to the textbook.


 General Comments on Lectures 

I find the subject of software engineering endlessly
fascinating, and get a great kick out of successfully solving a
programming problem. The down side is that this subject can be
almost equally endlessly annoying and frustrating, particularly
when you are trying to come to grips with it for the very first
time.  This course has been evolving over the past several years,
but one basic principle has remained constant: I see my role not
as a teacher or instructor, but as a    facilitator .

The reason is that software engineering is a subject which is
dominated by    doing . It cannot be primarily learned by
listening or reading, though both of these can play a subsidiary
role; it must be learned by taking control, and    doing . The
frustration of the subject lies in the fact that, particularly at
first, this doing is necessarily uninformed and feels very much
like thrashing around in the dark - and will frequently end in
failure; but the joy lies in the fact that the failures are
always ultimately understandable and correctable - and
this understanding and correction delivers tremendous satisfaction.

Given this emphasis on doing, I consider the time you will spend
in the laboratory (both scheduled lab sessions, and private study
time) as being the most central and important component of the
course. Nonetheless, a small lecture component is still useful to
complement this practical work, and to provide overall structure
and    pacing  to the course.

There are two scheduled lectures and one tutorial each week. In
practise, I expect all three sessions to be tutorial in nature. This
reflects an increasing emphasis on active, personal, study time. The
lectures will closely follow the textbook. I work on the assumption that
you will have studied the assigned material in the textbook, and any
related notes provided here,    before  each lecture. In this way
the lecture sessions can be interactive in nature.

Note that attendance at lectures for this course is monitored. While it
is entirely under your own control and discretion whether to attend or
not, information collected on attendance will be made available to the
examination board and may be taken into account in considering
borderline decisions.

Delivering the lectures on this basis has several significant
advantages:
 
  I am relieved of the tedium of presenting extensive,
repetitive, and usually rather boring, notes in the lectures.
  Perhaps more importantly, you are freed of the tedium (not
to mention writer's cramp) of transcribing them.
  The lectures can be used to allow discussion,
clarification (and sometimes even correction!) of the textbook
material. Further, should you choose to play an active role in
the lectures, they can be used to explore those topics which you
find particularly interesting or difficult in more detail.
  Since all the written materials for the course are
available to you independently of attendance at lectures, you can
decide for yourself whether or when it is worth your while to
attend the lectures.  For example, if you happen to already be
proficient in certain aspects of software engineering, and are
satisfied that you completely understand the assigned materials,
then you may well decide to miss certain lectures and allocate
the time for some other, more pressing, studies. However, I recommend
that you adopt this position only with    extreme caution! 
  Conversely, should you miss a lecture for some reason
outside your control, or otherwise fall behind in the course,
then because the written materials are all available to you


you will have the
flexibility to schedule in extra study time to make up.
 

In summary, my intention in designing this course is to   
empower  you, the student. That is, I hope that you will be
able to enjoy and benefit from this course precisely in
proportion to the energy and effort you are willing to invest in
it. No more, no less.


 Background 

The course is introduced with background discussion of computers,
computer programming, and some demonstration of particular computer
tools which will be used during the course.

 Chapter 1 pp. 1 10 

 Overview 

Chapter 1 is short, but very important. If you can successfully
master it, and complete its one associated exercise (p. 10)
then you will have laid down an excellent foundation for tackling
the substantive topics which will follow - and they will follow
very quickly indeed! Conversely, if you experience any difficulty
with anything in this chapter, this is your
opportunity to get your questions answered quickly.

In brief, chapter 1 introduces a problem (the calculation of
repayments on a loan) which can be solved "by hand" (i.e.  by
manual calculation), but which can more conveniently be solved by
programming a computer, in a suitable language, "once and for
all".  The bulk of the chapter is then concerned with presenting
and dissecting such a program, written in  

 First Step: Understand the problem! 

 
  I think that there is only one way to science - or
to philosophy, for that matter: to meet a problem, to see its
beauty and fall in love with it; to get married to it, and to
live with it happily, till death do ye part - unless you should
meet another and even more fascinating problem, or unless,
indeed, you should obtain a solution.  But even if you do obtain
a solution, you may then discover, to your delight, the existence
of a whole family of enchanting though perhaps difficult problem
children for whose welfare you may work, with a purpose, to the
end of your days.

 
Realism and the Aim of Science 
Preface 1956, p. 8 
   Karl Popper 
 
 
Your first step in tackling any software engineering project is
to ensure that you fully understand the statement of the problem.
Indeed, many senior software engineers (classified as   
Systems Analysts ) specialise in nothing    but  clearly
stating the problems to be solved by software systems!

Alcock presents a formula for calculating loan repayments.  Ask
yourself do you    believe  this formula before going any
further. If you    do  believe it, try to say exactly   
why .  In any case, try to    test  the formula in any way
you can dream up.  You may not be able to prove that it is
definitely    right , but you should be able to think of at
least some    partial  tests which it should pass. For
example, does it give the correct answer for the limiting case of
the principle being   0? Can you think of any other
comparable tests?

Next, actually carry out the sequence of instructions which
Alcock lists on p. 2, for at least a few different cases. Carry
out these instructions as exactly and literally as you possibly
can (you will need to get a friend to act as the "client", and
then alternate roles).  If you find anything in the instructions
which you think is obscure or ambiguous - which you cannot
immediately see how to carry out - then make a note of it.


 What's a "high-level" computer language? 

The next sections of the chapter present a   program to make a
computer carry out, in effect, the same sequence of actions to
calculate loan repayments.  Note that essentially the
same   program will work on practically any computer. This is a
key advantage of using a so-called "high-level" language - it
can be automatically translated or "compiled" into a form which
can be executed on practically any computer.  However, the
precise procedures for entering the program, and arranging for it
to be translated and executed will generally vary from computer
to computer.  For this reason, Alcock cannot and does not give
detailed instructions for how to do this; however, you are
given the appropriate detailed instructions, for the computers
which you will be using, in the notes for
   Laboratory Session 2   lab2 .

Alcock next dissects this simple program in great detail, to
explain, at least in a superficial sense, what its components
are, and how they are interpreted by the computer.  Ask
yourself at this point if you can explain why a    special 
language such as   is necessary at all - why can the computer not
simply understand the original instructions written in English on
p. 1?

 Watch for the new technical terms! 

Note that Alcock introduces a variety of technical terms in this
chapter - such as    directive ,    block ,    statement 
etc.  He generally italicises such technical terms on the first
occasion on which they are used. This specialised technical
language is very important in this subject - as, indeed, it is in
most other engineering subjects; if you do not realise which
terms are being used with specialised meanings, and what those
meanings are, from the very start, then you will quickly find
that you are becoming very confused.

So, I suggest you acquire a notebook to use exclusively as a glossary
for the duration of this course. Organise it alphabetically.  Each time
you come across a word which seems to be used in a special technical
sense, write it in, together with a definition in your own words, and an
example or two.  You may well have to go back over certain definitions
at various times, and revise or correct them - but that is perfectly OK.
If there are words that seem to have a technical meaning, but you cannot
quite see what the meaning is, then e-mail  feedback  a question,
or ask in the lecture.

Start constructing your glossary now, by going through the
whole of Chapter 1 carefully.

 Caution: On Program Style 

In dissecting the initial example program Alcock comments that:
 
  "You have complete freedom of layout. Statements may be typed
  several to a line, one per line, one to several lines."
 

His point here is that as far as the computer - or more
precisely, the   compiler - is concerned, many details of
layout or formatting of the program are irrelevant. This is true.
However, you should notice that programs have effectively two
parallel, but quite distinct, purposes:

 
  To represent a suitable sequence of instructions which
(after compilation) the computer can execute to accomplish some
task. This is the role which Alcock explicitly recognises and
discusses.
  To    document  that set of instructions so that some
   person  can understand how the computer is accomplishing
the task. This becomes vitally important whenever it is necessary to
modify or extend the behaviour of a program - or to correct
previously undetected malfunctions.  Alcock does not explicitly
refer to this purpose of the program at all.
 

It should be clear that, in its role as    documentation  for
another human being, the details of layout and formatting of a
program are    very  important. A poorly formatted program
may be perfectly "correct" - in the sense of generating the
correct answers to the problem at hand - but may be virtually
impossible for a human being (even the original author) to
decipher if it becomes necessary to enhance it (or even to decide
whether it may malfunction in certain circumstances).

Another aspects of program design which is similarly constrained
is the choice of    identifiers  - names for variables and
other objects which the program uses.  The computer is largely
insensitive to particular identifiers, as long as, in each
context, they unambiguously identify the "correct" object. In
particular, the computer does not care whether an identifier is
"meaningful" in human terms.  The names  P ,  Principal 
and  bZ34ywT  are all equally valid to the
machine, as along as they are used consistently (unambiguously).
However, the net effect if entirely different from the point of
view of someone trying to understand how the program works. So
try to get into the habit, from the very start, of taking care
with the formatting and layout of your programs, and with your
choice of identifiers. These things may not directly affect the
functioning of your program, but they will seriously affect how
long it actually takes you to get the program working
satisfactorily!

 Chapter 2 pp. 11 19 

 What's all this about "Boolean" variables? 

Alcock is at pains to point out that   does    not  support
"Boolean" variables. What does this mean?

Well, certain other programming languages (notably    Pascal  and
its descendants) support a special kind of variable which can only
hold or be assigned one of    two  distinct values - usually
referred to as  TRUE  and  FALSE . These are
  called "Boolean" variables in reference to the notable 19th
  century mathematician    George Boole . Boole's greatest work
  was entitled    An Investigation of the Laws of Thought on
  Which Are Founded the Mathematical Theories of Logic and
  Probabilities , published in 1854. Boole reduced logic (the
  analysis of the truth or falsity of complex propositions) to a
  simple algebra, thereby incorporating logic into mathematics.
  Boole's two-valued, or binary, algebra is the simplest form of
  his more general    boolean algebra . 
Now Boolean variables would be quite different in kind from the
numeric variables we have seen so far in   Admittedly, as we
shall see in more detail later, the total number of distinct values
that can be accommodated in any type of   variable
(such as type  int  or type  float ) is always    finite ,
but it is still much bigger than two!

So why are we worried about all this anyway?

Well, it turns out that, in many programs, it is desirable for
the program to have a "choice" of things to do, depending on a
variety of factors that only become known when the program is
actually executing.  In effect this means that we want the
program to be able to evaluate some proposition or condition and
classify it as being "true" or "false".

To allow this, a programming language normally provides operators
to test or compare various conditions - such as whether certain
numbers are equal, or one is greater than the other etc.  If the
language supports Boolean variables as such, then the result of
applying or evaluating such an operator would be one of the
Boolean values normally denoted  TRUE  or  FALSE . In
  however, there is no special Boolean data type, so, instead, the
so-called    logical  operators simply generate   
numeric  results, with the number  0  being
interpreted as signalling "false" and the number  1 
signalling "true". Technically, the result of applying
  a   logical operator is of the particular   numeric data
  type called  int . 

The    logical  operators would never generate any value
other than  0  or  1 . However, where, in effect, a
Boolean value is examined or used for some purpose in a  
program,    any     int  value may be introduced, with the
convention that all non-zero values (including  1  of course)
are classified or interpreted as meaning "true".

 Making a statement! 

Now that we have some logical operators available to us, how can
we use their results to control the flow of execution in a
program?

The answer is that there are special kinds of       
statement  where the exact effect of the statement can depend
on a value generated at execution time (usually by a logical
operator). But what    is  a "statement" anyway? Let's back
up and review the concept a little.

We have seen examples of   "statements" already, in the
introductory chapter; they were things like:

 
  R = Rpct / 100;
 

or:

 
  printf("  Principal, Rate
 

These are all what we might term "simple" statements:
they are the components of a program that are carried
out, in sequence, when the program is executed. Technically,
these particular statements are all examples of what are called
   expression  statements: that is, as we shall see later, we
can always think of the execution of these statements as
involving, in effect, the evaluation of an "expression".

In any case, the "statement" notion is much broader than is
accommodated by just expression statements.  We shall see other
more complex kinds of statement shortly. But first note that
statements in general should be distinguished from the other
major "kinds" of program component. As well as statements,
other program components we have seen are    comments ,   
preprocessor directives , and    declarations .

   Comments  are simple pieces of text delimited by the
special tokens  /*  and  */  which are ignored by the
compiler, but serve to provide some additional information to a
   person  reading the program:

 
/* WOTCOST; Computes the cost of a loan. */
 

   Directives  are the components of a program signalled by the
 
control the way the rest of the program is translated or
compiled; we say that they play a role only at "compile time".
The only example we have seen so far is the
 

   include <stdio.h>
 

Its effect is to tell the compiler to also compile an extra
separate file (in this case, one called  stdio.h ) as an
integral part of the compilation of our own file.

   Declarations , roughly speaking, "prepare the ground" for
statements to be subsequently executed. So far, the only
declarations we have seen have been concerned with    creating
variables  with given names and types, e.g.  as follows:

 
  float P, Rpct, R, M;
 

Recall that declarations and statements can act as   
sub-\/ components: in particular, they may be combined to form a
   block , by enclosing them within braces; and a block, in
turn, can form part of a    function definition . This notion of
   hierarchical structure , with components within components
within components, is a very important and pervasive one in the
design of all complex engineering systems.

For now, note that technically a block is, itself, regarded as a
special kind of    statement . In fact, it is also
alternatively referred to as a    compound statement . The
significance of this is that anywhere in a program where it is
"grammatically" (or "syntactically") legal to place a
"simple" statement then it will also be legal to put in a whole
block, comprising an indefinitely large group of declarations and
statements. It follows, logically, that a block may itself be a
component of a larger more encompassing block and so on! Granted,
there is no obvious reason why this might be useful yet, but
we'll get there in due course!

 Branching and Looping 

So far we have two kinds of statement: "expression" statements,
and "block" or "compound" statements. Expression statements
are just executed. The (simple) statements within a block are
executed in simple sequence.  As yet we have no way of making the
execution of a program depend on, or vary in sympathy with, a
logical value determined at execution time.

There are two principle kinds of variation we typically might
want: branching (or, as Alcock says,    selection ) and
looping.  In branching we have    alternative  things we
might want our program to do; in looping we want to do a
particular thing    repeatedly  - over and over again.   
provides several different ways of achieving both of these
effects; but for the time being, it is sufficient to have at
least one way of doing each in our   armoury, and that is what
Alcock now introduces. He presents the  if - else  kind of
statement to achieve branching, and the  for  statement to
achieve looping or "iteration".

Examine the formats which Alcock specifies for the  if-else 
and  for  statements very carefully. Notice that, in each
case, the statement format contains one or more other statements
as    components  nested within it. In the case of an   
if  statement, the nested statement is to be executed if and
only if the    expression  value fed into the  if  is
"true" (non-zero); in the case of  if-else  exactly one or
the other of the nested statements will be executed depending on
whether the    expression  is classified as "true" or
"false"; and in the case of the  for , the nested statement
is what gets executed repeatedly.

 
   My point here is to draw your
attention to the fact that the nested statement is grammatically,
or "syntactically", allowed to be any kind of statement at
all. 
 

So, as Alcock shows in his example, the statement nested within
an  if  can, itself, be another  if  statement (complete
with a further, nested, statement); equally, the nested statement
could be a  for  statement or, indeed, a    compound 
statement. The latter means that, with one  if  statement, we
can control execution of a whole block of statements - simply by
enclosing them in braces to make a single compound statement.

Furthermore, the nesting of statements within statements can, in
theory, be made as deep as we like!  However, in practice,
programs get rather difficult for human beings to understand if
statements get nested    too  deeply within each other - so
don't get carried away with this!

 A Shady Character? 

A computer clearly has to be able to deal with "text" as
well as "numbers", if we are going to use it for general
purpose information processing.  We need to be able to print out
headings, or textual prompts at the very least. More generally,
we want to be able to carry out "processing" on textual
information: sorting, searching, formatting etc.

Text is ultimately made up of basic components we call   
characters .

But what exactly counts as a "character"?

In the English speaking world we tend to be happy with upper and
lower case alphabetics, some digits, and a selection of
miscellaneous brackets, punctuation, and other "special"
characters. Even that still leaves room for tremendous variety in
   style  and    size .

But to handle text in most languages of the European
Union we need to add a large variety of    accented 
characters ( a ,  u ) etc.; and if we move further afield,
things get even more complicated: in the middle East we need
arabic and hebrew alphabets, in the CIS we need (at least) the
cyrillic character set, and in the far East we need even more
elaborate character sets to handle Japanese, the various kinds of
Chinese, and so on.

Further, if we want to deal with mathematical notation, we
need to be able to deal with things like sub- and super-scripts
( x^ y_1  ), and so forth.

So what should count as a "character", or even how many
"distinct" characters we might want to deal with, is a fairly
difficult question to deal with.

However, we have to start somewhere!

In the case of computers, the start was historically based on
good old fashioned typewriters, or, more strictly,   
teletypewriters . Teletypewriters, or teletypes, were automatic
electric typewriters which could be remotely hooked together, so
that whatever was typed on one was automatically printed on the
other also. In the early days of computers, teletype machines
were already fairly freely available, and were a convenient
mechanism for both keying input information into a computer, and
for printing output information from the computer.

However, it was in the nature of teletype technology that they
could only support a rather limited range of characters:
typically just the upper and lower case roman alphabet, digits 0
to 9, and a variety of "special" characters. Furthermore, if
these devices were to work properly with each other it was
essential that they all adhere to some single   
standard  - both for the characters to be supported, and the
way they should be encoded (something akin to the "dots" and
"dashes" of the earlier    Morse  code).

A set of just 96 such characters, plus 32 non-printable
"control" characters, together with a standardised
way of encoding these into electrical impulses (as strings of
"highs" and "lows", or "ones" and "zeros" - i.e.  binary
numbers) was therefore devised, and became a    de facto 
standard, known as the    American Standard Code for
Information Interchange , or  ASCII  for short.

Because it only allows 96 characters, based on the roman
alphabet,  ASCII  is very limited.  It doesn't include any
accented characters, or even the   sign. It doesn't allow
for variations in size, or style. It can't deal properly with
mathematical notation. And it does not address non-roman
alphabets at all.

Nonetheless,  ASCII  has become a sort of "lowest common
denominator" or    lingua franca  of computers.  They can
virtually all deal with  ASCII  encoded text.

In particular, the text that makes up computer programs is
normally limited to the  ASCII  character set (or some even
more restrictive character set), to ensure that the programs can
be potentially processed on the widest possible variety of
computers.

Now, the  ASCII  code is certainly not the    only  way
of representing textual material in computers, and the standard
for the   language does not absolutely require either that the
characters making up a   program, or the characters making up
textual material being processed by the program, must be encoded
in  ASCII . Instead, the standard simply stipulates certain
restrictions on the encoding, which are summarised by
Alcock. Alcock rather misleadingly talks of certain
  characters being    stored  in a certain "order" and/or
  "contiguously"; what he means is that characters are   
  encoded  in a certain order or contiguously. 
 ASCII  text satisfies these restrictions, and also has some
additional properties which Alcock notes.

As it happens, in the implementation of the   language which
you will be using, characters are, indeed, encoded in  ASCII 
and these properties do hold. However, the point to be made here
is that in general, in your programs, you should not    rely 
on these latter properties (specific to the  ASCII  coding),
since they are not    guaranteed  to hold on all computers
implementing the   language - except, of course, where your
program is specifically intended    only  to work with
 ASCII  encoded text.

 The    newline  Character - and other
vagaries! 

One special problem in dealing with characters is how to represent the
idea of a "line break" or a "new line". In the  ASCII  character set
there are actually two separate numeric codes associated with the idea
of line breaking - one called "carriage return" the other called "line
feed". The terminology goes back to mechanical typewriters and
teletypes, where moving on to a new line required two separate actions:
the "carriage" carrying the roller and paper had to be moved back to the
extreme right (so that the left hand edge of the paper was relocated
under the printing position)    and  the paper had to be "fed"
through the roller by the correct amount for a new line.  Of course,
these quaint historical origins have rather little to do with the
operation of modern computer displays; but the distinction between
moving the print position (the cursor)    down  a line, and moving it
   back  to the left hand margin still exists, and so the distinct
functions of the two  ASCII  control codes still apply.
Nonetheless, it is conventional in   programming to have a single
control code which is denoted by the sequence    n  and which
has the effect of    both  moving down a line,    and  moving back
to the left hand margin. This special character is called the   
newline  character.

Why cannot a newline simply be represented in the   file by,
literally, inserting a new line? Why do we need this special sequence
   n ?  The basic answer is that line breaks are used in a  
program to try to lay it out in a legible manner; and that if we put in
line breaks in the   file where what we really want is to have the
program    emit  a line break at run time, then this will disrupt and
interfere with the formatting of the   program.  In fact, within  
string constants, which is where line breaks are usually desired, it is
   illegal  to insert a literal line break, and a compiler error would
be generated.  So it is necessary to invent some kind of more or less
artificial way of indicating the idea of a line break, or the "newline"
character within a   string, and    n  is the way it is
done.  Note that when you write a statement like:

 
    printf(" );
 

then the string which is actually stored internally in the computer at
run time (and passed as an argument to  printf() ) contains just
   one  character - the  ASCII  newline character, and    not 
the two separate  ASCII  characters backslash and the letter
 n . In turn, what the  printf()  function actually "sends" to
the screen is    not  the two separate characters     and
 n , but rather the single  ASCII     newline 
character. Its actually even more complicated than this, but I
  will spare you any further details! 

The sequence    n  is called an "escape" sequence: the idea is
that the character  n  is "escaping" from its normal interpretation,
and is, instead, being given a special interpretation.  Its a "sequence"
because the  n  must be preceded by the special "escape" character,
which, in this case, is the backslash character,    . The use of an
escape sequence to represent a special  ASCII  character (such as
newline) introduces a difficulty of its own: what if we want to actually
put the backslash character itself into a string?  How can we   
prevent  it being interpreted as an "escape" character?  Well, the
answer is obvious enough: if we want to incorporate, literally, the
backslash character     into a   string, we simply escape this
escape character itself: i.e., we write    .  Thus to print the
backslash character using  printf()  we would write:

 
    printf(" );
 

As well as the newline character, denoted    n  there are a
variety of other special  ASCII  characters which can be represented
in   by escape sequences.  Try out these sequences for yourself, and
try to establish their effect:    r ,    b ,
   t .

 Variables, Data Types and Arrays 

A "simple" variable in   can store or record a    single  "value".
What    kind  of value depends on the    data type : variable of
type  int  can hold positive or negative integers, up to about   
32000 ; type  long  variables can hold integers up to about   
2  10^ 9  ;  float  variables can hold positive or negative
rational numbers, with a precision of about 9 significant digits, and a
range up to about  10^   38  .  The type  char  is a little
stranger: its values can be regarded    either  as integers (with a
range from 0 to 255)    or  as  ASCII  characters, represented in
single quotes, e.g.,  'a' ,  'Z' ,  '9' ,  '+' ,
 '('  and so on.

But the basic point remains that one variable can store or record just
one value.

Now, if we want to store a whole lot of values (and this is a very
common requirement) we could simply declare as many separate, simple,
variables as are needed.  This is a perfectly satisfactory approach for
many purposes.

BUT: in many cases, we don't just want a whole lot of separate values -
we want our program to be able to automatically scan or iterate or
repeat some operation over all these values. If the values are all
stored in separate variables this will be very clumsy, if not impossible
to code.  This is so because each repetition or iteration would need
to refer to a different variable - with a different    name .  That
means that the   code for each iteration is necessarily different.  If
I want to do the same thing to three variables called  x ,  y 
and  z , then I will have to program (at least) three separate
statements.  Granted, the statements will be almost identical - but not
quite    exactly  identical, because the relevant variable name will
have to be changed from  x  to  y  to  z . This means I
can't possibly achieve the required repetition with, say, a  for 
statement: because there    is  just one substatement controlled as
the substatement of a  for , and that substatement would have to
name some particular one of the three variables  x ,  y  or
 z .

This may not be too much of a problem if I just want to repeat something
over two or three different variables - it will not be too much of a
problem simply to copy the relevant   source statements, and edit the
separate copies to deal with the different variables
as appropriate.  But what if the repetition is to be over 50, or 100, or
even 1000 different values?  Then duplicating, and varying, the   code
for each different variable is going to be very laborious, and very
error prone.

Well, the   language provides a special mechanism for dealing with
this kind of situation.  If you want a set of data items,    all of the
same type , then instead of declaring a separate variable for each one,
you can declare a single    array  variable:

 
    int foobar 100 ;
 

The square brackets on the declaration signal that  foobar 
is some kind of array; the number inside the square brackets says how
many elements are in the array; and the type -  int  in the case -
gives the type of the    elements .  So this declaration creates a
single "array" variable, called  foobar ; but  foobar  in turn,
is actually made up of a whole lot (100 to be precise) of individual,
simple,  int  variables.

Note that arrays already have an advantage over separate variables, in
that they are much more concise to declare. In the example above, if we
did not have the array mechanism, we would have had to declare 100
separate variables - something like this:

 
    int foobar1, foobar2, foobar3, foobar4, foobar5;
    int foobar6, foobar7, foobar8, foobar9, foobar10;
 

and so on!

Once an array is declared, the individual elements can be accessed by
indexing the array name.  That is, we can use statements like:

 
    foobar 10  = 42;
    foobar 24  = foobar 5  * foobar 4  * 163;
    printf("This is element 9 of foobar: 
 

Array indices in   always start at zero.  So if an array has 100
elements, the valid indices run from 0 to 99.

Note carefully at this stage that an array is a quite different   
kind  of object from its elements.  The kinds of things you can
typically do to a complete array are quite different from the kinds of
things you can do to its elements.  Thus, in the case of  foobar 
above, it is perfectly reasonable to write:

 
    foobar 0  = foobar 0  + 10;
    printf("
 

which has the effect of increasing the value of the zero'th element of
 foobar  by 10.  But it would be silly to write:

 
    foobar = foobar + 10;
    printf("
 

 foobar  itself - the whole array, as opposed to one of its elements
- is not a number (even though all its elements    are ).  Adding 10
to an array is simply not a sensible or meaningful kind of thing to try
to do to a complete array. Similarly, if we give  printf()  a format
specification,  " i" , which signals to it to expect a further
argument of type  int , then it would be totally confusing to give
it a whole    array  of  int  values instead.

Are there    any  operations which it makes sense to carry out or use
on a complete array, as opposed to individual elements of it? Well, there
are, but they are relatively few.  The only one we will have immediate
use for is in passing a complete array to a function - somewhat as was
attempted above with  printf() . But whereas  printf()  has not
been designed to be capable of accepting a whole array as a single
argument, it is perfectly possible, and useful, to write your    own 
functions which can accept such array arguments. This
  possibility is not pursued in detail by Alcock at this stage, but is
  raised implicitly in Exercise 4 of the chapter. 

In the example given above, the elements of  foobar  are simply of
type  int .  But the elements of an array can, themselves, be arrays
- and so on.  This allows the creation of data objects which can be
thought of as multi-dimensional arrays:

 
    int two_dim_foobar 20  30 ;
 

This makes  foobar  an array with 20 elements, where each of these
elements is, in turn, an array with 30 elements, and each of these is a
simple variable of type  int . The elements can then be accessed as
you would expect, but now needing two indices to identify a particular,
simple,  int  element:

 
  two_dim_foobar 10  5  = foobar 6  / foobar 7 ;
 

Multidimensional arrays turn out to be useful in many engineering
applications.  For example, if I am writing a program to plot points on
a graph, I could conveniently set up a two dimensional array to record
which points are marked, and which left blank. Similarly, Alcock's
  MATMUL   matmul.c 
program provides a business calculation example where two
dimensional arrays are useful.

Anyway: so far we have just looked at declaring array variables, and
accessing particular elements.  While this simplifies the declarations
somewhat, it is not yet clear that it addresses the original problem -
our desire to be able to easily and conveniently repeat a single
operation over a whole set of different variables, without having to
code a separate statement for each one. Thus, we can now refer to
array elements  foobar 0   through  foobar 99  , instead of, say,
to the completely separate variables named  foobar0 ,  foobar1 
and so on down to  foobar99 . But how does this make it easier to
repeat operations over these elements?

The key notion here is that the index of an array is not limited to
being just a particular, literal, number, such as in  foobar 7  .
The index is technically allowed to be an arbitrary    expression .
Thus we could refer to  foobar 2+20  , or  foobar (10+3)*2  .
This is still not terribly useful. But, in an expression, we can put
   variables .  Thus we can have a program fragment like:

 
    int foobar 100 ;
    int index;
    .
    .
    .
    index = 6;
    .
    .
    .
    printf("
 

This will have the effect of printing element 6 of  foobar . What
advantage does this have over simply writing  printf(" i",
foobar 6 ) ? Not much yet, but now consider    this  code fragment:

 
    int foobar 100 ;
    int index;
    .
    .
    .
    index = 6;
    printf("
    index = index + 1;
    printf("
 

This has the effect of printing elements  6  and  7  of
 foobar . Still, so what? The so what is that, if you examine this
fragment carefully, you will see that the two  printf()  statements,
even though they print out two    different  elements of  foobar ,
are actually textually    identical  statements.  Not just "almost"
the same, but strictly identical.  Their different    effects , when
they are executed, arise because the effect depends on the value of the
variable  index  at the time of execution - and that actually
changes between the two statements in this example.

What's so significant about the two  printf()  statements being
absolutely identical? Well, because of this    absolute  identity, it
is not necessary to actually repeat them in the   source file at all:
the two separate invocations of this statement can be collapsed down as
the substatement of a  for  statement as follows:

 
    for (index = 6; index <= 7; index++)
      printf("
 

Granted, it is not terrible exciting just to replace a single
duplication of a statement.  But it becomes terribly useful if we have a
large number of duplications.  For example, suppose we want to print out
   all  the elements of  foobar  rather than just elements  6 
and  7 ? Instead of writing something like:

 
    printf("
    printf("
    .
    .
    .
    printf("
 

with 100 separate statements, each differing only ever so slightly from
the others, we can have a single  for  statement:

 
    for (index = 0; index <= 99; index++)
      printf("
 

Now    that's  a real benefit.

Make sure you understand that this kind of collapsing down, using an
iteration statement like  for , only works if there is absolutely no
variation in the    text  of the statement being repeated - although,
of course, there can be a large variation in the    effect  of that
text each time it is executed. Go back over the example above, and satisfy
yourself that you cannot achieve the same effect - collapsing down to a
single  for  statement - if the program were using 100 separate
variables with separate names instead of an array. Note that the
  names of separate variables    must  be textually different, even if
  only "slightly" so. 

Note also that, although I have built up this example using the specific
idea that the thing we want to iterate or repeat is the printing out of
an element of the array, the concept and argument does not rely in any
way on just    what  it is we want to repeat.  Exactly similar
considerations would have applied if we wanted to increment each
element, or take its square root, or add it into a cumulative total, or
whatever. Alcock's two example programs
  MATMUL   matmul.c  and
  BUBBLE   bubble.c  provide, of course, more
comprehensive examples of the same ideas.  It is a good idea to at least
attempt to see how you might try implementing either of these programs
if you did not have the possibility of using arrays open to you, but
were forced, instead, to simply declare as many separate, simple,
variables as are necessary. You should find that, in that case, you
cannot use  for  statements to repeat the required operations over
different variables, and you would therefore be forced to "manually"
duplicate blocks of   code many times over, and then edit each copy to
deal with slightly different variable(s).  The whole thing becomes
incredibly cumbersome!

 The Dreaded Array Indexing Bugs 

It is vitally important in   programming to ensure that you never use
an array index which is out of range - for example, if  foobar  is
an array declared as having 100 elements, then referring to
 foobar 100  , or  foobar -6   would be indexing errors.  For
somewhat obscure and technical reasons, the   compiler cannot detect
such errors, nor, in general, can they be automatically detected at run
time. But the effects of such errors tend to be highly variable, very
strange, and potentially very confusing.  The nett result is that array
indexing errors are among the most difficult to find and correct in  
programming.  It is very important, therefore, to get into the habit
very early on of being very careful indeed about array indices.

This is straightforward enough with simple numeric indices, such as
 foobar 10   and so on. Although, for beginning programmers it is
still very common to make errors "at the margin" - i.e. to forget that
the valid indices start at zero instead of 1, or make some related
mistake. So even with a simple indexing expression such as
 foobar j   it is important to carefully examine how the value of
 j  is being generated or manipulated to make sure that it cannot go
out of range.

But what about more complicated index expressions, such as in:

 
    foobar (x + y) * z  = w + 1;
 

If you put something like this in a program, you need to very carefully
examine what the values of  x ,  y  and  z  can possibly be,
and try to guarantee that there are absolutely    no  circumstances in
which the index expression might yield a value which is outside the
bounds of the particular array. In practice this kind of analysis is
difficult and error prone.  It is better to adopt a style of   
defensive programming  where you deliberately try to make the program
itself warn you if something is going wrong.  Thus, the statement above
might be replaced with the following:

 
  index = (x + y) * z;
  if( (index < 0) || (index >= 100)
   
    printf("  indexing bug - Controlled Crash!!! );
    exit();
   
  foobar index  = w + 1;
 

The operator  ||  is called    logical OR  and will evaluate as
 TRUE  (non-zero) if either of its operands is  TRUE .  The
function  exit()  is a library function which causes the program to
immediately and unconditionally terminate. The prototype for
  the  exit()  function is in the header file  stdlib.h , so you
  should   include  this header at the top of your file if you
  intend to use the  exit()  function. 

So the effect of this code is that the index is first calculated and
stored in the variable  index . Then this value is compared with the
valid limits.  If it turns out, for whatever reason, to be invalid, a
warning message is printed and the program is terminated. Of course, if
the index is valid, then execution simply continues around the  if 
statement, no message is printed, the program is not terminated, and the
index is used to actually index into the array.

At first sight this defensive programming may seem like a lot of
additional effort. But to repeat: array indexing bugs    do  happen;
and if you have not adopted this kind of defensive programming strategy,
they are    extremely  difficult to track down and isolate.  Thus the
small additional effort of the defensive checks is usually far
outweighed by the reduction in time (not to mention sweat and tears)
required to debug the program. Of course, some judgement will always be
required as to whether to add extra checking code - but if in doubt, it
is always safer to add it.  There is also a question over what to do if
an indexing error    is  detected: the example above shows just about
the crudest possible reaction.  There are much more sophisticated
possible approaches to reacting to unexpected events in a program (so
called "exception handling").  By all means, feel free to experiment
with formulating your own approaches to this if you think you can
improve on the very crude mechanism shown above. In particular,
  you might have a look at the standard library facilities for "process
  control" described in pagese 171 176 of    Illustrating C 
  - though this is not for the faint hearted! 

 Chapter 2 pp. 20 26 

 What's a function anyway? 

In   a    function  is a unit of code that can be used, or called,
or invoked, from some other place in the program.  Better names for this
idea, used in other languages, are "subprogram", "subroutine", or
"procedure". But, for reasons which, no doubt, seemed sensible at the
time, the designers of   plumped for the word "function" and I am
afraid we are stuck with it!

Functions are handy because they allow you to break a program down into
manageable chunks, where each chunk is quite small (typically 10 20
lines) and does just one or two definite things. Thus each chunk is
pretty readable or understandable in its own right.  If a single
function starts getting too complicated, then we chop some coherent
piece of it out and wrap it up in a function of its own, which now is
merely "called" by the original function.

Functions are also handy because they support "re-usability" in a
program.  Thus, if there is a particular generic kind of manipulation
which has to be done in two or three, or 20 different situations in a
program, we can write a function to do this manipulation; then each time
we need that manipulation, we call the function, instead of repeating
all the detailed code.

In fact, we have been using functions all along already, without really
noticing it.  The block of code in our programs with the following
outline:

 
    int main(void)
     
    .
    .
    .
     
 
is technically the definition of a    function  whose name is
 main() . In general when I refer to the name of a function
  I include a pair of parentheses after the name, as in
   main() . I adopt this convention so that, whenever I use a name,
  you can easily tell whether it is intended as the name of a function
  or of a variable. In fact, the compiler works much the same way: when
  it sees a name in an expression, it looks to see whether the name is
  followed by a left parenthesis, and, if so, it knows that the name is
  supposed to refer to some function. 

All   functions have names, just like variables.  When you are
defining your own functions, you get to pick the names, again just as
with variables.

But the function name  main()  is special in this regard.    
Every    program is required to have a function called  main() .
Normally functions get invoked or executed by some    other  function
calling them, or transferring control to them.  But if this were true of
   all  functions then we would have an infinite regress, with no way
for    any  function to get started in the first place.  What is
special about the function called  main()  is that it does not have
to be called or started up by any prior function - it is started up
automatically as the first function to be executed when the whole
program is started executing. In fact, it will normally cause    very
bad things  to happen if  main()     is  actually called by any
other function...

OK, so, in a sort of a way we have already seen what a function   
definition  looks like, in the form of the definition of the
 main()  function.  We have also seen what function    calls  or
   invocations  look like, because we have used some of the so-called
   standard library functions , such as  printf() ,
 scanf() ,  sin() ,  cos() , and so on.  So calling a
function basically involves just putting its    name  into a statement
(or, more precisely, into an    expression ).  When (or if) the
statement gets executed, then that will cause the function to be
invoked, and control will pass, temporarily, into the called function;
that function will do its thing, and then control will pass back to
where the function was called (the so-called "calling site").

Thus, consider this simple program:
 
     include <stdio.h>

    int main(void)
     
      printf("Help: I'm trapped in the computer!!! );

      return(0);
     
 
When the program is started, execution will start at the  main() 
function.  This is why a function called  main()  must be present:
otherwise the computer has no way of knowing where in our program
execution    should  start.  This may not seem like much of a problem
at the moment: after all, our program only has one function, so that is
"obviously" where execution should start - regardless of what the
function is called.  But very soon we will see programs where the
program consists of more than one function and some ambiguity would then
necessarily arise. Even still you might think the computer could simply
start execution at the first function that is defined (or the last for
that matter) rather that looking for a function of a particular name.
Indeed, with many other programming languages that    is  the approach
adopted.  But, again for reasons best known to the original   language
designers, they decided to take the approach of insisting that a
function with the specific name  main()  must always be present, and
that that is where execution will start...

Anyway: in the example above execution starts at the function
 main() .  The very first statement of this function is then a call,
or invocation, of the standard library function called  printf() .
So execution is transferred to that function.  In the meantime, the
 main()  function is essentially put into suspended animation. So
 printf()  is executed, which, in this case, means that the string
 "Help: I'm trapped in the computer!!!    n" 
will be printed out on the
screen. Once  printf()  has done that, it "terminates", or, more
technically, executes a  return  statement, which returns control to
the calling site - in this case, the function  main() . So now,
 main()  becomes re-animated, and continues execution with its next
statement.  This is a  return  statement which causes  main() 
itself to terminate. Now, as just explained, when a function executes a
 return  this normally causes control to return to wherever the
function was called    from ; but this is a bit tricky with
 main()  because, technically,  main()  was not called from   
anywhere  (or not, at least, from any other    function ).  In
practice, when the  main()  function does a  return  then the
whole program is terminated, and control is returned to the "external
environment" - in our case the Turbo-C++ IDE.  This makes a sort of
sense since  main()   was the first, or "top-level" function to be
invoked, and in this contrived way, its "calling site" simply    is 
the "external environment"...

So: be sure you have the basic idea of a function as a unit or chunk of
code that can be "invoked".  With this facility at our disposal, a
complete program need no longer consist of just one function, called
 main()  (together with whatever standard library functions
 main()  calls).  Instead, once  main()  starts to get a bit
long or complicated, we break down the work that  main()  is doing,
and the statements that constitute it,
into some set of more or less coherent sub-blocks.  Each of these is
then hived off into a "function" of its own.  We will give each such
function some name; this should be as meaningfull as possible, but we
have essentially complete freedom in this (as we have in naming
variables).  Whereas the name of the top-level function is constrained
to be precisely  main() , the functions called by  main()  can
have any names we like.  These additional functions which we name and
write are called    user-defined  functions to distinguish them from
standard library functions.

In the simplest case then, the outline of a   program now looks
something like this:

 
    void function1(void)
     
      /* Statements making up function1() appear
         here... */
         .
         .
         .

      return;
     

    void function2(void)
     
      /* Statements making up function2() appear
         here... */
         .
         .
         .

      return;
     

    void function3(void)
     
      /* Statements making up function3() appear
         here... */
         .
         .
         .

      return;
     

    int main(void)
     
      function1();
      function2();
      function3();

      return 0;
     
 
This program consists of the definitions of four functions called
(rather unimaginatively!)  function1() ,  function2() ,
 function3()  and  main() . When the program is started up,
execution will begin at  main()  (even though  main()  is
actually the    last  function to be defined!); execution immediately
transfers to  function1() , and  main()  is suspended; when
control is  return 'ed from  function1()  to  main() , then
 main()  goes on to call  function2()  and is again suspended
while  function2  is executed; similarly, when  function2() 
executes its  return  statement control comes back to  main() ,
and then  function3()  is invoked; finally,  function3() 
executes its  return , control comes back to  main() , and
 main()  itself executes a  return , causing the complete
program to terminate, and control comes back to the IDE.

Be clear that you understand how this function call/return mechanism
allows  main()  to be broken down into smaller, hopefully simpler,
chunks. In this, admittedly contrived, example, the operation of
 main()  itself has actually become trivial - it just calls the
functions  function1() ,  function2()  and  function3()  in
sequence. All the complicated stuff that might have been in  main() 
has been split up, and moved into the "sub-" functions.

Of course, this process of breaking things down can be repeated
indefinitely.  Thus, in the example above, if the function
 function1()  turned out to be still too complicated it might be
broken down into smaller pieces still:

 
    void function1a(void)
     
      /* Statements making up function1a() appear
         here... */
         .
         .
         .

      return;
     

    void function1b(void)
     
      /* Statements making up function1b() appear
         here... */
         .
         .
         .

      return;
     

    void function1(void)
     
      function1a();
      function1b();

      return;
     
 
And, of course, it might turn out that any of these functions can
conveniently be used (or called) multiple times, possibly from multiple
other functions.  In the outline above, it might turn out that
 function1b() , say, could usefully be called both by
 function1()  and by  function3() , for example.  Indeed, if we
do a good job in designing the program - in deciding just exactly how
"best" to break it down - we may be able to achieve quite a high level
of such re-usability!

Note that in laying out the outline example program above, I have chosen
to place the function definitions is essentially "reverse" order - if
 function1()  would call  function1a()  at run-time, then I have
placed the definition of  function1a()     before  the definition
of  function1()  in the   source file.  This ordering is not
absolutely required, but it is an excellent rule of thumb - and you
should stick to it unless and until you have some very good reason for
diverging from it.  But be clear on this point: the order in which the
function definitions appear in the source file has no effect whatsoever
on the order in which the functions will actually be invoked.  The
function called  main()  will    always  be the first function
invoked, regardless of whether it is positioned at the top, the middle,
or the bottom, of the source file! And the next function to be
invoked will depend exclusively on what statement in  main()  causes
a function to be invoked; and whatever function that actually is, it
will be invoked next regardless of whether it is located in the source
file before or after the definition of the function  main() .  The
ordering of the functions in the source file affects how the   
compiler  translates the file - and it is to facilitate the compiler
that I give you the rule of thumb of ordering the definitions in reverse
order to the order in which they are invoked.  Ordering them in this way
ensures that the compiler never has to attempt to translate a call or
   invocation  of a function without already having translated the
   definition  of the function - and it turns out that this makes the
compiler's job easier, and can avoid a variety of compile time problems
and errors.  But regardless of whether this ordering is adopted or not, it
will not affect the order of function    execution  at run-time!

 Moving Data Around 

All right: so far the concern here has been with the    concept  of a
  function, and an outline of how they can be defined and used.  Now
we need to get down to some more nitty gritty specifics!

In general, if we want to decompose a program into a set of functions
then, for the functions to co-operate or interact properly they will
need to, in some sense "share" data between them.  By "data" here I
essentially mean variables.  How can the required data sharing be
achieved?

  supports two quite different models for sharing or exchanging data
between functions.  We will consider them in turn.

 Thinking Global... 

In this model the data (the variables) are made
"global": global variables are declared before, and outside of, the
first function definition in the source file.  Global variables are
visible and accessible to    all  functions.  Thus, if one function
wants to call another sub-function I use the term
  "sub-function" loosely in this kind of context to distinguish between
  one "calling" function and a second function which is "called".
  However, "sub-function" is not a technical term of the   language:
  to   all functions are "equal" - there is no kind of
  precise distinction between "functions" and "sub-functions". 
and have that sub-function carry out some
operation on some particular data values, the calling function can first
store the data values in some global variables, then call the
sub-function; the sub-function can then access the variables and do the
processing; if some "result" is generated, it can be made available to
the calling function by storing it in another global variable.

Have a look at the file  TRI-GLBL.C  tri-glbl.c .
This is a somewhat contrived example program, showing functional
decomposition and the use of global variables.  Examine the program and
try to understand how it is supposed to work. Note that, given my rule
of thumb of defining functions in reverse order, the sensible way to
read a   source file is to start at the    bottom  and work
backwards.  That is, you should generally start by looking at the
 main()  function.  Try to get some understanding of it in isolation
first. Then start looking at the functions which  main()  calls to
get some extra detail on what is going on. Then, if necessary (it does
not arise in this particular program) you can have a look at the
functions which are called by that function in turn, and so on,
effectively working your way back toward the top of the source file.

Play with  TRI-GLBL.C  tri-glbl.c .
In particular, experiment with single
stepping through it.  Try to predict where execution is going to go
next at each stage, and then check whether you are right.  Use the
 Watch  window to monitor the variables. Again, try to predict when
changes are going to happen, and see if you get it right. Notice how
execution transfers to a sub-function, then returns to the calling site.
Notice how the global variables are allowing information to be accessed
or shared between different functions.  Try varying the program
somewhat: e.g.  to calculate the area of a rectangle, or square; or to
repeat the calculation for a number of times; or to initially print out
some kind of "welcome" message before doing any calculations; or extend
the program to be able to calculate areas of several different shapes -
so that it first has to ask for the kind of shape to be identified (e.g.
saying "Enter 1 for triangle, 2 for square:" or something like that),
and then read the appropriate data for the particular shape, and do the
appropriate calculation. Note that the function for printing
  out the result will not have to vary or be modified... 

Note that in  TRI  none of the (sub-)functions actually have
a  return  statement as such.  A feature of   functions is that if
execution reaches the end of the function definition - i.e.  the brace
which closes the definition - then the function will be terminated
anyway and control will automatically return to the calling site, just as
if a  return  statement had been executed.  Thus, if you have a
simple  return  statement as the very last statement in a function
definition it can actually be omitted - and usually is.  By a "simple"
 return  statement I mean one without an operand - unlike the
 return  statement in the  main()  function which has an operand
(namely  0 ). The significance of this operand will be explained
shortly: for now the point to note is that, if a  return  statement
does have an operand, then, even if it    is  the last statement of a
function definition, this  return  statement cannot be omitted.

 Thinking Local... 

OK, so much for using global variable for co-ordinating or sharing data
access between functions in a   function. What is the alternative?

The alternative mechanism arises if we choose to make the data -
the variables -    local  to individual functions.  Local
variables are declared immediately after the opening brace of a
function definition. Local variables are visible and accessible
   only  within the function in which they are defined. Indeed,
the storage for the variables is only allocated at all when the
function is invoked and is deallocated again when the function
terminates - so the variables only actually "exist" as long as
the function is active.

Now it follows directly from the "privacy" of local variables that they
cannot be used to share data between functions.  Therefore, if our
functions are using local variables, and if we need to share this data,
we need some mechanism for "passing" the data from one function to
another.  This generally arises just precisely when one function is
calling another (call the latter the sub-function).  Actually there are
then two sub-issues: getting data    from  the function to the sub-function,
and getting data (results)    back  to the calling site from the
sub-function.    provides separate mechanisms, with rather different
characteristics to deal with these two directions of data flow - from
calling site to called function, and from called function back to
calling site.

Note that in any particular case of one function calling
another we may need to transfer data in only one of the two directions,
so they really are quite separate. Indeed, as we have already seen with
the use of global variables above, we can have functional decomposition
of a program without using    either  of these data transfer
mechanisms. In designing a program - in breaking it up into functions -
each case has to be assessed individually.

 A (Short) Digression 

We will look at the separate data transfer mechanisms in turn,
below; but first: a digression.  A common confusion among
students is between the notion of moving data around    inside 
a program - between the functions making up a program - and
moving data between the program and the outside world (screen,
keyboard, diskette files etc.).  Granted these two things are
both involved with moving data around. And granted, it turns out
that to do the latter (move data into or out of the program) we
also have to do the former (move data around    within  the
program - specifically between our own user-defined functions and
those standard library functions, such as  printf()  and
 scanf() , which actual do the direct exchange of data with
the outside world). But still, the two concepts are quite
different and should be kept distinct. We very frequently do the
former without the latter (i.e.  move data around    within 
the program without exchanging data with the outside world). I
will use (and reserve) the terms "input" and "output", and also
"reading" and "writing", with the technical connotation of moving
data between the program and the outside world.  When dealing
with data movement    within  the program I will use an
alternative set of terms, to be introduced below.  I encourage
you to adopt this more precise vocabulary also.

 Getting data    into  a function 

All right: let's consider moving data from a function to a sub-function -
from a calling site to a called function.  We have already seen examples
of this in using the standard library functions: in the function call we
simply list the data to be passed in, separated by commas if necessary,
and within brackets, thus:

 
    printf("Hi there...");
 

Here the function  printf()  is being called, and the string
 "Hi there..."  is being passed in. Similarly, with:

 
    x = sin(y);
 

In this case the function  sin()  is being called, and the value of
the variable  y  is being passed in to it.  Again, with:

 
    printf("The answer is 
 

The function  printf()  is being called, but now two separate data
items are being passed in - the string  "The answer is  i" , and
the value of the variable  result .  These two are separated by a
comma.

These values passed in to a function are called    arguments  to the
function.  They are, if you like, the values which the function is
supposed to operate on or process in some way. And, as already noted, a
function may not need any arguments - if, for that particular function,
what it is supposed to do does not need or rely on any data values to be
processed.

In general, function arguments are actually    expressions .  That is,
any argument can involve a more or less complicated expression.  In that
case, the expression is    evaluated , to yield a resulting value, and
that value is what is actually passed into the function as the argument
value.  Here is an example:

 
    printf("
 

In this case the second argument is actually the expression
 ((x * x) + (y * y))/(z + 3.141) .  When execution reaches
this invocation of  printf() , the expression is evaluated
first, and the resulting value is what actually gets passed in to
 printf() .  Note that  printf()  itself neither knows
nor cares how this value was generated -    all  it sees is the
resulting value.  This is what is described as "passing by
value"; it is discussed by Alcock on page 21, and is specifically
described by him as "fundamental to the   language". Be clear
about this particular point: since   only allows values to be
passed as arguments, nothing that a called function does with an
argument given to it can affect or change the original source of
the argument. Consider an example. Suppose there is a function
called  foobar()  which expects to be given an argument of
type  int .  Then a fragment of code where  foobar()  is
called might look like this:

 
    int main(void)
     
      int i, j, k;
      .
      .
      .
      foobar((k + i) * 15);
      foobar(j);
      .
      .
      .
     
 
In both calls to  foobar()  the argument is technically an
expression and is evaluated before  foobar()  is actually
started up.  This is clear enough with the argument  (k + i)
* 15 , but it is true also even in the trivial case of the argument
being simply  j .  In the latter case what happens is   
not  that the    variable   j  gets passed to
 foobar() , but only that the current    value  of  j 
is passed in.  The significance is that  foobar()  absolutely
cannot alter or modify the    variable   j ; in fact,
 foobar()  doesn't even know of the existence of such a
variable!    All   foobar()  knows is the value it is given
- it does not know whether that value resulted simply from taking
the value of a single variable, or from evaluating some
arbitrarily more complicated expression.

Some languages    do  support a mechanism whereby whole
variables    can , in effect, be passed as arguments.  In that
case, the called function can be given access to a variable
actually belonging to the calling site (not just a copy of its
current value); and the called function can, if it wishes,   
change  the value of that variable, and that change will be
visible back at the calling site when control returns there.
Such a facility is a called "passing by reference" - because, in
effect, what is passed is a reference or handle whereby the
called function can get at the original variable, rather than
just a copy of its value.  BUT: the   language does    not 
support passing by reference - it    only  supports passing by
value. A technical exception here is in the case of   
  array  arguments.   has some special rules for what happens
  when an array name is used in an expression (without any index)
  - such as when an array is specified as an argument.  The nett
  effect is that arrays "sort of" get passed by reference: but
  the details of this are beyond the scope of the current
  discussion. 
Fortunately, it turns out  that this is not a significant
limitation because the effect of passing by reference can be
"simulated" using passing by value, in combination with certain
other features of the   language. However (as should be clear)
this issue only becomes important in the context of trying to get
data    back  from a called function to the calling site - so
it will be considered again below.

As noted with  printf()  above, a function can accept a number of
arguments.  In general, functions are designed so that they expect some
definite, unique, number of arguments, be it 0, 1, 10, or whatever.  In
that case the function    must  be called with just that number of
arguments - no more, no less. Both  printf()  and
   scanf()  are exceptions to this rule - they can deal with
  varying numbers of arguments.  But this is highly exceptional:
  most of the library functions we deal with will only accept
  fixed numbers of arguments; and    all  of the user defined
  functions covered in this course will only accept fixed numbers
  of arguments. 

Function calls are actually even more restrictive than this
in general.  Not only must the call provide precisely the correct
number of arguments, but they must be of the correct    data
types .  Remember: the data type defines the "kind" of
information represented by a data value or a variable - such as
 int ,  double  etc.  So if a function has been designed
to accept two arguments, the first of type  int , the second
of type  float , then every call or invocation of that
function must include precisely two arguments - no more, no less
- and they must be of precisely those types. Again,
   printf()  and  scanf()  are exceptions to this general
  rule; but also again, it is beyond the scope of this course to
  consider this very exceptional behaviour in any further
  detail! 

The compiler will usually try to enforce these rules - i.e. it
will try to check every invocation of a function to see that the
arguments match the design or specification of the function in
both number and type(s).  But the compiler can obviously only do
this correctly if it    knows  how many arguments, of what
types, are expected. In the case of user-defined functions it
   will  know these things if it has processed the function
   definition  before it sees any invocation of it.  This
condition will be satisfied if the program file is laid out as
suggested earlier - in "reverse" order - which, of course, was
precisely the reason for suggesting that ordering.

The situation with library functions is a little
more complicated.  The whole idea of library functions is that
they have been pre-written and compiled ahead of time, so, by
definition, their    definitions  will not be available to the
compiler.  But although the full definitions of the library
functions are not available, short "interface specifications"
   are  provided.  These interface specifications tell the
compiler, for each function, exactly what arguments are expected,
and of what types.  Such a specification, for one function, is
called a    function prototype .  Function prototypes for the
standard library functions are provided in the standard   
header files  - files like  stdio.h  and so on.  So: if you
want to use or invoke any standard library function in your
program, you must ensure that an appropriate header file is first
scanned by the compiler, so that the compiler will then be able
to check that the invocation is legal.  This is done with the
  include  directive.  If you are ever in doubt as to which
header file to include to provide the prototype for a particular
library function, look up the function in the Turbo-C++ on-line
help system - the header file will always be listed
there. In some cases, the prototype for a particular
  function may be included in several different header files; in
  such a case you just have to insure that at least one of them
  is   include 'd in your file. 

Having said all that, be warned: the ability of the compiler to
detect mismatches between the arguments    expected  by a function
and the arguments actually    provided , is still less than
perfect. You should still take some care yourself in coding any
function call, and not rely completely on the compiler to check
these things for you!

 
   So the format of a function call or function invocation is
always    function  followed by a left bracket -    (  -
followed by zero, one, or more argument expressions, separated by
commas, followed by a right bracket -    ) .  The arguments must
match what is expected by the function in both number and types.
The compiler will try to check for this if - and only if - it has
earlier processed either a full definition of the function, or at
least a prototype, typically found in a   include 'd header
file. 
 

So much for the    invocation  of functions with arguments - but how
does this get represented in the function    definition ?

In all the function definitions given so far (including
definitions of  main() ), the functions did not accept any
arguments.  This was denoted by the keyword  void , placed
between the brackets after the function name:

 
  void foo(void)
       /*  ^^^^ This denotes that foo()
                will not accept *any*
                arguments! */
   
    /* Statements making up foo() go here... */
    .
    .
    .
   
 
Note carefully that, in this example, the keyword  void 
appears    before  the function name    also  - but in that
position it has quite a different significance, nothing to do
with the    arguments  which  foo()  accepts. We'll
consider this again later.

OK: that was the special case of a function which does not accept
any arguments.  Where a function    does  accept arguments,
this is denoted or coded by replacing the keyword  void  with
a list of names and types for the arguments: these are
technically called the    parameters  rather than the
arguments.  The difference is this: the    arguments  are the
values which get generated at the calling site; the   
parameters  are effectively a special kind of variable, belonging
to the called function, into which the argument values are copied
when the function is invoked.  If you like, all the calling site
can see are the arguments (or argument expressions) - it cannot
see the parameters as such; and all the called function can see
are the parameters (and their values) - it cannot see the
original expressions, or arguments, from which they were derived.
In this sense, arguments and parameters are like two views of the
same thing: but the calling site only has the argument view, and
the called function only has the parameter view.

The format for a parameter list, in a function definition is
something like this:

 
    void foobar(int max, int min, double ecstasy)
 

This heading part of the function definition says that
 foobar()  will accept exactly three arguments.
 foobar()  will refer to these as  max ,  min  and
 ecstasy .  The first two must be of type  int , the
third of type  double .  Whenever  foobar()  is invoked,
the arguments at the calling site will be evaluated; parameters
called  max ,  min  and  ecstasy  will be created;
and the argument values will be copied into them; then execution
will proceed with the statements in the body of  foobar .
The parameters are then effectively a special kind of local
variable belonging to  foobar .  Within  foobar()  they
can be used just like any normal variable.  In particular, their
values can be referenced, as one would normally reference the
value of a variable - by using its name in an expression.
Thus, a fragment of  foobar()  might be the following:

 
    void foobar(int max, int min, double ecstasy)
     
      .
      .
      .
      if (ecstasy < 0.0) printf("  dear...");
      else
        if (ecstasy > 100.0) printf("  don't believe it!");
      .
      .
      .
   
 
Since parameters are a form of local variable you can, of course,
   change  their values: but by doing so you overwrite the
original value - the argument - passed in from the calling site.
So this should only be done with care! Note also that, when the
function terminates, the parameters, just like all other local
variables, are disposed of, and their values lost.  Changes to a
parameter within a called function, just like changes to a local
variable, have absolutely no effect on anything at the calling
site.

 Getting data back    out  of a function 

All right: that's passing data    into  a function, both from the
point of view of the calling site (argument expressions) and
from the point of view of defining the called function (parameter
variables). Now we turn to the complementary question of getting
information    back  from the called function to the calling
site.

The simplest, and arguably most convenient, mechanism provided
for this in   is the use of so-called "return" values.
Essentially, a function can terminate using a form of the
 return  statement which specifies a value (it actually has a
complete expression in general: but this will be evaluated to
yield some particular, single, value, of some particular type).
If a function terminates in this way, the specified value is
retained and made available back at the calling site.  In effect,
wherever the function name appeared at the calling site, its
name gets replaced by the returned value, and execution
then continues at the calling site.

Consider this simple example, using the library function
 sin() :

 
    x = sin(0.5);
 

In this case the function  sin()  is invoked (incidentally
being passed one argument, the  double  value  0.5 ).
 sin()  is executed, and when it terminates, it does so using
a  return  statement.  That  return  statement specifies
a return value.  Back at the calling site, the name of the
function is effectively replaced by this value, and execution
continues, so that this value is then assigned to the variable
called  x . When I say that the name of the function
  is "replaced" by the returned value, I do not, of course, mean
  literally that the text of your program is altered. Rather I
  mean that the way to understand what happens next at execution
  time is to imagine the return value appearing in the place
  occupied by the function name. 

That was a simple example.  Here is a somewhat more complicated
one:

 
    printf("Answer: 
 

When this is executed, what happens? Well,  printf()  cannot
be started before its arguments have been evaluated.  To evaluate
the second argument,  sqrt()  (a function for extracting
square roots) must be invoked - but again, that cannot happen
until    its  argument has been evaluated.  So the calls to the
 sin()  and  cos()  functions have to happen first. It
does not matter which is done first - lets say it is  sin() .
Its argument is evaluated - simple, it is just the value of the
variable  x .  This is passed in and  sin()  gets
executed.  In due course it terminates with a  return 
statement, specifying a return value.  This is retained in
temporary storage at the calling site. The argument for
 cos()  is then evaluated - again simple, just the value of
 y . When  cos()  terminates, again with a  return 
statement, its return value comes back to the calling site and
gets added to the value returned earlier from  sin() .  This
is now the argument value for  sqrt()  so  sqrt()  is
started up, with the value passed in to it.  Again, in due
course,  sqrt()  terminates with a  return  statement,
and specifies a return value.  This value is multiplied by
 2 , and the resulting value is the second argument for
 printf() , so  printf()  can now finally be invoked. It
will do its thing (printing out the text  Answer:  followed
by the value of its second argument, regarded - correctly - as a
 double ),  and will then terminate.
 printf()  actually also ends with a  return  statement
which specifies a return value, and this will indeed be passed
back to the calling site (this return value signals whether
 printf()  executed without detecting any difficulties). In
this particular case, the return value is not used at the calling
site - it is not, for example, stored in a variable, or used in
some further expression evaluation. So it is simply discarded,
and execution proceeds to the next statement.

Phew!

That covers how return values are used at the calling site, and
we have also given the gist of how the return value is
implemented in the function definition - namely by using a
 return  statement with an appropriate operand expression.
But let's just detail this with another example:

 
    double silly_sum(double apples, double oranges)
     
      return (apples + oranges);
     
 
Note that the  void  that we were previously placing in front
of the function name, in the heading, has now been replaced by
 double .  This is a general rule.  The word that comes   
before  the function name in the heading specifies the    type 
of return value (if any) which the function will yield.  If the
function does not yield any value, then the return type must be
specified here as  void , as indeed, we have generally been
doing up to now.  But if the function    does  return a value,
then the type of that value must be advertised or notified ahead of
time, by specifying it here in the heading. So, in our case, just
from the heading, we know that the function  silly () 
must terminate using a  return  statement, with an operand
which is of type  double .  If there is a subsequent
inconsistency in the definition of  silly ()  - if it
does not contain any  return  statement, or it has a
 return  statement with no operand, or it has a  return 
statement with an operand which is not of type  double  -
then the compiler will complain bitterly.

As an aside, note that we have consistently been defining the
function  main()  as if it returns a value of type  int ,
and then actually ending it with a  return  statement
specifying a return value of  0 :

 
    int main(void)
     
      .
      .
      .
      return 0;
     
 
This is, to say the least, a trifle odd.  main()  is not
being called by any other function - it is automatically started
as the very start of program execution. We already mentioned
that, for that very reason, it would not make sense for the
parameter list of  main()  to be anything other than
 void .  Yet, notwithstanding that, we are defining
 main()  as if it makes sense for it to be passing a return
value back to some calling site. Why is this?

Well, this is yet another of those funny historical oddities of
   The essential idea is that the return value from
 main()  is passed back to the enclosing "environment" -
wherever the program was started up from.  In our case, this is
the Turbo-C++ IDE. The general idea is that this return value
could be used to signal whether program execution was
"successful" or not - with  0  signalling success, and
anything else indicating some kind of failure.  However, this
idea of signalling success or failure to the enclosing
environment doesn't actually serve any useful purpose in the case
of running programs under the Turbo-C++ IDE.  Using it under
other environments (where it might be useful) is beyond the scope
of this course.  In fact, for our purposes, it would be legal, and
preferable, to define  main()  as not returning any value at
all, and omit any  return  statement from it:

 
    void main(void)
     
      .
      .
      .
     
 
Unfortunately Alcock has insisted in showing  main()  as
yielding a return value of type  int , so, in the interests
of consistency with his presentation, I have stuck with that...

OK: time for another substantial example. Let's revisit the
triangle area calculating program, but now avoiding the use of
global variables, and moving data around with arguments and
return statements instead.  Have a look at
 TRI-LCL.C  tri-lcl.c . Compare it with
 TRI-GLBL.C  tri-glbl.c . Again experiment
with single
stepping through it.  Try to predict where execution is going to go
next at each stage, and then check whether you are right.  Use the
 Watch  window to monitor the variables (including the
various function parameters). Note that because
the variables are now all    local , they only exist at all when the
relevant function is active.  Furthermore, it is a characteristic
of the  Watch  window that only variables of the function
which currently contains the execution highlight bar are
  visible. Other functions may be "active" -
   main()  is active all the time, for example - but
  be temporarily suspended while a sub-function is executing. Their
  variables still exist, and will become visible again in the
   Watch  window as execution returns to them... 
Again, try to predict when changes are going to happen, and see
if you get it right. Notice how the parameters and return values
are allowing information to be moved around between the different
functions.  Again, try varying the program somewhat: e.g.  to
calculate the area of a rectangle, or square etc.

Now notice one particular difference between
 TRI-GLBL.C  tri-glbl.c  and
 TRI-LCL.C  tri-lcl.c .
In  TRI-GLBL.C  tri-glbl.c  I used a single function
( read () ) to prompt for and read in both the
dimensions of the triangle; whereas, in
 TRI-LCL.C  tri-lcl.c  I have broken this down into
two separate functions,  read ()  and  read () .
Stop for a moment, and see if you can figure out why I made this
change...

Well, if I had retained the original structure of
 TRI-GLBL.C  tri-glbl.c ,
but without using global variables, then the
function  read ()  would have to somehow return
   two  distinct pieces of information.  But that cannot be
done with the  return  mechanism in  OK: this is
  a little white lie.  It    can  be done, but only using the
  mechanisms of  struct  data types. This is beyond the scope
  of the current discussion, but have a browse in Chapter 8 of
     Illustrating C  if you want to explore this further... 
The  return  statement can only have one operand, and thus
only a single value can be passed back to the calling site.
Indeed, if you think about the way the returned value is handled
at the calling site (being inserted wherever the function name
appeared) it should be clear why this must be so: if more than
one value was returned then there would be no unambiguous way of
deciding what to do at the calling site.

So: I elected to re-organise the program to remove the need for a
function which would return more than one value - by replacing it
with a pair of functions, each of which individually only return
one value.

This works fine.  But it also gives me an excuse for mentioning
an alternative way for getting information back from a called
function to the calling site - a way which does not suffer the
limitation of the  return  mechanism, which can only handle a
single value.

First recall the earlier discussion of "passing by value" and
"passing by reference".  Clearly, if we had a mechanism for
passing by reference we could use this to get as many distinct
items of data as we like back from a function: we just pass ("by
reference") that number of variables    into  the function; the
function then stashes the relevant results in these variables;
and when the function terminates, and control returns to the
calling site, those results can be accessed - as many of them as
you like.

But, of course,   does not support passing by reference. So
what can we do?

The answer is that we can achieve the effect of passing by
reference, in a somewhat roundabout way: we can pass, as values,
and using the normal   passing by value mechanism, the   
addresses  of some variables.  Using these addresses, the
function can then, indirectly, get at the variables, and deposit
results in them - exactly as if the variables    had  been
passed by reference.

In fact, we have already been implicitly using this mechanism,
though without drawing too much attention to it.  It is the way
we have been getting information back from the standard library
function  scanf() :

 
  scanf("
 

In this case the additional arguments to  scanf()  are   
addresses  or    pointers  to variables.  We generate the
address of a variable by preceding its name with the "address-of"
operator, denoted with the ampersand character,   & .

Of course, this mechanism is used by  scanf()  precisely
because  scanf()  is    supposed  to be able to get more
than one item of information back to the calling site.  In fact,
a single call to  scanf()  can be used to pass an
indefinitely large number of separate items back, just according
to how the format specification string is laid out.

Can you use this mechanism in your own user-defined functions?

Yes - but it is quite messy.  To define a function of your own
which uses this mechanism you need to know two new things: how to
specify that the type of a parameter is an address or pointer;
and given an address or pointer, how to "de-reference" it -
access the thing pointed at.  The program
 TRI-LCL1.C  tri-lcl1.c  gives an example, if
you want to have a look - but we will not consider this in detail
here.

 Preferences? 

We have seen two radically different styles of getting
information into and out of functions: the use of global
variables and the use of parameters and return values.  Which is
best?

Unfortunately there is no simple answer. Both kinds of mechanism
are provided in   precisely because, in different circumstances
they each have their own advantages.

For people just starting out to learn to program the global
variable mechanism is conceptually easier to deal with, and
actually syntactically easier to use in programs.  Therefore I
recommend that you start out by using this mechanism, unless you
have some specific reason for doing otherwise.

However, quite quickly you will probably discover some of the
drawbacks of using global variables.  At that point it is worth
getting familiar with both parameter and return value mechanisms
and trying them out.

Once you are technically happy that you can use both kinds of
style then, with experience, you will get a feel for the
particular circumstances in which each is appropriate.  But:
until you are comfortable with judging this I recommend that you
actually try to avoid or minimise your use of global variables.
You will find that this requires you to think rather harder about
how to design your programs; and will also make the design and
coding somewhat more cumbersome at times.  However, it is a fact
that the use of global variables makes programs prone to a
variety of subtle problems and bugs - due to non-obvious and
un-intended interactions between functions, via changes in global
variables. Such bugs are particularly difficult to isolate and
correct. Therefore you will find that avoiding the use of global
variables - except where they have some overwhelming advantage to
recommend them - will save you a lot of program development time
in the long run.

 Recursion 

Alcock chooses to introduce recursion toward the end of
Chapter 2.  I'm not sure I agree with the need for this.
Recursion - functions which invoke themselves - is an interesting
programming concept, and it does permit very elegant solutions to
certain programming problems.  On the other hand, there is never
a situation where the same effect as recursion cannot be achieved
using conventional iteration mechanisms, such as  for 
statements.

In any case, all I will add to Alcock's discussion are the
following brief comments:

 
  When    any  function is invoked, storage is allocated for its
local variables (including parameters, if any).

  This storage will be deallocated when the function
terminates.

  Recursive functions are a special kind of function. This
may seem trite, but it follows that you need to understand the
   general  concept of a function reasonably well before you
try to understand the idea of a recursive function...

  If a function is invoked again while it is still already
active - i.e.  if a recursive invocation is made - then
additional storage is allocated to hold a new set of local
variables for this new invocation. Each invocation thus has an
entirely separate set of variables and parameters - even though,
within each invocation, these are referred to by the same names.
There is no confusion because the computer keeps careful track of
which invocation is being executed, and therefore which copies of
variables or parameters to refer to.

  At first sight it seems that recursive function invocation
must be fatal: surely the computer will be caught in an infinite
regression, invoking ever more copies of the same function, until
something horrible happens (like it runs out of storage to hold
any more variables) and the whole thing crashes? This certainly
   can  happen; but recursion can also be made benign.  It is
made benign by ensuring that, at each nested invocation,   
something  is changing (typically a parameter value); and at
some, finite, point, this will result in a copy of the function
which does    not  make a further, nested, recursive call to
itself, but instead terminates.  That then allows its caller to
terminate, and its caller in turn, so that the whole nest of
recursive calls can finally unwind, and return control to
wherever the very first instance of the function was called from.

  But recursion will only be benign and controlled in this
way if you, the software engineer, make it so.  Recursion is
intrinsically dangerous stuff - use it with extreme caution!

 
 Copyright 

This Hypermedia Document is copyrighted,   1994, 1995, by
 Barry McMullin 
 http://www.eeng.dcu.ie/ 7Emcmullin/home.html .

Permission is hereby granted to access, copy, or store this work, in
whole or in part, for purposes of individual private study only. The
work may    not  be accessed or copied, in whole or in part, for
commercial purposes, except with the prior written permission of the
author.