WebSVN – tendra.SVN – /branches/tendra4/doc/port/port3.html

<!-- Crown Copyright (c) 1998 -->
<HTML>
<HEAD>
<TITLE>TDF and Portability: Portability</TITLE>
</HEAD>
<BODY TEXT="#000000" BGCOLOR="#FFFFFF" LINK="#0000FF" VLINK="#400080" ALINK="#FF0000">
<A NAME=S2>
<H1>TDF and Portability</H1>
<H3>January 1998</H3>
<A HREF="port4.html"><IMG SRC="../images/next.gif" ALT="next section"></A>
<A HREF="port1.html"><IMG SRC="../images/prev.gif" ALT="previous section"></A>
<A HREF="port1.html"><IMG SRC="../images/top.gif" ALT="current document"></A>
<A HREF="../index.html"><IMG SRC="../images/home.gif" ALT="TenDRA home page">
</A>
<IMG SRC="../images/no_index.gif" ALT="document index"><P>
<HR>
<DL>
<DT><A HREF="#S3"><B>2.1</B> - Portable Programs</A><DD>
<DL>
<DT><A HREF="#S4"><B>2.1.1</B> - Definitions and Preliminary Discussion</A><DD>
<DT><A HREF="#S5"><B>2.1.2</B> - Separation and Combination of Code</A><DD>
<DT><A HREF="#S6"><B>2.1.3</B> - Application Programming Interfaces</A><DD>
<DT><A HREF="#S7"><B>2.1.4</B> - Compilation Phases</A><DD>
</DL>
<DT><A HREF="#S8"><B>2.2</B> - Portability Problems</A><DD>
<DL>
<DT><A HREF="#S9"><B>2.2.1</B> - Programming Problems</A><DD>
<DT><A HREF="#S10"><B>2.2.2</B> - Code Transformation Problems</A><DD>
<DT><A HREF="#S11"><B>2.2.3</B> - Code Combination Problems</A><DD>
<DT><A HREF="#S12"><B>2.2.4</B> - API Problems</A><DD>
<DL>
<DT><A HREF="#S13"><B>2.2.4.1</B> - API Checking</A><DD>
<DT><A HREF="#S14"><B>2.2.4.2</B> - API Implementation Errors</A><DD>
<DT><A HREF="#S15"><B>2.2.4.3</B> - System Header Problems</A><DD>
<DT><A HREF="#S16"><B>2.2.4.4</B> - System Library Problems</A><DD>
</DL>
</DL>
<DT><A HREF="#S17"><B>2.3</B> - APIs and Portability</A><DD>
<DL>
<DT><A HREF="#S18"><B>2.3.1</B> - Target Dependent Code</A><DD>
<DT><A HREF="#S19"><B>2.3.2</B> - Making APIs Explicit</A><DD>
<DT><A HREF="#S20"><B>2.3.3</B> - Choosing an API</A><DD>
<DT><A HREF="#S21"><B>2.3.4</B> - Alternative Program Versions</A><DD>
</DL>
</DL>
<HR>

<H1>2.  Portability</H1>
We start by examining some of the problems involved in the writing
of portable programs. Although the discussion is very general, and
makes no mention of TDF, many of the ideas introduced are of importance
in the second half of the paper, which deals with TDF.<P>
<A NAME=S3>
<HR><H2>2.1.  Portable Programs</H2>
<A NAME=S4>
<H3>2.1.1.  Definitions and Preliminary Discussion</H3>
Let us firstly say what we mean by a portable program. A program is
portable to a number of machines if it can be compiled to give the
same functionality on all those machines. Note that this does not
mean that exactly the same source code is used on all the machines.
One could envisage a program written in, say, 68020 assembly code
for a certain machine which has been translated into 80386 assembly
code for some other machine to give a program with exactly equivalent
functionality. This would, under our definition, be a program which
is portable to these two machines. At the other end of the scale,
the C program:<P>
<PRE>
        #include &lt;stdio.h&gt;
        
        int main ()
        {
            fputs ( &quot;Hello world\n&quot;, stdout ) ;
            return ( 0 ) ;
        }
</PRE>
which prints the message, &quot;Hello world&quot;, onto the standard
output stream, will be portable to a vast range of machines without
any need for rewriting. Most of the portable programs we shall be
considering fall closer to the latter end of the spectrum - they will
largely consist of target independent source with small sections of
target dependent source for those constructs for which target independent
expression is either impossible or of inadequate efficiency.<P>
Note that we are defining portability in terms of a set of target
machines and not as some universal property. The act of modifying
an existing program to make it portable to a new target machine is
called porting. Clearly in the examples above, porting the first program
would be a highly complex task involving almost an entire rewrite,
whereas in the second case it should be trivial.<P>
<A NAME=S5>
<H3>2.1.2.  Separation and Combination of Code</H3>
So why is the second example above more portable (in the sense of
more easily ported to a new machine) than the first? The first, obvious,
point to be made is that it is written in a high-level language, C,
rather than the low-level languages, 68020 and 80386 assembly codes,
used in the first example. By using a high-level language we have
abstracted out the details of the processor to be used and expressed
the program in an architecture neutral form. It is one of the jobs
of the compiler on the target machine to transform this high-level
representation into the appropriate machine dependent low-level representation.
<P>
The second point is that the second example program is not in itself
complete. The objects <CODE>fputs</CODE> and <CODE>stdout</CODE>,
representing the procedure to output a string and the standard output
stream respectively, are left undefined. Instead the header <CODE>stdio.h</CODE>
is included on the understanding that it contains the specification
of these objects.<P>
A version of this file is to be found on each target machine. On a
particular machine it might contain something like:<P>
<PRE>
        typedef struct {
            int __cnt ;
            unsigned char *__ptr ;
            unsigned char *__base ;
            short __flag ;
            char __file ;
        } FILE ;
        
        extern FILE __iob [60] ;
        #define stdout ( &amp;__iob [1] )
        
        extern int fputs ( const char *, FILE * ) ;
</PRE>
meaning that the type <CODE>FILE</CODE> is defined by the given structure,
<CODE>__iob</CODE> is an external array of 60 <CODE>FILE</CODE>'s,
<CODE>stdout</CODE> is a pointer to the second element of this array,
and that <CODE>fputs</CODE> is an external procedure which takes a
<CODE>const char *</CODE> and a <CODE>FILE *</CODE> and returns an
<CODE>int</CODE>. On a different machine, the details may be different
(exactly what we can, or cannot, assume is the same on all target
machines is discussed below).<P>
These details are fed into the program by the pre-processing phase
of the compiler. (The various compilation phases are discussed in
more detail later - see Fig. 1.) This is a simple, preliminary textual
substitution. It provides the definitions of the type <CODE>FILE</CODE>
and the value <CODE>stdout</CODE> (in terms of <CODE>__iob</CODE>),
but still leaves the precise definitions of <CODE>__iob</CODE> and
<CODE>fputs</CODE> still unresolved (although we do know their types).
The definitions of these values are not provided until the final phase
of the compilation - linking - where they are linked in from the precompiled
system libraries.<P>
Note that, even after the pre-processing phase, our portable program
has been transformed into a target dependent form, because of the
substitution of the target dependent values from <CODE>stdio.h</CODE>.
If we had also included the definitions of <CODE>__iob</CODE> and,
more particularly, <CODE>fputs</CODE>, things would have been even
worse - the procedure for outputting a string to the screen is likely
to be highly target dependent.<P>
To conclude, we have, by including <CODE>stdio.h</CODE>, been able
to effectively separate the target independent part of our program
(the main program) from the target dependent part (the details of
<CODE>stdout</CODE> and <CODE>fputs</CODE>). It is one of the jobs
of the compiler to recombine these parts to produce a complete program.<P>
<A NAME=S6>
<H3>2.1.3.  Application Programming Interfaces</H3>
As we have seen, the separation of the target dependent sections of
a program into the system headers and system libraries greatly facilitates
the construction of portable programs. What has been done is to define
an interface between the main program and the existing operating system
on the target machine in abstract terms. The program should then be
portable to any machine which implements this interface correctly.<P>
The interface for the &quot;Hello world&quot; program above might
be described as follows : defined in the header <CODE>stdio.h</CODE>
are a type <CODE>FILE</CODE> representing a file, an object <CODE>stdout</CODE>
of type <CODE>FILE *</CODE> representing the standard output file,
and a procedure <CODE>fputs</CODE> with prototype:<P>
<PRE>
        int fputs ( const char *s, FILE *f ) ;
</PRE>
which prints the string <CODE>s</CODE> to the file <CODE>f</CODE>.
This is an example of an Application Programming Interface (API).
Note that it can be split into two aspects, the syntactic (what they
are) and the semantic (what they mean). On any machine which implements
this API our program is both syntactically correct and does what we
expect it to.<P>
The benefit of describing the API at this fairly high level is that
it leaves scope for a range of implementation (and thus more machines
which implement it) while still encapsulating the main program's requirements.
<P>
In the example implementation of <CODE>stdio.h</CODE> above we see
that this machine implements this API correctly syntactically, but
not necessarily semantically. One would have to read the documentation
provided on the system to be sure of the semantics.<P>
Another way of defining an API for this program would be to note that
the given API is a subset of the ANSI C standard. Thus we could take
ANSI C as an &quot;off the shelf&quot; API. It is then clear that
our program should be portable to any ANSI-compliant machine.<P>
It is worth emphasising that all programs have an API, even if it
is implicit rather than explicit. However it is probably fair to say
that programs without an explicit API are only portable by accident.
We shall have more to say on this subject later.<P>
<A NAME=S7>
<H3>2.1.4.  Compilation Phases</H3>
The general plan for how to write the extreme example of a portable
program, namely one which contains no target dependent code, is now
clear. It is shown in the compilation diagram in Fig. 1 which represents
the traditional compilation process. This diagram is divided into
four sections. The left half of the diagram represents the actual
program and the right half the associated API. The top half of the
diagram represents target independent material - things which only
need to be done once - and the bottom half target dependent material
- things which need to be done on every target machine.<P>
FIGURE 1.  Traditional Compilation Phases 
<BR>
<CENTER>
<IMG SRC="../images/trad_scheme.gif">
</CENTER>
<BR>
So, we write our target independent program (top left), conforming
to the target independent API specification (top right). All the compilation
actually takes place on the target machine. This machine must have
the API correctly implemented (bottom right). This implementation
will in general be in two parts - the system headers, providing type
definitions, macros, procedure prototypes and so on, and the system
libraries, providing the actual procedure definitions. Another way
of characterising this division is between syntax (the system headers)
and semantics (the system libraries).<P>
The compilation is divided into three main phases. Firstly the system
headers are inserted into the program by the pre-processor. This produces,
in effect, a target dependent version of the original program. This
is then compiled into a binary object file. During the compilation
process the compiler inserts all the information it has about the
machine - including the Application Binary Interface (ABI) - the sizes
of the basic C types, how they are combined into compound types, the
system procedure calling conventions and so on. This ensures that
in the final linking phase the binary object file and the system libraries
are obeying the same ABI, thereby producing a valid executable. (On
a dynamically linked system this final linking phase takes place partially
at run time rather than at compile time, but this does not really
affect the general scheme.)<P>
The compilation scheme just described consists of a series of phases
of two types ; code combination (the pre-processing and system linking
phases) and code transformation (the actual compilation phases). The
existence of the combination phases allows for the effective separation
of the target independent code (in this case, the whole program) from
the target dependent code (in this case, the API implementation),
thereby aiding the construction of portable programs. These ideas
on the separation, combination and transformation of code underlie
the TDF approach to portability.<P>
<A NAME=S8>
<HR><H2>2.2.  Portability Problems</H2>
We have set out a scheme whereby it should be possible to write portable
programs with a minimum of difficulties. So why, in reality, does
it cause so many problems? Recall that we are still primarily concerned
with programs which contain no target dependent code, although most
of the points raised apply by extension to all programs.<P>
<A NAME=S9>
<H3>2.2.1.  Programming Problems</H3>
A first, obvious class of problems concern the program itself. It
is to be assumed that as many bugs as possible have been eliminated
by testing and debugging on at least one platform before a program
is considered as a candidate for being a portable program. But for
even the most self-contained program, working on one platform is no
guarantee of working on another. The program may use undefined behaviour
- using uninitialised values or dereferencing null pointers, for example
- or have built-in assumptions about the target machine - whether
it is big-endian or little-endian, or what the sizes of the basic
integer types are, for example. This latter point is going to become
increasingly important over the next couple of years as 64-bit architectures
begin to be introduced. How many existing programs implicitly assume
a 32-bit architecture?<P>
Many of these built-in assumptions may arise because of the conventional
porting process. A program is written on one machine, modified slightly
to make it work on a second machine, and so on. This means that the
program is &quot;biased&quot; towards the existing set of target machines,
and most particularly to the original machine it was written on. This
applies not only to assumptions about endianness, say, but also to
the questions of API conformance which we will be discussing below.<P>
Most compilers will pick up some of the grosser programming errors,
particularly by type checking (including procedure arguments if prototypes
are used). Some of the subtler errors can be detected using the <B>-Wall</B>
option to the Free Software Foundation's GNU C Compiler (<CODE>gcc</CODE>)
or separate program checking tools such as <CODE>lint</CODE>, for
example, but this remains a very difficult area.<P>
<A NAME=S10>
<H3>2.2.2.  Code Transformation Problems</H3>
We now move on from programming problems to compilation problems.
As we mentioned above, compilation may be regarded as a series of
phases of two types : combination and transformation. Transformation
of code - translating a program in one form into an equivalent program
in another form - may lead to a variety of problems. The code may
be transformed wrongly, so that the equivalence is broken (a compiler
bug), or in an unexpected manner (differing compiler interpretations),
or not at all, because it is not recognised as legitimate code (a
compiler limitation). The latter two problems are most likely when
the input is a high level language, with complex syntax and semantics.<P>
Note that in Fig. 1 all the actual compilation takes place on the
target machine. So, to port the program to <I>n</I> machines, we need
to deal with the bugs and limitations of <I>n</I>, potentially different,
compilers. For example, if you have written your program using prototypes,
it is going to be a large and rather tedious job porting it to a compiler
which does not have prototypes (this particular example can be automated;
not all such jobs can). Other compiler limitations can be surprising
- not understanding the <CODE>L</CODE> suffix for long numeric literals
and not allowing members of enumeration types as array indexes are
among the problems drawn from my personal experience.<P>
The differing compiler interpretations may be more subtle. For example,
there are differences between ANSI and &quot;traditional&quot; C which
may trap the unwary. Examples are the promotion of integral types
and the resolution of the linkage of static objects.<P>
Many of these problems may be reduced by using the &quot;same&quot;
compiler on all the target machines. For example, <CODE>gcc</CODE>
has a single front end (C -&gt; RTL) which may be combined with an
appropriate back end (RTL -&gt; target) to form a suitable compiler
for a wide range of target machines. The existence of a single front
end virtually eliminates the problems of differing interpretation
of code and compiler quirks. It also reduces the exposure to bugs.
Instead of being exposed to the bugs in <I>n</I> separate compilers,
we are now only exposed to bugs in one half-compiler (the front end)
plus <I>n</I> half-compilers (the back ends) - a total of <I>( n +
1 ) / 2</I>. (This calculation is not meant totally seriously, but
it is true in principle.) Front end bugs, when tracked down, also
only require a single workaround.<P>
<A NAME=S11>
<H3>2.2.3.  Code Combination Problems</H3>
If code transformation problems may be regarded as a time consuming
irritation, involving the rewriting of sections of code or using a
different compiler, the second class of problems, those concerned
with the combination of code, are far more serious.<P>
The first code combination phase is the pre-processor pulling in the
system headers. These can contain some nasty surprises. For example,
consider a simple ANSI compliant program which contains a linked list
of strings arranged in alphabetical order. This might also contain
a routine:<P>
<PRE>
        void index ( char * ) ;
</PRE>
which adds a string to this list in the appropriate position, using
<CODE>strcmp</CODE> from <CODE>string.h</CODE> to find it. This works
fine on most machines, but on some it gives the error:<P>
<PRE>
        Only 1 argument to macro 'index'
</PRE>
The reason for this is that the system version of <CODE>string.h</CODE>
contains the line:<P>
<PRE>
        #define index ( s, c ) strchr ( s, c )
</PRE>
But this is nothing to do with ANSI, this macro is defined for compatibility
with BSD.<P>
In reality the system headers on any given machine are a hodge podge
of implementations of different APIs, and it is often virtually impossible
to separate them (feature test macros such as <CODE>_POSIX_SOURCE</CODE>
are of some use, but are not always implemented and do not always
produce a complete separation; they are only provided for &quot;standard&quot;
APIs anyway). The problem above arose because there is no transitivity
rule of the form : if program <I>P</I> conforms to API <I>A</I>, and
API <I>B</I> extends <I>A</I>, then <I>P</I> conforms to <I>B</I>.
The only reason this is not true is these namespace problems.<P>
A second example demonstrates a slightly different point. The POSIX
standard states that <CODE>sys/stat.h</CODE> contains the definition
of the structure <CODE>struct stat</CODE>, which includes several
members, amongst them:<P>
<PRE>
        time_t st_atime ;
</PRE>
representing the access time for the corresponding file. So the program:<P>
<PRE>
        #include &lt;sys/types.h&gt;
        #include &lt;sys/stat.h&gt;
        
        time_t st_atime ( struct stat *p )
        {
            return ( p-&gt;st_atime ) ;
        }
</PRE>
should be perfectly valid - the procedure name <CODE>st_atime</CODE>
and the field selector <CODE>st_atime</CODE> occupy different namespaces
(see however the appendix on namespaces and APIs below). However at
least one popular operating system has the implementation:<P>
<PRE>
        struct stat {
            ....
            union {
                time_t st__sec ;
                timestruc_t st__tim ;
            } st_atim ;
            ....
        } ;
        #define st_atime st_atim.st__sec
</PRE>
This seems like a perfectly legitimate implementation. In the program
above the field selector <CODE>st_atime</CODE> is replaced by <CODE>st_atim.st__sec
</CODE> by the pre-processor, as intended, but unfortunately so is
the procedure name <CODE>st_atime</CODE>, leading to a syntax error.<P>
The problem here is not with the program or the implementation, but
in the way they were combined. C does not allow individual field selectors
to be defined. Instead the indiscriminate sledgehammer of macro substitution
was used, leading to the problem described.<P>
Problems can also occur in the other combination phase of the traditional
compilation scheme, the system linking. Consider the ANSI compliant
routine:<P>
<PRE>
        #include &lt;stdio.h&gt;
        
        int open ( char *nm )
        {
            int c, n = 0 ;
            FILE *f = fopen ( nm, &quot;r&quot; ) ;
            if ( f == NULL ) return ( -1 ) ;
            while ( c = getc ( f ), c != EOF ) n++ ;
            ( void ) fclose ( f ) ;
            return ( n ) ;
        }
</PRE>
which opens the file <CODE>nm</CODE>, returning its size in bytes
if it exists and -1 otherwise. As a quick porting exercise, I compiled
it under six different operating systems. On three it worked correctly;
on one it returned -1 even when the file existed; and on two it crashed
with a segmentation error.<P>
The reason for this lies in the system linking. On those machines
which failed the library routine <CODE>fopen</CODE> calls (either
directly or indirectly) the library routine <CODE>open</CODE> (which
is in POSIX, but not ANSI). The system linker, however, linked my
routine <CODE>open</CODE> instead of the system version, so the call
to <CODE>fopen</CODE> did not work correctly.<P>
So code combination problems are primarily namespace problems. The
task of combining the program with the API implementation on a given
platform is complicated by the fact that, because the system headers
and system libraries contain things other than the API implementation,
or even because of the particular implementation chosen, the various
namespaces in which the program is expected to operate become &quot;polluted&quot;.
<P>
<A NAME=S12>
<H3>2.2.4.  API Problems</H3>
We have said that the API defines the interface between the program
and the standard library provided with the operating system on the
target machine. There are three main problems concerned with APIs.
The first, how to choose the API in the first place, is discussed
separately. Here we deal with the compilation aspects : how to check
that the program conforms to its API, and what to do about incorrect
API implementations on the target machine(s).<P>
<A NAME=S13>
<H4>2.2.4.1.  API Checking</H4>
The problem of whether or not a program conforms to its API - not
using any objects from the operating system other than those specified
in the API, and not making any unwarranted assumptions about these
objects - is one which does not always receive sufficient attention,
mostly because the necessary checking tools do not exist (or at least
are not widely available). Compiling the program on a number of API
compliant machines merely checks the program against the system headers
for these machines. For a genuine portability check we need to check
against the abstract API description, thereby in effect checking against
all possible implementations.<P>
Recall from above that the system headers on a given machine are an
amalgam of all the APIs it implements. This can cause programs which
should compile not to, because of namespace clashes; but it may also
cause programs to compile which should not, because they have used
objects which are not in their API, but which are in the system headers.
For example, the supposedly ANSI compliant program:<P>
<PRE>
        #include &lt;signal.h&gt;
        int sig = SIGKILL ;
</PRE>
will compile on most systems, despite the fact that <CODE>SIGKILL</CODE>
is not an ANSI signal, because <CODE>SIGKILL</CODE> is in POSIX, which
is also implemented in the system <CODE>signal.h</CODE>. Again, feature
test macros are of some use in trying to isolate the implementation
of a single API from the rest of the system headers. However they
are highly unlikely to detect the error in the following supposedly
POSIX compliant program which prints the entries of the directory
<CODE>nm</CODE>, together with their inode numbers:<P>
<PRE>
        #include &lt;stdio.h&gt;
        #include &lt;sys/types.h&gt;
        #include &lt;dirent.h&gt;
        
        void listdir ( char *nm )
        {
            struct dirent *entry ;
            DIR *dir = opendir ( nm ) ;
            if ( dir == NULL ) return ;
            while ( entry = readdir ( dir ), entry != NULL ) {
                printf ( &quot;%s : %d\n&quot;, entry-&gt;d_name, ( int ) entry-&gt;d_ino ) ;
            }
            ( void ) closedir ( dir ) ;
            return ;
        }
</PRE>
This is not POSIX compliant because, whereas the <CODE>d_name</CODE>
field of <CODE>struct dirent</CODE> is in POSIX, the <CODE>d_ino</CODE>
field is not. It is however in XPG3, so it is likely to be in many
system implementations.<P>
The previous examples have been concerned with simply telling whether
or not a particular object is in an API. A more difficult, and in
a way more important, problem is that of assuming too much about the
objects which are in the API. For example, in the program:<P>
<PRE>
        #include &lt;stdio.h&gt;
        #include &lt;stdlib.h&gt;
        
        div_t d = { 3, 4 } ;
        
        int main ()
        {
            printf ( &quot;%d,%d\n&quot;, d.quot, d.rem ) ;
            return ( 0 ) ;
        }
</PRE>
the ANSI standard specifies that the type <CODE>div_t</CODE> is a
structure containing two fields, <CODE>quot</CODE> and <CODE>rem</CODE>,
of type <CODE>int</CODE>, but it does not specify which order these
fields appear in, or indeed if there are other fields. Therefore the
initialisation of <CODE>d</CODE> is not portable. Again, the type
<CODE>time_t</CODE> is used to represent times in seconds since a
certain fixed date. On most systems this is implemented as <CODE>long</CODE>,
so it is tempting to use <CODE>( t &amp; 1 )</CODE> to determine for
a <CODE>time_t</CODE> <CODE>t</CODE> whether this number of seconds
is odd or even. But ANSI actually says that <CODE>time_t</CODE> is
an arithmetic, not an integer, type, so it would be possible for it
to be implemented as <CODE>double</CODE>. But in this case <CODE>(
t &amp; 1 )</CODE> is not even type correct, so it is not a portable
way of finding out whether <CODE>t</CODE> is odd or even.<P>
<A NAME=S14>
<H4>2.2.4.2.  API Implementation Errors</H4>
Undoubtedly the problem which causes the writer of portable programs
the greatest headache (and heartache) is that of incorrect API implementations.
However carefully you have chosen your API and checked that your program
conforms to it, you are still reliant on someone (usually the system
vendor) having implemented this API correctly on the target machine.
Machines which do not implement the API at all do not enter the equation
(they are not suitable target machines), what causes problems is incorrect
implementations. As the implementation may be divided into two parts
- system headers and system libraries - we shall similarly divide
our discussion. Inevitably the choice of examples is personal; anyone
who has ever attempted to port a program to a new machine is likely
to have their own favourite examples.<P>
<A NAME=S15>
<H4>2.2.4.3.  System Header Problems</H4>
Some header problems are immediately apparent because they are syntactic
and cause the program to fail to compile. For example, values may
not be defined or be defined in the wrong place (not in the header
prescribed by the API).<P>
A common example (one which I have to include a workaround for in
virtually every program I write) is that <CODE>EXIT_SUCCESS</CODE>
and <CODE>EXIT_FAILURE</CODE> are not always defined (ANSI specifies
that they should be in <CODE>stdlib.h</CODE>). It is tempting to change
<CODE>exit (EXIT_FAILURE)</CODE> to <CODE>exit (1)</CODE> because
&quot;everyone knows&quot; that <CODE>EXIT_FAILURE</CODE> is 1. But
this is to decrease the portability of the program because it ties
it to a particular class of implementations. A better workaround would
be:<P>
<PRE>
        #include &lt;stdlib.h&gt;
        #ifndef EXIT_FAILURE
        #define EXIT_FAILURE 1
        #endif
</PRE>
which assumes that anyone choosing a non-standard value for <CODE>EXIT_FAILURE
</CODE> is more likely to put it in <CODE>stdlib.h</CODE>. Of course,
if one subsequently came across a machine on which not only is <CODE>EXIT_FAILURE
</CODE> not defined, but also the value it should have is not 1, then
it would be necessary to resort to <CODE>#ifdef machine_name</CODE>
statements. The same is true of all the API implementation problems
we shall be discussing : non-conformant machines require workarounds
involving conditional compilation. As more machines are considered,
so these conditional compilations multiply.<P>
As an example of things being defined in the wrong place, ANSI specifies
that <CODE>SEEK_SET</CODE>, <CODE>SEEK_CUR</CODE> and <CODE>SEEK_END</CODE>
should be defined in <CODE>stdio.h</CODE>, whereas POSIX specifies
that they should also be defined in <CODE>unistd.h</CODE>. It is not
uncommon to find machines on which they are defined in the latter
but not in the former. A possible workaround in this case would be:<P>
<PRE>
        #include &lt;stdio.h&gt;
        #ifndef SEEK_SET
        #include &lt;unistd.h&gt;
        #endif
</PRE>
Of course, by including &quot;unnecessary&quot; headers like <CODE>unistd.h
</CODE> the risk of namespace clashes such as those discussed above
is increased.<P>
A final syntactic problem, which perhaps should belong with the system
header problems above, concerns dependencies between the headers themselves.
For example, the POSIX header <CODE>unistd.h</CODE> declares functions
involving some of the types <CODE>pid_t</CODE>, <CODE>uid_t</CODE>
etc, defined in <CODE>sys/types.h</CODE>. Is it necessary to include
<CODE>sys/types.h</CODE> before including <CODE>unistd.h</CODE>, or
does <CODE>unistd.h</CODE> automatically include <CODE>sys/types.h</CODE>?
The approach of playing safe and including everything will normally
work, but this can lead to multiple inclusions of a header. This will
normally cause no problems because the system headers are protected
against multiple inclusions by means of macros, but it is not unknown
for certain headers to be left unprotected. Also not all header dependencies
are as clear cut as the one given, so that what headers need to be
included, and in what order, is in fact target dependent.<P>
There can also be semantic errors in the system headers : namely wrongly
defined values. The following two examples are taken from real operating
systems. Firstly the definition:<P>
<PRE>
        #define DBL_MAX 1.797693134862316E+308
</PRE>
in <CODE>float.h</CODE> on an IEEE-compliant machine is subtly wrong
- the given value does not fit into a <CODE>double</CODE> - the correct
value is:<P>
<PRE>
        #define DBL_MAX 1.7976931348623157E+308
</PRE>
Again, the type definition:<P>
<PRE>
        typedef int size_t ; /* ??? */
</PRE>
(sic) is not compliant with ANSI, which says that <CODE>size_t</CODE>
is an unsigned integer type. (I'm not sure if this is better or worse
than another system which defines <CODE>ptrdiff_t</CODE> to be <CODE>unsigned
int</CODE> when it is meant to be signed. This would mean that the
difference between any two pointers is always positive.) These particular
examples are irritating because it would have cost nothing to get
things right, correcting the value of <CODE>DBL_MAX</CODE> and changing
the definition of <CODE>size_t</CODE> to <CODE>unsigned int</CODE>.
These corrections are so minor that the modified system headers would
still be a valid interface for the existing system libraries (we shall
have more to say about this later). However it is not possible to
change the system headers, so it is necessary to build workarounds
into the program. Whereas in the first case it is possible to devise
such a workaround:<P>
<PRE>
        #include &lt;float.h&gt;
        #ifdef machine_name
        #undef DBL_MAX
        #define DBL_MAX 1.7976931348623157E+308
        #endif
</PRE>
for example, in the second, because <CODE>size_t</CODE> is defined
by a <CODE>typedef</CODE> it is virtually impossible to correct in
a simple fashion. Thus any program which relies on the fact that <CODE>size_t
</CODE> is unsigned will require considerable rewriting before it
can be ported to this machine.<P>
<A NAME=S16>
<H4>2.2.4.4.  System Library Problems</H4>
The system header problems just discussed are primarily syntactic
problems. By contrast, system library problems are primarily semantic
- the provided library routines do not behave in the way specified
by the API. This makes them harder to detect. For example, consider
the routine:<P>
<PRE>
        void *realloc ( void *p, size_t s ) ;
</PRE>
which reallocates the block of memory <CODE>p</CODE> to have size
<CODE>s</CODE> bytes, returning the new block of memory. The ANSI
standard says that if <CODE>p</CODE> is the null pointer, then the
effect of <CODE>realloc ( p, s )</CODE> is the same as <CODE>malloc
( s )</CODE>, that is, to allocate a new block of memory of size <CODE>s</CODE>.
This behaviour is exploited in the following program, in which the
routine <CODE>add_char</CODE> adds a character to the expanding array,
<CODE>buffer</CODE>:<P>
<PRE>
        #include &lt;stdio.h&gt;
        #include &lt;stdlib.h&gt;
        
        char *buffer = NULL ;
        int buff_sz = 0, buff_posn = 0 ;
        
        void add_char ( char c )
        {
            if ( buff_posn &gt;= buff_sz ) {
                buff_sz += 100 ;
                buffer = ( char * ) realloc ( ( void * ) buffer, buff_sz * sizeof ( char ) ) ;
                if ( buffer == NULL ) {
                    fprintf ( stderr, &quot;Memory allocation error\n&quot; ) ;
                    exit ( EXIT_FAILURE ) ;
                }
            }
            buffer [ buff_posn++ ] = c ;
            return ;
        }
</PRE>
On the first call of <CODE>add_char</CODE>, <CODE>buffer</CODE> is
set to a real block of memory (as opposed to <CODE>NULL</CODE>) by
a call of the form <CODE>realloc ( NULL, s )</CODE>. This is extremely
convenient and efficient - if it was not for this behaviour we would
have to have an explicit initialisation of <CODE>buffer</CODE>, either
as a special case in <CODE>add_char</CODE> or in a separate initialisation
routine.<P>
Of course this all depends on the behaviour of <CODE>realloc ( NULL,
s )</CODE> having been implemented precisely as described in the ANSI
standard. The first indication that this is not so on a particular
target machine might be when the program is compiled and run on that
machine for the first time and does not perform as expected. To track
the problem down will demand time debugging the program.<P>
Once the problem has been identified as being with <CODE>realloc</CODE>
a number of possible workarounds are possible. Perhaps the most interesting
is to replace the inclusion of <CODE>stdlib.h</CODE> by the following:<P>
<PRE>
        #include &lt;stdlib.h&gt;
        #ifdef machine_name
        #define realloc ( p, s )\
            ( ( p ) ? ( realloc ) ( p, s ) : malloc ( s ) )
        #endif
</PRE>
where <CODE>realloc ( p, s )</CODE> is redefined as a macro which
is the result of the procedure <CODE>realloc</CODE> if <CODE>p</CODE>
is not null, and <CODE>malloc ( s )</CODE> otherwise. (In fact this
macro will not always have the desired effect, although it does in
this case. Why (exercise)?)<P>
The only alternative to this trial and error approach to finding API
implementation problems is the application of personal experience,
either of the particular target machine or of things that are implemented
wrongly by many machines and as such should be avoided. This sort
of detailed knowledge is not easily acquired. Nor can it ever be complete:
new operating system releases are becoming increasingly regular and
are on occasions quite as likely to introduce new implementation errors
as to solve existing ones. It is in short a &quot;black art&quot;.<P>
<A NAME=S17>
<HR><H2>2.3.  APIs and Portability</H2>
We now return to our discussion of the general issues involved in
portability to more closely examine the role of the API.<P>
<A NAME=S18>
<H3>2.3.1.  Target Dependent Code</H3>
So far we have been considering programs which contain no conditional
compilation, in which the API forms the basis of the separation of
the target independent code (the whole program) and the target dependent
code (the API implementation). But a glance at most large C programs
will reveal that they do contain conditional compilation. The code
is scattered with <CODE>#if</CODE>'s and <CODE>#ifdef</CODE>'s which,
in effect, cause the pre-processor to construct slightly different
programs on different target machines. So here we do not have a clean
division between the target independent and the target dependent code
- there are small sections of target dependent code spread throughout
the program.<P>
Let us briefly consider some of the reasons why it is necessary to
introduce this conditional compilation. Some have already been mentioned
- workarounds for compiler bugs, compiler limitations, and API implementation
errors; others will be considered later. However the most interesting
and important cases concern things which need to be done genuinely
differently on different machines. This can be because they really
cannot be expressed in a target independent manner, or because the
target independent way of doing them is unacceptably inefficient.<P>
Efficiency (either in terms of time or space) is a key issue in many
programs. The argument is often advanced that writing a program portably
means using the, often inefficient, lowest common denominator approach.
But under our definition of portability it is the functionality that
matters, not the actual source code. There is nothing to stop different
code being used on different machines for reasons of efficiency.<P>
To examine the relationship between target dependent code and APIs,
consider the simple program:<P>
<PRE>
        #include &lt;stdio.h&gt;
        
        int main ()
        {
        #ifdef mips
            fputs ( &quot;This machine is a mips\n&quot;, stdout ) ;
        #endif
            return ( 0 ) ;
        }
</PRE>
which prints a message if the target machine is a mips. What is the
API of this program? Basically it is the same as in the &quot;Hello
world&quot; example discussed in sections 2.1.1</A> and 2.1.2</A>,
but if we wish the API to fully describe the interface between the
program and the target machine, we must also say that whether or not
the macro <CODE>mips</CODE> is defined is part of the API. Like the
rest of the API, this has a semantic aspect as well as a syntactic
- in this case that <CODE>mips</CODE> is only defined on mips machines.
Where it differs is in its implementation. Whereas the main part of
the API is implemented in the system headers and the system libraries,
the implementation of either defining, or not defining, <CODE>mips</CODE>
ultimately rests with the person performing the compilation. (In this
particular example, the macro <CODE>mips</CODE> is normally built
into the compiler on mips machines, but this is only a convention.)<P>
So the API in this case has two components : a system-defined part
which is implemented in the system headers and system libraries, and
a user-defined part which ultimately relies on the person performing
the compilation to provide an implementation. The main point to be
made in this section is that introducing target dependent code is
equivalent to introducing a user-defined component to the API. The
actual compilation process in the case of programs containing target
dependent code is basically the same as that shown in Fig. 1. But
whereas previously the vertical division of the diagram also reflects
a division of responsibility - the left hand side is the responsibility
of the programmer (the person writing the program), and the right
hand side of the API specifier (for example, a standards defining
body) and the API implementor (the system vendor) - now the right
hand side is partially the responsibility of the programmer and the
person performing the compilation. The programmer specifies the user-defined
component of the API, and the person compiling the program either
implements this API (as in the mips example above) or chooses between
a number of alternative implementations provided by the programmer
(as in the example below).<P>
Let us consider a more complex example. Consider the following program
which assumes, for simplicity, that an <CODE>unsigned int</CODE> contains
32 bits:<P>
<PRE>
        #include &lt;stdio.h&gt;
        #include &quot;config.h&quot;
        
        #ifndef SLOW_SHIFT
        #define MSB ( a ) ( ( unsigned char ) ( a &gt;&gt; 24 ) )
        #else
        #ifdef BIG_ENDIAN
        #define MSB ( a ) *( ( unsigned char * ) &amp;( a ) )
        #else
        #define MSB ( a ) *( ( unsigned char * ) &amp;( a ) + 3 )
        #endif
        #endif
        
        unsigned int x = 100000000 ;
        
        int main ()
        {
            printf ( &quot;%u\n&quot;, MSB ( x ) ) ;
            return ( 0 ) ;
        }
</PRE>
The intention is to print the most significant byte of <CODE>x</CODE>.
Three alternative definitions of the macro <CODE>MSB</CODE> used to
extract this value are provided. The first, if <CODE>SLOW_SHIFT</CODE>
is not defined, is simply to shift the value right by 24 bits. This
will work on all 32-bit machines, but may be inefficient (depending
on the nature of the machine's shift instruction). So two alternatives
are provided. An <CODE>unsigned int</CODE> is assumed to consist of
four <CODE>unsigned char</CODE>'s. On a big-endian machine, the most
significant byte is the first of these <CODE>unsigned char</CODE>'s;
on a little-endian machine it is the fourth. The second definition
of <CODE>MSB</CODE> is intended to reflect the former case, and the
third the latter.<P>
The person compiling the program has to choose between the three possible
implementations of <CODE>MSB</CODE> provided by the programmer. This
is done by either defining, or not defining, the macros <CODE>SLOW_SHIFT</CODE>
and <CODE>BIG_ENDIAN</CODE>. This could be done as command line options,
but we have chosen to reflect another commonly used device, the configuration
file. For each target machine, the programmer provides a version of
the file <CODE>config.h</CODE> which defines the appropriate combination
of the macros <CODE>SLOW_SHIFT</CODE> and <CODE>BIG_ENDIAN</CODE>.
The person performing the compilation simply chooses the appropriate
<CODE>config.h</CODE> for the target machine.<P>
There are two possible ways of looking at what the user-defined API
of this program is. Possibly it is most natural to say that it is
<CODE>MSB</CODE>, but it could also be argued that it is the macros
<CODE>SLOW_SHIFT</CODE> and <CODE>BIG_ENDIAN</CODE>. The former more
accurately describes the target dependent code, but is only implemented
indirectly, via the latter.<P>
<A NAME=S19>
<H3>2.3.2.  Making APIs Explicit</H3>
As we have said, every program has an API even if it is implicit rather
than explicit. Every system header included, every type or value used
from it, and every library routine used, adds to the system-defined
component of the API, and every conditional compilation adds to the
user-defined component. What making the API explicit does is to encapsulate
the set of requirements that the program has of the target machine
(including requirements like, I need to know whether or not the target
machine is big-endian, as well as, I need <CODE>fputs</CODE> to be
implemented as in the ANSI standard). By making these requirements
explicit it is made absolutely clear what is needed on a target machine
if a program is to be ported to it. If the requirements are not explicit
this can only be found by trial and error. This is what we meant earlier
by saying that a program without an explicit API is only portable
by accident.<P>
Another advantage of specifying the requirements of a program is that
it may increase their chances of being implemented. We have spoken
as if porting is a one-way process; program writers porting their
programs to new machines. But there is also traffic the other way.
Machine vendors may wish certain programs to be ported to their machines.
If these programs come with a list of requirements then the vendor
knows precisely what to implement in order to make such a port possible.<P>
<A NAME=S20>
<H3>2.3.3.  Choosing an API</H3>
So how does one go about choosing an API? In a sense the user-defined
component is easier to specify than the system-defined component because
it is less tied to particular implementation models. What is required
is to abstract out what exactly needs to be done in a target dependent
manner and to decide how best to separate it out. The most difficult
problem is how to make the implementation of this API as simple as
possible for the person performing the compilation, if necessary providing
a number of alternative implementations to choose between and a simple
method of making this choice (for example, the <CODE>config.h</CODE>
file above). With the system-defined component the question is more
likely to be, how do the various target machines I have in mind implement
what I want to do? The abstraction of this is usually to choose a
standard and widely implemented API, such as POSIX, which provides
all the necessary functionality.<P>
The choice of &quot;standard&quot; API is of course influenced by
the type of target machines one has in mind. Within the Unix world,
the increasing adoption of Open Standards, such as POSIX, means that
choosing a standard API which is implemented on a wide variety Unix
boxes is becoming easier. Similarly, choosing an API which will work
on most MSDOS machines should cause few problems. The difficulty is
that these are disjoint worlds; it is very difficult to find a standard
API which is implemented on both Unix and MSDOS machines. At present
not much can be done about this, it reflects the disjoint nature of
the computer market.<P>
To develop a similar point : the drawback of choosing POSIX (for example)
as an API is that it restricts the range of possible target machines
to machines which implement POSIX. Other machines, for example, BSD
compliant machines, might offer the same functionality (albeit using
different methods), so they should be potential target machines, but
they have been excluded by the choice of API. One approach to the
problem is the &quot;alternative API&quot; approach. Both the POSIX
and the BSD variants are built into the program, but only one is selected
on any given target machine by means of conditional compilation. Under
our &quot;equivalent functionality&quot; definition of portability,
this is a program which is portable to both POSIX and BSD compliant
machines. But viewed in the light of the discussion above, if we regard
a program as a program-API pair, it could be regarded as two separate
programs combined on a single source code tree. A more interesting
approach would be to try to abstract out what exactly the functionality
which both POSIX and BSD offer is and use that as the API. Then instead
of two separate APIs we would have a single API with two broad classes
of implementations. The advantage of this latter approach becomes
clear if wished to port the program to a machine which implements
neither POSIX nor BSD, but provides the equivalent functionality in
a third way.<P>
As a simple example, both POSIX and BSD provide very similar methods
for scanning the entries of a directory. The main difference is that
the POSIX version is defined in <CODE>dirent.h</CODE> and uses a structure
called <CODE>struct dirent</CODE>, whereas the BSD version is defined
in <CODE>sys/dir.h</CODE> and calls the corresponding structure <CODE>struct
direct</CODE>. The actual routines for manipulating directories are
the same in both cases. So the only abstraction required to unify
these two APIs is to introduce an abstract type, <CODE>dir_entry</CODE>
say, which can be defined by:<P>
<PRE>
        typedef struct dirent dir_entry ;
</PRE>
on POSIX machines, and:<P>
<PRE>
        typedef struct direct dir_entry ;
</PRE>
on BSD machines. Note how this portion of the API crosses the system-user
boundary. The object <CODE>dir_entry</CODE> is defined in terms of
the objects in the system headers, but the precise definition depends
on a user-defined value (whether the target machine implements POSIX
or BSD).<P>
<A NAME=S21>
<H3>2.3.4.  Alternative Program Versions</H3>
Another reason for introducing conditional compilation which relates
to APIs is the desire to combine several programs, or versions of
programs, on a single source tree. There are several cases to be distinguished
between. The reuse of code between genuinely different programs does
not really enter the argument : any given program will only use one
route through the source tree, so there is no real conditional compilation
per se in the program. What is more interesting is the use of conditional
compilation to combine several versions of the same program on the
same source tree to provide additional or alternative features.<P>
It could be argued that the macros (or whatever) used to select between
the various versions of the program are just part of the user-defined
API as before. But consider a simple program which reads in some numerical
input, say, processes it, and prints the results. This might, for
example, have POSIX as its API. We may wish to optionally enhance
this by displaying the results graphically rather than textually on
machines which have X Windows, the compilation being conditional on
some boolean value, <CODE>HAVE_X_WINDOWS</CODE>, say. What is the
API of the resultant program? The answer from the point of view of
the program is the union of POSIX, X Windows and the user-defined
value <CODE>HAVE_X_WINDOWS</CODE>. But from the implementation point
of view we can either implement POSIX and set <CODE>HAVE_X_WINDOWS</CODE>
to false, or implement both POSIX and X Windows and set <CODE>HAVE_X_WINDOWS
</CODE> to true. So what introducing <CODE>HAVE_X_WINDOWS</CODE> does
is to allow flexibility in the API implementation.<P>
This is very similar to the alternative APIs discussed above. However
the approach outlined will really only work for optional API extensions.
To work in the alternative API case, we would need to have the union
of POSIX, BSD and a boolean value, say, as the API. Although this
is possible in theory, it is likely to lead to namespace clashes between
POSIX and BSD.<P>
<HR><H2>Appendix: Namespaces and APIs</H2>
Namespace problems are amongst the most difficult faced by standard
defining bodies (for example, the ANSI and POSIX committees) and they
often go to great lengths to specify which names should, and should
not, appear when certain headers are included. (The position is set
out in D. F. Prosser, <I>Header and name space rules for UNIX systems</I>
(private communication), USL, 1993.)<P>
For example, the intention, certainly in ANSI, is that each header
should operate as an independent sub-API. Thus <CODE>va_list</CODE>
is prohibited from appearing in the namespace when <CODE>stdio.h</CODE>
is included (it is defined only in <CODE>stdarg.h</CODE>) despite
the fact that it appears in the prototype:<P>
<PRE>
        int vprintf ( char *, va_list ) ;
</PRE>
This seeming contradiction is worked round on most implementations
by defining a type <CODE>__va_list</CODE> in <CODE>stdio.h</CODE>
which has exactly the same definition as <CODE>va_list</CODE>, and
declaring <CODE>vprintf</CODE> as:<P>
<PRE>
        int vprintf ( char *, __va_list ) ;
</PRE>
This is only legal because <CODE>__va_list</CODE> is deemed not to
corrupt the namespace because of the convention that names beginning
with <CODE>__</CODE> are reserved for implementation use.<P>
This particular namespace convention is well-known, but there are
others defined in these standards which are not generally known (and
since no compiler I know tests them, not widely adhered to). For example,
the ANSI header <CODE>errno.h</CODE> reserves all names given by the
regular expression:<P>
<PRE>
        E[0-9A-Z][0-9a-z_A-Z]+
</PRE>
against macros (i.e. in all namespaces). By prohibiting the user from
using names of this form, the intention is to protect against namespace
clashes with extensions of the ANSI API which introduce new error
numbers. It also protects against a particular implementation of these
extensions - namely that new error numbers will be defined as macros.<P>
A better example of protecting against particular implementations
comes from POSIX. If <CODE>sys/stat.h</CODE> is included names of
the form:<P>
<PRE>
        st_[0-9a-z_A-Z]+
</PRE>
are reserved against macros (as member names). The intention here
is not only to reserve field selector names for future extensions
to <CODE>struct stat</CODE> (which would only affect API implementors,
not ordinary users), but also to reserve against the possibility that
these field selectors might be implemented by macros. So our <CODE>st_atime
</CODE> example in section 2.2.3</A> is strictly illegal because the
procedure name <CODE>st_atime</CODE> lies in a restricted namespace.
Indeed the namespace is restricted precisely to disallow this program.<P>
As an exercise to the reader, how many of your programs use names
from the following restricted namespaces (all drawn from ANSI, all
applying to all namespaces)?<P>
<PRE>
        is[a-z][0-9a-z_A-Z]+                            (ctype.h)
        to[a-z][0-9a-z_A-Z]+                            (ctype.h)
        str[a-z][0-9a-z_A-Z]+                           (stdlib.h)
</PRE>
With the TDF approach of describing APIs in abstract terms using the
<CODE>#pragma token</CODE> syntax most of these namespace restrictions
are seen to be superfluous. When a target independent header is included
precisely the objects defined in that header in that version of the
API appear in the namespace. There are no worries about what else
might happen to be in the header, because there is nothing else. Also
implementation details are separated off to the TDF library building,
so possible namespace pollution through particular implementations
does not arise.<P>
Currently TDF does not have a neat way of solving the <CODE>va_list</CODE>
problem. The present target independent headers use a similar workaround
to that described above (exploiting a reserved namespace). (See the
footnote in section 3.4.1.1.)<P>
None of this is intended as criticism of the ANSI or POSIX standards.
It merely shows some of the problems that can arise from the insufficient
separation of code.<P>
<HR>
<P><I>Part of the <A HREF="../index.html">TenDRA Web</A>.<BR>Crown
Copyright &copy; 1998.</I></P>
</BODY>
</HTML>
Subversion Repositories tendra.SVN

(root)/branches/tendra4/doc/port/port3.html – Rev 3