testfloat/notes/testfloat.txt

1.1Sross
1.1SrossTestFloat Release 2a General Documentation
1.1Sross
1.1SrossJohn R. Hauser
1.1Sross1998 December 16
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossIntroduction
1.1Sross
1.1SrossTestFloat is a program for testing that a floating-point implementation
1.1Srossconforms to the IEC/IEEE Standard for Binary Floating-Point Arithmetic.
1.1SrossAll standard operations supported by the system can be tested, except for
1.1Srossconversions to and from decimal.  Any of the following machine formats can
1.1Srossbe tested:  single precision, double precision, extended double precision,
1.1Srossand/or quadruple precision.
1.1Sross
1.1SrossTestFloat actually comes in two variants:  one is a program for testing
1.1Srossa machine's floating-point, and the other is a program for testing
1.1Srossthe SoftFloat software implementation of floating-point.  (Information
1.1Srossabout SoftFloat can be found at the SoftFloat Web page, `http://
1.1SrossHTTP.CS.Berkeley.EDU/~jhauser/arithmetic/SoftFloat.html'.)  The version that
1.1Srosstests SoftFloat is expected to be of interest only to people compiling the
1.1SrossSoftFloat sources.  However, because the two versions share much in common,
1.1Srossthey are discussed together in all the TestFloat documentation.
1.1Sross
1.1SrossThis document explains how to use the TestFloat programs.  It does not
1.1Srossattempt to define or explain the IEC/IEEE Standard for floating-point.
1.1SrossDetails about the standard are available elsewhere.
1.1Sross
1.1SrossThe first release of TestFloat (Release 1) was called _FloatTest_.  The old
1.1Srossname has been obsolete for some time.
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossLimitations
1.1Sross
1.1SrossTestFloat's output is not always easily interpreted.  Detailed knowledge
1.1Srossof the IEC/IEEE Standard and its vagaries is needed to use TestFloat
1.1Srossresponsibly.
1.1Sross
1.1SrossTestFloat performs relatively simple tests designed to check the fundamental
1.1Srosssoundness of the floating-point under test.  TestFloat may also at times
1.1Srossmanage to find rarer and more subtle bugs, but it will probably only find
1.1Srosssuch bugs by accident.  Software that purposefully seeks out various kinds
1.1Srossof subtle floating-point bugs can be found through links posted on the
1.1SrossTestFloat Web page (`http://HTTP.CS.Berkeley.EDU/~jhauser/arithmetic/
1.1SrossTestFloat.html').
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossContents
1.1Sross
1.1Sross    Introduction
1.1Sross    Limitations
1.1Sross    Contents
1.1Sross    Legal Notice
1.1Sross    What TestFloat Does
1.1Sross    Executing TestFloat
1.1Sross    Functions Tested by TestFloat
1.1Sross        Conversion Functions
1.1Sross        Standard Arithmetic Functions
1.1Sross        Remainder and Round-to-Integer Functions
1.1Sross        Comparison Functions
1.1Sross    Interpreting TestFloat Output
1.1Sross    Variations Allowed by the IEC/IEEE Standard
1.1Sross        Underflow
1.1Sross        NaNs
1.1Sross        Conversions to Integer
1.1Sross    TestFloat Options
1.1Sross        -help
1.1Sross        -list
1.1Sross        -level <num>
1.1Sross        -errors <num>
1.1Sross        -errorstop
1.1Sross        -forever
1.1Sross        -checkNaNs
1.1Sross        -precision32, -precision64, -precision80
1.1Sross        -nearesteven, -tozero, -down, -up
1.1Sross        -tininessbefore, -tininessafter
1.1Sross    Function Sets
1.1Sross    Contact Information
1.1Sross
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossLegal Notice
1.1Sross
1.1SrossTestFloat was written by John R. Hauser.
1.1Sross
1.1SrossTHIS SOFTWARE IS DISTRIBUTED AS IS, FOR FREE.  Although reasonable effort
1.1Srosshas been made to avoid it, THIS SOFTWARE MAY CONTAIN FAULTS THAT WILL AT
1.1SrossTIMES RESULT IN INCORRECT BEHAVIOR.  USE OF THIS SOFTWARE IS RESTRICTED TO
1.1SrossPERSONS AND ORGANIZATIONS WHO CAN AND WILL TAKE FULL RESPONSIBILITY FOR ANY
1.1SrossAND ALL LOSSES, COSTS, OR OTHER PROBLEMS ARISING FROM ITS USE.
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossWhat TestFloat Does
1.1Sross
1.1SrossTestFloat tests a system's floating-point by comparing its behavior with
1.1Srossthat of TestFloat's own internal floating-point implemented in software.
1.1SrossFor each operation tested, TestFloat generates a large number of test cases,
1.1Srossmade up of simple pattern tests intermixed with weighted random inputs.
1.1SrossThe cases generated should be adequate for testing carry chain propagations,
1.1Srossplus the rounding of adds, subtracts, multiplies, and simple operations like
1.1Srossconversions.  TestFloat makes a point of checking all boundary cases of the
1.1Srossarithmetic, including underflows, overflows, invalid operations, subnormal
1.1Srossinputs, zeros (positive and negative), infinities, and NaNs.  For the
1.1Srossinteresting operations like adds and multiplies, literally millions of test
1.1Srosscases can be checked.
1.1Sross
1.1SrossTestFloat is not remarkably good at testing difficult rounding cases for
1.1Srossdivisions and square roots.  It also makes no attempt to find bugs specific
1.1Srossto SRT divisions and the like (such as the infamous Pentium divide bug).
1.1SrossSoftware that tests for such failures can be found through links on the
1.1SrossTestFloat Web page, `http://HTTP.CS.Berkeley.EDU/~jhauser/arithmetic/
1.1SrossTestFloat.html'.
1.1Sross
1.1SrossNOTE!
1.1SrossIt is the responsibility of the user to verify that the discrepancies
1.1SrossTestFloat finds actually represent faults in the system being tested.
1.1SrossAdvice to help with this task is provided later in this document.
1.1SrossFurthermore, even if TestFloat finds no fault with a floating-point
1.1Srossimplementation, that in no way guarantees that the implementation is bug-
1.1Srossfree.
1.1Sross
1.1SrossFor each operation, TestFloat can test all four rounding modes required
1.1Srossby the IEC/IEEE Standard.  TestFloat verifies not only that the numeric
1.1Srossresults of an operation are correct, but also that the proper floating-point
1.1Srossexception flags are raised.  All five exception flags are tested, including
1.1Srossthe inexact flag.  TestFloat does not attempt to verify that the floating-
1.1Srosspoint exception flags are actually implemented as sticky flags.
1.1Sross
1.1SrossFor machines that implement extended double precision with rounding
1.1Srossprecision control (such as Intel's 80x86), TestFloat can test the add,
1.1Srosssubtract, multiply, divide, and square root functions at all the standard
1.1Srossrounding precisions.  The rounding precision can be set equivalent to single
1.1Srossprecision, to double precision, or to the full extended double precision.
1.1SrossRounding precision control can only be applied to the extended double-
1.1Srossprecision format and only for the five standard arithmetic operations:  add,
1.1Srosssubtract, multiply, divide, and square root.  Other functions can be tested
1.1Srossonly at full precision.
1.1Sross
1.1SrossAs a rule, TestFloat is not particular about the bit patterns of NaNs that
1.1Srossappear as function results.  Any NaN is considered as good a result as
1.1Srossanother.  This laxness can be overridden so that TestFloat checks for
1.1Srossparticular bit patterns within NaN results.  See the sections _Variations_
1.1Sross_Allowed_by_the_IEC/IEEE_Standard_ and _TestFloat_Options_ for details.
1.1Sross
1.1SrossNot all IEC/IEEE Standard functions are supported by all machines.
1.1SrossTestFloat can only test functions that exist on the machine.  But even if
1.1Srossa function is supported by the machine, TestFloat may still not be able
1.1Srossto test the function if it is not accessible through standard ISO C (the
1.1Srossprogramming language in which TestFloat is written) and if the person who
1.1Srosscompiled TestFloat did not provide an alternate means for TestFloat to
1.1Srossinvoke the machine function.
1.1Sross
1.1SrossTestFloat compares a machine's floating-point against the SoftFloat software
1.1Srossimplementation of floating-point, also written by me.  SoftFloat is built
1.1Srossinto the TestFloat executable and does not need to be supplied by the user.
1.1SrossIf SoftFloat is wanted for some other reason (to compile a new version
1.1Srossof TestFloat, for instance), it can be found separately at the Web page
1.1Sross`http://HTTP.CS.Berkeley.EDU/~jhauser/arithmetic/SoftFloat.html'.
1.1Sross
1.1SrossFor testing SoftFloat itself, the TestFloat package includes a program that
1.1Srosscompares SoftFloat's floating-point against _another_ software floating-
1.1Srosspoint implementation.  The second software floating-point is simpler and
1.1Srossslower than SoftFloat, and is completely independent of SoftFloat.  Although
1.1Srossthe second software floating-point cannot be guaranteed to be bug-free, the
1.1Srosschance that it would mimic any of SoftFloat's bugs is remote.  Consequently,
1.1Srossan error in one or the other floating-point version should appear as an
1.1Srossunexpected discrepancy between the two implementations.  Note that testing
1.1SrossSoftFloat should only be necessary when compiling a new TestFloat executable
1.1Srossor when compiling SoftFloat for some other reason.
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossExecuting TestFloat
1.1Sross
1.1SrossTestFloat is intended to be executed from a command line interpreter.  The
1.1Sross`testfloat' program is invoked as follows:
1.1Sross
1.1Sross    testfloat [<option>...] <function>
1.1Sross
1.1SrossHere square brackets ([]) indicate optional items, while angled brackets
1.1Sross(<>) denote parameters to be filled in.
1.1Sross
1.1SrossThe `<function>' argument is a name like `float32_add' or `float64_to_int32'.
1.1SrossThe complete list of function names is given in the next section,
1.1Sross_Functions_Tested_by_TestFloat_.  It is also possible to test all machine
1.1Srossfunctions in a single invocation.  The various options to TestFloat are
1.1Srossdetailed in the section _TestFloat_Options_ later in this document.  If
1.1Sross`testfloat' is executed without any arguments, a summary of TestFloat usage
1.1Srossis written.
1.1Sross
1.1SrossTestFloat will ordinarily test a function for all four rounding modes, one
1.1Srossafter the other.  If the rounding mode is not supposed to have any affect
1.1Srosson the results--for instance, some operations do not require rounding--only
1.1Srossthe nearest/even rounding mode is checked.  For extended double-precision
1.1Srossoperations affected by rounding precision control, TestFloat also tests all
1.1Srossthree rounding precision modes, one after the other.  Testing can be limited
1.1Srossto a single rounding mode and/or rounding precision with appropriate options
1.1Sross(see _TestFloat_Options_).
1.1Sross
1.1SrossAs it executes, TestFloat writes status information to the standard error
1.1Srossoutput, which should be the screen by default.  In order for this status to
1.1Srossbe displayed properly, the standard error stream should not be redirected
1.1Srossto a file.  The discrepancies TestFloat finds are written to the standard
1.1Srossoutput stream, which is easily redirected to a file if desired.  Ordinarily,
1.1Srossthe errors TestFloat reports and the ongoing status information appear
1.1Srossintermixed on the same screen.
1.1Sross
1.1SrossThe version of TestFloat for testing SoftFloat is called `testsoftfloat'.
1.1SrossIt is invoked the same as `testfloat',
1.1Sross
1.1Sross    testsoftfloat [<option>...] <function>
1.1Sross
1.1Srossand operates similarly.
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossFunctions Tested by TestFloat
1.1Sross
1.1SrossTestFloat tests all operations required by the IEC/IEEE Standard except for
1.1Srossconversions to and from decimal.  The operations are
1.1Sross
1.1Sross-- Conversions among the supported floating-point formats, and also between
1.1Sross   integers (32-bit and 64-bit) and any of the floating-point formats.
1.1Sross
1.1Sross-- The usual add, subtract, multiply, divide, and square root operations
1.1Sross   for all supported floating-point formats.
1.1Sross
1.1Sross-- For each format, the floating-point remainder operation defined by the
1.1Sross   IEC/IEEE Standard.
1.1Sross
1.1Sross-- For each floating-point format, a ``round to integer'' operation that
1.1Sross   rounds to the nearest integer value in the same format.  (The floating-
1.1Sross   point formats can hold integer values, of course.)
1.1Sross
1.1Sross-- Comparisons between two values in the same floating-point format.
1.1Sross
1.1SrossDetailed information about these functions is given below.  In the function
1.1Srossnames used by TestFloat, single precision is called `float32', double
1.1Srossprecision is `float64', extended double precision is `floatx80', and
1.1Srossquadruple precision is `float128'.  TestFloat uses the same names for
1.1Srossfunctions as SoftFloat.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1SrossConversion Functions
1.1Sross
1.1SrossAll conversions among the floating-point formats and all conversion between
1.1Srossa floating-point format and 32-bit and 64-bit signed integers can be tested.
1.1SrossThe conversion functions are:
1.1Sross
1.1Sross   int32_to_float32      int64_to_float32
1.1Sross   int32_to_float64      int64_to_float32
1.1Sross   int32_to_floatx80     int64_to_floatx80
1.1Sross   int32_to_float128     int64_to_float128
1.1Sross
1.1Sross   float32_to_int32      float32_to_int64
1.1Sross   float32_to_int32      float64_to_int64
1.1Sross   floatx80_to_int32     floatx80_to_int64
1.1Sross   float128_to_int32     float128_to_int64
1.1Sross
1.1Sross   float32_to_float64    float32_to_floatx80   float32_to_float128
1.1Sross   float64_to_float32    float64_to_floatx80   float64_to_float128
1.1Sross   floatx80_to_float32   floatx80_to_float64   floatx80_to_float128
1.1Sross   float128_to_float32   float128_to_float64   float128_to_floatx80
1.1Sross
1.1SrossThese conversions all round according to the current rounding mode as
1.1Srossnecessary.  Conversions from a smaller to a larger floating-point format are
1.1Srossalways exact and so require no rounding.  Conversions from 32-bit integers
1.1Srossto double precision or to any larger floating-point format are also exact,
1.1Srossand likewise for conversions from 64-bit integers to extended double and
1.1Srossquadruple precisions.
1.1Sross
1.1SrossISO/ANSI C requires that conversions to integers be rounded toward zero.
1.1SrossSuch conversions can be tested with the following functions that ignore any
1.1Srossrounding mode:
1.1Sross
1.1Sross   float32_to_int32_round_to_zero    float32_to_int64_round_to_zero
1.1Sross   float64_to_int32_round_to_zero    float64_to_int64_round_to_zero
1.1Sross   floatx80_to_int32_round_to_zero   floatx80_to_int64_round_to_zero
1.1Sross   float128_to_int32_round_to_zero   float128_to_int64_round_to_zero
1.1Sross
1.1SrossTestFloat assumes that conversions from floating-point to integer should
1.1Srossraise the invalid exception if the source value cannot be rounded to a
1.1Srossrepresentable integer of the desired size (32 or 64 bits).  If such a
1.1Srossconversion overflows, TestFloat expects the largest integer with the same
1.1Srosssign as the operand to be returned.  If the floating-point operand is a NaN,
1.2SandvarTestFloat allows either the largest positive or largest negative integer to
1.1Srossbe returned.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1SrossStandard Arithmetic Functions
1.1Sross
1.1SrossThe following standard arithmetic functions can be tested:
1.1Sross
1.1Sross   float32_add    float32_sub    float32_mul    float32_div    float32_sqrt
1.1Sross   float64_add    float64_sub    float64_mul    float64_div    float64_sqrt
1.1Sross   floatx80_add   floatx80_sub   floatx80_mul   floatx80_div   floatx80_sqrt
1.1Sross   float128_add   float128_sub   float128_mul   float128_div   float128_sqrt
1.1Sross
1.1SrossThe extended double-precision (`floatx80') functions can be rounded to
1.1Srossreduced precision under rounding precision control.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1SrossRemainder and Round-to-Integer Functions
1.1Sross
1.1SrossFor each format, TestFloat can test the IEC/IEEE Standard remainder and
1.1Srossround-to-integer functions.  The remainder functions are:
1.1Sross
1.1Sross   float32_rem
1.1Sross   float64_rem
1.1Sross   floatx80_rem
1.1Sross   float128_rem
1.1Sross
1.1SrossThe round-to-integer functions are:
1.1Sross
1.1Sross   float32_round_to_int
1.1Sross   float64_round_to_int
1.1Sross   floatx80_round_to_int
1.1Sross   float128_round_to_int
1.1Sross
1.1SrossThe remainder functions are always exact and so do not require rounding.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1SrossComparison Functions
1.1Sross
1.1SrossThe following floating-point comparison functions can be tested:
1.1Sross
1.1Sross   float32_eq    float32_le    float32_lt
1.1Sross   float64_eq    float64_le    float64_lt
1.1Sross   floatx80_eq   floatx80_le   floatx80_lt
1.1Sross   float128_eq   float128_le   float128_lt
1.1Sross
1.1SrossThe abbreviation `eq' stands for ``equal'' (=); `le' stands for ``less than
1.1Srossor equal'' (<=); and `lt' stands for ``less than'' (<).
1.1Sross
1.1SrossThe IEC/IEEE Standard specifies that the less-than-or-equal and less-than
1.1Srossfunctions raise the invalid exception if either input is any kind of NaN.
1.1SrossThe equal functions, for their part, are defined not to raise the invalid
1.1Srossexception on quiet NaNs.  For completeness, the following additional
1.1Srossfunctions can be tested if supported:
1.1Sross
1.1Sross   float32_eq_signaling    float32_le_quiet    float32_lt_quiet
1.1Sross   float64_eq_signaling    float64_le_quiet    float64_lt_quiet
1.1Sross   floatx80_eq_signaling   floatx80_le_quiet   floatx80_lt_quiet
1.1Sross   float128_eq_signaling   float128_le_quiet   float128_lt_quiet
1.1Sross
1.1SrossThe `signaling' equal functions are identical to the standard functions
1.1Srossexcept that the invalid exception should be raised for any NaN input.
1.1SrossLikewise, the `quiet' comparison functions should be identical to their
1.1Srosscounterparts except that the invalid exception is not raised for quiet NaNs.
1.1Sross
1.1SrossObviously, no comparison functions ever require rounding.  Any rounding mode
1.1Srossis ignored.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossInterpreting TestFloat Output
1.1Sross
1.1SrossThe ``errors'' reported by TestFloat may or may not really represent errors
1.1Srossin the system being tested.  For each test case tried, TestFloat performs
1.1Srossthe same floating-point operation for the two implementations being compared
1.1Srossand reports any unexpected difference in the results.  The two results could
1.1Srossdiffer for several reasons:
1.1Sross
1.1Sross-- The IEC/IEEE Standard allows for some variation in how conforming
1.1Sross   floating-point behaves.  Two implementations can occasionally give
1.1Sross   different results without either being incorrect.
1.1Sross
1.1Sross-- The trusted floating-point emulation could be faulty.  This could be
1.1Sross   because there is a bug in the way the enulation is coded, or because a
1.1Sross   mistake was made when the code was compiled for the current system.
1.1Sross
1.1Sross-- TestFloat may not work properly, reporting discrepancies that do not
1.1Sross   exist.
1.1Sross
1.1Sross-- Lastly, the floating-point being tested could actually be faulty.
1.1Sross
1.1SrossIt is the responsibility of the user to determine the causes for the
1.1Srossdiscrepancies TestFloat reports.  Making this determination can require
1.1Srossdetailed knowledge about the IEC/IEEE Standard.  Assuming TestFloat is
1.1Srossworking properly, any differences found will be due to either the first or
1.1Srosslast of these reasons.  Variations in the IEC/IEEE Standard that could lead
1.1Srossto false error reports are discussed in the section _Variations_Allowed_by_
1.1Sross_the_IEC/IEEE_Standard_.
1.1Sross
1.1SrossFor each error (or apparent error) TestFloat reports, a line of text
1.1Srossis written to the default output.  If a line would be longer than 79
1.1Srosscharacters, it is divided.  The first part of each error line begins in the
1.1Srossleftmost column, and any subsequent ``continuation'' lines are indented with
1.1Srossa tab.
1.1Sross
1.1SrossEach error reported by `testfloat' is of the form:
1.1Sross
1.1Sross    <inputs>  soft: <output-from-emulation>  syst: <output-from-system>
1.1Sross
1.1SrossThe `<inputs>' are the inputs to the operation.  Each output is shown as a
1.1Srosspair:  the result value first, followed by the exception flags.  The `soft'
1.1Srosslabel stands for ``software'' (or ``SoftFloat''), while `syst' stands for
1.1Sross``system,'' the machine's floating-point.
1.1Sross
1.1SrossFor example, two typical error lines could be
1.1Sross
1.1Sross    800.7FFF00  87F.000100  soft: 001.000000 ....x  syst: 001.000000 ...ux
1.1Sross    081.000004  000.1FFFFF  soft: 001.000000 ....x  syst: 001.000000 ...ux
1.1Sross
1.1SrossIn the first line, the inputs are `800.7FFF00' and `87F.000100'.  The
1.1Srossinternal emulation result is `001.000000' with flags `....x', and the
1.1Srosssystem result is the same but with flags `...ux'.  All the items composed of
1.1Srosshexadecimal digits and a single period represent floating-point values (here
1.1Srosssingle precision).  These cases were reported as errors because the flag
1.1Srossresults differ.
1.1Sross
1.1SrossIn addition to the exception flags, there are seven data types that may
1.1Srossbe represented.  Four are floating-point types:  single precision, double
1.1Srossprecision, extended double precision, and quadruple precision.  The
1.1Srossremaining three types are 32-bit and 64-bit two's-complement integers and
1.1SrossBoolean values (the results of comparison operations).  Boolean values are
1.1Srossrepresented as a single character, either a `0' or a `1'.  32-bit integers
1.1Srossare written as 8 hexadecimal digits in two's-complement form.  Thus,
1.1Sross`FFFFFFFF' is -1, and `7FFFFFFF' is the largest positive 32-bit integer.
1.1Sross64-bit integers are the same except with 16 hexadecimal digits.
1.1Sross
1.1SrossFloating-point values are written in a correspondingly primitive form.
1.1SrossDouble-precision values are represented by 16 hexadecimal digits that give
1.1Srossthe raw bits of the floating-point encoding.  A period separates the 3rd and
1.1Sross4th hexadecimal digits to mark the division between the exponent bits and
1.1Srossfraction bits.  Some notable double-precision values include:
1.1Sross
1.1Sross    000.0000000000000    +0
1.1Sross    3FF.0000000000000     1
1.1Sross    400.0000000000000     2
1.1Sross    7FF.0000000000000    +infinity
1.1Sross
1.1Sross    800.0000000000000    -0
1.1Sross    BFF.0000000000000    -1
1.1Sross    C00.0000000000000    -2
1.1Sross    FFF.0000000000000    -infinity
1.1Sross
1.1Sross    3FE.FFFFFFFFFFFFF    largest representable number preceding +1
1.1Sross
1.1SrossThe following categories are easily distinguished (assuming the `x's are not
1.1Srossall 0):
1.1Sross
1.1Sross    000.xxxxxxxxxxxxx    positive subnormal (denormalized) numbers
1.1Sross    7FF.xxxxxxxxxxxxx    positive NaNs
1.1Sross    800.xxxxxxxxxxxxx    negative subnormal numbers
1.1Sross    FFF.xxxxxxxxxxxxx    negative NaNs
1.1Sross
1.1SrossQuadruple-precision values are written the same except with 4 hexadecimal
1.1Srossdigits for the sign and exponent and 28 for the fraction.  Notable values
1.1Srossinclude:
1.1Sross
1.1Sross    0000.0000000000000000000000000000    +0
1.1Sross    3FFF.0000000000000000000000000000     1
1.1Sross    4000.0000000000000000000000000000     2
1.1Sross    7FFF.0000000000000000000000000000    +infinity
1.1Sross
1.1Sross    8000.0000000000000000000000000000    -0
1.1Sross    BFFF.0000000000000000000000000000    -1
1.1Sross    C000.0000000000000000000000000000    -2
1.1Sross    FFFF.0000000000000000000000000000    -infinity
1.1Sross
1.1Sross    3FFE.FFFFFFFFFFFFFFFFFFFFFFFFFFFF    largest representable number
1.1Sross                                             preceding +1
1.1Sross
1.1SrossExtended double-precision values are a little unusual in that the leading
1.1Srosssignificand bit is not hidden as with other formats.  When correctly
1.1Srossencoded, the leading significand bit of an extended double-precision value
1.1Srosswill be 0 if the value is zero or subnormal, and will be 1 otherwise.
1.1SrossHence, the same values listed above appear in extended double-precision as
1.1Srossfollows (note the leading `8' digit in the significands):
1.1Sross
1.1Sross    0000.0000000000000000    +0
1.1Sross    3FFF.8000000000000000     1
1.1Sross    4000.8000000000000000     2
1.1Sross    7FFF.8000000000000000    +infinity
1.1Sross
1.1Sross    8000.0000000000000000    -0
1.1Sross    BFFF.8000000000000000    -1
1.1Sross    C000.8000000000000000    -2
1.1Sross    FFFF.8000000000000000    -infinity
1.1Sross
1.1Sross    3FFE.FFFFFFFFFFFFFFFF    largest representable number preceding +1
1.1Sross
1.1SrossThe representation of single-precision values is unusual for a different
1.1Srossreason.  Because the subfields of standard single-precision do not fall
1.1Srosson neat 4-bit boundaries, single-precision outputs are slightly perturbed.
1.1SrossThese are written as 9 hexadecimal digits, with a period separating the 3rd
1.1Srossand 4th hexadecimal digits.  Broken out into bits, the 9 hexademical digits
1.1Srosscover the single-precision subfields as follows:
1.1Sross
1.1Sross    x000 .... ....  .  .... .... .... .... .... ....    sign       (1 bit)
1.1Sross    .... xxxx xxxx  .  .... .... .... .... .... ....    exponent   (8 bits)
1.1Sross    .... .... ....  .  0xxx xxxx xxxx xxxx xxxx xxxx    fraction  (23 bits)
1.1Sross
1.1SrossAs shown in this schematic, the first hexadecimal digit contains only
1.1Srossthe sign, and will be either `0' or `8'.  The next two digits give the
1.1Srossbiased exponent as an 8-bit integer.  This is followed by a period and
1.1Sross6 hexadecimal digits of fraction.  The most significant hexadecimal digit
1.1Srossof the fraction can be at most a `7'.
1.1Sross
1.1SrossNotable single-precision values include:
1.1Sross
1.1Sross    000.000000    +0
1.1Sross    07F.000000     1
1.1Sross    080.000000     2
1.1Sross    0FF.000000    +infinity
1.1Sross
1.1Sross    800.000000    -0
1.1Sross    87F.000000    -1
1.1Sross    880.000000    -2
1.1Sross    8FF.000000    -infinity
1.1Sross
1.1Sross    07E.7FFFFF    largest representable number preceding +1
1.1Sross
1.1SrossAgain, certain categories are easily distinguished (assuming the `x's are
1.1Srossnot all 0):
1.1Sross
1.1Sross    000.xxxxxx    positive subnormal (denormalized) numbers
1.1Sross    0FF.xxxxxx    positive NaNs
1.1Sross    800.xxxxxx    negative subnormal numbers
1.1Sross    8FF.xxxxxx    negative NaNs
1.1Sross
1.1SrossLastly, exception flag values are represented by five characters, one
1.1Srosscharacter per flag.  Each flag is written as either a letter or a period
1.1Sross(`.') according to whether the flag was set or not by the operation.  A
1.1Srossperiod indicates the flag was not set.  The letter used to indicate a set
1.1Srossflag depends on the flag:
1.1Sross
1.1Sross    v    invalid flag
1.1Sross    z    division-by-zero flag
1.1Sross    o    overflow flag
1.1Sross    u    underflow flag
1.1Sross    x    inexact flag
1.1Sross
1.1SrossFor example, the notation `...ux' indicates that the underflow and inexact
1.1Srossexception flags were set and that the other three flags (invalid, division-
1.1Srossby-zero, and overflow) were not set.  The exception flags are always shown
1.1Srossfollowing the value returned as the result of the operation.
1.1Sross
1.1SrossThe output from `testsoftfloat' is of the same form, except that the results
1.1Srossare labeled `true' and `soft':
1.1Sross
1.1Sross    <inputs>  true: <simple-software-result>  soft: <SoftFloat-result>
1.1Sross
1.1SrossThe ``true'' result is from the simpler, slower software floating-point,
1.1Srosswhich, although not necessarily correct, is more likely to be right than
1.1Srossthe SoftFloat (`soft') result.
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossVariations Allowed by the IEC/IEEE Standard
1.1Sross
1.1SrossThe IEC/IEEE Standard admits some variation among conforming
1.1Srossimplementations.  Because TestFloat expects the two implementations being
1.1Srosscompared to deliver bit-for-bit identical results under most circumstances,
1.1Srossthis leeway in the standard can result in false errors being reported if
1.1Srossthe two implementations do not make the same choices everywhere the standard
1.1Srossprovides an option.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1SrossUnderflow
1.1Sross
1.1SrossThe standard specifies that the underflow exception flag is to be raised
1.1Srosswhen two conditions are met simultaneously:  (1) _tininess_ and (2) _loss_
1.1Sross_of_accuracy_.  A result is tiny when its magnitude is nonzero yet smaller
1.1Srossthan any normalized floating-point number.  The standard allows tininess to
1.1Srossbe determined either before or after a result is rounded to the destination
1.1Srossprecision.  If tininess is detected before rounding, some borderline cases
1.1Srosswill be flagged as underflows even though the result after rounding actually
1.1Srosslies within the normal floating-point range.  By detecting tininess after
1.1Srossrounding, a system can avoid some unnecessary signaling of underflow.
1.1Sross
1.1SrossLoss of accuracy occurs when the subnormal format is not sufficient
1.1Srossto represent an underflowed result accurately.  The standard allows
1.1Srossloss of accuracy to be detected either as an _inexact_result_ or as a
1.1Sross_denormalization_loss_.  If loss of accuracy is detected as an inexact
1.1Srossresult, the underflow flag is raised whenever an underflowed quantity
1.1Srosscannot be exactly represented in the subnormal format (that is, whenever the
1.1Srossinexact flag is also raised).  A denormalization loss, on the other hand,
1.1Srossoccurs only when the subnormal format is not able to represent the result
1.1Srossthat would have been returned if the destination format had infinite range.
1.1SrossSome underflowed results are inexact but do not suffer a denormalization
1.1Srossloss.  By detecting loss of accuracy as a denormalization loss, a system can
1.1Srossonce again avoid some unnecessary signaling of underflow.
1.1Sross
1.1SrossThe `-tininessbefore' and `-tininessafter' options can be used to control
1.1Srosswhether TestFloat expects tininess on underflow to be detected before or
1.1Srossafter rounding.  (See _TestFloat_Options_ below.)  One or the other is
1.1Srossselected as the default when TestFloat is compiled, but these command
1.1Srossoptions allow the default to be overridden.
1.1Sross
1.1SrossMost (possibly all) systems detect loss of accuracy as an inexact result.
1.1SrossThe current version of TestFloat can only test for this case.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1SrossNaNs
1.1Sross
1.1SrossThe IEC/IEEE Standard gives the floating-point formats a large number of
1.1SrossNaN encodings and specifies that NaNs are to be returned as results under
1.1Srosscertain conditions.  However, the standard allows an implementation almost
1.1Srosscomplete freedom over _which_ NaN to return in each situation.
1.1Sross
1.1SrossBy default, TestFloat does not check the bit patterns of NaN results.  When
1.1Srossthe result of an operation should be a NaN, any NaN is considered as good
1.1Srossas another.  This laxness can be overridden with the `-checkNaNs' option.
1.1Sross(See _TestFloat_Options_ below.)  In order for this option to be sensible,
1.1SrossTestFloat must have been compiled so that its internal floating-point
1.1Srossimplementation (SoftFloat) generates the proper NaN results for the system
1.1Srossbeing tested.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1SrossConversions to Integer
1.1Sross
1.1SrossConversion of a floating-point value to an integer format will fail if the
1.1Srosssource value is a NaN or if it is too large.  The IEC/IEEE Standard does not
1.1Srossspecify what value should be returned as the integer result in these cases.
1.1SrossMoreover, according to the standard, the invalid exception can be raised or
1.1Srossan unspecified alternative mechanism may be used to signal such cases.
1.1Sross
1.1SrossTestFloat assumes that conversions to integer will raise the invalid
1.1Srossexception if the source value cannot be rounded to a representable integer.
1.1SrossWhen the conversion overflows, TestFloat expects the largest integer with
1.1Srossthe same sign as the operand to be returned.  If the floating-point operand
1.2Sandvaris a NaN, TestFloat allows either the largest positive or largest negative
1.1Srossinteger to be returned.  The current version of TestFloat provides no means
1.1Srossto alter these conventions.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossTestFloat Options
1.1Sross
1.1SrossThe `testfloat' (and `testsoftfloat') program accepts several command
1.1Srossoptions.  If mutually contradictory options are given, the last one has
1.1Srosspriority.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-help
1.1Sross
1.1SrossThe `-help' option causes a summary of program usage to be written, after
1.1Srosswhich the program exits.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-list
1.1Sross
1.1SrossThe `-list' option causes a list of testable functions to be written,
1.1Srossafter which the program exits.  Some machines do not implement all of the
1.1Srossfunctions TestFloat can test, plus it may not be possible to test functions
1.1Srossthat are inaccessible from the C language.
1.1Sross
1.1SrossThe `testsoftfloat' program does not have this option.  All SoftFloat
1.1Srossfunctions can be tested by `testsoftfloat'.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-level <num>
1.1Sross
1.1SrossThe `-level' option sets the level of testing.  The argument to `-level' can
1.1Srossbe either 1 or 2.  The default is level 1.  Level 2 performs many more tests
1.1Srossthan level 1.  Testing at level 2 can take as much as a day (even longer for
1.1Sross`testsoftfloat'), but can reveal bugs not found by level 1.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-errors <num>
1.1Sross
1.1SrossThe `-errors' option instructs TestFloat to report no more than the
1.1Srossspecified number of errors for any combination of function, rounding mode,
1.1Srossetc.  The argument to `-errors' must be a nonnegative decimal number.  Once
1.1Srossthe specified number of error reports has been generated, TestFloat ends the
1.1Srosscurrent test and begins the next one, if any.  The default is `-errors 20'.
1.1Sross
1.1SrossAgainst intuition, `-errors 0' causes TestFloat to report every error it
1.1Srossfinds.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-errorstop
1.1Sross
1.1SrossThe `-errorstop' option causes the program to exit after the first function
1.1Srossfor which any errors are reported.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-forever
1.1Sross
1.1SrossThe `-forever' option causes a single operation to be repeatedly tested.
1.1SrossOnly one rounding mode and/or rounding precision can be tested in a single
1.1Srossinvocation.  If not specified, the rounding mode defaults to nearest/even.
1.1SrossFor extended double-precision operations, the rounding precision defaults
1.1Srossto full extended double precision.  The testing level is set to 2 by this
1.1Srossoption.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-checkNaNs
1.1Sross
1.1SrossThe `-checkNaNs' option causes TestFloat to verify the bitwise correctness
1.1Srossof NaN results.  In order for this option to be sensible, TestFloat must
1.1Srosshave been compiled so that its internal floating-point implementation
1.1Sross(SoftFloat) generates the proper NaN results for the system being tested.
1.1Sross
1.1SrossThis option is not available to `testsoftfloat'.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-precision32, -precision64, -precision80
1.1Sross
1.1SrossFor extended double-precision functions affected by rounding precision
1.1Srosscontrol, the `-precision32' option restricts testing to only the cases
1.1Srossin which rounding precision is equivalent to single precision.  The other
1.1Srossrounding precision options are not tested.  Likewise, the `-precision64'
1.1Srossand `-precision80' options fix the rounding precision equivalent to double
1.1Srossprecision or extended double precision, respectively.  These options are
1.1Srossignored for functions not affected by rounding precision control.
1.1Sross
1.1SrossThese options are not available if extended double precision is not
1.1Srosssupported by the machine or if extended double precision functions cannot be
1.1Srosstested.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-nearesteven, -tozero, -down, -up
1.1Sross
1.1SrossThe `-nearesteven' option restricts testing to only the cases in which the
1.1Srossrounding mode is nearest/even.  The other rounding mode options are not
1.1Srosstested.  Likewise, `-tozero' forces rounding to zero; `-down' forces
1.1Srossrounding down; and `-up' forces rounding up.  These options are ignored for
1.1Srossfunctions that are exact and thus do not round.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross-tininessbefore, -tininessafter
1.1Sross
1.1SrossThe `-tininessbefore' option indicates that the system detects tininess
1.1Srosson underflow before rounding.  The `-tininessafter' option indicates that
1.1Srosstininess is detected after rounding.  TestFloat alters its expectations
1.1Srossaccordingly.  These options override the default selected when TestFloat was
1.1Srosscompiled.  Choosing the wrong one of these two options should cause error
1.1Srossreports for some (not all) functions.
1.1Sross
1.1SrossFor `testsoftfloat', these options operate more like the rounding precision
1.1Srossand rounding mode options, in that they restrict the tests performed by
1.1Sross`testsoftfloat'.  By default, `testsoftfloat' tests both cases for any
1.1Srossfunction for which there is a difference.
1.1Sross
1.1Sross- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossFunction Sets
1.1Sross
1.1SrossJust as TestFloat can test an operation for all four rounding modes in
1.1Srosssequence, multiple operations can be tested with a single invocation of
1.1SrossTestFloat.  Three sets are recognized:  `-all1', `-all2', and `-all'.  The
1.1Srossset `-all1' comprises all one-operand functions; `-all2' is all two-operand
1.1Srossfunctions; and `-all' is all functions.  A function set can be used in place
1.1Srossof a function name in the TestFloat command line, such as
1.1Sross
1.1Sross    testfloat [<option>...] -all
1.1Sross
1.1Sross
1.1Sross-------------------------------------------------------------------------------
1.1SrossContact Information
1.1Sross
1.1SrossAt the time of this writing, the most up-to-date information about
1.1SrossTestFloat and the latest release can be found at the Web page `http://
1.1SrossHTTP.CS.Berkeley.EDU/~jhauser/arithmetic/TestFloat.html'.
1.1Sross
1.1Sross