Page 20

Chapter 1. Introduction

calculations is the bit, all other numbers such as rational, fractions and irrational are

represented, and often represented by an approximation, by real numbers. As it is clear

a standard had to be established in order to represent these numbers so as to be used

in computer science. The term ﬂoating point refers to the fact that a number’s radix

point in computers, can ”ﬂoat”. That is, it can be placed anywhere relative to the

signiﬁcant digits of the number. This position is indicated as the exponent component

in the internal representation, and ﬂoating point can thus be thought of as a computer

realization of scientiﬁc notation. All ﬂoating point numbers are represented with the

following formula:

Signif icantDigits ∗ base

exponent

(1.1)

The numbers are, in general, represented approximately to a ﬁxed number of signiﬁcant

digits (the signiﬁcand) and scaled using an exponent. The base for the scaling is normally

2, 10 or 16. The idea of ﬂoating-point representation over intrinsically integer ﬁxed-point

numbers, which consist purely of signiﬁcand, is that expanding it with the exponent

component achieves greater range. For instance, to represent large values, e.g. distances

between galaxies, there is no need to keep all 39 decimal places down to femtometre-

resolution (employed in particle physics). Assuming that the best resolution is in light

years, only the 9 most signiﬁcant decimal digits matter, whereas the remaining 30 digits

carry pure noise, and thus can be safely dropped. This represents a savings of 100 bits

of computer data storage. Instead of these 100 bits, much fewer are used to represent

the scale (the exponent), e.g. 8 bits or 2 decimal digits. Given that one number can

encode both astronomic and subatomic distances with the same nine digits of accuracy,

but because a 9-digit number is 100 times less accurate than the 11 digits reserved

for scale, this is considered a trade-oﬀ exchanging range for precision. The example

of using scaling to extend the dynamic range reveals another contrast with ﬁxed-point

numbers: Floating-point values are not uniformly spaced. Small values, close to zero,

can be represented with much higher resolution (e.g. one femtometre) than large ones

because a greater scale (e.g. light years) must be selected for encoding signiﬁcantly

larger values.[1] That is, ﬂoating-point numbers cannot represent point coordinates with

atomic accuracy at galactic distances, only close to the origin.