We shall represent a monochrome (luminance) image by a matrix x whose elements are x(n), where n=(

n₁

n₂

) is the integer vector of row and column indexes. The energy of x is defined as

Energy of x=∑(x²(n))(1)

where the sum is performed over all n in x.

**Figure 1:** The basic block diagram of an image coding system.

Figure 1 shows the main blocks in any image coding system. The decoder is the inverse of the encoder. The three encoder blocks perform the following tasks:

Energy compression - This is usually a transformation or filtering process which aims to concentrate a high proportion of the energy of the image x into as few samples (coefficients) of yas possible while preserving the total energy of x in y. This minimises the number of non-zero samples of y which need to be transmitted for a given level of distortion in the reconstructed image x.
Quantisation - This represents the samples of y to a given level of accuracy in the integer matrix q. The quantiser step size controls the tradeoff between distortion and bit rate and may be adapted to take account of human visual sensitivities. The inverse quantiser reconstructs y, the best estimate of y from q.
Entropy coding - This encodes the integers in q into a serial bit stream d, using variable-length entropy codes which attempt to minimise the total number of bits in d, based on the statistics (PDFs) of various classes of samples in q.

The energy compression / reconstruction and the entropy coding / decoding processes are normally all lossless. Only the quantiser introduces loss and distortion: y is a distorted version of y, and hence x is a distorted version of x. In the absence of quantisation, if y=y, then x=x.

Use of Laplacian PDFs in Image Compression

It is found to be appropriate and convenient to model the distribution of many types of transformed image coefficients by Laplacian distributions. It is appropriate because much real data is approximately modeled by the Laplacian probability density function (PDF), and it is convenient because the mathematical form of the Laplacian PDF is simple enough to allow some useful analytical results to be derived.

A Laplacian PDF is a back-to-back pair of exponential decays and is given by:

p(x)=

2x₀

ⅇ^{−(
|x|
x₀
)}(1)

where x₀ is the equivalent of a time constant which defines the width of the PDF from the centre to the

ⅇ

points. The initial scaling factor ensures that the area under p(x) is unity, so that it is a valid PDF. Figure 1 shows the shape of p(x).

**Figure 1:** Laplacian PDF, p(x), and typical quantiser decision thresholds, shown for the case when the quantiser step size Q=2x₀

The mean of this PDF is zero and the variance is given by:

v(x₀)

∫_−∞^∞x²p(x)dx

2∫

x²

2x₀

ⅇ^{−(
x
x₀
)}dx

2x₀²

(2)

(using integration by parts twice).

Hence the standard deviation is:

σ(x₀)	=	\v(x₀)
	=	\2x₀

(3)

Given the variance (power) of a subimage of transformed pels, we may calculate x₀ and hence determine the PDF of the subimage, assuming a Laplacian shape. We now show that, if we quantise the subimage using a uniform quantiser with step size Q, we can calculate the entropy of the quantised samples and thus estimate the bit rate needed to encode the subimage in bits/pel. This is a powerful analytical tool as it shows how the compressed bit rate relates directly to the energy of a subimage. The vertical dashed lines in Figure 1 show the decision thresholds for a typical quantiser for the case when Q=2x₀.

First we analyse the probability of a pel being quantised to each step of the quantiser. This is given by the area under p(x) between each adjacent pair of quantiser thresholds.

Probability of being at step 0, p₀=Pr[−(

1

2

Q)<x<

1

2

Q]=2Pr[0<x<

1

2

Q]
Probability of being at step k, p_k=Pr[(k−

1

2

)Q<x<(k+

1

2

)Q]

First, for x₂≥x₁≥0, we calculate: Pr[x₁<x<x₂]=∫_x₁^x₂p(x)dx=(−(

))ⅇ^{−(
x
x₀
)}|_x₁^x₂=

(ⅇ^{−(
x₁
x₀
)}−ⅇ^{−(
x₂
x₀
)}) Therefore,

p₀=1−ⅇ^{−(
Q
2x₀
)}(4)

and, for k≥1,

p_k

(ⅇ^{−(
(k−
1
2
)Q
x₀
)}−ⅇ^{−(
(k+
1
2
)Q
x₀
)})

sinh(

2x₀

)ⅇ^{−(
kQ
x₀
)}

(5)

By symmetry, if k is nonzero, p_−k=p_k=sinh(

2x₀

)ⅇ^{−(
|k|Q
x₀
)}

Now we can calculate the entropy of the subimage:

−(

∞

∑

k=−∞

(p_klog₂p_k))

−(p₀log₂p₀)−2

∞

∑

k=1

(p_klog₂p_k)

(6)

To make the evaluation of the summation easier when we substitute for p_k, we let p_k=αr^k where α=sinh(

2x₀

) and r=ⅇ^{−(
Q
x₀
)}. Therefore,

∞

∑

k=1

(p_klog₂p_k)

∞

∑

k=1

(αr^klog₂(αr^k))

∞

∑

k=1

(αr^k(log₂α+klog₂r))

αlog₂α

∞

∑

k=1

(r^k)+αlog₂r

∞

∑

k=1

(kr^k)

(7)

Now

∞

∑

k=1

(r^k)=

1−r

and, differentiating by r:

∞

∑

k=1

(kr^k−1)=

(1−r)²

. Therefore,

∞

∑

k=1

(p_klog₂p_k)

αlog₂α

1−r

+αlog₂r

(1−r)²

αr

1−r

(log₂α+

log₂r

1−r

)

(8)

and

p₀log₂p₀=(1−\r)log₂(1−\r)(9)

Hence the entropy is given by:

H=−((1−\r)log₂(1−\r))−

2αr

1−r

(log₂α+

log₂r

1−r

)(10)

Because both α and r are functions of

x₀

, then H is a function of just

x₀

too. We expect that, for constant Q, as the energy of the subimage increases, the entropy will also increase approximately logarithmically, so we plot H against

x₀

in dB in Figure 2. This shows that our expectations are born out.

Figure 2: Entropy H and approximate entropy H_a of a quantised subimage with Laplacian PDF, as a function of

x₀

in dB.

We can show this in theory by considering the case when (

x₀

,≫,1), when we find that: α≈

2x₀

r≈1−

x₀

≈1−2α \r≈1−α Using the approximation log₂(1−ε)≈−(

ln2

) for small ε, it is then fairly straightforward to show that H≈−(log₂α)+

ln2

≈log₂(

2ⅇx₀

) We denote this approximation as H_a in Figure 2, which shows how close to H the approximation is, for x₀>Q (i.e. for

x₀

>0 dB).

We can compare the entropies calculated using Equation 10 with those that were calculated from the bandpass subimage histograms, as given in these figures describing Haar transform energies and entropies; level 1 energies, level 2 energies, level 3 energies, and level 4 energies. (The Lo-Lo subimages have PDFs which are more uniform and do not fit the Laplacian model well.) The values of x₀ are calculated from: x₀=

std. dev.

subimage energy

2 (no of pels in subimage)

The following table shows this comparison:

TABLE 1
Transform level	Subimage type	Energy (× 10⁶)	No of pels	x₀	Laplacian entropy	Measured entropy
1	Hi-Lo	4.56	16384	11.80	2.16	1.71
1	Lo-Hi	1.89	16384	7.59	1.58	1.15
1	Hi-Hi	0.82	16384	5.09	1.08	0.80
2	Hi-Lo	7.64	4096	30.54	3.48	3.00
2	Lo-Hi	2.95	4096	18.98	2.81	2.22
2	Hi-Hi	1.42	4096	13.17	2.31	1.75
3	Hi-Lo	13.17	1024	80.19	4.86	4.52
3	Lo-Hi	3.90	1024	43.64	3.99	3.55
3	Hi-Hi	2.49	1024	34.87	3.67	3.05
4	Hi-Lo	15.49	256	173.9	5.98	5.65
4	Lo-Hi	6.46	256	112.3	5.35	4.75
4	Hi-Hi	3.29	256	80.2	4.86	4.38

We see that the entropies calculated from the energy via the Laplacian PDF method (second column from the right) are approximately 0.5 bit/pel greater than the entropies measured from the Lenna subimage histograms. This is due to the heavier tails of the actual PDFs compared with the Laplacian exponentially decreasing tails. More accurate entropies can be obtained if x₀ is obtained from the mean absolute values of the pels in each subimage. For a Laplacian PDF we can show that

Mean absolute value

∫_−∞^∞|x|p(x)dx

2∫₀^∞

2x₀

ⅇ^{−(
x
x₀
)}dx

x₀

(11)

This gives values of x₀ that are about 20% lower than those calculated from the energies and the calculated entropies are then within approximately 0.2 bit/pel of the measured entropies.