Chapter 5D Abridged (Simpson's Rule)

Unfortunately this description of a primitive function using the integral for f begs the question of whether this function is computable in some direct way. Certainly most of the examples we have discussed have had primitive functions that can be described using elementary functions, but there are many important functions for which there is no easily computable elementary primitive function. For example the function `e^{-x^2}`, which plays an important role in the study of probability, does not an elementary primitive function. In some situations the value of the integrand may not be described algebraically, and only graphical or numerical information about the function may be available to estimate the related definite integral. Many rational functions do not have an easily described primitive function because of the practical difficulties of finding the precise roots of the polynomials. (See Chapter VII for more on integrating rational functions.) In fact, sometimes a primitive function defined by the integral becomes so important to applications that it is named and studied as carefully as the elementary trignometric functions. One very important example of this type in probability theory is the distribution function called the error function, denoted erf, defined by
erf`(x)=2/{sqrt(pi)}int_0^x e^{-t^2}dt`.

Since the definite integral was defined initially using estimates and limits, it is sensible to think that there might be other methods giving more accurate estimates of the definite integral for less effort. In this section we will investigate some alternatives for estimating the definite integral. In the process we will try to understand both the motivation for these estimates and the justification for believing these estimates actually do approximate the definite integral. We will look at some examples where we can find the definite integral's value exactly using the fundamental theorem of calculus. These examples provide some foundation for comparing the exact result with estimates to see how well a particular method performs when the result is known and comparing the performance of different algorithms. In other examples the fundamental theorem of calculus will not help evaluate the definite integral. In these cases it is truly necesssary to find more efficient methods for approximating its value and knowing something about the accuracy of these estimates.

General Principles: The key motivations for estimating the definite integral are geometric or physical, connected to the interpretation of the definite integral for positive functions visually as an area or dynamically as the estimation of the distance travelled by an object moving. In either case the ideas first use a model for the estimation on a single interval. This model is then reproduced to estimate the integral for sub-intervals of shorter length. These estimates are accumulated to a sum of estimates which provides the improved estimate of the integral for the original interval. In some sense the estimates work by combining two slogans: "what's good for one is good for all" and "the more the merrier!" The accumulation of many small estimates each determined in the same fashion with small errors generally improves the quality of the approximation. So the fundamental approach to estimations of the definite integral follows a 3 step procedure:

Single Interval Estimate Method: Choose a method to make an estimate of the area (or distance travelled) for the single interval [a,b] based on some limited but computable information about the function f for that interval.

For example, use the area of the rectangle with height `f(b)` and base `b-a`, so we would estimate with `f(b) (b-a)`. See Figure 1. This method uses the height of the graph of the function at the right hand endpoint, `f(b)`, for the estimation of the area of the rectangle. This is in contrast to the Euler sum which uses the height at the left hand endpoint of the interval, `f(a)`.

Or equivalently, use the distance travelled at velocity `f(b)` for time `b-a` to motivate the same estimate, `f(b) (b-a)`. See Figure ***. In the motion interpretation this uses the velocity at the arrival time, `f(b)`, to estimate the distance travelled during the time interval instead of the velocity at the initial time, `f(a)`, as used by Euler's method.

Partition: Cut (partition) the given interval `[a,b]` into N pieces of the same size,`{b-a}/N=dx=Delta x`, and use the estimation method to estimate the area (distance travelled) for these smaller intervals and accumulate these estimates by adding to arrive at an estimate for the total area (or distance travelled). This sum may be simplified for computation using some algebra.

Continuing the example, we would let `x_k=a+k Delta x` for `k`= 0, 1, 2,..., N. Then consider intervals `[x_{k-1}, x_k]` for `k`= 1, 2, 3,..., N. Evaluate f at the right hand endpoint, `x_k`, when `k` = 1, 2, 3,..., N and multiply by the length `Delta x`, and then add these to give the estimate

`f(x_1)Delta x +f(x_2)Delta x +...+ f(x_N)Delta x =Sigma_{k=1}^N f(x_k)Delta x`

This formula for the estimate can be simplified algebraically by recognizing the common factor of `Delta x`. So the estimate can be computed with the formula

`[f(x_1)+f(x_2) +...+ f(x_N)]Delta x = [Sigma_{k=1}^N f(x_k)]Delta x `

The graphical visualization is very suggestive here. A positive continuous function gives a picture with a region in the plane cut into N pieces, each estimated by a rectangle with base of length

`Delta x` and height determined by the graph of the function at the right hand endpoint. See Figure ***. This estimate is called the right hand endpoint (Euler) estimate.

Justify: We would like to justify that these estimates actually approach the desired integral as N gets large. To do this we connect our method either to the defining approximations of the definite integral or to some other estimate known to approximate the definite integral.

In the example, the estimating sum `Sigma_{k=1}^N f(x_k)Delta x` can be compared with the euler sum
`Sigma_{k=0}^{N-1}f(x_k) Delta x` used to define the definite integral. Consider that the difference between these two estimates using the same number of segments, N, is

`Sigma_{k=1}^N f(x_k)Delta x` - `Sigma_{k=0}^{N-1}f(x_k) Delta x = f(x_N)Delta x - f(x_0)Delta x = [f(b)-f(a)]Delta x.`

Again the area interpretation can be helpful here. See Figure***. Now when `N` is large, `Delta x = {b-a}/N ->0` , so the difference between these two estimates is also approaches 0. The definition of the definite integral tells us that when the definite integral exists, `Sigma_{k=0}^{N-1}f(x_k) Delta x -> int_a^b f(x) dx`.
Hence the estimate `Sigma_{k=1}^{N}f(x_k) Delta x -> int_a^b f(x) dx.`

Simpson's Rule for estimating `int_a^b f(x) dx` is developed by using quadratic functions (parabolas) to estimate the integrand f. There are many quadratic functions `P(x)=Ax^2+Bx+C` where `P(a)=f(a)` and `P(b)=f(b)` and whose graph therefore coincides at two points with the graph of `f`. To determine a single quadratic function for Simpson's rule, we consider the unique quadratic that also agrees with `f` at the midpoint `m=(a+b)/2`, that is, `P(m) = f(m)`.

Thus for the initial single interval estimate we approximate `int_a^b f(x) dx` with `int_a^b P(x) dx = int_a^b Ax^2 + Bx+ C dx` where A, B, and C are determined by `P(a)=f(a), P(b)=f(b)`, and `P(m)=f(m)`.[ See Figure V.D.1.]

Figure V.D.1

Now `int_a^b Ax^2 + Bx+ C dx = {Ax^3}/3 + {Bx^2}/2 + C| _a^b`
` {A(b^3-a^3)}/3 +{B(b^2-a^2)}/2 +C (b-a) = {(b-a)}/6 [2A(b^2+ba+a^2) +3B(b+a) + 6C].`

When we plug `a, m,` and `b` into `f(x)` we obtain
`f(a) = Aa^2 + Ba + C`,
`f(m) = A ({a+b}/2)^2 + B{a+b}/2 + C = 1/4[Aa^2 + 2Aab + A b^2 + 2B( a+b) + 4C]`,
`f(b) = Ab^2 + Bb + C`.

With substitution and a little algebraic manipulation we arrive at the following rather simple equation relating the estimating quadratic integral to the values of `f` at `a`,` m`, and `b`.

`int_a^b Ax^2 + Bx + C dx = {b-a}/6 [f(a) + 4f(m) + f(b)].`

If `h={b-a}/2` and `a=x_0, m=x_1,` and `b=x_2` then this becomes

`int_a^b Ax^2 + Bx + C dx = h/3 [f(x_0) + 4f(x_1) + f(x_2)].`

To generalize this work we partition the interval `[a,b]` into `2k=N` subintervals of length `h=(b-a)/{2k}=(b-a)/N` by letting `x_i=a+ih`. We estimate the integral of f for each subinterval as we just did so that for each of the `k` intervals `[x_ {2i},x_{2i+2}]` we have

`int_{x_{2i}}^{x_{2i+2}}f(x)dx~~int_{x_{2i}}^{x_{2i+2}} Ax^2 + Bx + C dx = h/3 [f(x_{2i}) + 4f(x_{2i+1}) + f(x_{2i+2})].`
or
`int_{x_{2i}}^{x_{2i+2}}f(x)dx~~int_{x_{2i}}^{x_{2i+2}} Ax^2 + Bx + C dx = {b-a}/{6k} [f(x_{2i}) + 4f(x_{2i+1}) + f(x_{2i+2})].`

Solution: Begin as usual by finding `{b-a}/N = {2-0}/4=1/2`. Thus `x_0= 0, x_1= 0.5, x_2= 1, x_3 = 1.5`, and `x_4 = 2`. Following the last formula for Simpson's Rule we have

You can also look at the following GeoGebra spreadsheet for `f (x) = x^4` . You can change `f` using the Input Box.

xi	`f(x)=x^3	Simpson Factor	Summand
0	0	1	0
0.5	0.125	4	0.5
1	1	2	2
1.5	3.375	4	13.5
2	8	1	8
		Simpson Sum = 24
		Simpson Estimate = `1/6` Sum = 4

V.D Other Methods of Estimating The Definite Integral (Abridged)