Oscillation of a Function
In a previous post we obtained the Riemann's condition of integrability using the concept of upper and lower Darboux sums. We observed that in order that a function be Riemann integrable on interval [a,b] it was necessary (and sufficient) to make the sum U(P,f)−L(P,f)=n∑k=1(Mk−mk)(xk−xk−1) arbitrarily small for some partition P={x0,x1,x2,…,xn} of [a,b].Now it is obvious that this sum can be made arbitrarily smaller if both the parts (Mk−mk) and (xk−xk−1) can be made arbitrarily smaller. Making the Δxk=xk−xk−1 small represents no hurdle, we just need to choose partitions of sufficiently small norms.
It is the difference Mk−mk which requires further probing. Technically this difference is called the oscillation of function f on interval [xk−1,xk]. More formally we define the oscillation of a function f on an interval I (denoted by Ωf(I)) as follows: Ωf(I)=sup{f(x)−f(y)∣x,y∈I} Clearly Ωf(I) exists and is non-negative provided f is bounded in I. It should be obvious that Ωf(I1)≤Ωf(I2) whenever I1⊆I2. It also makes sense to define the oscillation of a function f at a certain point. Let f be defined in a certain neighborhood of a point a. Then we define the oscillation of f at a (denoted by ωf(a)) as follows: ωf(a)=limh→0+Ωf((a−h,a+h)) The above limit always exists (provided f is bounded in neighborhood of a) because when h→0+ the oscillation decreases and is always non-negative. It should also be obvious that ωf(a)=0 if and only if the function f is continuous at a.
We next need to understand that if the oscillation of a function at a certain point a is small then its oscillation is also small at points in the neighborhood of a. To be precise we have the following property:
If for some given ϵ>0 we have ωf(a)<ϵ then there is a neighborhood I of a such that ωf(x)<ϵ for all x∈I.
To see why this is the case we need to understand that if ωf(a)=A<ϵ then there is a neighborhood I of a such that Ωf(I)<(A+ϵ)/2. Since I is an open interval, if x∈I then we can find a neighborhood of x which is wholly contained in I and in this neighborhood the oscillation of f does not exceed (A+ϵ)/2 and hence the oscillation ωf(x)≤(A+ϵ)/2<ϵ.
From the above it follows that:
If f is bounded in [a,b] then for any given ϵ>0 the set of points Iϵ={x∣x∈[a,b],ωf(x)<ϵ} is open.
and therefore it is quite obvious that:
If f is bounded on [a,b] then for any given ϵ>0 the set of points Jϵ={x∣x∈[a,b],ωf(x)≥ϵ} is closed.
Also we need another result (whose proof is reminiscent of the proof for uniform continuity) on oscillation of a function:
If a function f is bounded on [a,b] and there is an ϵ>0 such that ωf(x)<ϵ for all x∈[a,b] then we can find a partition of [a,b] such that the oscillation of f in each of the sub-intervals generated by this partition is less then ϵ.
Clearly if ωf(x)<ϵ then there is a neighborhood Ix of x in which the oscillation of f is less than ϵ. Together all these neighborhoods Ix form an open cover for [a,b] and hence by Heine-Borel Principle there is a finite set of neighborhoods Ix which covers the interval [a,b]. Naturally these neighborhoods must overlap and the end points of these neighborhoods partition the interval [a,b] into a finite number of sub-intervals. Let δ be a positive number less than the length of least of these sub-intervals. Then any sub-interval of length less than δ is contained in some interval Ix and hence the oscillation of f in this sub-interval is less than ϵ. We thus only need to form a partition of [a,b] whose norm is less than δ and then oscillation of f in each of the sub-intervals generated by P is less than ϵ.
Condition for Integrability
Let a function f be bounded in [a,b] and let σ>0 be given. If we consider a partition P={x0,x1,x2,…,xn} then there will be certain sub-intervals [xk−1,xk] in which the oscillation of f is less than σ and in the remaining sub-intervals the oscillation of f will be greater than or equal to σ. Then the sum U(P,f)−L(P,f)=n∑k=1(Mk−mk)Δxk can be split into two sums S1,S2 where S1 is based on sub-intervals where the oscillation of f is less than σ and the sum S2 is based on the remaining sub-interval. Clearly then we have S1≤σ(b−a),S2≥σ∑Δxk=σLσ where the sum Lσ=∑Δxk in bove equation is based on the sub-intervals where the oscillation of f is greater than or equal to σ. If f is integrable on [a,b] then we must be able to make the difference U(P,f)−L(P,f) less than ϵ for any given ϵ>0. Clearly this would mean that S2<ϵ so that σLσ≤S2<ϵ. Thus we have Lσ<ϵ/σ.So we see that in order that a function be integrable on an interval it is necessary that given any σ>0 we must be able to make the sum of lengths of sub-intervals where oscillation of f is greater or equal to σ arbitrarily small. Riemann understood that this condition is also sufficient for integrability. Let ϵ>0 be given. Also let M=Ωf([a,b]). We set σ=ϵ/(2(b−a)) and we choose a partition P such that the sum of lengths of sub-intervals where the oscillation of f is greater than or equal to σ is less than ϵ/2M. Clearly then we have S1<ϵ2(b−a)⋅(b−a)=ϵ2,S2<M⋅ϵ2M=ϵ2 so that U(P,f)−L(P,f)<ϵ. What we have established above is the following:
Let f be a function bounded on [a,b]. Then f is integrable on [a,b] if and only if for any given σ>0,λ>0 it is possible to find a partition P of [a,b] such that the sum of lengths of sub-intervals of P where the oscillation of f exceeds σ is less than λ.
Lebesgue's Criterion of Integrability
In the above condition of integrability by Riemann we are forced to consider the lengths of sub-intervals satisfying some particular property. Namely we want to constrain the length of sub-intervals where the oscillation of the function is not small. We also know that if a function is continuous at a point then its oscillation at that point is zero (and conversely). Therefore the sub-intervals where the oscillation is not small should contain points of discontinuity of the function. So in effect Riemann condition implies that we should be able to cover the points of discontinuity by a set of intervals whose total length is arbitrarily small. This important idea was first formalized by Henri Lebesgue and was given the name of "measure".More formally we say that a set of points is of measure zero if for any given ϵ>0 it can be covered by a countable collection of open intervals the total sum of whose lengths is less than ϵ.
It is trivial that a set consisting of a single point is of measure zero. Also if we have a countable number of sets Si each of which is of measure zero, then their union is also of measure zero. Clearly given ϵ>0 we can cover the set Si by a countable collection of intervals whose total length is less than ϵ/2i and hence the union of all sets Si can be covered by a countable collection of intervals whose total length is less than ∑(ϵ/2i)=ϵ. It is thus clear that any countable set is of measure zero and in particular the set of all rational numbers is of measure zero. There also exist uncountable sets which are of measure zero and we shall indicate one famous example of such a set later.
So Riemann condition seems to suggest that the set of discontinuities of an integrable function should be of measure zero. This condition was established by Lebesgue and can be stated as follows:
Let f be a function bounded on [a,b]. Then f is Riemann integrable on [a,b] if and only if the set of discontinuities of f in [a,b] is of measure zero.
It is easy to observe that the set of discontinuities of f can be expressed as a union D=∞⋃i=1Di where Di={x∣x∈[a,b],ωf(x)≥1i} If D is not of measure zero then there is some set Di which is not of measure zero. Clearly in that case there exists an ϵ>0 such that Di can not be covered by any collection of intervals whose total length is less than ϵ. If P is a partition of [a,b] we can split the difference U(P,f)−L(P,f)=∑(Mk−mk)(xk−xk−1) into two sums S1,S2 such that S1 is based on those sub-intervals which don't contain any point of Di. Then clearly sub-intervals in S2 form a cover for the points of Di and hence in each of these sub-intervals the oscillation of f is greater than or equal to 1/i. Also the total sum of the lengths of these sub-intervals is not less than ϵ. Therefore it follows that U(P,f)−L(P,f)≥S2≥ϵ/i for any partition P of [a,b]. Clearly this means that the function f is not Riemann integrable on [a,b].
Next we prove that if set D is of measure zero then f is integrable on [a,b]. This is bit subtle in terms of exact formal argument and the reader must pay attention to what follows. Since D is of measure zero and Di⊆D for all i, it follows that each Di is of measure zero. Hence for each i the set Di can be covered by a countable collection of open intervals the sum of whose lengths is less than 1/i. Since the set Di is closed and bounded, Heine-Borel Principle applies here (this although requires a stronger form of the principle than we have established in previous post). Therefore set Di can be covered by a finite number of open intervals the sum of whose lengths is less than 1/i. Let the union of these open intervals be denoted by Ai and let Bi denote set of points which are in [a,b] but not in Ai. Then clearly Bi is made up of a finite number of closed intervals. If I is one of the intervals making up Bi then we have ωf(x)<1/i for all x∈I. Hence the interval I can be further partitioned into sub-intervals such that oscillation of f in each of these sub-intervals in less than 1/i. Thus we observe that the set Bi can be partitioned into sub-intervals such that the oscillation of f in each of these sub-intervals is less than 1/i. The end points of these sub-intervals make a partition P of [a,b]. We then express the difference U(P,f)−L(P,f)=∑(Mk−mk)(xk−xk−1)=S1+S2 where S1 is based on those sub-intervals which contain points of Di and S2 is based on the remaining sub-intervals. Clearly then we see that sub-intervals corresponding to S1 are covered by Ai and here Mk−mk≤M−m and ∑Δxk<1/i so that S1<(M−m)/i. The sub-intervals corresponding to S2 are contained in Bi and here we have Mk−mk<1/i and \sum \Delta x_{k} \leq (b - a) so that S_{2} < (b - a) / i. It follows that U(P, f) - L(P, f) < (M - m + b - a) / i Thus corresponding to any positive integer i we have a partition P such that U(P, f) - L(P, f) < (M - m + b - a) / i It follows that f is integrable on [a, b].
If a property holds on a certain set of points A except for a set of points of measure zero then we say that the property holds almost everywhere (abbreviated frequently as a.e.) on set A. Thus the Lebesgue's criterion can be expressed concisely as follows:
A bounded function f is integrable on [a, b] if and only if it is continuous almost everywhere in [a, b].
This result can be considered as the beginning of the theory of Measure and Integration as developed by Henri Lebesgue. We shall have occasion to deal with the rich and elegant theory developed by Lebesgue in later posts. For now we shall be content to give an example of an uncountable set which is of measure zero.
The Cantor Set
Let A_{0} = [0, 1]. We remove the middle third open interval (1/3, 2/3) from A_{0} to get A_{1} = [0, 1/3] \cup [2/3, 1]. Thus A_{1} is composed of 2 intervals whose total length is 1 - 1/3 = 2/3. Now we remove the middle third open interval from each of these two intervals to get A_{2} which consists now of 4 closed intervals whose total length is 1 - 1/3 - 2/9 = 4/9. We proceed in this similar fashion indefinitely to generate sets A_{i} which consists of 2^{i} closed intervals whose total length is 1 - \left(\frac{1}{3} + \frac{2}{9} + \frac{4}{27} + \cdots + \frac{2^{i - 1}}{3^{i}}\right) = \left(\frac{2}{3}\right)^{i} Let A = \bigcap_{i = 0}^{\infty}A_{i} It is obvious that 0 \in A, 1 \in A and hence set A is non-empty. The set A is called Cantor set.Clearly A can be covered by A_{i} whose length is (2/3)^{i} and (2/3)^{i} \to 0 as i \to \infty, it follows that set A is of measure zero. It needs to be shown that set A is uncountable. If we observe the construction of set A deeply we will see that it consists of those numbers of interval [0, 1] whose ternary expansion (as opposed to decimal expansion) consists of only 0 or 2 but not 1. Such numbers can't be arranged in a sequence. If they could be arranged in sequence say x_{1}, x_{2}, \ldots then we could write \begin{align} x_{1} &= 0.x_{11}x_{12}x_{13}\ldots\notag\\ x_{2} &= 0.x_{21}x_{22}x_{23}\ldots\notag\\ x_{3} &= 0.x_{31}x_{32}x_{33}\ldots\notag\\ \cdots\notag\end{align} where the representation is in ternary and each of x_{ij} is either 0 or 2. Clearly we can now construct a number y = 0.y_{1}y_{2}y_{3}\ldots such that y_{n} = 2 - x_{nn}. Then clearly y_{n} \neq x_{nn} for any n. Thus clearly y belongs to A, but does not belong to sequence \{x_{n}\}. This famous technique of Cantor called diagonal slash shows that the set A is uncountable.
Print/PDF Version
Thankyou for the tutorial to riemann integral. Is it possible to add tutorial regarding lebesgue integral or measure theory too. --Rajendra
Anonymous
February 6, 2014 at 9:12 PM@Rajendra
I am working on a series regarding Measure Theory and Lebesgue Integration. But I find that the proofs of various theorems are quite lengthy and bit technical to explain. Perhaps it will make for a very long series consisting of 8-10 posts.
Paramanand
February 7, 2014 at 9:24 AM