# Cantor set

﻿
Cantor set

In mathematics, the Cantor set, introduced by German mathematician Georg Cantor in 1883 [Georg Cantor (1883) "Über unendliche, lineare Punktmannigfaltigkeiten V" [On infinite, linear point-manifolds (sets)] , "Mathematische Annalen", vol. 21, pages 545-591.] [H.-O. Peitgen, H. Jürgens, and D. Saupe, "Chaos and Fractals: New Frontiers of Science 2nd ed." (N.Y., N.Y.: Springer Verlag, 2004), page 65.] (but discovered in 1875 by Henry John Stephen Smith [Henry J.S. Smith (1875) “On the integration of discontinuous functions.” "Proceedings of the London Mathematical Society", Series 1, vol. 6, pages 140-153.] [The “Cantor set” was also discovered by Paul du Bois-Reymond (1831-1889). See footnote on page 128 of: Paul du Bois-Reymond (1880) “Der Beweis des Fundamentalsatzes der Integralrechnung,” "Mathematische Annalen", vol. 16, pages 115-128. The “Cantor set” was also discovered in 1881 by Vito Volterra (1860-1940). See: Vito Volterra (1881) “Alcune osservazioni sulle funzioni punteggiate discontinue” [Some observations on point-wise discontinuous functions] , "Giornale di Matematiche", vol. 19, pages 76-86.] [José Ferreirós, "Labyrinth of Thought: A History of Set Theory and Its Role in Modern Mathematics" (Basel, Switzerland: Birkhäuser Verlag, 1999), pages 162-165.] [Ian Stewart, "Does God Play Dice?: The New Mathematics of Chaos"] ), is a set of points lying on a single line segment that has a number of remarkable and deep properties. Through consideration of it, Cantor and others helped lay the foundations of modern general topology. Although Cantor himself defined the set in a general, abstract way, the most common modern construction is the Cantor ternary set, built by removing the middle thirds of a line segment. Cantor himself only mentioned the ternary construction in passing, as an example of a more general idea, that of a perfect set that is nowhere dense.

Construction of the ternary set

The Cantor ternary set is created by repeatedly deleting the open middle thirds of a set of line segments. One starts by deleting the open middle third (1/3, 2/3) from the interval [0, 1] , leaving two line segments: [0, 1/3] ∪ [2/3, 1] . Next, the open middle third of each of these remaining segments is deleted, leaving four line segments: [0, 1/9] ∪ [2/9, 1/3] ∪ [2/3, 7/9] ∪ [8/9, 1] . This process is continued ad infinitum, where the nth set is $frac\left\{C_\left\{n-1\left\{3\right\} cup left\left(frac\left\{2\right\}\left\{3\right\}+frac\left\{C_\left\{n-1\left\{3\right\} ight\right)$. The Cantor ternary set contains all points in the interval [0, 1] that are not deleted at any step in this infinite process.

The first six steps of this process are illustrated below.

Some research papers describe the explicit formula of Cantor ternary set in detail [Mohsen Soltanifar, "On A sequence of cantor Fractals", Rose Hulman Undergraduate Mahematics Journal, Vol 7, No 1, paper 9, 2006.] [Mohsen Soltanifar, "A Different Description of A Family of Middle-a Cantor Sets", American Journal of Undergraduate Research, Vol 5, No 2,pp 9 - 12, 2006.] .

Composition

Since the Cantor set is defined as the set of points not excluded, the proportion (i.e., measure) of the unit interval remaining can be found by total length removed. This total is the geometric progression

:$sum_\left\{n=0\right\}^infty frac\left\{2^n\right\}\left\{3^\left\{n+1 = frac\left\{1\right\}\left\{3\right\} + frac\left\{2\right\}\left\{9\right\} + frac\left\{4\right\}\left\{27\right\} + frac\left\{8\right\}\left\{81\right\} + cdots = frac\left\{1\right\}\left\{3\right\}left\left(frac\left\{1\right\}\left\{1-frac\left\{2\right\}\left\{3 ight\right) = 1.$

So that the proportion left is 1 &ndash; 1 = 0.

(Intuitively, one could imagine the geometric series as being base-3 decimals, so that 0.2222... repeating equals 1 just as in base 10 0.999... repeating equals 1.)

This calculation shows that the Cantor set cannot contain any interval of non-zero length. In fact, it may seem surprising that there should be anything left &mdash; after all, the sum of the lengths of the removed intervals is equal to the length of the original interval. However a closer look at the process reveals that there must be something left, since removing the "middle third" of each interval involved removing open sets (sets that do not include their endpoints). So removing the line segment (1/3, 2/3) from the original interval [0, 1] leaves behind the points 1/3 and 2/3. Subsequent steps do not remove these (or other) endpoints, since the intervals removed are always internal to the intervals remaining. So the Cantor set is not empty, and in fact contains an infinite number of points.

It may appear that "only" the endpoints are left, but that is not the case either. The number 1/4, for example is in the bottom third, so it is not removed at the first step, and is in the top third of the bottom third, and is in the bottom third of "that", and in the "top" third of "that", and so on ad infinitum&mdash;alternating between top and bottom thirds. Since it is never in one of the middle thirds, it is never removed, and yet it is also not one of the endpoints of any middle third. The number 3/10 is also in the Cantor set and is not an endpoint.

In the sense of cardinality, "most" members of the Cantor set are not endpoints of deleted intervals.

Properties

Cardinality

It can be shown that there are as many points left behind in this process as there were that were removed, and that therefore, the Cantor set is uncountable. To see this, we show that there is a function "f" from the Cantor set "C" to the closed interval [0,1] that is surjective (i.e. "f" maps from "C" onto [0,1] ) so that the cardinality of "C" is no less than that of [0,1] . Since "C" is a subset of [0,1] , its cardinality is also no greater, so the two cardinalities must in fact be equal.

To construct this function, consider the points in the [0, 1] interval in terms of base 3 (or ternary) notation. In this notation, 1/3 can be written as 0.13 and 2/3 can be written as 0.23, so the middle third (to be removed) contains the numbers with ternary numerals of the form 0.1xxxxx...3 where xxxxx...3 is strictly between 00000...3 and 22222...3. So the numbers remaining after the first step consists of
* Numbers of the form 0.0xxxxx...3
* 1/3 = 0.13 = 0.022222...3 (This alternative recurring representation of a number with a terminating numeral occurs in any positional system.)
* 2/3 = 0.122222...3 = 0.23
* Numbers of the form 0.2xxxxx...3All of which can be stated as those numbers with a ternary numeral 0.0xxxxx...3 or 0.2xxxxx...3

The second step removes numbers of the form 0.01xxxx...3 and 0.21xxxx...3, and (with appropriate care for the endpoints) it can be concluded that the remaining numbers are those with a ternary numeral whose first "two" digits are not 1. Continuing in this way, for a number not to be excluded at step "n", it must have a ternary representation whose "n"th digit is not 1. For a number to be in the Cantor set, it must not be excluded at any step, it must have a numeral consisting entirely of 0s and 2s. It is worth emphasising that numbers like 1, 1/3 = 0.13 and 7/9 = 0.213 are in the Cantor set, as they have ternary numerals consisting entirely of 0s and 2s: 1 = 0.2222...3, 1/3 = 0.022222...3 and 7/9 = 0.2022222...3. So while a number in "C" may have either a terminating or a recurring ternary numeral, one of its representations will consist entirely of 0s and 2s. It has been conjectured that all algebraic irrational numbers are normal and if true, this would imply that all members of the Cantor set are either rational or transcendental.

The function from "C" to [0,1] is defined by taking the numeral that does consist entirely of 0s and 2s, replacing all the 2s by 1s, and interpreting the sequence as a binary representation of a real number. In a formula,

:$f left\left( sum_\left\{k=1\right\}^infty a_k 3^\left\{-k\right\} ight\right) = sum_\left\{k=1\right\}^infty \left(a_k/2\right) 2^\left\{-k\right\}.$

For any number "y" in [0,1] , its binary representation can be translated into a ternary representation of a number "x" in "C" by replacing all the 1s by 2s. With this, "f"("x") = "y" so that "y" is in the range of "f". For instance if "y" = 3/5 = 0.100110011001...2, we write "x" = 0.200220022002...3 = 7/10. Consequently "f" is surjective; however, "f" is "not" injective &mdash; interestingly enough, the values for which "f"("x") coincides are those at opposing ends of one of the "middle thirds" removed. For instance, 7/9 = 0.2022222...3 and 8/9 = 0.2200000...3 so "f"(7/9) = 0.101111...2 = 0.112 = "f"(8/9).

So there are as many points in the Cantor set as there are in [0, 1] , and the Cantor set is uncountable (see Cantor's diagonal argument). However, the set of endpoints of the removed intervals is countable, so there must be uncountably many numbers in the Cantor set which are not interval endpoints. As noted above, one example of such a number is ¼, which can be written as 0.02020202020...3 in ternary notation.

The Cantor set contains as many points as the interval from which it is taken, yet itself contains no interval. (Actually, the irrational numbers have the same property, but the Cantor set has the additional property of being closed, so it is not even dense in any interval, unlike the irrational numbers, which are dense everywhere.)

Self-similarity

The Cantor set is the prototype of a fractal. It is self-similar, because it is equal to two copies of itself, if each copy is shrunk by a factor of 3 and translated. More precisely, there are two functions, the left and right self-similarity transformations, $f_L\left(x\right)=x/3$ and $f_R\left(x\right)=\left(2+x\right)/3$, which leave the Cantor set invariant: $f_L\left(C\right)=f_R\left(C\right)=C$.

Repeated iteration of $f_L$ and $f_R$ can be visualized as an infinite binary tree. That is, at each node of the tree, one may consider the subtree to the left or to the right. Taking the set $\left\{f_L,f_R\right\}$ together with function composition forms a monoid, the dyadic monoid.

The automorphisms of the binary tree are its hyperbolic rotations, and are given by the modular group. Thus, the Cantor set is a homogeneous space in the sense that for any two points $x$ and $y$ in the Cantor set $C$, there exists a homeomorphism $h:C o C$ with $h\left(x\right)=y$. These homeomorphisms can be expressed explicitly, as Mobius transformations.

The Hausdorff dimension of the Cantor set is equal to ln(2)/ln(3) = log3(2).

Topological and analytical properties

As the above summation argument shows, the Cantor set is uncountable but has Lebesgue measure 0. Since the Cantor set is the complement of a union of open sets, it itself is a closed subset of the reals, and therefore a complete metric space. Since it is also totally bounded, the Heine-Borel theorem says that it must be compact.

For any point in the Cantor set and any arbitrarily small neighborhood of the point, there is some other number with a ternary numeral of only 0s and 2s, as well as numbers whose ternary numerals contain 1s. Hence, every point in the Cantor set is an accumulation point, but none is an interior point. A closed set in which every point is an accumulation point is also called a perfect set in topology, while a closed subset of the interval with no interior points is nowhere dense in the interval.

Every point of the Cantor set is a cluster point of the Cantor set. Every point of the Cantor set is also a cluster point of the complement of the Cantor set.

For two points in the Cantor set, there will be some ternary digit where they differ &mdash; one d will have 0 and the other 2. By splitting the Cantor set into "halves" depending on the value of this digit, one obtains a partition of the Cantor set into two closed sets that separate the original two points. In the relative topology on the Cantor set, the points have been separated by a clopen set. Consequently the Cantor set is totally disconnected. As a compact totally disconnected Hausdorff space, the Cantor set is an example of a Stone space.

As a topological space, the Cantor set is naturally homeomorphic to the product of countably many copies of the space $\left\{0, 1\right\}$, where each copy carries the discrete topology. This is the space of all sequences in two digits: $2^mathbb\left\{N\right\}=\left\{x_nvert x_nin \left\{0,1\right\}, nin mathbb\left\{N\right\}\right\}$, which can also be identified with the set of 2-adic integers. The basis for the open sets of the product topology are cylinder sets; the homeomorphism maps these to the subspace topology that the Cantor set inherits from the natural topology on the real number line. This characterization of the Cantor space as a product of compact spaces gives a second proof that Cantor space is compact, via Tychonoff's theorem.

From the above characterization, the Cantor set is homeomorphic to the p-adic integers, and, if one point is removed from it, to the p-adic numbers.

The Cantor set can be endowed with a metric, the p-adic metric. Given two sequences $\left\{x_n\right\},\left\{y_n\right\}in 2^mathbb\left\{N\right\}$, the distance between them may be given by $d\left(\left\{x_n\right\},\left\{y_n\right\}\right) = 1/k$, where $k$ is the smallest index such that $x_k e y_k$; if there is no such index, then the two sequences are the same, and one defines the distance to be zero. This turns the Cantor set into a metric space.

Every nonempty totally-disconnected perfect compact metric space is homeomorphic to the Cantor set. See Cantor space for more on spaces homeomorphic to the Cantor set.

The Cantor set is sometimes regarded as universal in the category of compact metric spaces as any compact metric space is a continuous image of the Cantor set; however this construction is not unique so the Cantor set is not universal in the precise categorical sense. The "universal" property has important applications in functional analysis, where it is sometimes known as the representation theorem for compact metric spaces [Stephen Willard, "General Topology", Addison-Wesley Publishing Company, 1968.] .

Variants

Instead of repeatedly removing the middle third of every piece as in the Cantor set, we could also keep removing any other fixed percentage (other than 0% and 100%) from the middle. The resulting sets are all homeomorphic to the Cantor set and also have Lebesgue measure 0. In the case where the middle 8/10 of the interval is removed, we get a remarkably accessible case &mdash; the set consists of all numbers in [0,1] that can be written as a decimal consisting entirely of 0s and 9s.

By removing progressively smaller percentages of the remaining pieces in every step, one can also construct sets homeomorphic to the Cantor set that have positive Lebesgue measure, while still being nowhere dense. See Smith-Volterra-Cantor set for an example.

Historical remarks

Cantor himself defined the set in a general, abstract way, and mentioned the ternary construction only in passing, as an example of a more general idea, that of a perfect set that is nowhere dense. The original paper provides several different constructions of the abstract concept.

This set would have been considered abstract at the time when Cantor devised it. Cantor himself was led to it by practical concerns about the set of points where a trigonometric series might fail to converge. The discovery did much to set him on the course for developing an abstract, general theory of infinite sets.

Cantor dust

Cantor dust is a multi-dimensional version of the Cantor set. It can be formed by taking a finite cartesian product of the Cantor set with itself, making it a Cantor space. Like the Cantor set, Cantor dust has zero measure.

A different 2D analogue of the Cantor set is the Sierpinski carpet, where a square is divided up into nine smaller squares, and the middle one removed. The remaining squares are then further divided into nine each and the middle removed, and so on ad infinitum. The 3D analogue of this is the Menger sponge.

*Cantor function
*Cantor cube
*Sierpinski carpet
*Koch snowflake
*Menger sponge
*List of fractals by Hausdorff dimension

References

* [http://www.rose-hulman.edu/mathjournal/v7n1.php the first generalized explicit formula of Cantor set]
* [http://www.ajur.uni.edu/v5n2/Sep2006.htm the second generalized explicit formula of Cantor set]
* | year=1995 "(See example 29)".
* Gary L. Wise and Eric B. Hall, "Counterexamples in Probability and Real Analysis". Oxford University Press, New York 1993. ISBN 0-19-507068-2. "(See chapter 1)".
* [http://www.cut-the-knot.org/do_you_know/Cantor2.shtml Cantor Sets] at cut-the-knot
* [http://www.cut-the-knot.org/do_you_know/cantor.shtml Cantor Set and Function] at cut-the-knot

Wikimedia Foundation. 2010.

### Look at other dictionaries:

• Cantor set — Math. the set obtained from the closed interval from 0 to 1 by removing the middle third from the interval, then the middle third from each of the two remaining sets, and continuing the process indefinitely. Also called Cantor ternary set. [after …   Universalium

• Cantor set — Math. the set obtained from the closed interval from 0 to 1 by removing the middle third from the interval, then the middle third from each of the two remaining sets, and continuing the process indefinitely. Also called Cantor ternary set. [after …   Useful english dictionary

• Smith-Volterra-Cantor set — In mathematics, the Smith Volterra Cantor set (SVC) or the fat Cantor set is an example of a set of points on the real line R that is nowhere dense (in particular it contains no intervals), yet has positive measure. Construction Similar to the… …   Wikipedia

• Cantor — may refer to:In general* The Latin word for singer, e.g. the main singer of a cantus * Hazzan , in Judaism, the English name for a professional singer who leads prayer services (Kantor is a frequently noted Jewish patronym) * Cantor (church), an… …   Wikipedia

• Cantor function — In mathematics, the Cantor function, named after Georg Cantor, is an example of a function that is continuous, but not absolutely continuous. DefinitionThe Cantor function c : [0,1] → [0,1] is defined as follows:#Express x in base 3. If possible …   Wikipedia

• Cantor space — In mathematics, the term Cantor space is sometimes used to denotethe topological abstraction of the classical Cantor set:A topological space is aCantor space if it is homeomorphic to the Cantor set.The Cantor set itself is of course a Cantor… …   Wikipedia

• Cantor distribution — Probability distribution name =Cantor type =mass pdf cdf parameters =none support =Cantor set pdf =none cdf =Cantor function mean =1/2 median =anywhere in [1/3, 2/3] mode =n/a variance =1/8 skewness =0 kurtosis = 8/5 entropy = mgf =e^{t/2} prod… …   Wikipedia

• Cantor-Menge — Unter der Cantor Menge, auch cantorsches Diskontinuum, Cantor Staub oder Wischmenge genannt, versteht man in der Mathematik eine gewisse Teilmenge der Menge der reellen Zahlen mit besonderen topologischen, maßtheoretischen, geometrischen und… …   Deutsch Wikipedia

• Cantor'sches Diskontinuum — Unter der Cantor Menge, auch cantorsches Diskontinuum, Cantor Staub oder Wischmenge genannt, versteht man in der Mathematik eine gewisse Teilmenge der Menge der reellen Zahlen mit besonderen topologischen, maßtheoretischen, geometrischen und… …   Deutsch Wikipedia

• Cantor-Funktion — Unter der Cantor Menge, auch cantorsches Diskontinuum, Cantor Staub oder Wischmenge genannt, versteht man in der Mathematik eine gewisse Teilmenge der Menge der reellen Zahlen mit besonderen topologischen, maßtheoretischen, geometrischen und… …   Deutsch Wikipedia