Erdős–Rényi model

Erdős–Rényi model

In graph theory, the Erdős-Rényi model, named for Paul Erdős and Alfréd Rényi, is either of two models for generating random graphs, including one that sets an edge between each pair of nodes with equal probability, independently of the other edges. It can be used in the probabilistic method to prove the existence of graphs satisfying various properties, or to provide a rigorous definition of what it means for a property to hold for almost all graphs.

Definition

There are two closely related variants of the Erdős-Rényi random graph model.

*In the "G"("n", "M") model a graph is chosen uniformly at random from the collection of all graphs which have "n" nodes and "M" edges. For example, in the "G"("3","2") model, each of the three possible graphs on three vertices and two edges are included with probability 1/3.
*In the "G"("n", "p") model, a graph is thought to be constructed by connecting nodes randomly. Each edge is included in the graph with probability "p", with the presence or absence of any two distinct edges in the graph being independent. Equivalently, all graphs with "n" nodes and "M" edges have equal probability of p^M (1-p)^n choose 2}-M}. The parameter "p" in this model can be thought of as a weighting function; as "p" increases from "0" to "1", the model becomes more and more likely to include graphs with more edges and less and less likely to include graphs with fewer edges. In particular, the case "p" = 0.5 corresponds to the case where all 2^ binom{n}{2} graphs on "n" vertices are chosen with equal probability.

The behavior of random graphs are often studied in the case where "n", the number of vertices, tends to infinity. Although "p" and "M" can be fixed in this case, they can also be functions depending on "n". For example, the statement "almost every graph in G(n, frac{2 ln n}{n}) is connected" means "as "n" tends to infinity, the probability that a graph on "n" vertices with edge probability frac{2 ln n}{n} is connected tends to 1".

Comparison between the two models

The expected number of edges in "G"("n","p") is binom{n}{2}p, and by the law of large numbers any graph in "G"("n","p") will almost surely have approximately this many edges (provided the expected number of edges tends to infinity). Therefore a rough heuristic is that if pn^2 tends to infinity, then "G"("n","p") should behave similarly to "G"("n","M") with M= binom{n}{2} p as "n" increases.

For many graph properties, this is the case. If "P" is any graph property which is Monotone with respect to the subgraph ordering (meaning that if "A" is a subgraph of "B" and "A" satisfies "P", then "B" will satisfy "P" as well), then the statements " "P" holds for almost all graphs in "G"("n","p") " and "P" holds for almost all graphs in G(n, binom{n}{2} p) are equivalent (provided np^2 tends to infinity). For example, this holds if "P" is the property of being connected, or if "P" is the property of containing a Hamiltonian cycle. However, this will not necessarily hold for non-monotone properties (e.g. the property of having an even number of edges).

In practice, the "G"("n", "p") model is the one more commonly used today, in part due to the ease of analysis allowed by the independence of the edges.

Properties of "G"("n", "p")

As mentioned above, a graph from "G"("n", "p") has on average binom{n}{2} p edges. The distribution of the degree of any particular vertex is binomial:

: P(operatorname{deg}(v) = k) = {n-1choose k}p^k(1-p)^{n-1-k},

where "n" is the total number of vertices in the graph.

In a 1960 paper, Erdős and Rényi described the behavior of "G"("n","p") very precisely for various values of "p". Their results included that:

*If np < 1 , then a graph in "G"("n","p") will almost surely have no connected components of size larger than O(log n)
*If np=1 , then a graph in "G"("n","p") will almost surely have largest component whose size is of order n^{2/3}
*If np tends to a constant "c" > 1, then a graph in "G"("n", "p") will almost surely have a unique "giant" component containing a positive fraction of the vertices. No other component will contain more than O(log n) vertices.

*If p< frac{(1-epsilon)ln n}{n}, then a graph in "G"("n", "p") will almost surely not be connected.
*If p> frac{(1+epsilon) ln n}{n}, then a graph in "G"("n", "p") will almost surely be connected.

Thus frac{ln n}{n} is a sharp threshold for the connectivity of "G"("n", "p").

Other properties of the graph can be described almost precisely as "n" tends to infinity. For example

*There is a "k"("n") (approximately equal to 2 log_2 n) such that the largest clique in "G"("n", 0.5) is almost surely either "k"("n") or "k"("n") + 1.

Thus even though finding the size of the largest clique in a graph is NP-complete, the size of the largest clique in a "typical" graph (according to this model) is very well understood.

Weaknesses

Both of the two major assumptions of the "G"("n", "p") model (that edges are independent and that each edge is equally likely) may be unrealistic in modeling real situations. In particular, an Erdős-Rényi graph will likely not be scale-free like many real networks. The Watts and Strogatz model attempts to correct this limitation.

History

The "G"("n", "p") model was first introduced by Edgar Gilbert in a 1959 paper which studied the connectivity threshold mentioned above. The "G"("n", "M") model was introduced by Erdős and Rényi in their 1959 paper. As with Gilbert, their first investigations were as to the connectivity of "G"("n","M"), with the more detailed analysis following in 1960.

References

*
*
*
*
*
*


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Alfréd Rényi — (20 March 1921 ndash; 1 February 1970) was a Hungarian mathematician who made contributions in combinatorics and graph theory but mostly in probability theory. [citation|title=Obituary: Alfred Renyi|first=David|last=Kendall|journal=Journal of… …   Wikipedia

  • List of things named after Paul Erdős — The following were named after Paul Erdős:* Erdős number * Erdős cardinal * Erdős conjecture a list of numerous conjectures named after Erdős ** Erdős conjecture on arithmetic progressions ** Cameron–Erdős conjecture ** Erdős–Burr conjecture **… …   Wikipedia

  • Watts and Strogatz model — The Watts and Strogatz model is a random graph generation model that produces graphs with small world properties, including short average path lengths and high clustering. It was proposed by Duncan J. Watts and Steven Strogatz in their joint 1998 …   Wikipedia

  • BA model — The Barabási–Albert (BA) model is an algorithm for generating random scale free networks using a preferential attachment mechanism. Scale free networks are widely observed in natural and man made systems, including the Internet, the world wide… …   Wikipedia

  • De Bruijn–Erdős theorem (graph theory) — This article is about coloring infinite graphs. For the number of lines determined by a finite set of points, see De Bruijn–Erdős theorem (incidence geometry). In graph theory, the De Bruijn–Erdős theorem, proved by Nicolaas Govert de Bruijn and… …   Wikipedia

  • Rado graph — The Rado graph, as numbered by Rado (1964). In the mathematical field of graph theory, the Rado graph, also known as the random graph or the Erdős–Renyi graph, is the unique (up to isomorphism) countable graph R such that for any finite graph G… …   Wikipedia

  • Inégalité FKG — L’inégalité FKG, due à Fortuin, Kasteleyn et Ginibre[1] est une version généralisée de l inégalité de Tchebychev pour les sommes. C est une inégalité de corrélation utilisée, par exemple, en théorie de la percolation, et dans l étude du modèle de …   Wikipédia en Français

  • List of mathematics articles (E) — NOTOC E E₇ E (mathematical constant) E function E₈ lattice E₈ manifold E∞ operad E7½ E8 investigation tool Earley parser Early stopping Earnshaw s theorem Earth mover s distance East Journal on Approximations Eastern Arabic numerals Easton s… …   Wikipedia

  • Edge coloring — A 3 edge coloring of the Desargues graph. In graph theory, an edge coloring of a graph is an assignment of “colors” to the edges of the graph so that no two adjacent edges have the same color. For example, the figure to the right shows an edge… …   Wikipedia

  • Clique problem — The brute force algorithm finds a 4 clique in this 7 vertex graph (the complement of the 7 vertex path graph) by systematically checking all C(7,4)=35 4 vertex subgraphs for completeness. In computer science, the clique problem refers to any of… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”