 Clique (graph theory)

In the mathematical area of graph theory, a clique in an undirected graph is a subset of its vertices such that every two vertices in the subset are connected by an edge. Cliques are one of the basic concepts of graph theory and are used in many other mathematical problems and constructions on graphs. Cliques have also been studied in computer science: finding whether there is a clique of a given size in a graph (the clique problem) is NPcomplete, but despite this hardness result many algorithms for finding cliques have been studied.
Although the study of complete subgraphs goes back at least to the graphtheoretic reformulation of Ramsey theory by Erdős & Szekeres (1935),^{[1]} the term "clique" comes from Luce & Perry (1949), who used complete subgraphs in social networks to model cliques of people; that is, groups of people all of whom know each other. Cliques have many other applications in the sciences and particularly in bioinformatics.
Contents
Definitions
A clique in an undirected graph G = (V, E) is a subset of the vertex set C ⊆ V, such that for every two vertices in C, there exists an edge connecting the two. This is equivalent to saying that the subgraph induced by C is complete (in some cases, the term clique may also refer to the subgraph).
A maximal clique is a clique that cannot be extended by including one more adjacent vertex, that is, a clique which does not exist exclusively within the vertex set of a larger clique.
A maximum clique is a clique of the largest possible size in a given graph. The clique number ω(G) of a graph G is the number of vertices in a maximum clique in G. The intersection number of G is the smallest number of cliques that together cover all edges of G.
The opposite of a clique is an independent set, in the sense that every clique corresponds to an independent set in the complement graph. The clique cover problem concerns finding as few cliques as possible that include every vertex in the graph. A related concept is a biclique, a complete bipartite subgraph. The bipartite dimension of a graph is the minimum number of bicliques needed to cover all the edges of the graph.
Mathematics
Mathematical results concerning cliques include the following.
 Turán's theorem (Turán 1941) gives a lower bound on the size of a clique in dense graphs. If a graph has sufficiently many edges, it must contain a large clique. For instance, every graph with more than edges must contain a threevertex clique.
 Ramsey's theorem (Graham, Rothschild & Spencer 1990) states that every graph or its complement graph contains a clique with at least a logarithmic number of vertices.
 According to a result of Moon & Moser (1965), a graph with 3n vertices can have at most 3^{n} maximal cliques. The graphs meeting this bound are the Moon–Moser graphs K_{3,3,...}, a special case of the Turán graphs arising as the extremal cases in Turán's theorem.
 Hadwiger's conjecture, still unproven, relates the size of the largest clique minor in a graph to its chromatic number.
 The Erdős–Faber–Lovász conjecture is another unproven statement relating graph coloring to cliques.
Several important classes of graphs may be defined by their cliques:
 A chordal graph is a graph whose vertices can be ordered into a perfect elimination ordering, an ordering such that the neighbors of each vertex v that come later than v in the ordering form a clique.
 A cograph is a graph all of whose induced subgraphs have the property that any maximal clique intersects any maximal independent set in a single vertex.
 An interval graph is a graph whose maximal cliques can be ordered in such a way that, for each vertex v, the cliques containing v are consecutive in the ordering.
 A line graph is a graph whose edges can be covered by edgedisjoint cliques in such a way that each vertex belongs to exactly two of the cliques in the cover.
 A perfect graph is a graph in which the clique number equals the chromatic number in every induced subgraph.
 A split graph is a graph in which some clique contains at least one endpoint of every edge.
 A trianglefree graph is a graph that has no cliques other than its vertices and edges.
Additionally, many other mathematical constructions involve cliques in graphs. Among them,
 The clique complex of a graph G is an abstract simplicial complex X(G) with a simplex for every clique in G
 A simplex graph is an undirected graph κ(G) with a vertex for every clique in a graph G and an edge connecting two cliques that differ by a single vertex. It is an example of median graph, and is associated with a median algebra on the cliques of a graph: the median m(A,B,C) of three cliques A, B, and C is the clique whose vertices belong to at least two of the cliques A, B, and C.^{[2]}
 The cliquesum is a method for combining two graphs by merging them along a shared clique.
 Cliquewidth is a notion of the complexity of a graph in terms of the minimum number of distinct vertex labels needed to build up the graph from disjoint unions, relabeling operations, and operations that connect all pairs of vertices with given labels. The graphs with cliquewidth one are exactly the disjoint unions of cliques.
 The intersection number of a graph is the minimum number of cliques needed to cover all the graph's edges.
Closely related concepts to complete subgraphs are subdivisions of complete graphs and complete graph minors. In particular, Kuratowski's theorem and Wagner's theorem characterize planar graphs by forbidden complete and complete bipartite subdivisions and minors, respectively.
Computer science
Main article: Clique problemIn computer science, the clique problem is the computational problem of finding a maximum clique, or all cliques, in a given graph. It is NPcomplete, one of Karp's 21 NPcomplete problems (Karp 1972). It is also fixedparameter intractable, and hard to approximate. Nevertheless, many algorithms for computing cliques have been developed, either running in exponential time (such as the Bron–Kerbosch algorithm) or specialized to graph families such as planar graphs or perfect graphs for which the problem can be solved in polynomial time.
Applications
The word "clique", in its graphtheoretic usage, arose from the work of Luce & Perry (1949), who used complete subgraphs to model cliques (groups of people who all know each other) in social networks. For continued efforts to model social cliques graphtheoretically, see e.g. Alba (1973), Peay (1974), and Doreian & Woodard (1994).
Many different problems from bioinformatics have been modeled using cliques. For instance, BenDor, Shamir & Yakhini (1999) model the problem of clustering gene expression data as one of finding the minimum number of changes needed to transform a graph describing the data into a graph formed as the disjoint union of cliques; Tanay & Sharan (Shamir) discuss a similar biclustering problem for expression data in which the clusters are required be cliques. Sugihara (1984) uses cliques to model ecological niches in food webs. Day & Sankoff (1986) describe the problem of inferring evolutionary trees as one of finding maximum cliques in a graph that has as its vertices characteristics of the species, where two vertices share an edge if there exists a perfect phylogeny combining those two characters. Samudrala & Moult (1998) model protein structure prediction as a problem of finding cliques in a graph whose vertices represent positions of subunits of the protein. And by searching for cliques in a proteinprotein interaction network, Spirin & Mirny (2003) found clusters of proteins that interact closely with each other and have few interactions with proteins outside the cluster. Power graph analysis is a method for simplifying complex biological networks by finding cliques and related structures in these networks.
In electrical engineering, Prihar (1956) uses cliques to analyze communications networks, and Paull & Unger (1959) use them to design efficient circuits for computing partiallyspecified Boolean functions. Cliques have also been used in automatic test pattern generation: a large clique in an incompatibility graph of possible faults provides a lower bound on the size of a test set.^{[3]} Cong & Smith (1993) describe an application of cliques in finding a hierarchical partition of an electronic circuit into smaller subunits.
In chemistry, Rhodes et al. (2003) use cliques to describe chemicals in a chemical database that have a high degree of similarity with a target structure. Kuhl, Crippen & Friesen (1983) use cliques to model the positions in which two chemicals will bind to each other.
Notes
 ^ The earlier work by Kuratowski (1930) characterizing planar graphs by forbidden complete and complete bipartite subgraphs was originally phrased in topological rather than graphtheoretic terms.
 ^ Barthélemy, Leclerc & Monjardet (1986), page 200.
 ^ Hamzaoglu & Patel (1998).
References
 Alba, Richard D. (1973), "A graphtheoretic definition of a sociometric clique", Journal of Mathematical Sociology 3 (1): 113–126, doi:10.1080/0022250X.1973.9989826, http://aris.ss.uci.edu/~lin/1.pdf.
 Barthélemy, J.P.; Leclerc, B.; Monjardet, B. (1986), "On the use of ordered sets in problems of comparison and consensus of classifications", Journal of Classification 3 (2): 187–224, doi:10.1007/BF01894188.
 BenDor, Amir; Shamir, Ron; Yakhini, Zohar (1999), "Clustering gene expression patterns.", Journal of Computational Biology 6 (3–4): 281–297, doi:10.1089/106652799318274, PMID 10582567.
 J., Cong; M., Smith (1993), "A parallel bottomup clustering algorithm with applications to circuit partitioning in VLSI design", Proc. 30th International Design Automation Conference, pp. 755–760, doi:10.1145/157485.165119.
 Day, William H. E.; Sankoff, David (1986), "Computational complexity of inferring phylogenies by compatibility", Systematic Zoology 35 (2): 224–229, doi:10.2307/2413432, JSTOR 2413432.
 Doreian, Patrick; Woodard, Katherine L. (1994), "Defining and locating cores and boundaries of social networks", Social Networks 16 (4): 267–293, doi:10.1016/03788733(94)900132.
 Erdős, Paul; Szekeres, George (1935), "A combinatorial problem in geometry", Compositio Math. 2: 463–470, http://www.renyi.hu/~p_erdos/193501.pdf.
 Graham, R.; Rothschild, B.; Spencer, J. H. (1990), Ramsey Theory, New York: John Wiley and Sons, ISBN 0471500461.
 Hamzaoglu, I.; Patel, J. H. (1998), "Test set compaction algorithms for combinational circuits", Proc. 1998 IEEE/ACM International Conference on ComputerAided Design, pp. 283–289, doi:10.1145/288548.288615.
 Karp, Richard M. (1972), "Reducibility among combinatorial problems", in Miller, R. E.; Thatcher, J. W., Complexity of Computer Computations, New York: Plenum, pp. 85–103, http://www.cs.berkeley.edu/~luca/cs172/karp.pdf.
 Kuhl, F. S.; Crippen, G. M.; Friesen, D. K. (1983), "A combinatorial algorithm for calculating ligand binding", Journal of Computational Chemistry 5 (1): 24–34, doi:10.1002/jcc.540050105.
 Kuratowski, Kazimierz (1930), "Sur le probléme des courbes gauches en Topologie" (in French), Fundamenta Mathematicae 15: 271–283, http://matwbn.icm.edu.pl/ksiazki/fm/fm15/fm15126.pdf.
 Luce, R. Duncan; Perry, Albert D. (1949), "A method of matrix analysis of group structure", Psychometrika 14 (2): 95–116, doi:10.1007/BF02289146, PMID 18152948.
 Moon, J. W.; Moser, L. (1965), "On cliques in graphs", Israel J. Math. 3: 23–28, doi:10.1007/BF02760024, MR0182577.
 Paull, M. C.; Unger, S. H. (1959), "Minimizing the number of states in incompletely specified sequential switching functions", IRE Trans. on Electronic Computers EC8 (3): 356–367, doi:10.1109/TEC.1959.5222697.
 Peay, Edmund R. (1974), "Hierarchical clique structures", Sociometry 37 (1): 54–65, doi:10.2307/2786466, JSTOR 2786466.
 Prihar, Z. (1956), "Topological properties of telecommunications networks", Proceedings of the IRE 44 (7): 927–933, doi:10.1109/JRPROC.1956.275149.
 Rhodes, Nicholas; Willett, Peter; Calvet, Alain; Dunbar, James B.; Humblet, Christine (2003), "CLIP: similarity searching of 3D databases using clique detection", Journal of Chemical Information and Computer Sciences 43 (2): 443–448, doi:10.1021/ci025605o, PMID 12653507.
 Samudrala, Ram; Moult, John (1998), "A graphtheoretic algorithm for comparative modeling of protein structure", Journal of Molecular Biology 279 (1): 287–302, doi:10.1006/jmbi.1998.1689, PMID 9636717.
 Spirin, Victor; Mirny, Leonid A. (2003), "Protein complexes and functional modules in molecular networks", Proceedings of the National Academy of Sciences 100 (21): 12123–12128, doi:10.1073/pnas.2032324100, PMC 218723, PMID 14517352, http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=218723.
 Sugihara, George (1984), "Graph theory, homology and food webs", in Levin, Simon A., Population Biology, Proc. Symp. Appl. Math., 30, pp. 83–101.
 Tanay, Amos; Sharan, Roded; Shamir, Ron (2002), "Discovering statistically significant biclusters in gene expression data", Bioinformatics 18 (Suppl. 1): S136–S144, doi:10.1093/bioinformatics/18.suppl_1.S136, PMID 12169541.
 Turán, Paul (1941), "On an extremal problem in graph theory" (in Hungarian), Matematikai és Fizikai Lapok 48: 436–452
External links
 Weisstein, Eric W., "Clique" from MathWorld.
 Weisstein, Eric W., "Clique Number" from MathWorld.
Categories: Graph theory objects
Wikimedia Foundation. 2010.
Look at other dictionaries:
Graph theory — In mathematics and computer science, graph theory is the study of graphs : mathematical structures used to model pairwise relations between objects from a certain collection. A graph in this context refers to a collection of vertices or nodes and … Wikipedia
graph theory — A branch of mathematics used to represent relations and networks. A graph consists of a set of points (nodes or vertices) and the pairwise links between them (arcs or lines). In sociological applications, the nodes are typically individuals,… … Dictionary of sociology
Minor (graph theory) — In graph theory, an undirected graph H is called a minor of the graph G if H is isomorphic to a graph that can be obtained by zero or more edge contractions on a subgraph of G. The theory of graph minors began with Wagner s theorem that a graph… … Wikipedia
Topological graph theory — In mathematics topological graph theory is a branch of graph theory. It studies the embedding of graphs in surfaces, and graphs as topological spaces. [J.L. Gross and T.W. Tucker, Topological graph theory, Wiley Interscience, 1987] Embedding a… … Wikipedia
Degree (graph theory) — A graph with vertices labeled by degree In graph theory, the degree (or valency) of a vertex of a graph is the number of edges incident to the vertex, with loops counted twice.[1] The degree of a vertex … Wikipedia
Degeneracy (graph theory) — In graph theory, a k degenerate graph is an undirected graph in which every subgraph has a vertex of degree at most k: that is, some vertex in the subgraph touches k or fewer of the subgraph s edges. The degeneracy of a graph is the smallest… … Wikipedia
Neighbourhood (graph theory) — A graph consisting of 6 vertices and 7 edges For other meanings of neighbourhoods in mathematics, see Neighbourhood (mathematics). For non mathematical neighbourhoods, see Neighbourhood (disambiguation). In graph theory, an adjacent vertex of a… … Wikipedia
Glossary of graph theory — Graph theory is a growing area in mathematical research, and has a large specialized vocabulary. Some authors use the same word with different meanings. Some authors use different words to mean the same thing. This page attempts to keep up with… … Wikipedia
Intersection number (graph theory) — In the mathematical field of graph theory, the intersection number of a graph is the smallest number of elements in a representation of G as an intersection graph of finite sets. Equivalently, it is the smallest number of cliques needed to cover… … Wikipedia
Independent set (graph theory) — The nine blue vertices form a maximum independent set for the Generalized Petersen graph GP(12,4). In graph theory, an independent set or stable set is a set of vertices in a graph, no two of which are adjacent. That is, it is a set I of vertices … Wikipedia