Container method

Template:Short description

The method of (hypergraph) containers is a powerful tool that can help characterize the typical structure and/or answer extremal questions about families of discrete objects with a prescribed set of local constraints. Such questions arise naturally in extremal graph theory, additive combinatorics, discrete geometry, coding theory, and Ramsey theory; they include some of the most classical problems in the associated fields.

These problems can be expressed as questions of the following form: given a hypergraph Template:Math on finite vertex set Template:Math with edge set Template:Math (i.e. a collection of subsets of Template:Math with some size constraints), what can we say about the independent sets of Template:Math (i.e. those subsets of Template:Math that contain no element of Template:Math)? The hypergraph container lemma provides a method for tackling such questions.

History

One of the foundational problems of extremal graph theory, dating to work of Mantel in 1907 and Turán from the 1940s, asks to characterize those graphs that do not contain a copy of some fixed forbidden Template:Math as a subgraph. In a different domain, one of the motivating questions in additive combinatorics is understanding how large a set of integers can be without containing a Template:Math-term arithmetic progression, with upper bounds on this size given by Roth ( $k = 3$ ) and Szemerédi (general Template:Math).

The method of containers (in graphs) was initially pioneered by Kleitman and Winston in 1980, who bounded the number of lattices^[1] and graphs without 4-cycles.^[2] Container-style lemmas were independently developed by multiple mathematicians in different contexts, notably including Sapozhenko, who initially used this approach in 2002-2003 to enumerate independent sets in regular graphs,^[3] sum-free sets in abelian groups,^[4] and study a variety of other enumeration problems^[5]

A generalization of these ideas to a hypergraph container lemma was devised independently by Saxton and Thomason^[6] and Balogh, Morris, and Samotij^[7] in 2015, inspired by a variety of previous related work.

Main idea and informal statement

Many problems in combinatorics can be recast as questions about independent sets in graphs and hypergraphs. For example, suppose we wish to understand subsets of integers Template:Math to Template:Math, which we denote by $[n]$ that lack a Template:Math-term arithmetic progression. These sets are exactly the independent sets in the Template:Math-uniform hypergraph $H = ({1, 2, \dots, n}, E)$ , where Template:Math is the collection of all Template:Math-term arithmetic progressions in ${1, 2, \dots, n}$ .

In the above (and many other) instances, there are usually two natural classes of problems posed about a hypergraph Template:Math:

What is the size of a maximum independent set in Template:Math? What does the collection of maximum-sized independent sets in Template:Math look like?
How many independent sets does Template:Math have? What does a "typical" independent set in Template:Math look like?

These problems are connected by a simple observation. Let $α (H)$ be the size of a largest independent set of Template:Math and suppose $H$ has $i (H)$ independent sets. Then,

2^{α (H)} \leq i (H) \leq \sum_{r = 0}^{α (H)} (\binom{| V (H) |}{r}),

where the lower bound follows by taking all subsets of a maximum independent set. These bounds are relatively far away from each other unless $α (H)$ is very large, close to the number of vertices of the hypergraph. However, in many hypergraphs that naturally arise in combinatorial problems, we have reason to believe that the lower bound is closer to the true value; thus the primary goal is to improve the upper bounds on Template:Math.

The hypergraph container lemma provides a powerful approach to understanding the structure and size of the family of independent sets in a hypergraph. At its core, the hypergraph container method enables us to extract from a hypergraph, a collection of containers, subsets of vertices that satisfy the following properties:

There are not too many containers.
Each container is not much larger than the largest independent set.
Each container has few edges.
Every independent set in the hypergraph is fully included in some container.

The name container alludes to this last condition. Such containers often provide an effective approach to characterizing the family of independent sets (subsets of the containers) and to enumerating the independent sets of a hypergraph (by simply considering all possible subsets of a container).

The hypergraph container lemma achieves the above container decomposition in two pieces. It constructs a deterministic function Template:Math. Then, it provides an algorithm that extracts from each independent set Template:Math in hypergraph Template:Math, a relatively small collection of vertices $S \subset I$ , called a fingerprint, with the property that $S \subset I \subset S \cup f (S)$ . Then, the containers are the collection of sets $S \cup f (S)$ that arise in the above process, and the small size of the fingerprints provides good control on the number of such container sets.

Graph container algorithm

We first describe a method for showing strong upper bounds on the number of independent sets in a graph; this exposition is adapted from a survey of Samotij^[8] about the graph container method, originally employed by Kleitman-Winston and Sapozhenko.

Notation

We use the following notation in the below section.

$G = (V, E)$ is a graph on $| V | = n$ vertices, where the vertex set is equipped with (arbitrary) ordering ${v_{1}, \dots, v_{n}}$ .
Let $ℓ (G)$ be the collection of independent sets of Template:Math with size $i (G) := | ℓ (G) |$ . Let $i (G, r)$ be the number of independent sets of size Template:Math.
The max-degree ordering of a vertex subset $A \subset V$ is the ordering of the vertices in Template:Math by their degree in the induced subgraph $G [A]$ .

Kleitman-Winston algorithm

The following algorithm gives a small "fingerprint" for every independent set in a graph and a deterministic function of the fingerprint to construct a not-too-large subset that contains the entire independent set

Fix graph Template:Math, independent set $I \in ℓ (G)$ and positive integer $q \leq | I |$ .

Initialize: let $A = V (G), S = \emptyset$ .
Iterate for s=1,2,…,q:
- Construct the max-degree ordering of $A, (v_{1}, \dots v_{| A |})$
- Find the minimal index $j_{s}$ such that $v_{j_{s}} \in I$ (i.e. the vertex in Template:Math of largest degree in induced subgraph Template:Math)
- Let $S \leftarrow S \cup {v_{j_{s}}}, A \leftarrow A ∖ ({v_{1}, \dots, v_{j_{s}}} \cup N (v_{j_{s}}))$ , where $N (v)$ is the neighborhood of vertex $v$ .
Output the vector $(j_{1}, \dots, j_{q})$ and the vertex set $A \cap I$ .

Analysis

By construction, the output of the above algorithm has property that ${v_{j_{1}}, \dots, v_{j_{q}}} \subset I \subset {v_{j_{1}}, \dots, v_{j_{q}}} \cup (A \cap I)$ , noting that $A \cap I$ is a vertex subset that is completely determined by ${j_{1}, \dots, j_{q}}$ and not otherwise a function of $I$ . To emphasize this we will write $A = A (j_{1}, \dots, j_{q})$ . We also observe that we can reconstruct the set $S = {v_{j_{1}}, \dots, v_{j_{q}}} = S (j_{1}, \dots j_{q})$ in the above algorithm just from the vector $(j_{1}, \dots, j_{q})$ .

This suggests that $S$ might be a good choice of a fingerprint and $S (j_{1}, \dots j_{q}) \cup A (j_{1}, \dots, j_{q})$ a good choice for a container. More precisely, we can bound the number of independent sets of $G$ of some size $r \geq q$ as a sum over output sequences $(j_{1}, \dots j_{q})$

i (G, r) = \sum_{(j_{s})_{s = 1}^{q}} i (G [A (j_{1}, \dots j_{q})], r - q) \leq \sum_{(j_{s})} (\binom{A (j_{1}, \dots j_{q})}{r - q})

,

where we can sum across $r$ to get a bound on the total number of independent sets of the graph:

i (G) = \sum_{r = 0}^{q - 1} (\binom{n}{r}) + \sum_{(j_{s})_{s = 1}^{q}} i (G [A (j_{1}, \dots j_{q})]) \leq \sum_{r = 0}^{q - 1} (\binom{n}{r}) + \sum_{(j_{s})} 2^{| A (j_{1}, \dots j_{q}) |}

.

When trying to minimize this upper bound, we want to pick $q$ that balances/minimizes these two terms. This result illustrates the value of ordering vertices by maximum degree (to minimize $| A (j_{1}, \dots j_{q}) |$ ).

Lemmas

The above inequalities and observations can be stated in a more general setting, divorced from an explicit sum over vectors $(j_{s})$ .

Lemma 1: Given a graph $G$ with $n$ and assume that integer $q$ and real numbers $R, β \in [0, 1]$ satisfy $R \geq e^{- β q} n$ . Suppose that every induced subgraph on at least $R$ vertices has edge density at least $β$ . Then for every integer $r \geq q$ ,

i (G, r) \leq (\binom{n}{q}) (\binom{R}{r - q}) .

Lemma 2: Let $G$ be a graph on $n$ vertices and assume that an integer $q$ and reals $R, D$ are chosen such that $n \leq R + q D$ . If all subsets $U$ of at least $R$ vertices have at least $D | U | / 2$ edges, then there is a collection $ℱ$ of subsets of $q$ vertices ("fingerprints") and a deterministic function $f : 𝒞 \to 𝒫 (V (G))$ , so that for every independent set $I \subset V (G)$ , there is $S \in ℱ$ such that $S \subset I \subset f (S) \cup S$ .

Hypergraph container lemma

Informally, the hypergraph container lemma tells us that we can assign a small fingerprint $S \subset I$ to each independent set, so that all independent sets with the same fingerprint belong to the same larger set, $C = f (S)$ , the associated container, that has size bounded away from the number of vertices of the hypergraph. Further, these fingerprints are small (and thus there are few containers), and we can upper bound their size in an essentially optimal way using some simple properties of the hypergraph.

We recall the following notation associated to $k$ uniform hypergraph $ℋ$ .

Define $Δ_{l} (ℋ) := \max {d_{H} (A) ∣ A \subset V (ℋ), | A | = l}$ for positive integers $1 \leq l \leq k$ , where $d_{ℋ} (A) = | {e \in E (ℋ) ∣ A \subset e} |$ .
Let $ℐ (ℋ)$ be the collection of independent sets of $ℋ$ . $I$ will denote some such independent set.

Statement

We state the version of this lemma found in a work of Balogh, Morris, Samotij, and Saxton.^[9]

Let $ℋ$ be a $k$ -uniform hypergraph and suppose that for every $l \in {1, 2, \dots, k}$ and some $b, r \in ℕ$ , we have that $Δ_{l} (H) \leq {(\frac{b}{| V (H) |})}^{l - 1} \frac{| E (H) |}{r}$ . Then, there is a collection $𝒞 \subset 𝒫 (V (H))$ and a function $f : 𝒫 (V (H)) \to 𝒞$ such that

for every $I \in ℐ (H)$ there exists $S \subset I$ with $| S | \leq (k - 1) b$ and $I \subset f (S)$ .
$| C | \leq | V (H) | - δ r$ for every $C \in 𝒞$ and $δ = 2^{- k (k + 1)}$ .

Example applications

Regular graphs

Upper bound on the number of independent sets

We will show that there is an absolute constant Template:Math such that every $n$ -vertex $d$ -regular graph $G$ satisfies $i (G) \leq 2^{(1 + C \sqrt{\frac{\log d}{d}}) \frac{n}{2}}$ .

We can bound the number of independent sets of each size $r$ by using the trivial bound $i (G, r) \leq (\binom{n}{r}) \leq (\binom{n}{n / 10}) \leq 2^{0.48 n}$ for $r \leq n / 10$ . For larger $r$ , take $β > 10 / n, q = ⌊ 1 / β ⌋, R = \frac{n}{2} + \frac{β n^{2}}{2 d} .$ With these parameters, Template:Math-regular graph $G$ satisfies the conditions of Lemma 1 and thus,

i (G, r) \leq (\binom{n}{q}) (\binom{R}{r - q}) \leq (\binom{n}{q}) (\binom{\frac{n}{2} + \frac{β n^{2}}{2 d}}{r - q}) \leq {(\frac{e n}{q})}^{q} (\binom{\frac{n}{2} + \frac{β n^{2}}{2 d}}{r - q}) \leq (e β n)^{⌊ 1 / β ⌋} (\binom{\frac{n}{2} + \frac{β n^{2}}{2 d}}{r - q}) .

Summing over all $0 \leq r \leq n$ gives

i (G) \leq 2^{0.49 n} + 2^{\frac{n}{2} + \frac{β n^{2}}{2 d} + ⌊ 1 / β ⌋ \log_{2} (e β n)}

,

which yields the desired result when we plug in $β = \sqrt{d \log d} / n .$

Sum-free sets

A set $A$ of elements of an abelian group is called sum-free if there are no $x, y, z \in A$ satisfying $x + y = z$ . We will show that there are at most $2^{(1 / 2 + o (1)) n}$ sum-free subsets of $[n] := {1, 2, \dots, n}$ .

This will follow from our above bounds on the number of independent sets in a regular graph. To see this, we will need to construct an auxiliary graph. We first observe that up to lower order terms, we can restrict our focus to sum-free sets with at least $n^{2 / 3}$ elements smaller than $n / 2$ (since the number of subsets in the complement of this is at most $(n / 2)^{n^{2 / 3}} 2^{n / 2 + 1}$ ).

Given some subset $S \subset {1, 2, \dots, ⌈ n / 2 ⌉ - 1}$ , we define an auxiliary graph $G_{S}$ with vertex set $[n]$ and edge set ${{x, y} ∣ x + s \equiv y (\mod n) for some s \in S \cup (- S)}$ , and observe that our auxiliary graph is $2 | S |$ regular since each element of Template:Math is smaller than $n / 2$ . Then if $S_{A}$ are the smallest $n^{2 / 3}$ elements of subset $A \subset [n]$ , the set $A ∖ S_{A}$ is an independent set in the graph $G_{S_{A}}$ . Then, by our previous bound, we see that the number of sum-free subsets of $[n]$ is at most

(n / 2)^{n^{2 / 3}} 2^{n / 2 +} + (\binom{n / 2}{n^{2 / 3}}) 2^{(1 + O (n^{- 1 / 3} \sqrt{\log n})) \frac{n}{2}} \leq 2^{(1 / 2 + O (n^{- 1 / 3} \log n)) n} .

Triangle-free graphs

We give an illustration of using the hypergraph container lemma to answer an enumerative question by giving an asymptotically tight upper bound on the number of triangle-free graphs with $n$ vertices.^[10]

Informal statement

Since bipartite graphs are triangle-free, the number of triangle free graphs with $n$ vertices is at least $2^{⌊ n^{2} / 4 ⌋}$ , obtained by enumerating all possible subgraphs of the balanced complete bipartite graph $K_{⌊ n / 2 ⌋, ⌈ n / 2 ⌉}$ .

We can construct an auxiliary Template:Math-uniform hypergraph Template:Math with vertex set $V (H) = E (K_{n})$ and edge set $E (H) = {{e_{1}, e_{2}, e_{3}} \subset E (K_{n}) = V (H) ∣ e_{1}, e_{2}, e_{3} form a triangle}$ . This hypergraph "encodes" triangles in the sense that the family of triangle-free graphs on $n$ vertices is exactly the collection of independent sets of this hypergraph, $ℐ (H)$ .

The above hypergraph has a nice degree distribution: each edge of $K_{n}$ , and thus vertex in $V (H)$ is contained in exactly $n - 2$ triangles and each pair of elements in $V (H)$ is contained in at most 1 triangle. Therefore, applying the hypergraph container lemma (iteratively), we are able to show that there is a family of $n^{O (n^{3 / 2})}$ containers that each contain few triangles that contain every triangle-free graph/independent set of the hypergraph.

Upper bound on the number of triangle-free graphs

We first specialize the generic hypergraph container lemma to 3-uniform hypergraphs as follows:

Lemma: For every $c > 0$ , there exists $δ > 0$ such that the following holds. Let $H$ be a 3-uniform hypergraph with average degree $d \geq 1 / δ$ and suppose that $Δ_{1} (H) \leq c d, Δ_{2} (H) \leq c \sqrt{d}$ . Then there exists a collection $𝒞 \subset 𝒫 (V (H))$ of at most $| 𝒞 | \leq (\binom{| V (H) |}{| V (H) | / \sqrt{d}})$ containers such that

for every $I \in ℐ (H)$ , there exists $I \subset C \in 𝒞$
$| C | \leq (1 - δ) | V (H) |$ for all $C \in 𝒞$

Applying this lemma iteratively will give the following theorem (as proved below):

Theorem: For all $ϵ > 0$ , there exists $C > 0$ such that the following holds. For each positive integer Template:Math, there exists a collection $𝒢$ of graphs on Template:Math vertices with $| 𝒢 | \leq n^{C n^{3 / 2}}$ such that

each $G \in 𝒢$ has fewer than $ϵ n^{3}$ triangles,
each triangle-free graph on $n$ vertices is contained in some $G \in 𝒢$ .

Proof: Consider the hypergraph $H$ defined above. As observed informally earlier, the hypergraph satisfies $| V (H) | = (\binom{n}{2}), Δ_{2} (H) = 1, d (v) = n - 2$ for every $v \in V (H)$ . Therefore, we can apply the above Lemma to $H$ with $c = 1$ to find some collection $𝒞$ of $n^{O (n^{3 / 2})}$ subsets of $E (K_{n})$ (i.e. graphs on $n$ vertices) such that

every triangle free graph is a subgraph of some $C \in 𝒞$ ,
every $C \in 𝒞$ has at most $(1 - δ) (\binom{n}{2})$ edges.

This is not quite as strong as the result we want to show, so we will iteratively apply the container lemma. Suppose we have some container $C \in 𝒞$ with at least $ϵ n^{3}$ triangles. We can apply the container lemma to the induced sub-hypergraph $H [C]$ . The average degree of $H [C]$ is at least $6 ϵ n$ , since every triangle in $C$ is an edge in $H [C]$ , and this induced subgraph has at most $(\binom{n}{2})$ vertices. Thus, we can apply Lemma with parameter $c = 1 / ϵ$ , remove $C$ from our set of containers, replacing it by this set of containers, the containers covering $ℐ (H [C])$ .

We can keep iterating until we have a final collection of containers $𝒞$ that each contain fewer than $ϵ n^{3}$ triangles. We observe that this collection cannot be too big; all of our induced subgraphs have at most $(\binom{n}{2})$ vertices and average degree at least $6 ϵ n$ , meaning that each iteration results in at most $n^{O (n^{3 / 2})}$ new containers. Further, the container size shrinks by a factor of $1 - δ$ each time, so after a bounded (depending on $ϵ$ ) number of iterations, the iterative process will terminate.

References

Template:Reflist

[1] Template:Cite journal

[2] Template:Cite journal

[3] Template:Cite journal

[4] Template:Cite journal

[5] Template:Citation

[6] Template:Cite journal

[7] Template:Cite journal

[8] Template:Cite journal

[9] Template:Cite journal

[10] Template:Cite journal

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

Container method

Contents

History

Main idea and informal statement

Graph container algorithm

Notation

Kleitman-Winston algorithm

Analysis

Lemmas

Hypergraph container lemma

Statement

Example applications

Regular graphs

Upper bound on the number of independent sets

Sum-free sets

Triangle-free graphs

Informal statement

Upper bound on the number of triangle-free graphs

See also

References

Navigation menu

Container method

History

Main idea and informal statement

Graph container algorithm

Notation

Kleitman-Winston algorithm

Analysis

Lemmas

Hypergraph container lemma

Statement

Example applications

Regular graphs

Upper bound on the number of independent sets

Sum-free sets

Triangle-free graphs

Informal statement

Upper bound on the number of triangle-free graphs

See also

References

Navigation menu

Search