Covering number

Template:Distinguish

In mathematics, a covering number is the number of balls of a given size needed to completely cover a given space, with possible overlaps between the balls. The covering number quantifies the size of a set and can be applied to general metric spaces. Two related concepts are the packing number, the number of disjoint balls that fit in a space, and the metric entropy, the number of points that fit in a space when constrained to lie at some fixed minimum distance apart.

Definition

Let (M, d) be a metric space, let K be a subset of M, and let r be a positive real number. Let B_r(x) denote the ball of radius r centered at x. A subset C of M is an r-external covering of K if:

K \subseteq ⋃_{x \in C} B_{r} (x)

.

In other words, for every $y \in K$ there exists $x \in C$ such that $d (x, y) \leq r$ .

If furthermore C is a subset of K, then it is an r-internal covering.

The external covering number of K, denoted $N_{r}^{ext} (K)$ , is the minimum cardinality of any external covering of K. The internal covering number, denoted $N_{r}^{int} (K)$ , is the minimum cardinality of any internal covering.

A subset P of K is a packing if $P \subseteq K$ and the set ${B_{r} (x)}_{x \in P}$ is pairwise disjoint. The packing number of K, denoted $N_{r}^{pack} (K)$ , is the maximum cardinality of any packing of K.

A subset S of K is r-separated if each pair of points x and y in S satisfies d(x, y) ≥ r. The metric entropy of K, denoted $N_{r}^{met} (K)$ , is the maximum cardinality of any r-separated subset of K.

Examples

Template:Ordered list

Properties

Template:Ordered list

The following properties relate to covering numbers in the standard Euclidean space, $ℝ^{m}$ :^[1]Template:Rp Template:Ordered list

Application to machine learning

Let $K$ be a space of real-valued functions, with the l-infinity metric (see example 3 above). Suppose all functions in $K$ are bounded by a real constant $M$ . Then, the covering number can be used to bound the generalization error of learning functions from $K$ , relative to the squared loss:^[2]Template:Rp

Prob [\sup_{h \in K} | GeneralizationError (h) - EmpiricalError (h) | \geq ϵ] \leq N_{r}^{int} (K) 2 \exp \frac{- m ϵ^{2}}{2 M^{4}}

where $r = \frac{ϵ}{8 M}$ and $m$ is the number of samples.

References

Template:Reflist

↑ Cite error: Invalid <ref> tag; no text was provided for refs named book14
↑ Cite error: Invalid <ref> tag; no text was provided for refs named book12

[book14-1] Cite error: Invalid <ref> tag; no text was provided for refs named book14

[book12-2] Cite error: Invalid <ref> tag; no text was provided for refs named book12

[1]

[2]

Covering number

Contents

Definition

Examples

Properties

Application to machine learning

See also

References

Navigation menu

Covering number

Definition

Examples

Properties

Application to machine learning

See also

References

Navigation menu

Search