Ogden's lemma

In the theory of formal languages, Ogden's lemma (named after William F. Ogden)^[1] is a generalization of the pumping lemma for context-free languages.

Despite Ogden's lemma being a strengthening of the pumping lemma, it is insufficient to fully characterize the class of context-free languages.^[2] This is in contrast to the Myhill-Nerode theorem, which unlike the pumping lemma for regular languages is a necessary and sufficient condition for regularity.

Statement

Template:Math theoremWe will use underlines to indicate "marked" positions.

Special cases

Ogden's lemma is often stated in the following form, which can be obtained by "forgetting about" the grammar, and concentrating on the language itself: If a language Template:Mvar is context-free, then there exists some number $p \geq 1$ (where Template:Mvar may or may not be a pumping length) such that for any string Template:Mvar of length at least Template:Mvar in Template:Mvar and every way of "marking" Template:Mvar or more of the positions in Template:Mvar, Template:Mvar can be written as

s = u v w x y

with strings Template:Mvar and Template:Mvar, such that

Template:Mvar has at least one marked position,
Template:Mvar has at most Template:Mvar marked positions, and
$u v^{n} w x^{n} y \in L$ for all $n \geq 0$ .

In the special case where every position is marked, Ogden's lemma is equivalent to the pumping lemma for context-free languages. Ogden's lemma can be used to show that certain languages are not context-free in cases where the pumping lemma is not sufficient. An example is the language ${a^{i} b^{j} c^{k} d^{l} : i = 0 or j = k = l}$ .

Example applications

Non-context-freeness

The special case of Ogden's lemma is often sufficient to prove some languages are not context-free. For example, ${a^{m} b^{n} c^{m} d^{n} | m, n \geq 1}$ is a standard example of non-context-free language,^[3]

Template:Math proof

Similarly, one can prove the "copy twice" language $L = {w^{2} | w \in {a, b}^{*}}$ is not context-free, by using Ogden's lemma on $a^{2 p} \underline{b^{2 p}} a^{2 p} b^{2 p}$ .

And the given example last section ${a^{i} b^{j} c^{k} d^{l} : i = 0 or j = k = l}$ is not context-free by using Ogden's lemma on $a b^{2 p} \underline{c^{2 p}} d^{2 p}$ .

Inherent ambiguity

Ogden's lemma can be used to prove the inherent ambiguity of some languages, which is implied by the title of Ogden's paper.

Example: Let $L_{0} = {a^{n} b^{m} c^{m} | m, n \geq 1}, L_{1} = {a^{m} b^{m} c^{n} | m, n \geq 1}$ . The language $L = L_{0} \cup L_{1}$ is inherently ambiguous. (Example from page 3 of Ogden's paper.)

Template:Math proof

Similarly, $L^{*}$ is inherently ambiguous, and for any CFG of the language, letting $p$ be the constant for Ogden's lemma, we find that $(a^{p! + p} b^{p! + p} c^{p! + p})^{n}$ has at least $2^{n}$ different parses. Thus $L^{*}$ has an unbounded degree of inherent ambiguity.

Undecidability

The proof can be extended to show that deciding whether a CFG is inherently ambiguous is undecidable, by reduction to the Post correspondence problem. It can also show that deciding whether a CFG has an unbounded degree of inherent ambiguity is undecidable. (page 4 of Ogden's paper)

Template:Math proof

Generalized condition

Bader and Moura have generalized the lemma^[4] to allow marking some positions that are not to be included in Template:Mvar. Their dependence of the parameters was later improved by Dömösi and Kudlek.^[5] If we denote the number of such excluded positions by Template:Mvar, then the number Template:Mvar of marked positions of which we want to include some in Template:Mvar must satisfy $d \geq p (e + 1)$ , where Template:Mvar is some constant that depends only on the language. The statement becomes that every Template:Mvar can be written as

s = u v w x y

with strings Template:Mvar and Template:Mvar, such that

Template:Mvar has at least one marked position and no excluded position,
Template:Mvar has at most $p^{(e + 1)}$ marked positions, and
$u v^{n} w x^{n} y \in L$ for all $n \geq 0$ .

Moreover, either each of Template:Mvar has a marked position, or each of $w, x, y$ has a marked position.

References

Template:Reflist

[1] Template:Cite journal

[2] Template:Cite conference

[3] Template:Cite book

[4] Template:Cite journal

[5] Template:Citation

[1]

[2]

[3]

[4]

[5]

Ogden's lemma

Contents

Statement

Special cases

Example applications

Non-context-freeness

Inherent ambiguity

Undecidability

Generalized condition

References

Navigation menu

Ogden's lemma

Statement

Special cases

Example applications

Non-context-freeness

Inherent ambiguity

Undecidability

Generalized condition

References

Navigation menu

Search