Lossless join decomposition

In database design, a lossless join decomposition is a decomposition of a relation $r$ into relations $r_{1}, r_{2}$ such that a natural join of the two smaller relations yields back the original relation. This is central in removing redundancy safely from databases while preserving the original data.^[1] Lossless join can also be called non-additive.^[2]

Definition

A relation $r$ on schema $R$ decomposes losslessly onto schemas $R_{1}$ and $R_{2}$ if $π_{R_{1}} (r) ⋈ π_{R_{2}} (r) = r$ , that is $r$ is the natural join of its projections onto the smaller schemas. A pair $(R_{1}, R_{2})$ is a lossless-join decomposition of $R$ or said to have a lossless join with respect to a set of functional dependencies $F$ if any relation $r (R)$ that satisfies $F$ decomposes losslessly onto $R_{1}$ and $R_{2}$ .^[3]

Decompositions into more than two schemas can be defined in the same way.^[4]

Criteria

A decomposition $R = R_{1} \cup R_{2}$ has a lossless join with respect to $F$ if and only if the closure of $R_{1} \cap R_{2}$ includes $R_{1} ∖ R_{2}$ or $R_{2} ∖ R_{1}$ . In other words, one of the following must hold:^[4]

$(R_{1} \cap R_{2}) \to (R_{1} ∖ R_{2}) \in F^{+}$
$(R_{1} \cap R_{2}) \to (R_{2} ∖ R_{1}) \in F^{+}$

Criteria for multiple sub-schemas

Multiple sub-schemas $R_{1}, R_{2}, . . ., R_{n}$ have a lossless join if there is some way in which we can repeatedly perform lossless joins until all the schemas have been joined into a single schema. Once we have a new sub-schema made from a lossless join, we are not allowed to use any of its isolated sub-schema to join with any of the other schemas. For example, if we can do a lossless join on a pair of schemas $R_{i}, R_{j}$ to form a new schema $R_{i, j}$ , we use this new schema (rather than $R_{i}$ or $R_{j}$ ) to form a lossless join with another schema $R_{k}$ (which may already be joined (e.g., $R_{k, l}$ )).Template:Vague

Example

Let $R = {A, B, C, D}$ be the relation schema, with attributes Template:Mvar, Template:Mvar, Template:Mvar and Template:Mvar.
Let $F = {A \to B C}$ be the set of functional dependencies.
Decomposition into $R_{1} = {A, B, C}$ and $R_{2} = {A, D}$ is lossless under Template:Mvar because $R_{1} \cap R_{2} = A$ and we have a functional dependency $A \to B C$ . In other words, we have proven that $(R_{1} \cap R_{2} \to R_{1} ∖ R_{2}) \in F^{+}$ .^[5]^[6]

References

Template:Reflist

[1] Template:Cite journal

[Elmasri-2] Template:Cite book

[3] Template:Cite book

[Ullman1988-4] 4.0 ^4.1 Template:Cite book

[5] Template:Cite web

[6] Template:Cite web

[1]

[2]

[3]

[4]

[5]

[6]

Lossless join decomposition

Contents

Definition

Criteria

Criteria for multiple sub-schemas

Example

References

Navigation menu

Lossless join decomposition

Definition

Criteria

Criteria for multiple sub-schemas

Example

References

Navigation menu

Search