Soft configuration model

Template:Short description Template:Network science

In applied mathematics, the soft configuration model (SCM) is a random graph model subject to the principle of maximum entropy under constraints on the expectation of the degree sequence of sampled graphs.^[1] Whereas the configuration model (CM) uniformly samples random graphs of a specific degree sequence, the SCM only retains the specified degree sequence on average over all network realizations; in this sense the SCM has very relaxed constraints relative to those of the CM ("soft" rather than "sharp" constraints^[2]). The SCM for graphs of size $n$ has a nonzero probability of sampling any graph of size $n$ , whereas the CM is restricted to only graphs having precisely the prescribed connectivity structure.

Model formulation

The SCM is a statistical ensemble of random graphs $G$ having $n$ vertices ( $n = | V (G) |$ ) labeled ${v_{j}}_{j = 1}^{n} = V (G)$ , producing a probability distribution on $𝒢_{n}$ (the set of graphs of size $n$ ). Imposed on the ensemble are $n$ constraints, namely that the ensemble average of the degree $k_{j}$ of vertex $v_{j}$ is equal to a designated value ${\hat{k}}_{j}$ , for all $v_{j} \in V (G)$ . The model is fully parameterized by its size $n$ and expected degree sequence ${{\hat{k}}_{j}}_{j = 1}^{n}$ . These constraints are both local (one constraint associated with each vertex) and soft (constraints on the ensemble average of certain observable quantities), and thus yields a canonical ensemble with an extensive number of constraints.^[2] The conditions $⟨ k_{j} ⟩ = {\hat{k}}_{j}$ are imposed on the ensemble by the method of Lagrange multipliers (see Maximum-entropy random graph model).

Derivation of the probability distribution

The probability $ℙ_{SCM} (G)$ of the SCM producing a graph $G$ is determined by maximizing the Gibbs entropy $S [G]$ subject to constraints $⟨ k_{j} ⟩ = {\hat{k}}_{j}, j = 1, \dots, n$ and normalization $\sum_{G \in 𝒢_{n}} ℙ_{SCM} (G) = 1$ . This amounts to optimizing the multi-constraint Lagrange function below:

\begin{matrix} ℒ (α, {ψ_{j}}_{j = 1}^{n}) \\ = & - \sum_{G \in 𝒢_{n}} ℙ_{SCM} (G) \log ℙ_{SCM} (G) + α (1 - \sum_{G \in 𝒢_{n}} ℙ_{SCM} (G)) + \sum_{j = 1}^{n} ψ_{j} ({\hat{k}}_{j} - \sum_{G \in 𝒢_{n}} ℙ_{SCM} (G) k_{j} (G)), \end{matrix}

where $α$ and ${ψ_{j}}_{j = 1}^{n}$ are the $n + 1$ multipliers to be fixed by the $n + 1$ constraints (normalization and the expected degree sequence). Setting to zero the derivative of the above with respect to $ℙ_{SCM} (G)$ for an arbitrary $G \in 𝒢_{n}$ yields

0 = \frac{\partial ℒ (α, {ψ_{j}}_{j = 1}^{n})}{\partial ℙ_{SCM} (G)} = - \log ℙ_{SCM} (G) - 1 - α - \sum_{j = 1}^{n} ψ_{j} k_{j} (G) \Rightarrow ℙ_{SCM} (G) = \frac{1}{Z} \exp [- \sum_{j = 1}^{n} ψ_{j} k_{j} (G)],

the constant $Z : = e^{α + 1} = \sum_{G \in 𝒢_{n}} \exp [- \sum_{j = 1}^{n} ψ_{j} k_{j} (G)] = \prod_{1 \leq i < j \leq n} (1 + e^{- (ψ_{i} + ψ_{j})})$ ^[3] being the partition function normalizing the distribution; the above exponential expression applies to all $G \in 𝒢_{n}$ , and thus is the probability distribution. Hence we have an exponential family parameterized by ${ψ_{j}}_{j = 1}^{n}$ , which are related to the expected degree sequence ${{\hat{k}}_{j}}_{j = 1}^{n}$ by the following equivalent expressions:

⟨ k_{q} ⟩ = \sum_{G \in 𝒢_{n}} k_{q} (G) ℙ_{SCM} (G) = - \frac{\partial \log Z}{\partial ψ_{q}} = \sum_{j \neq q} \frac{1}{e^{ψ_{q} + ψ_{j}} + 1} = {\hat{k}}_{q}, q = 1, \dots, n .

References

Template:Reflist

[van_der_Hoorn-1] Template:Cite news

[Diego-2] 2.0 ^2.1 Template:Cite news

[Park-3] Template:Cite news

[1]

[2]

[3]

Soft configuration model

Model formulation

Derivation of the probability distribution

References

Navigation menu

Search