Soft configuration model

From testwiki
Jump to navigation Jump to search

Template:Short description Template:Network science

In applied mathematics, the soft configuration model (SCM) is a random graph model subject to the principle of maximum entropy under constraints on the expectation of the degree sequence of sampled graphs.[1] Whereas the configuration model (CM) uniformly samples random graphs of a specific degree sequence, the SCM only retains the specified degree sequence on average over all network realizations; in this sense the SCM has very relaxed constraints relative to those of the CM ("soft" rather than "sharp" constraints[2]). The SCM for graphs of size n has a nonzero probability of sampling any graph of size n, whereas the CM is restricted to only graphs having precisely the prescribed connectivity structure.

Model formulation

The SCM is a statistical ensemble of random graphs G having n vertices (n=|V(G)|) labeled {vj}j=1n=V(G), producing a probability distribution on 𝒒n (the set of graphs of size n). Imposed on the ensemble are n constraints, namely that the ensemble average of the degree kj of vertex vj is equal to a designated value k^j, for all vj∈V(G). The model is fully parameterized by its size n and expected degree sequence {k^j}j=1n. These constraints are both local (one constraint associated with each vertex) and soft (constraints on the ensemble average of certain observable quantities), and thus yields a canonical ensemble with an extensive number of constraints.[2] The conditions ⟨kj⟩=k^j are imposed on the ensemble by the method of Lagrange multipliers (see Maximum-entropy random graph model).

Derivation of the probability distribution

The probability β„™SCM(G) of the SCM producing a graph G is determined by maximizing the Gibbs entropy S[G] subject to constraints ⟨kj⟩=k^j, j=1,…,n and normalization βˆ‘Gβˆˆπ’’nβ„™SCM(G)=1. This amounts to optimizing the multi-constraint Lagrange function below:

β„’(Ξ±,{ψj}j=1n)=βˆ’βˆ‘Gβˆˆπ’’nβ„™SCM(G)logβ„™SCM(G)+Ξ±(1βˆ’βˆ‘Gβˆˆπ’’nβ„™SCM(G))+βˆ‘j=1nψj(k^jβˆ’βˆ‘Gβˆˆπ’’nβ„™SCM(G)kj(G)),

where Ξ± and {ψj}j=1n are the n+1 multipliers to be fixed by the n+1 constraints (normalization and the expected degree sequence). Setting to zero the derivative of the above with respect to β„™SCM(G) for an arbitrary Gβˆˆπ’’n yields

0=βˆ‚β„’(Ξ±,{ψj}j=1n)βˆ‚β„™SCM(G)=βˆ’logβ„™SCM(G)βˆ’1βˆ’Ξ±βˆ’βˆ‘j=1nψjkj(G) β‡’ β„™SCM(G)=1Zexp[βˆ’βˆ‘j=1nψjkj(G)],

the constant Z:=eΞ±+1=βˆ‘Gβˆˆπ’’nexp[βˆ’βˆ‘j=1nψjkj(G)]=∏1≀i<j≀n(1+eβˆ’(ψi+ψj))[3] being the partition function normalizing the distribution; the above exponential expression applies to all Gβˆˆπ’’n, and thus is the probability distribution. Hence we have an exponential family parameterized by {ψj}j=1n, which are related to the expected degree sequence {k^j}j=1n by the following equivalent expressions:

⟨kq⟩=βˆ‘Gβˆˆπ’’nkq(G)β„™SCM(G)=βˆ’βˆ‚logZβˆ‚Οˆq=βˆ‘jβ‰ q1eψq+ψj+1=k^q, q=1,…,n.

References

Template:Reflist