Redundancy principle (biology)

From testwiki
Jump to navigation Jump to search

Template:Short description

The redundancy principle in biology[1][2][3][4][5][6][7][8][9] expresses the need of many copies of the same entity (cells, molecules, ions) to fulfill a biological function. Examples are numerous: disproportionate numbers of spermatozoa during fertilization compared to one egg, large number of neurotransmitters released during neuronal communication compared to the number of receptors, large numbers of released calcium ions during transient in cells, and many more in molecular and cellular transduction or gene activation and cell signaling. This redundancy is particularly relevant when the sites of activation are physically separated from the initial position of the molecular messengers. The redundancy is often generated for the purpose of resolving the time constraint of fast-activating pathways. It can be expressed in terms of the theory of extreme statistics to determine its laws and quantify how the shortest paths are selected. The main goal is to estimate these large numbers from physical principles and mathematical derivations.

When a large distance separates the source and the target (a small activation site), the redundancy principle explains that this geometrical gap can be compensated by large number. Had nature used less copies than normal, activation would have taken a much longer time, as finding a small target by chance is a rare event and falls into narrow escape problems.[10]

Molecular rate

The time for the fastest particles to reach a target in the context of redundancy depends on the numbers and the local geometry of the target. In most of the time, it is the rate of activation. This rate should be used instead of the classical Smoluchowski's rate describing the mean arrival time, but not the fastest. The statistics of the minimal time to activation set kinetic laws in biology, which can be quite different from the ones associated to average times.

Physical models

Stochastic process

The motion of a particle located at position Xt can be described by the Smoluchowski's limit of the Langevin equation:[11][12]

dXt=2DdBt+1γF(x)dt,

where D is the diffusion coefficient of the particle, γ is the friction coefficient per unit of mass, F(x) the force per unit of mass, and Bt is a Brownian motion. This model is classically used in molecular dynamics simulations.

Jump processes

xn+1={xna,with probability l(xn)xn+b, with probability r(xn), which is for example a model of telomere length dynamics. Here r(x)=11+βx, , with r(x)+l(x)=1.[13]

Directed motion process

Xห™=v0๐ฎ, where ๐ฎ is a unit vector chosen from a uniform distribution. Upon hitting an obstacle at a boundary point X0Ω, the velocity changes to Xห™=v0๐ฏ, where ๐ฏ is chosen on the unit sphere in the supporting half space at X0 from a uniform distribution, independently of ๐ฎ. This rectilinear with constant velocity is a simplified model of spermatozoon motion in a bounded domain Ω. Other models can be diffusion on graph, active graph motion.[14]

Mathematical formulation: Computing the rate of arrival time for the fastest

The mathematical analysis of large numbers of molecules, which are obviously redundant in the traditional activation theory, is used to compute the in vivo time scale of stochastic chemical reactions. The computation relies on asymptotics or probabilistic approaches to estimate the mean time of the fastest to reach a small target in various geometries.[15][16][17]

With N non-interacting i.i.d. Brownian trajectories (ions) in a bounded domain ฮฉ that bind at a site, the shortest arrival time is by definition

τ1=min(t1,,tN), where ti are the independent arrival times of the N ions in the medium. The survival distribution of arrival time of the fastest Pr(τ1>t) is expressed in terms of a single particle, Pr(τ1>t)=PrN(t1>t). Here Pr{t1>t} is the survival probability of a single particle prior to binding at the target.This probability is computed from the solution of the diffusion equation in a domain Ω:

p(x,t)t=DΔp(x,t) for xΩ,t>0

p(x,0)=p0(x) for xΩpn(x,t)=0 for xΩrp(x,t)=0 for xΩa,

where the boundary Ω contains NR binding sites ΩiΩ (Ωa=i=1NRΩi, Ωr=ΩΩa). The single particle survival probability is

Pr{t1>t}=Ωp(x,t)dx, so that Pr{τ1=t}=ddtPr{τ1<t}=N(Pr{t1>t})N1Pr\limits {t1=t},where

Pr{t1=t}=Ωap(x,t)ndSxand Pr{t1=t}=NRΩ1p(x,t)ndSx.

The probability density function (pdf) of the arrival time is

Pr{τ1=t}=NNR[Ωp(x,t)dx]N1Ω1p(x,t)ndSx, which gives the MFPT

τยฏ1=\limits 0Pr{τ1>t}dt=0[Pr{t1>t}]Ndt. The probability Pr{t1>t} can be computed using short-time asymptotics of the diffusion equation as shown in the next sections.

Explicit computation in dimension 1

The short-time asymptotic of the diffusion equation is based on the ray method approximation. For an semi-interval [0,[, the survival pdf is solution of

(x,t)t=D2p(x,t)x2 for x>0, t>0p(x,0)=δ(xa) for  x>0,p(0,t)=0 for t>0,

that isp(x,t)=14Dπt[exp{(xa)24Dt}exp{(x+a)24Dt}].

The survival probability with D=1 is Pr{t1>t}=\limits 0p(x,t)dx=12π\limits a/4teu2du. To compute the MFPT, we expand the complementary error function

2π\limits xeu2du=ex2xπ(112x2+O(x4))for x1, which givesτยฏ1=\limits 0[Pr{t1>t}]Ndt\limits 0exp{Nln(1e(a/4t)2(a/4t)π)}dta24\limits 0exp{Nue1uπ}du,

leading (the main contribution of the integral is near 0) to τยฏ1a24DlnNπfor N1.

This result is reminiscent of using the Gumbel's law. Similarly, escape from the interval [0,a] is computed from the infinite sum

p(x,t|y)=14Dπtn=[exp{(xy+2na)24t}exp{(x+y+2na)24t}].The conditional survival probability is approximated by[1]

Pr{t1>t|y}=\limits 0ap(x,t|y)dxds1max2tπ[ey2/4ty,e(ay)2/4tay]as t0, where the maximum occurs at δ= min[y,a-y] for 0<y<a (the shortest ray from y to the boundary). All other integrals can be computed explicitly, leading to

τยฏ1=\limits 0[Pr{t1>t}]Ndt\limits 0exp{Nln(18tδπeδ2/16t)}dtδ216Dln2Nπfor N1.

Arrival times of the fastest in higher dimensions

The arrival times of the fastest among many Brownian motions are expressed in terms of the shortest distance from the source S to the absorbing window A, measured by the distance δmin=d(S,A),where d is the associated Euclidean distance. Interestingly, trajectories followed by the fastest are as close as possible from the optimal trajectories. In technical language, the associated trajectories of the fastest among N, concentrate near the optimal trajectory (shortest path) when the number N of particles increases. For a diffusion coefficient D and a window of size a, the expected first arrival times of N identically independent distributed Brownian particles initially positioned at the source S are expressed in the following asymptotic formulas :

τยฏd1δmin24Dln(Nπ),in dim 1, valid forN1,

τยฏd2δmin24Dlog(π2N8log(1a)), in dim 2 for Nlog(1ϵ)1,

τยฏd3δmin24Dlog(N4a2π1/2δmin2), in dim 3, for Na2δmin21.

These formulas show that the expected arrival time of the fastest particle is in dimension 1 and 2, O(1/\log(N)). They should be used instead of the classical forward rate in models of activation in biochemical reactions. The method to derive formulas is based on short-time asymptotic and the Green's function representation of the Helmholtz equation. Note that other distributions could lead to other decays with respect N.

Optimal Paths

Minimizing The optimal path in large N

The optimal paths for the fastest can be found using the Wencell-Freidlin functional in the Large-deviation theory. These paths correspond to the short-time asymptotics of the diffusion equation from a source to a target. In general, the exact solution is hard to find, especially for a space containing various distribution of obstacles.

The Wiener integral representation of the pdf for a pure Brownian motion is obtained for a zero drift and diffusion tensor σ=D constant, so that it is given by the probability of a sampled path until it exits at the small window Ωaat the random time T

Pr{xN(t1,M)Ω,xN(t2,M)Ω,,xM(t)=x,tTt+Δt|x(0)=y}

=[Ω\limits Ωj=1Mdyj(2πΔt)ndetσ(x)(tj1,M))exp{12Δt[yjx(tj1,N)a(x(tj1,N))Δt]Tσ1(x(tj1,N))[yjx(tj1,N)a(x(tj1,N))Δt]}

where

Δt=t/M,tj,N=jΔt, x(t0,N)=y and yj=x(tj,N) in the product and T is the exit time in the narrow absorbing window Ωa. Finally,

τ(n)=\limits 0exp{nlogΩp(x,t|y)dx}dt=0τσPr{ Path σSn(y),τσ=t}dt,

where Sn(y) is the ensemble of shortest paths selected among n Brownian trajectories, starting at point y and exiting between time t and t+dt from the domain Ω. The probabilityPr{ Path σSn} is used to show that the empirical stochastic trajectories of Sn concentrate near the shortest paths starting from y and ending at the small absorbing window Ωa, under the condition that ϵ=|Ωa||Ω|1.  The paths of Sn(y) can be approximated using discrete broken lines among a finite number of points and we denote the associated ensemble by S~n(y).  Bayes' rule leads toPr{ Path σS~n(y)|t<τσ<t+dt}=m=0Pr{ Path σS~n(y)|m,t<τσ<t+dt}Pr{m steps} where Pr{m steps}=Pr{the paths of S~n(y)exit in m steps} is the probability that a path of S~n(y)  exits in m-discrete time steps. A path made of broken lines (random walk with a time stepΔt) can be expressed using Wiener path-integral.  The probability of a Brownian path x(s) can be expressed in the limit of a path-integral with the functional:

Pr{x(s)|s[0,t]}exp(0t|xห™|2ds).

The Survival probability conditioned on starting at y is given by the Wiener representation:

S(t|x0)=xΩdxx(0)x(t)=x๐’Ÿ(x)exp(0t|xห™|2ds),

where ๐’Ÿ(x) is the limit Wiener measure: the exterior integral is taken over all end points x and the path integral is over all paths starting from x(0). When we consider n-independent paths (σ1,..σn) (made of points with a time step Δt that exit in m-steps, the probability of such an event is

Pr{σ1,..σnSn(y)|m,τσ=mΔt}=(y0=yyjΩynΩa1(4DΔt)dm/2j=1mexp{14DΔt[|yjyj1)|2]})n(1(4DΔt)dm/2)nx๐’Ÿ(x)exp{n0mΔtxห™2ds}.Indeed, when there are n paths of m steps, and the fastest one escapes in m-steps, they should all exit in m steps. Using the limit of path integral, we get heuristically the representation

Pr{ Path σS~n(y)|m,τσ=mΔt}=(y0=yyjΩynΩa1(4DΔt)dm/2j=1mexp(14DΔt[|yjyj1)|2]))n

xΩdxx(0)=yx(t)=xD(x)exp(n0mΔtxห™2ds),

where the integral is taken over all paths starting at y(0) and exiting at time mΔt. This formula suggests that when n is large, only the paths that minimize the integrant will contribute. For large n, this formula suggests that paths that will contribute the most are the ones that will minimize the exponent, which allows selecting the paths for which the energy functional is minimal, that is

E=minX๐’ซt0Txห™2ds,

where the integration is taken over the ensemble of regular paths๐’ซt inside Ω starting at y and exiting in Ωa, defined as

๐’ซT={P(0)=y,P(T)Ωa and P(s)Ω and 0sT}.

This formal argument shows that the random paths associated to the fastest exit time are concentrated near the shortest paths. Indeed, the Euler-Lagrange equations for the extremal problem are the classical geodesics between y and a point in the narrow window Ωa.

Fastest escape from a cusp in two dimensions

The formula for the fastest escape can generalize to the case where the absorbing window is located in funnel cusp and the initial particles are distributed outside the cusp. The cusp has a size ϵ in the opening and a curvature R. The diffusion coefficient is D. The shortest arrival time, valid for large n is given by τ(n)π2R34ϵD(1cos(cϵ~)ϵ~)2log(2nπ). Hereϵ~=ϵRand c is a constant that depends on the diameter of the domain. The time taken by the first arrivers is proportional to the reciprocal of the size of the narrow target ϵ . This formula is derived for fixed geometry and large n and not in the opposite limit of large n and small epsilon.[18]

Concluding remarks

How nature sets the disproportionate numbers of particles remain unclear, but can be found using the theory of diffusion. One example is the number of neurotransmitters around 2000 to 3000 released during synaptic transmission, that are set to compensate the low copy number of receptors, so the probability of activation is restored to one.[19][20]

In natural processes these large numbers should not be considered wasteful, but are necessary for generating the fastest possible response and make possible rare events that otherwise would never happen. This property is universal, ranging from the molecular scale to the population level.[21]

Nature's strategy for optimizing the response time is not necessarily defined by the physics of the motion of an individual particle, but rather by the extreme statistics, that select the shortest paths. In addition, the search for a small activation site selects the particle to arrive first: although these trajectories are rare, they are the ones that set the time scale. We may need to reconsider our estimation toward numbers when punctioning nature in agreement with the redundant principle that quantifies the request to achieve the biological function.[21]

References

Template:Reflist

  1. โ†‘ 1.0 1.1 Template:Cite journal
  2. โ†‘ Template:Cite journal
  3. โ†‘ Template:Cite journal
  4. โ†‘ Template:Cite journal
  5. โ†‘ Template:Cite journal
  6. โ†‘ Template:Cite journal
  7. โ†‘ Template:Cite journal
  8. โ†‘ Template:Cite journal
  9. โ†‘ Template:Cite journal
  10. โ†‘ Template:Cite book
  11. โ†‘ Template:Cite bookTemplate:Page needed
  12. โ†‘ Template:Cite bookTemplate:Page needed
  13. โ†‘ Template:Cite journal
  14. โ†‘ Template:Cite journal
  15. โ†‘ Template:Cite journal
  16. โ†‘ Template:Cite journal
  17. โ†‘ Template:Cite journal
  18. โ†‘ Template:Cite journal
  19. โ†‘ Template:Cite journal
  20. โ†‘ Template:Cite bookTemplate:Page needed
  21. โ†‘ 21.0 21.1 Template:Cite journal