Home Random Page


CATEGORIES:

BiologyChemistryConstructionCultureEcologyEconomyElectronicsFinanceGeographyHistoryInformaticsLawMathematicsMechanicsMedicineOtherPedagogyPhilosophyPhysicsPolicyPsychologySociologySportTourism






When 8is sufficiently small, the first term is the smaller of the two, and it is well approximated by the expression 3 page



Appendix

PROOFS OF SELECTED THEOREMS

lemma 3.1 Freidlin and Wentzell, 1984). Let P be an irreducible Markov process defined î i a finite state space Z. The stationary distribution p ofP has the property that the probability of each state z is proportional to the sum of the likelihoods of its z-trees, that is,

p(z) = v(z)/ v(w), where v(z) = ^ P(T). (A.l)

:ceZ TeT;

PROOF. Let us view the states as vertices of a graph. For any two distinct states z and z' denote the edge directed from z to z' by the ordered pair (z, z'). Fix a particular state z. A z-cycle is a subset Ñ of directed edges such that: (i) Ñ contains a single directed cycle, which involves z, and (ii) for every state w not in the cycle there is a unique directed path in Ñ from w to the cycle. See Figure A.l for an illustration of this concept. Let Cz denote the set of all z-cycles. Recall that a w-tree T is a set of directed edges such that from every vertex w' Ô w there is a unique directed path from w' to w. Observe that every z-cycle Ñ can be written in the form C = TU {(zi>. z)), where T is a w-tree and w ô z. Similarly, every z-cycle can be written in the form Ñ = TU((z, w)), where T is a z-tree and w ô z. We shall exploit these equivalent representations below.

Let P be an irreducible Markov process on Z, where P:Z' denotes the probability of transiting from z to z' in one period. For any subset S of directed edges, define

P(S)= Ï

(z.z')eS

For each z e Z let

v(z) = £ P(T).

TeTt

Note that v(z) > 0 because the process is irreducible. By virtue of the connection between z-cycles and äà-trees remarked on above, we have


Figure A.l. Illustration of a z-cycle.


 

 


the identities


 

 


(A.2) (A.3)

£ P(C) = Y v(w)P„.

CeC- w.u&z

P(C) = v(z) ( P*


 

 


It follows that


 

 


J2 V(w)Pm = V(2) ( J2 P-~w)

(A.4)

w-.w^tz \u>:u&z J


 

 


We also have the identity


 

 


( £ Pzu) =

1>(Z)(1 - Pzz).
(A.5)

\w:w^z /


 

 


Combining (A.4) and (A.5) we deduce that

(A.6)

Y v(w)pwz = v(z).

WeZ

In other words, v satisfies the stationarity equation for P. Hence its normalization ä does also, so ä is the stationary distribution of P. This concludes the proof of lemma 3.1.

Theorem 3.1. Let P' be a regular perturbed Markov process and let p' be the unique stationary distribution of P' for each e > 0. Then limf _0 pe = p{) exists, and is a stationary distribution of P°. The stochastically stable states are precisely those states that are contained in the recurrent class(es) of P" having minimum stochastic potential.

proof. Let Pe be a regular, perturbed Markov process defined for all e ˆ [0. £*], and denote the recurrence classes of P° by E], E2,..., E*.

Construct a graph Ã0 having vertices 1.2............... K. where the /th vertex

corresponds to the class E;. Given two vertices 1 < / ô j < K, let ri; denote the resistance of the least resistant path in Z that begins in E, and ends in Ej. Recall that, for each j, the stochastic potential yj of the class Ej is defined to be the resistance of the least resistant j-tree in ð. To prove the theorem, we need to show that lim,_0 p' = p° exists, and that m"(z) > 0 if and only if z 6 E, where j minimizes yj.



We begin by characterizing the stochastically stable classes (and states) in terms of rooted trees whose vertex set is the entire space Z. Since P' is a regular perturbation, for every pair of distinct states z and z' there exists a number r(z, z') such that P£_. converges to at a rate on the order of £r{z-z ). (See expression 3.12.) We adopt the convention that r(z. z') = oo if P'zz. is identically zero on the interval [0. e*]. Let Ã* be the complete directed graph whose vertex set is Z such that each directed edge (z. z') has weight r(z. z'). Let T' denote the set of all z-trees on Z. Define the function y: Z —> R as follows:

y(z) = min J2 r(2'-z")- (A-7>

Note that, since P' is irreducible for each e > 0, there exists a positive probability path from each vertex z' to the vertex z. Hence there exists at least one z-tree T" for which the sum of the resistances is finite, hence y(z) is finite. We are going to show that the stochastically stable states minimize y(z). Then we shall show that ó is constant on each recurrence class Ej, and that its value is —the stochastic potential of class /defined above.

Lemma 3.2. Let P' be a regular perturbed Markov process, and for each e ˆ [0. e*] let p f be the unique stationary distribution of P'. Then lim£^o pr = p° exists, p° is a stationary distribution of P", and n°(z) > 0 if and only if ó achieves its minimum at z.

Proof. For each state 2 e Z let

= £ n Ú*-

T'eT.* <1'.2")áÒ'

By lemma 3.1, the stationary distribution // is just the normalization of v' that satisfies £ Mf (2) = 1. Let = min- y(2). We are going to show that lim,(2) > 0 if and only if ó(2) = ó. For each 2-tree Ã* e T*, the resistance of T* is

ã(Ã)= £ r(2',2"). (A.8)

(z,.z")eT

Given 2 e Z, there exists (by the definition of y) a 2-tree Ã* whose resistance equals )/(z). Consider the identity

Ï = Ï å-'^Ïã- (A.9)

tf.z")eT' (2'.ã")áÒ*

Since ó (2) is finite, it follows from the definition of the resistance function r that

0 < lim < 00 for every (2', 2") 6 T. (A.10)

If 2 does not minimize y, then r(T*) = y(z) > y, and it follows from (A.9) and (A.10) that

Bm."' Ï ^"=0.

V.neT-

By assumption, r(T) > y(z) > ó for every T 6 T*, hence

lim e~*ve(z) = 0. (A.ll)

Similarly, if ã(Ã*) = y(z) = y, we obtain

lim e->V(2) > 0. (A.12)

From lemma 3.1 we know that

Me(z) = e~fv°(z)/ £e"V(z). (A.13)

zzZ


From (A.11), (A.12), and (A.13) it follows that

lim pe(z) = 0 if y(z) > y.

e-*0*

and

lim (z) > 0 if v(z) = y.

c-> o*

In particular, we have shown that linv_»0+ Mf = exists, and that its support is precisely the set of states z that minimize y(z). Since satisfies the equation nFPe = öå for every s > 0, and by assumption Pr -> P°, we conclude that öàа = In other words, is a stationary distribution of P°. This completes the proof of lemma 3.2.

Since/i° is a sta tionary distribution of а,ä°(ã) = 0 for every transient state z. Thus to find the stochastically stable states, it suffices to compute the function ó only on the recurrent states. We claim that in fact ó is constant on each recurrence class of P°. To see this, suppose that z is in the recurrence class £; and let T* be a z-tree on Z with resistance ó (2). Let 1/ be some other vertex in E;. Choose a path of zero resistance from 2 to 1/. The edges of this path, together with the edges of T\ contain a y-tree T' that has the same resistance as T*. From this it follows that ó(y) < y(z). A symmetric argument shows that y(y) > y(2). Hence ó is constant on Ej.

To establish the theorem, it remains only to show that the value of ó on Ej is precisely the stochastic potential ó that was defined with respect to the graph Ã0.

Lemma 3.3. For every recurrence class Ej of P°, ó (2) = ó, for all z e Ej.

PROOF. Fix a particular state z, in each recurrent class Ej. We shall show first that y(Zj) < yj, and then we shall show the reverse inequality.

Fix a class Ej and a j-tree T, in Ã0 whose resistance r(T,) equals yj. For every i Ô j, there exists exactly one outgoing edge (i, /') e Tj, and its weight is r,?. Recall that Ã* is the complete directed graph with vertex set Z, and the weight on each edge (2,2') is the resistance r(2, z'). Choose a directed path in Ã* that begins at 2, and ends at zand that has total resistance r,-,>. Call this path . By construction, <*. is a least resistant path from £, to £,-. For each i Ô j, choose a set of directed edges T* in Ã* that form a z,-tree with respect to the subset of vertices £,. (Thus T* contains |E,| - 1 edges.) Since E, is a recurrence class of P°, the resistance of T* equals zero. Finally, for each transient state z £ LIE,, let be a directed path from z to UE, having zero resistance.

Let E be the union of all of the edges in all of the trees T', i ô j, together with all of the edges in the directed paths such that (;', /') e T,, and all of the edges in the directed paths for z UE,. By construction, E contains at least one directed path from every vertex in Z to the fixed vertex Zj. Therefore it contains a subset of edges that form a z;-tree T* in Ã*. The resistance of T is clearly less than or equal to the sum of the resistances of the paths C,*,, so it is less than or equal to r(T;). Thus y(Zj) < yj = r(Jj) as claimed.

To show that y(Zj) > yj, fix a class E, and a least resistant z,-tree T* among all z,-trees in the graph Ã*. Label each of the specially chosen vertices z, by the index "/" of the class E, to which it belongs. These will be called special vertices. A junction in T* is any vertex i/ with at least two incoming edges in T*. If the junction 1/ is not a special vertex, label it "i" if there exists a path of zero resistance from 1/ to E,. (There exists at least one such class because these are the recurrence classes of P°; if there are several such classes choose any one of them as label.) Every labeled vertex is either a special vertex or a junction (or both), and the label identifies a recurrence class to which there is a path of zero resistance from that point. (Since a special vertex is already in a recurrence class, the path of zero resistance in this case is trivial.)

Define the special predecessors of a state z 6 Z to be the special vertices z, that strictly precede z in the fixed tree T* (i.e., such that there is a path from z, to z in Ã-) and such that there is no other special vertex z,-- on the path from z, to z. We claim that

If Zj is a special predecessor of the labeled vertex z, then the unique path in the tree from z, to z has resistance at least rlk, where ê is the label of z. (A.14)

Property (ÀË4) clearly holds for the tree T* because any path from the special vertex z, to a vertex labeled "k" can be extended by a zero resistance path to the class £*, so the total path must have resistance at least rlk. We shall now perform certain operations on the tree T* that preserve property (A.14), and bring it into a form that can be mapped onto a j-tree in F°. We shall do this by successively eliminating all junctions that are not special vertices.

Suppose that T* contains a junction ó that is not a special vertex, and


 

 


 

let its label be "k." We distinguish two cases, depending on whether the special vertex zê is or is not a predecessor of t/ in the tree.

Case 1. If 2/t is a predecessor of 1/ (see Figures A.2 and A.3), cut off the subtree consisting of all edges and vertices that precede ó (except for the path from z* to ó and all of its predecessors) and paste them onto z*.

CASE 2. If zê is not a predecessor of ó in the tree (see Figures A.4 and



 

Î <0

Figure À.4. Case 2 surgery: before.


 

 


óò Î Î

©

ΗÎ

Figure À.5. Case 2 surgery: after.


 

 


A.5), cut off the subtree consisting of all edges and vertices that precede ó and paste them onto the tree at the vertex z*.

After the operation, remove y's label, since it is no longer a junction. Both of the above operations preserve property (A.14) because z* and ó had the same label. Each operation reduces by one the number of junctions that are not special vertices. Thus we eventually obtain a z;- tree T" in which every junction is a special vertex, property (A.14) is satisfied, and Ã** has the same resistance as the original tree T*.

Now construct a j-tree T° on the reduced graph Ã0 as follows. For every two classes /' and /' put the directed edge (i, /') in T° if and only if z, is a special predecessor of z,< in T**. By construction, T° forms a j-tree.


Let be the unique path in T" from z, to zf. By property (A.14) its resistance is at least r¹. The paths are edge-disjoint because every junction is one of the special vertices. Since T" contains their union, the resistance of T" is at least as large as the resistance of T°, that is,

y(z,) = r(TJ') > Y rif = r(T°). </.f)er°

Obviously r(T°) > Yj, since the latter is the resistance of the least resistant j-tree in Ã0. Thus y(Zj) > yj as claimed. This completes the proof of lemma 3.3, which, together with lemma 3.2 establishes theorem 3.1.

Theorem 6.2. Let Ñ be a symmetric two-person coordination game with payoff matrix

A Â

Assume that equilibrium (A, A) is strictly risk dominant, that is, a - d > b-c > 0. Let r* = (b - c)/((a - d) + (b - c)) < 1/2, and let Q be a class of graphs that are (r, k)-close-knit for some fixed r > r* and some fixed ê > 1. Given any p e (0.1), there exists a Pp such that for each fixed p > /),„ the p-inertia of the process P[fi is bounded above for all à e Q, and in particular it is bounded independently of the number of vertices in Ã.

À Â
a. a c, d
d.c b.b

proof. Let Q be as hypothesized in the theorem, and let à be a graph in Q with vertex set V and edge set £. Choose an r-close-knit set S of size k. Consider the discrete time process in which each player in S updates according to the probability distribution in (6.3) with response rate p > 0, while each player in S = V - S always chooses action  and does not update. States of this restricted process Pr s<P will be denoted by y, and states of the unrestricted process Pr fl will be denoted by x. Let Es denote the set of restricted states, and S the set of all states.

For every two nonempty subsets of vertices S', S" ñ V, let E(S'. S") be the set of all undirected edges {/, /) such that i e S' and j e S". Let e(S', S") be the number of such edges. The 2x2 game has the following potential function p:

p(A, A) = a — d p(A, Â) = 0 p(B.A) = 0 p(B, B) = b — ñ

APPENDIX

The corresponding potential function for the spatial game is

I'./leE

(Recall that all edges are assumed to have unit weight.) It is straight­forward to show (using an argument similar to that in section (6.1)) that the stationary distribution nr-s-fi(y) of Pr S /f satisfies Mr S /i(y) a for all ó 6 3s-

Let As denote the state in 3S such that everyone in S chooses action A, and everyone in S chooses action B. We claim that As uniquely maximizes p*(y) among all restricted states y. To see this, consider any restricted state ó and let S' = {/' e S:y, - B}. Substituting the values of p we obtain

p*(y) = (a- d)e(S -S\S- S') + (b - c)[«?<S\ S') + e(S', S) + e(S. S)]. and

p*(As) = (a - d)e(S, S) + (b - c)e(S, S).

It follows that

P*(As) — p*(y) = (a-d)e(S',S)-(b-c)[e(S',S') + e(S',S)l

— (a — d)e(S', S)-(b - c)le(S\ S') + e(S', V) - e(S', S)].

Thus p'(As) — p*(y) > 0 if and only if

[(a — d) + (b — c)]e(S\ S) > (b - c)[e(S', V) + e(S\ S')].

The latter holds because by assumption

e(S\ S)/[e(S', V) + e(S', S')] > r* = (b - c)/[(a - d) - (b - c)].

(Note that e(5'. V) + e(S'. S') > 0, because there are no isolated vertices.) Thus ó = As uniquely maximizes p'(y) as claimed. It follows that ^r.s.fl puts arbitrarily high probability on the state As whenever fi is sufficiently large.

Now fix p e (0,1). It follows from the preceding discussion that there exists a finite value fi(V. S. p) such that /yr-S /f(As) > 1 - p2/2 for all fi > fi(Ã. S, p). Fix such a value fi. Consider the continuous time process Pr S /f in which the agents in S update according to independent Poisson processes with unit expectation. Beginning in an initial state y", let the random variable yr denote the state of this process at time r. As ò goes to infinity, the number of transitions of the corresponding finite process Pr S /( up to time r also goes to infinity almost surely. Since P1 s /f is irreducible and aperiodic, it follows (see (3.10)) that limr-^ Pr[yr = As] = iirsp(As). Since the number of initial states is finite, there is a finite time ã(Ã. S, p, fi) such that, from any initial state y°,

Vfi > fi(Ã. S. p), Vr > ã(Ã. S, p. fi), Pr[yr = As] > 1 - p2.

Observe now that the continuous process P1 S /f and the embedded fi­nite process Prsp depend on à and S only through the configuration of internal edges that link vertices of S to other vertices of S, and on the con­figuration of external edges that link vertices of S to vertices outside of S. Since S is of size k, there are a finite number of internal edges and a finite number of ways in which they can be configured. Since S is of size ê and r-close-knit, there are a finite number of external edges, and a finite num­ber of ways in which they can be configured vis-^-vis vertices outside of S. Thus for a given r and ê there are a finite number of distinct pro­cesses Pr-S-P up to isomorphism. In particular, we can find fi(r, k. p) and r(r, k, p, fi) such that, among all graphs à in Q and all r-close-knit subsets S with ê vertices, the following holds independently of the initial state:

V/J > fi(r, k, p), Vr > r(r, k, p, fi), Pr[yr = As] > 1 - p2. (A.15)

For the remainder of the discussion, we shall fix r, k, and p as in the theorem. Let us also fix fi* > fi(r, k, p) and r* = ã(r. k. p. fi*).


Let à be a graph in Q with m vertices, and let S be an r-close-knit subset in à of size k. We denote the unrestricted process by Pr ft' and the restricted process by Pr s f. We may couple these two processes as follows. (For general results on coupled processes see Liggett (1985).) Create two disjoint isomorphic copies of the graph Ã, say Ã] and Ã2, where the /th vertex in Ï corresponds to the /'th vertex in Ã2. We will define a single continuous time process that mimics Pr /i' on Ã], and mimics Pr-S-P' on Ã2. For each state x of the unrestricted process Pr /r, let q,(A | x) denote the probability that ichooses A when /' updates, given that the current state is x. Similarly, for each state ó of the restricted pro­cess Pr S li', let q\(A | ó) denote the probability that /'chooses A when /' up­dates, given that the current state is y. Note that q\ (A | ó) = 0 for all/' e s.

The coupled process operates as follows. The state at time ã is a pair (õã, ór) where x- is the choice (Ë or B) at the /th vertex in Ï, and y] is the choice (A or Â) at the /'th vertex in Ã2. Each matched pair of vertices in the two graphs is governed by a Poisson process with unit expectation, and these processes are independent among the /// matched pairs. Thus whenever the /th agent in Ã1 updates the /'th agent in Ã2 updates, and vice versa. Let U be a random variable that is distributed uniformly on the interval [0, lj. Suppose that the /'th pair of individuals updates at time r. Draw a value of U at random, and denote it by //. The /'th indi­vidual in Ã1 chooses A if // < q,(A | xr) and chooses  if // > q,(A | xr). Similarly, the /'th individual in Ã2 chooses A if 1/ < q't(A \ y') and chooses  if 1/ > q'{A I yT).

For every two states x and ó on Ï and Ã2 respectively, write x >A ó if y, = A implies x, = A for all /'. In other words, if A appears at the /'th vertex in ó then A appears at the /'th vertex in x. It is evident that x >/t ó implies qt(A | x) > q'j(A | y) for all /'. Thus, if x >A ó and /' chooses A in the restricted process, then /' chooses A in the unrestricted process. It follows that if xr >a yr at some time r, then õã >a y' at all subsequent times r' > r.

Now let the coupled process begin in the initial state x° on Ã[ and y° on Ã2, where xf = yf for all /' e S, and ó? = Â for all /' ˆ S. Obviously we have x° >A y" initially, hence we have xr >A y' for all r > 0. From (A.15) and the choice of r* we know that

Vr > r*,Pr[yr = As] > 1 -p2,

hence

Vr > r*.Pr[xr = As] > 1 -p2.

This holds for every r-close-knit set S in Ã. Since every vertex /' is, by hypothesis, contained in such a set, it follows that

Vr > ã*. Pr[x,T = A] > 1 - p2.

Letting o?r be the proportion of individuals in Ã1 playing action A at time r, it follows that

Vr > r*,E[ar] > 1 -p2. (A.16)


We claim that this implies

Vr > r*,Pr[ar > 1 -p] > 1 -p. (A.17)

If this were false, the probability would be greater than p that more than p of the individuals at time r were playing B. But this would imply that E\a'] < 1 ~ P2' contradicting (A.16). Thus (A.17) holds. It follows that the expected waiting time in Pr p' until at least 1 -p of the individuals are playing action A is bounded above by r'/O - p). Moreover, once such a state is reached, the probability is at least 1 - p that the process is in such a state at all subsequent times. Since the time r* holds for all graphs à in the family Q, the p-inertia of the family of processes |Pr /,'lreg is bounded as claimed. This concludes the proof of theorem 6.2.

We remark that the assumption of strict risk-dominance in the theorem is important. Suppose, on the contrary, that G is a 2 x 2 coordination game in which both equilibria are weakly risk-dominant, that is, r* = 1/2. Given any graph à and any subset of vertices S, the number of internal edges in S divided by the sum of degrees of members of S is at most 1/2. Hence the conditions of the theorem cannot be satisfied, because there exists no (r, k)-close-knit graph for any value of ê such that r > r*. Suppose, however, that we consider graphs that are exactly (1 /2, fc)-close-knit for some value of k. For example, let Q be the family of graphs that consist of a number of disjoint, complete subgraphs (called cliques), each of size ê > 2. These graphs are obviously (1/2, /c)-close- knit, but the theorem fails to hold for this situation. Indeed, at any given time it is likely that about half of the cliques will be playing action A while the other half will be playing action B; moreover, the waiting time until, say, 99% of them are playing action A becomes arbitrarily large as the number of cliques goes to infinity.

Theorem 7.1. Let G be a weakly acyclic n-person game, and let P"'se be adaptive piny. If s/m is sufficiently small, the unperturbed process Pm s 0 converges with probability one to a convention from any initial state. If, in addition, e is sufficiently small, the perturbed process puts arbitrarily high probability on the convention(s) that minimize stochastic potential.

Proof. By theorem 3.1, it suffices to show that the recurrent classes of pm.s.o correSponcj one to one with the conventions whenever s/m is sufficiently small. Consider any initial state h°. There is a positive probability that all n agents in the first period sample from the most recent s entries in h°. There is also a positive probability that for the next s periods, the agents who play always draw precisely these fixed samples. (This assumes that s/m < 1 /2.) Since e = 0, the agents always choose best replies to their samples. Since all best replies have positive probability, there is a positive probability that the same strategy-tuple .v* is played for these s periods. In other words, after s periods have elapsed, the process arrives with positive probability at a state in which the last s entries equal x*. Call this state hl.

Since G is weakly acyclic, there exists a best-reply path x* = x°. x1, ..., xk ending in some strict Nash equilibrium xk = x**. If s/m is suf­ficiently small, there is a positive probability that, beginning after the history /;', x1 will be chosen for the next s periods. Then there is a posi­tive probability that x2 will be chosen for the next s periods, and so forth. Continuing in this fashion, we eventually obtain a history hk in which the last s entries consist exclusively of repetitions of x* = x*\ (All of this assumes that s/m < 1 /(ê + 1).) From hk there is a positive probability that in m — s more periods the process reaches a state /;** consisting of m repetitions of x**. Since x" is a strict Nash equilibrium, /i" is a conven­tion, that is, an absorbing state of P"' s 0. We have therefore proved that there is a positive probability of reaching a convention within a number of periods that is bounded independently of the initial state. It follows that the unperturbed process converges with probability one to a con­vention for any s, provided that s/m is sufficiently small. This proves that the recurrent classes are precisely the conventions, and theorem 7.1 follows at once from theorem 3.1.

Theorem 7.2. Let G be a generic n-person game on the finite strategy space X, and let P"'s-e be adaptive plai/. If s/m is sufficiently small and s is sufficiently large, the unperturbed process pm s 0 converges with probability one to a min­imal curb configuration. If, in addition, e is sufficiently small, the perturbed process puts arbitrarily high probability on the minimal curb configuration(s) that minimize stochastic potential.


Date: 2016-04-22; view: 562


<== previous page | next page ==>
When 8is sufficiently small, the first term is the smaller of the two, and it is well approximated by the expression 2 page | When 8is sufficiently small, the first term is the smaller of the two, and it is well approximated by the expression 4 page
doclecture.net - lectures - 2014-2024 year. Copyright infringement or personal data (0.016 sec.)