On the Monotonicity and Rate of Convergence of the Markovian Persuasion Value

Published Online:https://doi.org/10.1287/moor.2023.0296

We study a dynamic Bayesian persuasion model called Markovian persuasion, illustrated here with two players: the sender (he) and the receiver (she). In such a model, the belief of the receiver regarding the current state of a Markov chain (Xn)n1, over a finite state space K, is controlled through signals she obtains from a sender, who observes (Xn)n1 in real time. At each stage n1, the receiver takes an action based on his current belief, which, together with the realized state of Xn, determines the n-th-stage payoff of the sender. The sender’s goal in a Markovian persuasion game is to find a signaling policy that maximizes her expected δ-discounted sum of stage payoffs for a discount factor δ[0,1). We show that starting from any invariant distribution (Xn)n1, the trajectory of the δ-discounted value is monotone decreasing in δ. By combining this result with the opposite increasing monotone trajectories found in Lehrer and Shaiderman [Lehrer E, Shaiderman D (2025) Markovian persuasion with stochastic revelations. Games Econom. Behav. 154:411–439], we are able to derive an upper bound on the rate of convergence of the δ-discounted values (as δ1) in the case where (Xn)n1 is ergodic. The results for the Markovian persuasion model are then extended to the Markov chain games model of Renault (2006).

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.