On the Monotonicity and Rate of Convergence of the Markovian Persuasion Value
Abstract
We study a dynamic Bayesian persuasion model called Markovian persuasion, illustrated here with two players: the sender (he) and the receiver (she). In such a model, the belief of the receiver regarding the current state of a Markov chain , over a finite state space K, is controlled through signals she obtains from a sender, who observes in real time. At each stage , the receiver takes an action based on his current belief, which, together with the realized state of , determines the n-th-stage payoff of the sender. The sender’s goal in a Markovian persuasion game is to find a signaling policy that maximizes her expected -discounted sum of stage payoffs for a discount factor . We show that starting from any invariant distribution , the trajectory of the -discounted value is monotone decreasing in . By combining this result with the opposite increasing monotone trajectories found in Lehrer and Shaiderman [Lehrer E, Shaiderman D (2025) Markovian persuasion with stochastic revelations. Games Econom. Behav. 154:411–439], we are able to derive an upper bound on the rate of convergence of the -discounted values (as ) in the case where is ergodic. The results for the Markovian persuasion model are then extended to the Markov chain games model of Renault (2006).

