SUVA: A Probabilistic Framework for Auditing LLMs with an Application to Social Preferences
Published Online:23 Feb 2026https://doi.org/10.1287/isre.2024.0857
References
- (2023) How AI-based systems can induce reflections: The case of AI-augmented diagnostic work. MIS Quart. 47(4):1395–1424.Crossref, Google Scholar
- (2023) Human vs. automated sales agents: How and why customer responses shift across sales stages. Inform. Systems Res. 34(3):1148–1168.Link, Google Scholar
- (2023) Using large language models to simulate multiple humans and replicate human subject studies. Internat. Conf. Machine Learn. (PMLR, New York), 337–371.Google Scholar
- (2025) Playing repeated games with large language models. Nature Human Behav., 1–11.Google Scholar
- (2022) Language models as agent models. Findings of the Association for Computational Linguistics: EMNLP 2022, 5769–5779.Google Scholar
- (1990) Impure altruism and donations to public goods: A theory of warm-glow giving. The Econom. J. 100(401):464–477.Google Scholar
- (2024) Can generative AI improve social science? Proc. Natl. Acad. Sci. USA 121(21):e2314021121.Crossref, Google Scholar
- (2021) The next generation of research on IS use: A theoretical framework of delegation to and from agentic is artifacts. MIS Quart. 45(1):315–341.Crossref, Google Scholar
- (2021) Managing artificial intelligence. MIS Quart. 45(3):1433–1450.Crossref, Google Scholar
- (2023) Using cognitive psychology to understand GPT-3. Proc. Natl. Acad. Sci. USA 120(6):e2218523120.Crossref, Google Scholar
- (2000) ERC: A theory of equity, reciprocity, and competition. Amer. Econom. Rev. 91(1):166–193.Crossref, Google Scholar
- (2023) Using GPT for market research. Preprint, submitted March 30, https://doi.org/10.2139/ssrn.4395751.Google Scholar
- (1987) Intention, plans, and practical reason (Harvard University Press, Cambridge, MA).Google Scholar
- (2023) Playing games with GPT: What can we learn about a large language model from canonical strategic games? Preprint, submitted July 10, https://doi.org/10.2139/ssrn.4493398.Google Scholar
- (2020) Language models are few-shot learners. Preprint, submitted May 28, https://arxiv.org/abs/2005.14165.Google Scholar
- (2023) Could a large language model be conscious? Preprint, submitted March 4, https://arxiv.org/abs/2303.07103.Google Scholar
- (2023) Harms from increasingly agentic algorithmic systems. Proc. 2023 ACM Conf. Fairness Accountability Transparency (Association for Computing Machinery, New York), 651–666.Google Scholar
- (2002) Understanding social preferences with simple tests. Quart. J. Econom. 117(3):817–869.Crossref, Google Scholar
- (2009) Group identity and social preferences. Amer. Econom. Rev. 99(1):431–457.Crossref, Google Scholar
- (2023) The emergence of economic rationality of GPT. Proc. Natl. Acad. Sci. USA 120(51):e2316205120.Crossref, Google Scholar
- (2025) A manager and an AI walk into a bar: Does ChatGPT make biased decisions like we do? Manufacturing Service Oper. Management 27(2):354–368.Link, Google Scholar
- (2023) LLM-assisted content analysis: Using large language models to support deductive coding. Preprint, submitted June 23, https://arxiv.org/abs/2306.14924.Google Scholar
- (2023) Can large language models be an alternative to human evaluations? Proc. 61st Annual Meeting Assoc. Comput. Linguistics (Vol. 1 Long Papers), 15607–15631.Google Scholar
- (2017) Deep reinforcement learning from human preferences. Adv. Neural Inform. Processing Systems, vol. 30 (Curran Associates Inc., Red Hook, NY).Google Scholar
- (2024) Decision-making delegation in banks. Management Sci. 70(5):3281–3301.Link, Google Scholar
- (2006) A theory of reciprocity. Games Econom. Behav. 54(2):293–315.Crossref, Google Scholar
- (2000) Fairness and retaliation: The economics of reciprocity. J. Econom. Perspect. 14(3):159–182.Crossref, Google Scholar
- (1999) A theory of fairness, competition, and cooperation. Quart. J. Econom. 114(3):817–868.Crossref, Google Scholar
- (2020) GPT-3: Its nature, scope, limits, and consequences. Minds Machines 30:681–694.Crossref, Google Scholar
- (2002) Time discounting and time preference: A critical review. J. Econom. Literature 40(2):351–401.Crossref, Google Scholar
- (2022) Cognitive challenges in human–artificial intelligence collaboration: Investigating the path toward productive delegation. Inform. Systems Res. 33(2):678–696.Link, Google Scholar
- (2020) Artificial intelligence, values, and alignment. Minds Machines 30(3):411–437.Crossref, Google Scholar
- (2023) S3: Social-network simulation system with large language model-empowered agents. Preprint, submitted July 27, https://arxiv.org/abs/2307.14984.Google Scholar
- (1999) The belief-desire-intention model of agency. Intelligent Agents V Agents Theories Architectures Languages Fifth Internat. Workshop Proc., vol. 5 (Springer, Berlin, Heidelberg), 1–10.Google Scholar
- (2023) More than a bot? The impact of disclosing human involvement on customer interactions with hybrid service agents. Inform. Systems Res. 35(3):936–955.Google Scholar
- (2024) Can LLMs capture human preferences? Marketing Sci. 43(4):709–722.Link, Google Scholar
- (2023) Bots with feelings: Should AI agents express positive emotion in customer service? Inform. Systems Res. 34(3):1296–1311.Link, Google Scholar
- (2023) Delegation decisions in finance. Management Sci. 69(8):4828–4844.Link, Google Scholar
- (2023) Faithful question answering with Monte-Carlo planning. Preprint, submitted May 4, https://arxiv.org/abs/2305.02556.Google Scholar
- (2023) Large language models as simulated economic agents: What can we learn from homo silicus? NBER Working Paper No. 31122, National Bureau of Economic Research, Cambridge, MA.Google Scholar
- (2024) How interpretable are reasoning explanations from prompting large language models? Findings Assoc. Comput. Linguistics, 2148–2164.Google Scholar
- (2021) Augmenting medical diagnosis decisions? An investigation into physicians’ decision-making process with artificial intelligence. Inform. Systems Res. 32(3):713–735.Link, Google Scholar
- (2023) Measuring faithfulness in chain-of-thought reasoning. Preprint, submitted July 17, https://arxiv.org/abs/2307.13702.Google Scholar
- (2024) Can LLMs mimic human-like mental accounting and behavioral biases? Proc. 25th ACM Conf. Econom. Comput., 581.Google Scholar
- (2025) Latent neural coupling of risk and time preferences in LLMs mirrors human biases. Proc. 26th ACM Conf. Econom. Comput., vol. 542 (ACM, New York).Google Scholar
- (2024) Reduce disparity between LLMs and humans: Optimal LLM sample calibration. Preprint, submitted April 23, https://doi.org/10.2139/ssrn.4802019.Google Scholar
- Li P, Castelo N, Katona Z, Sarvary M (2024) Frontiers: Determining the validity of large language models for automated perceptual analysis. Marketing Sci. 43(2):254–266.Google Scholar
- (2019) Coding qualitative data: A synthesis guiding the novice. Qualitative Res. J. 19(3):259–270.Crossref, Google Scholar
- (2023) Faithful chain-of-thought reasoning. Preprint, submitted January 31, https://arxiv.org/abs/2301.13379.Google Scholar
- (2024) A Turing test of whether AI chatbots are behaviorally similar to humans. Proc. Natl. Acad. Sci. USA 121(9):e2313925121.Crossref, Google Scholar
- (2022) Who is GPT-3? An exploration of personality, values and demographics. Preprint, submitted September 28, https://arxiv.org/abs/2209.14338.Google Scholar
- (2023) A comprehensive overview of large language models. Preprint, submitted July 12, https://arxiv.org/abs/2307.06435.Google Scholar
- (2024) Influencers: The power of comments. Marketing Sci. 43(6):1153–1167.Link, Google Scholar
- (2005) Evolution of indirect reciprocity. Nature 437(7063):1291–1298.Crossref, Google Scholar
- (2022) Training language models to follow instructions with human feedback. Adv. Neural Inform. Processing Systems, vol. 35 (Curran Associates Inc., Red Hook, NY), 27730–27744.Google Scholar
- (2023) Generative agents: Interactive simulacra of human behavior. Proc. 36th Annual ACM Sympos. User Interface Software Tech. (ACM, New York), 1–22.Google Scholar
- (2023) AI psychometrics: Using psychometric inventories to obtain psychological profiles of large language models. OSF preprint.Google Scholar
- (2023) AI knowledge: Improving AI delegation through human enablement. Proc. 2023 CHI Conf. Human Factors Comput. Systems (ACM, New York), 1–17.Google Scholar
- (2004) Toward an integration of agent-and activity-centric approaches in organizational process modeling: Incorporating incentive mechanisms. Inform. Systems Res. 15(4):316–335.Link, Google Scholar
- (2019) Machine behaviour. Nature 568(7753):477–486.Crossref, Google Scholar
- (2025) TRiSM for agentic AI: A review of trust, risk, and security management in LLM-based agentic multi-agent systems. Preprint, submitted June 4, https://arxiv.org/abs/2506.04133.Google Scholar
- (2019) Human Compatible: AI and the Problem of Control (Penguin UK, London).Google Scholar
- (2021) Estimating the impact of “humanizing” customer service chatbots. Inform. Systems Res. 32(3):736–751.Link, Google Scholar
- (2024) Explainable generative AI (GenXAI): A survey, conceptualization, and research agenda. Artificial Intelligence Rev. 57(11):289.Crossref, Google Scholar
- (2024) Less artificial, more intelligent: Understanding affinity, trustworthiness, and preference for digital humans. Inform. Systems Res. 36(2):1096–1128.Link, Google Scholar
- (2023) Probing the psychology of AI models. Proc. Natl. Acad. Sci. USA 120(10):e2300963120.Crossref, Google Scholar
- (2025) The illusion of thinking: Understanding the strengths and limitations of reasoning models via the lens of problem complexity. Preprint, submitted June 16, https://arxiv.org/pdf/2506.09250.Google Scholar
- (2021) Value alignment: A formal approach. Preprint, submitted October 18, https://arxiv.org/abs/2110.09240.Google Scholar
- (2024) Rethinking interpretability in the era of large language models. Preprint, submitted January 30, https://arxiv.org/abs/2402.01761.Google Scholar
- (2024) To cot or not to cot? Chain-of-thought helps mainly on math and symbolic reasoning. Preprint, submitted September 18, https://arxiv.org/abs/2409.12183.Google Scholar
- (2024) Testing theory of mind in large language models and humans. Nature Human Behav. 8(7):1285–1295.Crossref, Google Scholar
- (2024) An examination of the use of large language models to aid analysis of textual data. Internat. J. Qualitative Methods 23:16094069241231168.Crossref, Google Scholar
- (2021) Understanding the capabilities, limitations, and societal impact of large language models. Preprint, submitted February 4, https://arxiv.org/abs/2102.02503.Google Scholar
- (2017) Direct and indirect information system use: A multimethod exploration of social power antecedents in healthcare. Inform. Systems Res. 28(4):690–710.Link, Google Scholar
- (1991) Loss aversion in riskless choice: A reference-dependent model. Quart. J. Econom. 106(4):1039–1061.Crossref, Google Scholar
- (1992) Advances in prospect theory: Cumulative representation of uncertainty. J. Risk Uncertainty 5:297–323.Crossref, Google Scholar
- (2023) Friend or foe? Teaming between artificial intelligence and workers with variation in experience. Management Sci. 70(9):5753–5775.Link, Google Scholar
- (2024) A survey on large language model based autonomous agents. Frontiers Comput. Sci. 18(6):186345.Crossref, Google Scholar
- (2023) Emergent analogical reasoning in large language models. Nature Human Behav. 7(9):1526–1541.Crossref, Google Scholar
- (2022) Chain-of-thought prompting elicits reasoning in large language models. Adv. Neural Inform. Processing Systems, vol. 35 (Curran Associates Inc., Red Hook, NY), 24824–24837.Google Scholar
- (2024) Can large language model agents simulate human trust behaviors? Preprint, submitted February 7, https://arxiv.org/abs/2402.04559.Google Scholar

