Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others · The Large Language Model Bible Contribute to LLM-Bible

Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others

Capraro Valerio, Di Paolo Roberto, Pizziol Veronica. Arxiv 2023

[Paper]    
Ethics And Bias Fairness GPT Model Architecture RAG Reinforcement Learning

Generative artificial intelligence (AI) holds enormous potential to revolutionize decision-making processes, from everyday to high-stake scenarios. By leveraging generative AI, humans can benefit from data-driven insights and predictions, enhancing their ability to make informed decisions that consider a wide array of factors and potential outcomes. However, as many decisions carry social implications, for AI to be a reliable assistant for decision-making it is crucial that it is able to capture the balance between self-interest and the interest of others. We investigate the ability of three of the most advanced chatbots to predict dictator game decisions across 108 experiments with human participants from 12 countries. We find that only GPT-4 (not Bard nor Bing) correctly captures qualitative behavioral patterns, identifying three major classes of behavior: self-interested, inequity-averse, and fully altruistic. Nonetheless, GPT-4 consistently underestimates self-interest and inequity-aversion, while overestimating altruistic behavior. This bias has significant implications for AI developers and users, as overly optimistic expectations about human altruism may lead to disappointment, frustration, suboptimal decisions in public policy or business contexts, and even social conflict.

Similar Work