Skip to content

A curated list of awesome papers, datasets, and models relevant to machine psychology.

Notifications You must be signed in to change notification settings

phoeniiix1203/awesome-machine-psychology

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Awesome Machine Psychology

Awesome

🌟 A curated collection of standout papers, datasets, and models in machine psychology—the fascinating study of artificial intelligence (AI) systems, especially large language models (LLMs), using experimental and theoretical methods traditionally applied in human psychology.

🎓 This project was created at OMNILab, Shanghai Jiao Tong University, by Xiangtiange Li, Qiyuan Gu, Siyu Pan, and Xinyue Zhang, under the guidance of Professor Yaohui Jin and Dr. Binglei Zhao. OMNILab is now a part of the BaiYuLan Open AI community.

💡 We welcome contributions to this collection! Please review the Contribution Guidelines to make sure your entries fit the criteria.

Table of Contents

Papers

Note: To keep paragraphs concise, we only include essential details when sorting papers by topic. Full information is provided when papers are sorted by year.

By Year

2024

LLMs achieve adult human performance on higher-order theory of mind tasks

  • PDF: https://arxiv.org/abs/2405.18870
  • Authors: Winnie Street, John Oliver Siy, Geoff Keeling, Adrien Baranes, Benjamin Barnett, Michael McKibben, Tatenda Kanyere, Alison Lentz, Blaise Aguera y Arcas, Robin I. M. Dunbar
  • Grouped by topic

Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

Testing theory of mind in large language models and humans

  • Published in: Nature Human Behavior
  • PDF: https://www.nature.com/articles/s41562-024-01882-z
  • Authors: James W. A. Strachan, Dalila Albergo, Giulia Borghini, Oriana Pansardi, Eugenio Scaliti, Saurabh Gupta, Krati Saxena, Alessandro Rufo, Stefano Panzeri, Guido Manzi, Michael S. A. Graziano & Cristina Becchio
  • Grouped by topic

Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View

InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety

PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents

HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy

CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

Using Artificial Populations to Study Psychological Phenomena in Neural Models

Working Memory Capacity of ChatGPT: An Empirical Study

PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals

2023

Playing repeated games with Large Language Models

Inductive reasoning in humans and large language models

Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?

Deception abilities emerged in large language models

Using cognitive psychology to understand GPT-3

Inducing anxiety in large language models increases exploration and bias

Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT

A Manager and an AI Walk into a Bar: Does ChatGPT Make Biased Decisions Like We Do?

Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks

Sparks of Artificial General Intelligence: Early experiments with GPT-4

  • PDF: https://arxiv.org/abs/2303.12712
  • Authors: Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang
  • Grouped by topic

Evaluating the Moral Beliefs Encoded in LLMs

2022

Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies

Towards Reasoning in Large Language Models: A Survey

Evaluating Psychological Safety of Large Language Models

Language models show human-like content effects on reasoning tasks

  • PDF: https://arxiv.org/abs/2207.07051
  • Authors: Ishita Dasgupta, Andrew K. Lampinen, Stephanie C. Y. Chan, Hannah R. Sheahan, Antonia Creswell, Dharshan Kumaran, James L. McClelland, Felix Hill
  • Grouped by topic

Capturing Failures of Large Language Models via Human Cognitive Biases

Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs

Do Large Language Models know what humans know?

Who is GPT-3? An Exploration of Personality, Values and Demographics

Emergent Analogical Reasoning in Large Language Models

Putting GPT-3's Creativity to the (Alternative Uses) Test

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

  • Published in: NeurIPS 2022
  • PDF: https://arxiv.org/abs/2210.01478
  • Authors: Zhijing Jin, Sydney Levine, Fernando Gonzalez, Ojasv Kamal, Maarten Sap, Mrinmaya Sachan, Rada Mihalcea, Josh Tenenbaum, Bernhard Schölkopf
  • Grouped by topic

Clinical Psychology

HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy

CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals

PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents

Inducing anxiety in large language models increases exploration and bias

Evaluating Psychological Safety of Large Language Models

Cognitive Psychology

Using cognitive psychology to understand GPT-3

Towards Reasoning in Large Language Models: A Survey

Capturing Failures of Large Language Models via Human Cognitive Biases

Language models show human-like content effects on reasoning tasks

Working Memory Capacity of ChatGPT: An Empirical Study

Inductive reasoning in humans and large language models

Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT

A Manager and an AI Walk into a Bar: Does ChatGPT Make Biased Decisions Like We Do?

Developmental Psychology

LLMs achieve adult human performance on higher-order theory of mind tasks

Testing theory of mind in large language models and humans

Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs

Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks

Do Large Language Models know what humans know?

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Group Psychology

Playing repeated games with Large Language Models

Intelligence Assessment

Emergent Analogical Reasoning in Large Language Models

Moral Psychology

Evaluating the Moral Beliefs Encoded in LLMs

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

Psychology of Creativity

Putting GPT-3's Creativity to the (Alternative Uses) Test

Psychology of Personality

InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

Who is GPT-3? An Exploration of Personality, Values and Demographics

Social Psychology

Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View

Datasets

Note: Most of the datasets listed below are free, however, some are not.

HealMe: Please refer to HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy

InCharacter: Please refer to InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

PsySafe: Please refer to PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety

CPsyCoun: Please refer to CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

ChatGPT-WM: Please refer to Working Memory Capacity of ChatGPT: An Empirical Study

Using Artificial Populations to Study Psychological Phenomena in Language Models

GPT3goesPsychology: Please refer to Using cognitive psychology to understand GPT-3

Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT

Do Large Language Models Know What They Don’t Know?

Models

HealMe: Please refer to HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy

Patient Psi: Please refer to PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals

MachineSoM: Please refer to Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View

InCharacter: Please refer to InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

CPsyCoun: Please refer to CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

Using Artificial Populations to Study Psychological Phenomena in Language Models

About

A curated list of awesome papers, datasets, and models relevant to machine psychology.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •