Habermas Machine dataset

This repository contains the Habermas Machine dataset, used in Tessler, M. H.^*, Bakker, M. A.^*, Jarret, D., Sheahan, H., Chadwick, M. J., Koster, R., Evans, G., Campbell-Gillingham, L., Collins, T., Parkes, D. C., Botvinick, M., & Summerfield C. (2024). "AI can help humans find common ground in democratic deliberation". Science. to train and evaluate the Habermas Machine.

Download data files from Google Cloud Storage

You can download the data from Google Cloud Storage (total size ~450 MB):

wget https://storage.googleapis.com/habermas_machine/datasets/hm_all_candidate_comparisons.parquet
wget https://storage.googleapis.com/habermas_machine/datasets/hm_all_final_preference_rankings.parquet
wget https://storage.googleapis.com/habermas_machine/datasets/hm_all_position_statement_ratings.parquet
wget https://storage.googleapis.com/habermas_machine/datasets/hm_all_round_survey_responses.parquet

Load into Python

You can load the data into a Python colab using the following:

import io
import requests
import pandas as pd

file_location = (
    'https://storage.googleapis.com/habermas_machine/datasets/hm_all_candidate_comparisons.parquet'
)
response = requests.get(file_location)
with io.BytesIO(response.content) as f:
  df = pd.read_parquet(f)

Prompted Habermas Machine

In habermas_machine/ we provide a prompted version of the Habermas Machine. The package can be installed using the following command:

pip install --upgrade git+https://github.com/google-deepmind/habermas_machine.git

For a demo of the prompted Habermas Machine, see the following colab:

There are a few key differences between the prompted version of the Habermas Machine and the fine-tuned Habermas Machine in the paper:

The prompted version of the Habermas Machine is not fine-tuned on the dataset that we provide but instead uses a more elaborate prompt and chain-of-thought to generate the candidate statements and preferences.
The prompted version of the Habermas Machine currently uses AI Studio with a Gemini backend. However, other LLM backends can be used and added to habermas_machine/llm_client/.
The reward model used in the prompted version of the Habermas Machine is generating the full ranking over statements given an opinion (and critique) rather than a score like is done in the paper. However, more reward models can be added to habermas_machine/reward_model/.

License and disclaimer

All software is licensed under the Apache License, Version 2.0 (Apache 2.0); you may not use this file except in compliance with the Apache 2.0 license. You may obtain a copy of the Apache 2.0 license at: https://www.apache.org/licenses/LICENSE-2.0

All other materials are licensed under the Creative Commons Attribution 4.0 International License (CC-BY). You may obtain a copy of the CC-BY license at: https://creativecommons.org/licenses/by/4.0/legalcode

Unless required by applicable law or agreed to in writing, all software and materials distributed here under the Apache 2.0 or CC-BY licenses are distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the licenses for the specific language governing permissions and limitations under those licenses.

This is not an official Google product.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
analysis		analysis
habermas_machine		habermas_machine
questions		questions
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Habermas Machine dataset

Download data files from Google Cloud Storage

Load into Python

Prompted Habermas Machine

License and disclaimer

About

Releases 1

Packages

Contributors 2

Languages

License

google-deepmind/habermas_machine

Folders and files

Latest commit

History

Repository files navigation

Habermas Machine dataset

Download data files from Google Cloud Storage

Load into Python

Prompted Habermas Machine

License and disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages