Dialogue Commonsense Graph in Japanese

We proposed a commonsense knowledge graph (CSKG) grounded on dialogues. This repository contains the graph built in Japanese.

Data Format

We defined the eight relations based on categories, time series, and target persons, referring to CICERO.

Relation	Category	Time	Person
Cause	Event	Before	Speaker
Prerequisite	Event	Before	Listener
Subsequent event	Event	After	Speaker
Effect	Event	After	Listener
Motivation	Mental state	Before	Speaker
Guess	Mental state	Before	Listener
Reflection	Mental state	After	Speaker
Reaction	Mental state	After	Listener

Each utterance is annotated with inferences of the eight relations, and each dialogue is stored as a list of the utterances. An example of the annotated utterances is shown below:

{
    "status_id": 1557164592392183808,
    "cause": [
        "暑い暑い"
    ],
    "prerequisite": [
        "確かに全然外に出てない"
    ],
    "subsequent_event": [
        "宿題が終わらない"
    ],
    "effect": [
        "じゃあ家で宿題すればちょうどいいじゃん",
        "外に出ないでエアコンが効いた部屋で過ごす"
    ],
    "motivation": [
        "暑すぎてキツイ",
        "夏が終わらないと困る"
    ],
    "guess": [
        "暑くない",
        "夏がすぐ終わると宿題が終わらなくて困る"
    ],
    "reflection": [
        "暑さには何も勝てないよ"
    ],
    "reaction": [
        "そうかもね",
        "その通りだ"
    ]
}

Note that the utterance texts are replaced by their Twitter IDs.

We annotated 352 dialogues, corresponding to 2,121 utterances, with the inferences. The statistics of the graph is shown below:

Relation	# triples	Avg # infs / utt
Cause	3,060	1.44
Prerequisite	2,728	1.29
Subsequent event	3,001	1.41
Effect	3,276	1.54
Motivation	3,567	1.68
Guess	1,679	0.79
Reflection	1,591	0.75
Reaction	3,564	1.68

Reference

@InProceedings{ide_nlp2023_dcsg,
    author =    "井手竜也 and 榮田亮真 and 河原大輔 and 山崎天 and 李聖哲 and 新里顕大 and 佐藤敏紀",
    title =     "対話に基づく常識知識グラフの構築と対話応答生成に対する適用",
    booktitle = "言語処理学会第29回年次大会",
    year =      "2023",
    url =       "https://www.anlp.jp/proceedings/annual_meeting/2023/pdf_dir/H1-4.pdf"
    note=       "in Japanese"
}

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
graph.jsonl		graph.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dialogue Commonsense Graph in Japanese

Data Format

Reference

License

About

License

nlp-waseda/dcsg-ja

Folders and files

Latest commit

History

Repository files navigation

Dialogue Commonsense Graph in Japanese

Data Format

Reference

License

About

Topics

Resources

License

Stars

Watchers

Forks