The Portal Dialogue Corpus
1 UC Berkeley 2 NYU 3 Saarland University * Equal Contribution
We collected a dataset of spoken human dialogues in Portal 2's cooperative mode, consisting of 11.5 total hours of gameplay collected from 36 participants. We release transcripts of these dialogues, along with gameplay videos and linguistic annotations, to support research on collaborative language use.
Background Portal is a first-person puzzle platformer game in which players must place portals in order to navigate through a series of increasingly difficult puzzles. The sequel to this game, Portal 2, introduces a cooperative mode in which players must collaborate in order to solve puzzles. We collected recordings from 18 pairs of players (36 participants in total), with each pair playing for up to one hour.
The Corpus We share transcripts of dialogues and demo files, which can be used to recover the underlying game state, on Hugging Face. We also share video recordings of each game on our YouTube channel. We cannot directly share audio recordings due to an IRB agreement, but you can request access at this form.
Example Clips
Select a clip below to view the corresponding video and transcript. These clips highlight key examples of collaborative dialogue from our paper. You can also explore the full dataset on the Data Explorer.