Text this: How social reinforcement learning can lead to metastable polarisation and the voter model.