Data matters: towards a data-centric theory of generalisation