Scalable Learning in Latent State Sequence Models