Scene segmentation using similarity, motion and depth based cues