On the Feature Alignment of Deep Vision Models: Explainability and Robustness Connected At Hip