Towards Zero-Shot Human Action Understanding and Generation