Preference-Driven Demonstrations Ranking For Inverse Reinforcement Learning