Mitigating Semantic and Distributional Discrepancies in Natural Language Processing