How to train your self-supervised NLP model: Investigating pre-training objectives, data, and scale