Transformer neural networks for natural language processing