Exploring Lexical Sensitivities in Word Prediction Models: A case study on BERT