Robust techniques for generating talking faces from speech