Beyond Maximum-Likelihood Training: Analysis And Methods For Building Robust Language Generation Models