optimizer_utils# Classes LossHistory TransformerOptimize Collects standard steps to train transformer call step_loss after computing each loss