transformer_optimize# Classes LossHistory TransformerOptimize Collects standard steps to train transformer call step_loss after computing each loss