Running 1 Transformer Training Visualized ๐ Visualize GPT training with weights, gradients, and attention