Article
Training Design for Text-to-Image Models: Lessons from Ablations
•
49
But higher LR makes the path more volatile/unstable ? @nroggendorff
Re-LAION-Caption19M[3].aesthetic_score > 5.6 and pwatermark < 0.2 and LaMa [2] mask generation.We are starting from the "fine-mask" / "Ours (scratch)" path.
Table below from LBM paper : https://huggingface.co/papers/2503.07535