metadata
license: mit
datasets:
- WensongSong/AnyInsertion
language:
- en
base_model:
- black-forest-labs/FLUX.1-Fill-dev
Insert Anything
Wensong Song
路
Hong Jinag
路
Zongxing Yang
路
Ruijie Quan
路
Yi Yang
Zhejiang University | Harvard University | Nanyang Technological University
News
- [2025.4.25] Release AnyInsertion dataset on HuggingFace.
- [2025.4.22] Release inference & demo code on GitHub, and mask-prompt pretrained checkpoint.
Model Introduction
The currently released checkpoint is 20250321_steps5000_pytorch_lora_weights.safetensors, which is for mask-prompt image insertion. Future versions of the checkpoints will be released as updates.
Citation
@article{song2025insert,
title={Insert Anything: Image Insertion via In-Context Editing in DiT},
author={Song, Wensong and Jiang, Hong and Yang, Zongxing and Quan, Ruijie and Yang, Yi},
journal={arXiv preprint arXiv:2504.15009},
year={2025}
}