Spaces:
Runtime error
Runtime error
Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -16,6 +16,25 @@ app_port: 7860
|
|
| 16 |
Iterative prompt refinement for text-to-image models.
|
| 17 |
Given a target image, WeavePrompt automatically generates and refines text prompts to make a model's output resemble the target image, using vision-language models and perceptual metrics.
|
| 18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
## Features
|
| 20 |
- Upload a target image
|
| 21 |
- Step-by-step prompt optimization
|
|
|
|
| 16 |
Iterative prompt refinement for text-to-image models.
|
| 17 |
Given a target image, WeavePrompt automatically generates and refines text prompts to make a model's output resemble the target image, using vision-language models and perceptual metrics.
|
| 18 |
|
| 19 |
+
## Introduction
|
| 20 |
+
|
| 21 |
+
**WeavePrompt** is a research and development project designed to evaluate and refine text-to-image generation prompts across multiple state-of-the-art image generation models. The primary goal is to optimize prompts such that the generated images align closely with a given reference image, improving both fidelity and semantic consistency.
|
| 22 |
+
|
| 23 |
+
The process involves generating images from identical prompts using various image generation models, comparing the results to a reference image through a recognition and similarity evaluation pipeline, and iteratively adjusting the prompt to minimize perceptual differences. This feedback loop continues for a set number of iterations, progressively enhancing prompt effectiveness.
|
| 24 |
+
|
| 25 |
+
To achieve this, WeavePrompt integrates advanced tools:
|
| 26 |
+
|
| 27 |
+
Image recognition is powered by meta-llama/Llama-4-Scout-17B-16E-Instruct.
|
| 28 |
+
|
| 29 |
+
Similarity evaluation uses the LPIPS (alex) metric for perceptual comparison.
|
| 30 |
+
|
| 31 |
+
Image generation models under evaluation include:
|
| 32 |
+
- FLUX family: FLUX.1 [pro], [dev], [schnell], and FLUX.1 with LoRAs
|
| 33 |
+
- Google models: Imagen 4, Imagen 4 Ultra, and Gemini 2.5 Flash Image
|
| 34 |
+
- Other models: Stable Diffusion 3.5 Large and Qwen Image
|
| 35 |
+
|
| 36 |
+
By systematically combining prompt optimization with multi-model evaluation, WeavePrompt aims to advance the understanding of cross-model prompt effectiveness and improve controllability in image generation tasks.
|
| 37 |
+
|
| 38 |
## Features
|
| 39 |
- Upload a target image
|
| 40 |
- Step-by-step prompt optimization
|