text-generation-webui/modules/api/images.py

"""
OpenAI-compatible image generation using local diffusion models.
"""

import base64
import io
import json
import time

from PIL.PngImagePlugin import PngInfo

from .errors import ServiceUnavailableError
from modules import shared


def generations(request):
    """
    Generate images using the loaded diffusion model.
    Returns dict with 'created' timestamp and 'data' list of images.
    """
    from modules.ui_image_generation import build_generation_metadata, generate

    if shared.image_model is None:
        raise ServiceUnavailableError("No image model loaded. Load a model via the UI first.")

    width, height = request.get_width_height()

    # Build state dict: GenerationOptions fields + image-specific keys
    state = request.model_dump()
    state.update({
        'image_model_menu': shared.image_model_name,
        'image_prompt': request.prompt,
        'image_neg_prompt': request.negative_prompt,
        'image_width': width,
        'image_height': height,
        'image_steps': request.steps,
        'image_seed': request.image_seed,
        'image_batch_size': request.batch_size,
        'image_batch_count': request.batch_count,
        'image_cfg_scale': request.cfg_scale,
        'image_llm_variations': False,
    })

    # Exhaust generator, keep final result
    images = []
    for images, _ in generate(state, save_images=False):
        pass

    if not images:
        raise ServiceUnavailableError("Image generation failed or produced no images.")

    # Build response with per-batch metadata (seed increments per batch)
    base_seed = state.get('image_seed_resolved', state['image_seed'])
    batch_size = int(state['image_batch_size'])

    resp = {'created': int(time.time()), 'data': []}
    for idx, img in enumerate(images):
        batch_seed = base_seed + idx // batch_size
        metadata = build_generation_metadata(state, batch_seed)
        metadata_json = json.dumps(metadata, ensure_ascii=False)
        png_info = PngInfo()
        png_info.add_text("image_gen_settings", metadata_json)
        b64 = _image_to_base64(img, png_info)

        image_obj = {'revised_prompt': request.prompt}

        if request.response_format == 'b64_json':
            image_obj['b64_json'] = b64
        else:
            image_obj['url'] = f'data:image/png;base64,{b64}'

        resp['data'].append(image_obj)

    return resp


def _image_to_base64(image, png_info=None) -> str:
    buffered = io.BytesIO()
    image.save(buffered, format="PNG", pnginfo=png_info)
    return base64.b64encode(buffered.getvalue()).decode('utf-8')
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00			`"""`
			`OpenAI-compatible image generation using local diffusion models.`
			`"""`

			`import base64`
			`import io`
Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00			`import json`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 17:50:08 -04:00			`import time`
Lint the openai extension 2023-09-15 20:11:16 -07:00
Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00			`from PIL.PngImagePlugin import PngInfo`

API: Move OpenAI-compatible API from extensions/openai to modules/api 2026-03-20 14:46:00 -03:00			`from .errors import ServiceUnavailableError`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00			`from modules import shared`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 17:50:08 -04:00
lint 2023-07-12 11:33:25 -07:00
Image: Simplify the API code, add the llm_variations option 2025-12-04 10:23:00 -08:00			`def generations(request):`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00			`"""`
			`Generate images using the loaded diffusion model.`
Image: Simplify the API code, add the llm_variations option 2025-12-04 10:23:00 -08:00			`Returns dict with 'created' timestamp and 'data' list of images.`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00			`"""`
Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00			`from modules.ui_image_generation import build_generation_metadata, generate`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00
			`if shared.image_model is None:`
			`raise ServiceUnavailableError("No image model loaded. Load a model via the UI first.")`

Image: Simplify the API code, add the llm_variations option 2025-12-04 10:23:00 -08:00			`width, height = request.get_width_height()`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00
Image: Simplify the API code, add the llm_variations option 2025-12-04 10:23:00 -08:00			`# Build state dict: GenerationOptions fields + image-specific keys`
			`state = request.model_dump()`
			`state.update({`
			`'image_model_menu': shared.image_model_name,`
			`'image_prompt': request.prompt,`
			`'image_neg_prompt': request.negative_prompt,`
			`'image_width': width,`
			`'image_height': height,`
			`'image_steps': request.steps,`
			`'image_seed': request.image_seed,`
			`'image_batch_size': request.batch_size,`
			`'image_batch_count': request.batch_count,`
			`'image_cfg_scale': request.cfg_scale,`
Image: Remove llm_variations from the API 2025-12-04 17:34:17 -08:00			`'image_llm_variations': False,`
Image: Simplify the API code, add the llm_variations option 2025-12-04 10:23:00 -08:00			`})`

			`# Exhaust generator, keep final result`
			`images = []`
			`for images, _ in generate(state, save_images=False):`
			`pass`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00
Image: Several fixes 2025-12-05 05:53:22 -08:00			`if not images:`
			`raise ServiceUnavailableError("Image generation failed or produced no images.")`

Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00			`# Build response with per-batch metadata (seed increments per batch)`
			`base_seed = state.get('image_seed_resolved', state['image_seed'])`
			`batch_size = int(state['image_batch_size'])`

Image: Simplify the API code, add the llm_variations option 2025-12-04 10:23:00 -08:00			`resp = {'created': int(time.time()), 'data': []}`
Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00			`for idx, img in enumerate(images):`
			`batch_seed = base_seed + idx // batch_size`
			`metadata = build_generation_metadata(state, batch_seed)`
			`metadata_json = json.dumps(metadata, ensure_ascii=False)`
			`png_info = PngInfo()`
			`png_info.add_text("image_gen_settings", metadata_json)`
			`b64 = _image_to_base64(img, png_info)`
Image: Add a revised_prompt field to API results for OpenAI compatibility 2025-12-04 17:41:09 -08:00
			`image_obj = {'revised_prompt': request.prompt}`

Image: Simplify the API code, add the llm_variations option 2025-12-04 10:23:00 -08:00			`if request.response_format == 'b64_json':`
Image: Add a revised_prompt field to API results for OpenAI compatibility 2025-12-04 17:41:09 -08:00			`image_obj['b64_json'] = b64`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 17:50:08 -04:00			`else:`
Image: Add a revised_prompt field to API results for OpenAI compatibility 2025-12-04 17:41:09 -08:00			`image_obj['url'] = f'data:image/png;base64,{b64}'`

			`resp['data'].append(image_obj)`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 17:50:08 -04:00
lint 2023-07-12 11:33:25 -07:00			`return resp`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00

Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00			`def _image_to_base64(image, png_info=None) -> str:`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00			`buffered = io.BytesIO()`
Image generation: Embed generation metadata in API image responses 2026-04-04 23:15:14 -07:00			`image.save(buffered, format="PNG", pnginfo=png_info)`
Add an API endpoint for generating images 2025-12-03 11:50:35 -08:00			`return base64.b64encode(buffered.getvalue()).decode('utf-8')`