Content Creation · Marketing Professionals

AI Agent for Generating Product Photos and Marketing Videos from a Reference Image

Automates the end-to-end workflow—from receiving a product photo and vision prompt to delivering an AI-generated image and a marketing video via email.

How it works
1 Step
Ingest & Backup
2 Step
Prompt Engineering & Image Creation
3 Step
Publish & Deliver
When a user submits the form, the agent saves the original photo to Google Drive as a backup and records the metadata.

Overview

End-to-end automation from input to delivery.

From a submitted photo and vision prompt, the agent converts the prompt into a production-ready instruction and generates a new AI-generated product image. It uploads the image to ImgBB to obtain a public URL and passes the URL to RunwayML to render a product marketing video. The agent monitors the rendering process and emails the customer with links to both assets, while backing up the original photo to Google Drive.


Capabilities

What Product Photo and Video AI Agent does

Ingests a reference image and vision prompt, then outputs a new AI-generated image and a marketing video, delivered by email.

01

Save the original photo to Google Drive as a backup.

02

Rewrite the user’s vision prompt into a professional image-generation prompt.

03

Generate a new AI-generated product image using OpenAI’s image editor.

04

Upload the generated image to ImgBB to obtain a public URL.

05

Render a product marketing video in RunwayML.

06

Email the user with links to the image and video via Gmail.

Why you should use AI Agent for Generating Product Photos and Marketing Videos

This AI agent eliminates manual image and video production, delivering consistent visuals quickly. It adds reliable backups and automated delivery, reducing risk and delays.

Before
Manual, slow image creation with inconsistent results.
Dependence on designers to translate prompts into visuals.
No automatic backup of the original photo.
Converting an image into a short marketing video is time-consuming.
Delays in delivering assets to marketing teams.
After
AI-generated product image consistent with brand guidelines.
Public URL available for asset use in campaigns.
Marketing video rendered and ready for distribution.
Automated asset delivery via email to customers.
Original photo backed up automatically on Google Drive.
Process

How it works

Three-step workflow that non-technical users can follow.

Step 01

Ingest & Backup

When a user submits the form, the agent saves the original photo to Google Drive as a backup and records the metadata.

Step 02

Prompt Engineering & Image Creation

GPT-4.1 rewrites the vision prompt into a production-ready image-generation prompt and uses gpt-image-1 to generate the new image.

Step 03

Publish & Deliver

Upload the generated image to ImgBB for a public URL, render the marketing video in RunwayML, poll every 30 seconds for completion, then email the user with links via Gmail.


Example

Example workflow

One realistic scenario that shows timing and outcomes.

A retailer uploads a product photo and vision prompt. Within about 12 minutes, the agent delivers a brand-new AI-generated product image and a 15-second marketing video, and emails the retailer with links to both assets and a Google Drive backup of the original photo.

Content Creation OpenAIGoogle DriveImgBBRunwayML AI Agent flow

Audience

Who can benefit

Each role gains faster access to usable visuals.

✍️ Marketing Manager

Needs rapid, on-brand visuals for campaigns.

💼 E-commerce Merchant

Updates product listings with fresh images and videos.

🧠 Creative Team Lead

Prototyping new visuals without design briefs.

Brand Manager

Maintains consistency across product visuals.

🎯 Social Media Producer

Requires short-form videos for platforms.

📋 Freelancer/Agency

Delivers complete assets to clients quickly.

Integrations

Key tools used inside the AI agent workflow.

OpenAI

Rewrite prompts and perform image editing (gpt-image-1) inside the agent.

Google Drive

Back up the original photo automatically.

ImgBB

Host the generated image and provide a public URL.

RunwayML

Render the marketing video from the generated image.

Gmail

Deliver assets to the user via email.

Applications

Best use cases

Practical scenarios where this AI agent adds value.

Launching a new product with fresh visuals in days instead of weeks.
Running seasonal campaigns with updated imagery and video.
Mass updating product catalogs with consistent visuals.
Producing short-form videos for social media ads.
Delivering client-ready assets for pitches or demos.
Automating consented asset backups and delivery workflows.

FAQ

FAQ

Common questions about how this AI agent works and its outputs.

A user must submit an existing product photo, a name, a vision prompt, and the user’s email. The agent then processes these inputs through a secure flow that backs up the original photo, rewrites the prompt, generates a new image, creates a video, and delivers the assets by email.

The agent outputs an AI-generated product image and a marketing video. The image is hosted on ImgBB and the video is delivered as a shareable asset. Both are linked in the final email sent to the user.

Video rendering can take several minutes depending on length and complexity. The system polls RunwayML every 30 seconds and continues until the video is ready, after which the email is sent. In most cases the process completes in a short session, but longer videos may extend total time.

The workflow uses API keys and OAuth2 credentials for OpenAI, Google Drive, ImgBB, RunwayML, and Gmail. Credentials are stored securely and accessed only by the agent during processing. Access is scoped to the assets being created and delivered.

Yes. The vision prompt guides image and video style, and you can adjust the input prompt to target desired aesthetics. The agent uses RunwayML capabilities to align output with the provided image and messaging, and you can iterate on prompts between submissions.

The agent requires image-generation access via the OpenAI account. If access is temporarily unavailable, the workflow can fall back to alternative prompts or pause until access is restored. In all cases, the original photo backup remains intact, and the notification email will explain any delays.

Only the backup copy stored in Google Drive is kept for archival purposes. The generation process uses the reference image to create a new product image, but the original file remains unmodified in Drive. All assets delivered to the user are links to generated content, not the original photo.


AI Agent for Generating Product Photos and Marketing Videos from a Reference Image

Automates the end-to-end workflow—from receiving a product photo and vision prompt to delivering an AI-generated image and a marketing video via email.

Use this template → Read the docs