JiwaAI
Blog
โ†All posts
prompt-engineering
ai
product-design
southeast-asia

The Industry Template Approach: Why Food Needs Different Prompts Than Fitness

Jiwa AI Teamยท

The Problem with Generic Prompts

Ask a generic AI image generator to show a product in a "lifestyle setting" and you will usually get the same thing: a clean surface, soft natural light, maybe a plant. The image is fine. It is inoffensive. And it looks exactly like every other brand's content.

The setting a product appears in is not decorative. It carries meaning. A protein bar photographed on a marble kitchen counter signals something entirely different from the same bar resting against a padel racket on a sports court. Same product. Completely different desire.

This is the problem we set out to solve with Jiwa AI's scene template system.

What a Scene Template Actually Does

When a business onboards to Jiwa AI, the system identifies the industry โ€” food and beverage, fitness, beauty, tech, and so on. That industry classification then drives a curated set of scene templates that determine where products appear in generated images.

Each template is not just a background. It is a bundle of three things: the physical setting, the way the product is being used or held, and the supporting props that fill out the world. A fitness brand's template might describe a gym floor with chalk-dusted hands reaching for a supplement bottle. A food brand's template describes a cafรฉ counter with steam rising from a drink and warm ceramic textures nearby.

These details feed directly into the image generation pipeline. The scene sets the mood before any product even appears in frame.

Why Industry Context Changes Everything

Consider two categories and what "aspirational" means in each.

For a food brand, aspiration looks like craft, warmth, and sensory pleasure. Kitchen counters, rustic wooden boards, the golden light of a morning pour. Customers want to imagine the experience of consuming the product, so the setting needs to invite that fantasy. A cold, minimal studio shot communicates efficiency โ€” exactly the wrong message for an artisan coffee or a homemade sauce.

For a fitness brand, aspiration looks like effort, transformation, and community. Gym floors, outdoor tracks, recovery moments between sets. Customers want to see themselves working toward something. A warm kitchen aesthetic would make a protein supplement look out of place โ€” as if the brand does not understand its own customer.

The scene is not just context. It is the first argument a product makes about who it is for.

How the Mapping Works in Practice

Jiwa AI's scene templates are matched at the start of every content cycle, not on a post-by-post basis. When we analyze a business โ€” reading its website, its Instagram captions, its product descriptions โ€” we extract an industry classification alongside content keywords from its real social presence.

Those keywords matter as much as the industry label. A fitness brand that posts heavily about padel gets different scenes than one focused on weightlifting. The padel content signals outdoor courts, recovery drinks at the net, sports towels and rackets as props. The weightlifting content signals gym equipment, chalk, locker rooms. Two fitness brands, two distinct sets of templates.

This means the system is not applying a coarse filter. It is reading what a brand actually does in the world and building scene logic around that reality.

When a Business Spans Multiple Categories

Some businesses do not sit neatly in one industry. A wellness brand might sell both protein supplements and healthy snacks. A lifestyle brand might carry activewear alongside a superfood line. Category overlap is common, especially for Southeast Asian small businesses that serve broad audiences.

We handle this at the post level, not the brand level. Each post in a two-week content calendar is assigned to a specific product. The scene template is selected based on that product's own keyword signals, not the brand's top-level category. So a wellness brand's protein post gets gym-adjacent scenes while its snack post gets cafรฉ textures โ€” within the same calendar, producing variety that still feels coherent.

The brand voice and visual style stay consistent across posts. What changes is the scene logic that makes each product feel native to its own context.

Scene Templates and the Composite Pipeline

Scene templates also interact with our image compositing approach. We do not ask AI image generators to render the product itself โ€” this leads to hallucinated packaging, wrong colors, and invented brand details. Instead, the AI generates the scene and we layer the real product photo on top.

This means the scene prompt has to do real work. It needs to generate a background and lighting environment that will actually look right when a real product is placed into it. A scene described as "rustic cafรฉ counter, warm afternoon light, shallow depth of field" produces a background that makes a physical product look like it belongs there. A generic "modern interior" prompt produces a background where the composited product floats awkwardly.

Getting the scene right is getting the composite right.

The Bigger Principle

Generic prompts produce generic results not because AI tools are limited, but because generic prompts contain no information about what desirability means for a specific product in a specific context. Feeding a food product into a fitness scene, or vice versa, does not just look wrong aesthetically โ€” it communicates the wrong message to the right audience, or the right message to the wrong one.

We built the scene template system because good content marketing has always known this. Location, setting, and context are not afterthoughts. They are part of the product story.

As we expand Jiwa AI's industry coverage โ€” adding templates for beauty, tech, fashion, and more โ€” the underlying logic stays the same: understand the business, understand its products, and build image prompts that know where those products actually belong.