What is Shopify Catalog Mapping and what does it actually control?

Catalog Mapping is a tool in the Shopify admin (Settings > Shopify Catalog Mapping) that lets merchants remap the source of three input fields: product title, product description, and product category. Each can be pointed to a metafield or metaobject reference instead of the default product attribute. It also allows custom variant grouping configuration. Catalog Mapping controls what data enters the ML inference pipeline, not what the pipeline produces. Changes are processed with a delay.

Does Shopify's product taxonomy affect how product data appears in the Catalog API?

Probably, but Shopify has not officially confirmed the mechanism. When a merchant assigns a taxonomy category to a product, Shopify unlocks category-specific metafield definitions with structured attributes. The Catalog API's inferred attributes field returns name-value pairs that match the taxonomy's attribute names exactly, which suggests a connection. Third-party analysis indicates that proper taxonomy assignment and populated category metafields improve inference accuracy, but the causal link is not officially documented.

What is the most effective action a merchant can take to improve how their data appears in the Catalog API?

Write the most complete product description you can. The Catalog API's inferred fields (uniqueSellingPoint, topFeatures, techSpecs) are generated by Shopify's ML models from your product description. A description that explicitly states features, specs, materials, use cases, and what distinguishes the product gives the model the most to work from. After that, assign the most specific available Shopify Standard Product Taxonomy category to unlock category-specific metafield definitions, and populate those metafields with structured values rather than prose. No official Shopify documentation confirms a direct input-to-output relationship for any specific inferred field.

← All Articles

The Shopify Agentic Plan: How to Get Your Product Data Ready and the Ways Shopify Machine Learning Transforms It Beyond Your Control

Q: Can merchants edit or override inferred fields in Shopify's Catalog API?

No. Merchants have no direct mechanism to edit, preview, or override the seven fields Shopify's ML generates for the Catalog API. The inferred fields are: description, uniqueSellingPoint, topFeatures, techSpecs, options, attributes, and secondhand. A Shopify staff member confirmed in February 2026 that these fields come from the catalog inference pipeline, not from merchant-provided mappings or metafields. The ML models read product descriptions to generate them, so using Shopify's full description character allowance with explicit features, specs, and use cases gives the model the most signal to work from.

Q: Is there any way to see what Shopify has inferred for my products?

Not through a merchant-facing admin tool. Developers with Catalog API access can query the Search and Lookup endpoints directly and inspect the inferred field values in the response. The Catalog Mapping tool includes an input preview, but this shows what data enters the pipeline, not the inferred output. Shopify's SimGym simulates AI shopper behavior but does not expose raw inferred field values. The Storefront MCP does not include the inferred fields that appear in the global Catalog MCP.

Tony Castillo

Published: March 25, 2026

A chrome robot octopus receives a stream of ASCII characters and pours ML-transformed product data into robotic AI model dogs — representing how Shopify's Catalog API ingests merchant product descriptions and outputs inferred fields for AI agents

Shopify’s Agentic Plan pushes your data to their Catalog API for agentic surface ingestion. Any brand, on any ecommerce platform, can now submit their products to Shopify’s agentic commerce infrastructure and get discovered by AI agents through the Catalog MCP.

What actually determines how those agents understand and recommend your products is a machine learning pipeline that transforms your product data before it ever reaches an agent. Most brands don’t know this is happening.

This article covers how the transformation works, what you can control going in, and what Shopify’s ML decides on your behalf.

The single most important input to that ML pipeline is your product description. Shopify reads it and generates seven fields that AI agents use when matching your product to buyer queries. The quality of those fields determines whether your product shows up in agent search results at all. And the description field supports up to 512 KB – roughly 67,000 words, more words than The Great Gatsby! Most product descriptions are a few hundred words. That gap is the opportunity. Here is what gets built from your description:

Shopify Admin

Product
Description

up to 512 KB

Shopify ML

►

description

Plain-text used in recommendations

uniqueSellingPoint

Value proposition for comparisons

topFeatures

Features mapped to buyer intent

techSpecs

Specs used for attribute filtering

options

Product options and variants

attributes

Name-value pairs for filtering

secondhand

Whether the variant is secondhand

These fields power the Catalog API Search step. Agents rank your product on them before ever reaching your storefront.

How your Product Data Gets Changed by Shopify Machine Learning for the Catalog API

When an agentic surface receives a buyer query, the first thing it uses for semantic matching is not your storefront copy. It is a set of machine-learning fields Shopify generates from your product description. These fields are what agents score and rank your product on during the Catalog API Search step. Whether your product surfaces at all depends on the quality of these seven fields:

Field	Endpoint	What it represents
description	Search + Lookup	Compressed plain-text product description used in recommendations
uniqueSellingPoint	Search + Lookup	The distinctive value proposition surfaced in agent comparisons
topFeatures	Search + Lookup	Array of top product features mapped to buyer intent queries
techSpecs	Search + Lookup	Array of technical specifications used for attribute filtering
options	Search + Lookup	Array of product options and variants
attributes	Search only	Name-value pairs used for structured filtering
secondhand	Search + Lookup	Whether the variant is secondhand

Here is what that transformation looks like with Glossier’s Full Orbit eye cream. We pulled the data directly from the Catalog API endpoints and here’s what we found:

Glossier Full Orbit Eye Cream

Shopify Catalog ML Transformation — Real Data from API Endpoints

Input: productDescription (merchant-written)

A 360° reset for every, single eye. A multi-benefit, 360° eye cream that tackles the concerns you care about most—immediately hydrating for up to 24 hours and depuffing while visibly brightening dark circles and smoothing the look of fine lines. Our unique lightweight, gel-cream texture (less likely to cause milia!) is fit for under eyes, on eyelids and along the orbital bone for overall fresher, brighter-looking eyes.

Key ingredients: Our lightweight, gel-cream formula is packed with powerful ingredients that do it all—hydrate, brighten, smooth, and de-puff the entire eye area for a full circle reset.

Polyglutamic Acid and Hyaluronic Acid: An ultra-hydrating Glossier-exclusive complex draws moisture into the skin for intense hydration while simultaneously helping to minimize moisture loss throughout the day.
White Hawthorn & Jasmine Flower: Together, these extracts deliver an overall brightening effect and reduce the appearance of dark circles related to two common causes: hyperpigmentation and poor microcirculation.
Niacinamide and Arctic Microalgae: Niacinamide (an active form of Vitamin B3 that helps smooth the look of the skin's surface) paired with Arctic Microalgae (an extract sourced from pure glacial waters) reduce the appearance of fine lines and help minimize under eye puffiness.

Clinical studies: In a 2 week consumer study, Full Orbit made eyes look well rested and gave the effect of 8 hours of sleep. The Full Orbit clinical and consumer study was conducted by a third-party testing facility and included 33 people between the ages of 25 and 55. They used the product in plain packaging for two weeks and completed a questionnaire based on their experience. An evaluator measured the skin immediately after application and at eight, 12 and 24 hours post-application.

100% immediately agreed Full Orbit is lightweight, easy to apply and absorbs instantly into skin.
100% immediately agreed skin looks and feels less puffy, smooth, hydrated and refreshed.
97% agreed after two weeks, fine lines and under eye dark circles look reduced.
97% agreed after two weeks, skin in the eye area looks brighter and has a more even tone.

Product details: Cruelty-free. Vegan. Dermatologist and Ophthalmologist tested. Suitable for contact lens wearers, sensitive skin and use on the upper lid.

↓ Shopify Catalog ML ↓

Output: 7 Inferred Fields (ML-generated, no merchant preview or override)

descriptionSearch + Lookup

"A hydrating, brightening eye cream that smooths fine lines and reduces dark circles and puffiness."

98 characters, compressed from ~1,200 words

uniqueSellingPointSearch + Lookup

"Combines brightening, hydrating, and smoothing effects in one tube-packaged eye cream."

topFeaturesSearch + Lookup

Brightening formula reduces the appearance of dark circles for a more awake look
Hydrating ingredients keep the under-eye area moisturized and comfortable
Smoothing action helps minimize fine lines for a youthful appearance
Depuffing effect soothes and refreshes tired eyes
Tube dispenser allows for easy, mess-free application

techSpecsSearch + Lookup

Dispenser Type: Tube

Product Form: Cream

Skin Care Effects: Brightening, Hydrating, Smoothing, Anti-dark circle, Anti-puffiness

Brand: Glossier

optionsSearch + Lookup

[ ] Single SKU, no variant options

attributesSearch only

[ ] Empty for this product

secondhandSearch + Lookup

false

The structured techSpecs (Dispenser Type, Product Form, Skin Care Effects) likely come from Shopify's Standard Product Taxonomy and category metafields, not the description text. The causal relationship is not officially confirmed.

Glossier’s description is rich and detailed. That gave the ML enough signal to produce coherent output. The risk is with products where the source material is thinner:

New product with an undeveloped description. The ML has almost nothing to work from. The inferred fields will be sparse, generic, or wrong. Agents will rank other products ahead of yours.
Reformulated product with outdated copy. The description still reflects the old formula. The ML generates inferred fields from that stale copy and agents surface the wrong version of your product to buyers.
Description written as marketing language with few specifics. Phrases like “luxurious formula” and “transformative results” give the ML no extractable features, specs, or use cases. The topFeatures and techSpecs fields come back thin or empty.
No way to review or correct before agents use it. There is no merchant-facing preview of inferred fields. By the time an agent surfaces your product with a bad uniqueSellingPoint or missing topFeatures, it has already happened.

Every one of these fields is derived from your product description. A thin description produces thin inferred fields. The GraphQL API for Agentic Plan data ingestion supports up to 512 KB, confirmed to accept and store data at that size. For context: 512 KB is roughly 67,000 words, more words than The Great Gatsby. How much of it the Catalog API reads when generating inferred fields is still unknown. Use as much of the allowance as you can. Write for the customer and the model.

Here’s what you control. Here’s what you don’t.

Most of what AI agents receive from the Catalog API is your product data as-is: title, pricing, images, variants, availability, and checkout URL. That is a direct mirror of your Shopify admin.

The seven inferred fields are the exception. They are generated by Shopify’s ML from your product description and other inputs, and they power the initial search matching step. When an agent receives a query like “brightening eye cream for dark circles,” the Catalog MCP runs a Search call that returns products ranked on these inferred fields, not on your raw product description. The Lookup step retrieves full product detail for matched results.

The clearest official confirmation of this came from the developer forum thread “UCP readiness questions” (January through February 2026):

February 3, 2026 — Liam-Shopify (Shopify Staff)

"Properties like uniqueSellingPoint, topFeatures, techSpecs are coming from the catalog, not from the merchant's mapping, or metafields."

Follow-up question — Developer

"How would merchants improve their products attributes for better discovery?"

No reply from Shopify. Here’s the full thread for reference: community.shopify.dev/t/ucp-readiness-questions/28684

The thread also reveals that dev store products are not currently included in the global catalog, which limits merchants’ ability to test inference on new products before they go live.

The actual merchant control surface has three tiers.

Tier 1 (full control): Product-level inputs feed the inference pipeline; Knowledge Base content affects brand and policy responses without going through inference at all.

Product title, description, images, options, pricing, availability
Taxonomy category and category metafields
Combined Listings configuration
Knowledge Base App content

Tier 2 (partial control): Catalog Mapping lets merchants redirect input sources to metafields or metaobjects. This affects what data enters the ML inference pipeline, not what the pipeline produces.

Product title source
Product description source
Product category source

Tier 3 (no control): Generated by Shopify’s specialized LLMs and served directly to AI agents with no merchant preview, approval, or override.

uniqueSellingPoint
topFeatures
techSpecs
attributes

The most practical starting point:

Assign the most specific taxonomy category available
Populate all category-specific metafields with structured values
Write descriptions that state features and specifications explicitly in parseable formats
Populate the Knowledge Base App with accurate FAQs and brand voice content

The merchant control gap over inferred fields is real. The unanswered community question says it plainly, and it’s the biggest gap in Shopify’s agentic commerce tooling right now.

Need a strategy for writing product descriptions that improve visibility in AI surfaces?

WISLR works with brands to audit catalog readiness for agentic commerce, including product description structure, inferred field quality, and Catalog API positioning.

Frequently Asked Questions

Can merchants edit or override inferred fields in Shopify’s Catalog API?

No. There is no direct mechanism.

uniqueSellingPoint, topFeatures, techSpecs, attributes, and description are all ML-generated
Shopify staff confirmed in February 2026 that these come from the catalog inference pipeline, not merchant mappings or metafields
The unanswered follow-up question in the community thread (“how would merchants improve their products attributes for better discovery?”) reflects the current state of documentation on this topic

Is there any way to see what Shopify has inferred for my products?

Not through a standard merchant tool.

Developers with Catalog API access can query the Search and Lookup endpoints and inspect inferred field values directly
The Catalog Mapping preview shows input data, not inferred output
SimGym tests agent behavior end-to-end but does not surface raw inferred field values
The Storefront MCP does not include the uniqueSellingPoint, topFeatures, or techSpecs fields from the global Catalog MCP

LLM Traffic Is a Blind Spot in Your Analytics. Here's Why.