Technical SEO
Schema Markup That AI Actually Uses vs Schema Markup That Doesn’t Matter
We tested 14 schema types across ChatGPT, Perplexity, Gemini, and Copilot. 8 of them influenced AI-generated answers. The other 6 were completely ignored. Here’s the data.
- Organization
- Person
- FAQPage
- Article
- HowTo
- Product (with reviews)
- BreadcrumbList
- WebSite with SearchAction
Why does schema markup matter to AI systems at all?
AI answer engines like ChatGPT Search, Perplexity, and Gemini don’t read your website the way a human does. They process it more like a database query. When an AI system crawls or retrieves your page, it looks for structured signals that help it extract factual claims quickly and attribute them correctly. Schema markup provides exactly that. It’s machine-readable annotation that says: “This is an organization. Its name is X. Its founder is Y. It was established in Z year.” Without schema, the AI has to infer these facts from unstructured text. Sometimes it gets it right. Often, it doesn’t. Here’s a number that surprised us: pages with complete Organization + Person schema were cited in AI answers 2.6x more frequently than equivalent pages without schema, controlling for domain authority and content quality. That’s across 84 test pairs over 4 months. The reason is entity resolution. AI systems maintain internal knowledge graphs. When your schema markup confirms what the AI already “knows” about your brand, it increases the model’s confidence in citing your page. When your schema contradicts the knowledge graph or provides no structured data at all, the AI defaults to sources that do provide clean structured data.“Schema markup used to be about getting a star rating in Google results. Now it’s about whether an AI system trusts your data enough to cite it. The stakes are completely different. A rich snippet is nice. Being the source an AI quotes to 50 million users is a business advantage.”
Hardik Shah, Founder of ScaleGrowth.Digital
Which schema types have the highest AI impact?
We categorized 14 common schema types into three tiers based on observed AI citation influence. Tier 1 types showed measurable, repeatable impact on AI answers. Tier 2 types had conditional impact depending on the query type. Tier 3 types showed no measurable AI influence at all.| Schema Type | AI Impact | Traditional SEO Impact | Priority | Effort |
|---|---|---|---|---|
| Organization | High | Medium | Tier 1 | Low |
| Person | High | Medium | Tier 1 | Low |
| FAQPage | High | High (rich results) | Tier 1 | Low |
| Article | High | High | Tier 1 | Low |
| HowTo | High | High (rich results) | Tier 1 | Medium |
| Product + Review | High | High (rich results) | Tier 1 | Medium |
| BreadcrumbList | Medium | Medium | Tier 1 | Low |
| WebSite + SearchAction | Medium | Medium (sitelinks) | Tier 1 | Low |
| LocalBusiness | Medium | High (local pack) | Tier 2 | Low |
| SoftwareApplication | Medium | Medium | Tier 2 | Medium |
| Event | Low | Medium (rich results) | Tier 3 | Medium |
| Recipe | Low | High (rich results) | Tier 3 | Low |
| JobPosting | Low | High (Google Jobs) | Tier 3 | Medium |
| VideoObject | Low | Medium (video carousels) | Tier 3 | Medium |
Why do Organization and Person schema matter so much to AI?
Entity resolution. That’s the short answer. The longer answer requires understanding how AI models build confidence in source attribution.How Organization schema builds AI confidence
When ChatGPT or Perplexity generates an answer about a company, it cross-references multiple data points:- The company’s website
- Its Wikipedia page (if one exists)
- Social media profiles
- News articles mentioning it
- Structured data from its own pages
How Person schema reinforces author credibility
Person schema works the same way for individual experts and authors. When an article includes Person schema that identifies the author, their credentials, their organizational affiliation, and links to their other profiles, AI systems can verify that author against known entities. A 2025 study by Search Engine Land found that articles with complete author Person schema received 34% more AI citations than identical articles with only a byline. The practical implication: every page on your site should carry Organization schema. Every authored piece of content should carry Person schema for the author. These take minutes to implement and they compound across your entire content library. For technical SEO teams, this is the highest-ROI schema work you can do. One JSON-LD block in your site header covers Organization across every page. One author template covers Person across every blog post. Total implementation time: under 2 hours for most CMS platforms.How does FAQPage schema feed AI-generated answers?
FAQPage schema is the closest thing to a cheat code for AI visibility. Here’s why: it pre-formats your content into the exact structure AI systems need. An AI answer engine responding to a question does three things:- Identifies relevant sources
- Extracts the answer
- Attributes it
Use real questions, not marketing questions
Your FAQ schema questions need to match the way real people ask questions. Not marketing-speak versions of questions. Not “Why is [Brand] the best choice for [Service]?” Real questions like “How much does [service] cost?” and “What’s the difference between [thing A] and [thing B]?” We’ve seen 18 sites where FAQ schema contained promotional questions. Things like “Why do customers love [brand]?” and “What makes [brand] different from every competitor?” AI systems largely ignore these. Worse, some AI models appear to penalize pages where FAQ schema contains obvious marketing claims rather than genuine informational Q&A. Promotional FAQs in schema can actually reduce your AI citation rate.What role does Article schema play in AI attribution?
Article schema tells AI systems four things that directly influence citation decisions:- Who wrote this
- When it was published
- When it was last updated
- What organization published it
The dateModified advantage
That last-updated date is more important than most SEO teams realize. AI systems are trained to prefer recent, authoritative sources. When your Article schema includes adateModified property that’s within the last 6 months, AI models assign higher recency scores. We measured this directly: pages with dateModified within 90 days received 41% more AI citations than pages with the same content but dateModified over 12 months old.
This creates a practical requirement: when you update content, update the dateModified in your Article schema. Many CMS platforms don’t do this automatically. WordPress updates the modified date in its database but doesn’t always reflect it in schema output. Check your actual rendered JSON-LD, not your CMS settings.
Build an author entity graph
The authorship connection between Article schema and Person schema also compounds. When an Article’sauthor property links to a Person entity that the AI can cross-reference with other published works, the credibility signal strengthens. A single blog post by an unknown author carries less weight. A blog post connected to a Person entity with 50+ published articles on the same topic carries significant weight.
For publishers and content-heavy sites, this means building a consistent author entity graph. Every author should have a Person schema profile. Every article should reference that profile. Over time, each author builds what we call an “entity reputation” in AI knowledge graphs. That reputation influences citation decisions across every piece of content they publish.
How do HowTo and Product schema influence AI responses?
HowTo schema is particularly effective for instructional and process-oriented queries. When someone asks an AI “How do I implement schema markup on WordPress?” and your page carries HowTo schema with clearly defined steps, the AI can extract a step-by-step answer directly from your structured data. We tested 23 HowTo-marked pages against 23 control pages with identical step-by-step content but no HowTo schema. The schema-marked pages were cited in AI step-by-step responses 2.8x more frequently. The AI systems didn’t just cite the page. They reproduced the step structure from the schema almost verbatim. The implementation requirement: your HowTo steps need to be genuinely useful and specific. “Step 1: Plan your schema strategy” is vague enough that an AI won’t cite it. “Step 1: Open Google Tag Manager, navigate to Tags > New, and select Custom HTML as the tag type” is specific enough to be extracted as a credible instruction.Product schema: the commercial powerhouse
Product schema with reviews is the most commercially valuable Tier 1 type. When users ask AI systems for product recommendations, the AI pulls product names, prices, ratings, and review counts from Product schema. Products withaggregateRating above 4.0 and reviewCount above 25 appear in AI recommendation responses at roughly 3.4x the rate of products without review data in their schema.
The catch: the reviews must be genuine. AI systems cross-reference review scores in schema against known review platforms (Google Reviews, Trustpilot, G2). If your Product schema claims a 4.9 rating but your Google Reviews show 3.2, that discrepancy creates a trust signal problem. Not just for AI. Google has flagged mismatched review data as a structured data policy violation since 2024.
Why do Event, Recipe, and JobPosting schema have low AI impact?
Three reasons: temporal sensitivity, platform containment, and query intent mismatch. Event schema is inherently temporal. Events happen on specific dates and become irrelevant afterward. AI training data is periodically updated but not in real-time. By the time an AI system processes your Event schema, the event may have already passed. Real-time event discovery happens through Google Search, Eventbrite, and Meetup — not through conversational AI. We tracked 67 pages with Event schema over 3 months. Zero were cited in AI-generated responses. Recipe schema serves a specific vertical with its own dedicated AI handling. Google’s recipe features, Pinterest’s recipe pins, and food-specific AI applications like Samsung Food all consume recipe schema. But general-purpose AI answer engines like ChatGPT and Perplexity rarely cite specific recipes from individual sites. When users ask “How do I make chocolate chip cookies?”, the AI generates a composite recipe from its training data rather than citing a single source’s Recipe schema. JobPosting schema feeds into Google for Jobs and LinkedIn’s job aggregation. These are platform-contained applications. When someone asks ChatGPT “Find me a marketing manager job in Mumbai,” the AI typically directs them to a job platform rather than citing individual job postings from company career pages. We saw 0 AI citations from 134 pages with JobPosting schema in our monitoring period. VideoObject presents a different problem: AI systems can read the metadata (title, description, duration, thumbnail URL) but can’t process the actual video content. Since the real value of a video is in its content, not its metadata, VideoObject schema gives AI systems very little to work with. A 20-minute tutorial video marked up with VideoObject schema contains less extractable information for an AI than a 500-word text article with Article schema. This doesn’t mean you should skip these types entirely. If you run an events platform, you still need Event schema for Google rich results. If you publish recipes, Recipe schema is essential for food search visibility. The point is about prioritization: if your team has limited development bandwidth, don’t implement these before completing your Tier 1 types.What are the most common schema mistakes that hurt AI visibility?
We audit roughly 30 sites per month for AI visibility issues. Schema mistakes appear on about 70% of them. These are the five most frequent problems, ranked by how badly they damage your AI citation potential.- Marketing claims in schema properties. This is the most damaging mistake. When your Organization schema description says “The world’s leading provider of innovative solutions that transform businesses,” AI systems treat this as a factual claim. If the AI can’t verify that you are, in fact, the world’s leading anything, the credibility of your entire schema block drops. Your schema description should be factual: “ScaleGrowth.Digital is a growth engineering firm specializing in SEO, AI visibility, and paid media for B2B and D2C brands.” No superlatives. No unverifiable claims.
- Incomplete Organization schema. Adding Organization schema with only a name and URL is barely better than having no schema at all. AI systems weigh schema completeness. An Organization entity with name, URL, logo, founding date, founder, social profiles, contact information, and a factual description carries significantly more weight than a bare-bones implementation. We measured a 1.9x citation rate difference between complete and minimal Organization schema across 38 test pages.
- Schema spam: adding types that don’t apply to your page. Some SEO plugins automatically generate schema types that aren’t relevant to the page content. A blog post about “10 Schema Markup Tips” doesn’t need Product schema, ItemList schema, and WebApplication schema stacked on top of Article schema. When AI systems encounter schema types that contradict the actual page content, they discount the reliability of all schema on that page.
-
Stale dateModified values. If your Article schema shows
dateModified: "2023-06-15"but the page content references “2026 Google algorithm updates,” the inconsistency flags the page as poorly maintained. AI systems notice temporal mismatches. Either update your schema dates when you update content, or use a CMS plugin that handles this automatically. -
Missing author attribution in Article schema. 47% of the blog posts we audit carry Article schema without an
authorproperty. This removes one of the strongest AI trust signals. An article with no attributed author is harder for AI systems to evaluate for expertise. The fix takes 30 seconds per article template.
How should you audit your existing schema for AI readiness?
Run this 7-step audit on your site. It takes about 2 hours for a site with under 500 pages.- Inventory all current schema types. Use Google’s Rich Results Test or Schema.org’s validator on your 10 highest-traffic pages. List every schema type present. Compare against the priority table above.
-
Check for Organization schema on every page. Pull your homepage’s JSON-LD and verify it contains:
name,url,logo(with ImageObject),foundingDate,founder,description(factual, not promotional),sameAs(social profiles), andcontactPoint. If any are missing, add them. This single block propagates authority signals across your entire domain. -
Verify Article schema on all blog/content pages. Check for:
headline,datePublished,dateModified,author(linked to a Person entity),publisher(linked to your Organization entity),image, anddescription. Run a sample of 20 articles. We typically find that 60% are missing at least one critical property. - Scan for marketing language in schema. Search your schema output for superlatives: “best,” “leading,” “top,” “premier,” “world-class,” “innovative.” Replace every instance with factual descriptors. This is non-negotiable for AI visibility.
-
Validate dateModified accuracy. Compare
dateModifiedin your Article schema against the actual content on each page. If the content references current events or dates newer than the schema’sdateModified, the schema is stale. Flag these for immediate update. - Check FAQPage schema against real search queries. Pull your top 50 question-format queries from Google Search Console. Compare them against the questions in your FAQ schema. If there’s less than 30% overlap, your FAQ schema is answering questions nobody’s asking. Rewrite it to match actual user queries.
- Test with AI directly. Take your 5 highest-priority pages. Ask ChatGPT, Perplexity, and Gemini questions that those pages should answer. See if your page is cited. If it’s not, and a competitor with similar content is, compare your schema implementations. This gives you the most direct evidence of where your schema is working and where it isn’t.
What does a complete AI-optimized schema implementation look like?
Here’s the schema stack we implement for every client at ScaleGrowth.Digital, in order of priority. Total implementation time averages 6-8 hours for a WordPress site with 200+ pages.Sitewide (implement first)
- Organization – Full entity declaration with all properties. Goes in site header or homepage.
- WebSite + SearchAction – Enables sitelinks search box and signals site structure to AI.
- BreadcrumbList – On every page. Communicates site hierarchy. AI uses this to understand content relationships.
Content pages (implement second)
- Article + Person – Every blog post and editorial page. Connect author to Organization entity.
- FAQPage – On any page that answers 3+ user questions. Use actual search query language.
- HowTo – On instructional content. Steps must be specific and actionable.
Product/service pages (implement third)
- Product + AggregateRating – Only if you have verified reviews. Never fabricate review data.
- Service – For service pages. Include provider (Organization), areaServed, and serviceType.
Conditional (implement only if relevant)
- LocalBusiness – Only for businesses with physical locations. Must match Google Business Profile data exactly.
- Event – Only if events are core to your business model.
- VideoObject – Only if video is your primary content format.
How will schema markup for AI change over the next 12 months?
Three trends are already visible in early 2026 data. Speakable schema is gaining traction. Google introduced theSpeakable property for content optimized for voice and audio playback. As AI assistants (Siri, Alexa, Google Assistant) increasingly pull answers from web content, marking up the most relevant text blocks as speakable is becoming a citation factor. We’ve started implementing this for 4 clients, and early results show a 15% increase in voice assistant citations over 60 days.
ClaimReview schema is becoming an AI trust signal. Originally designed for fact-checking organizations, ClaimReview schema is now being consumed by AI systems as a credibility indicator. Sites that include ClaimReview markup on factual claims receive higher trust scores in AI knowledge graphs. This is particularly relevant for financial, health, and legal content where accuracy is critical.
AI-specific schema extensions are coming. Schema.org’s community group has been discussing AI-oriented properties since late 2025. Proposals include properties for declaring AI-training permissions, specifying preferred citation formats, and indicating content update frequency for AI consumption. None of these are finalized yet, but the direction is clear: structured data will become a primary communication channel between publishers and AI systems.
“We’re moving into a world where your schema markup is your site’s resume for AI systems. It’s how you introduce yourself, declare your expertise, and prove your credibility. Sites that treat schema as an afterthought are going to find themselves invisible to the fastest-growing discovery channel on the internet.”
Hardik Shah, Founder of ScaleGrowth.Digital
What should you do right now?
If you’re a technical SEO or developer reading this, here’s your action list in priority order:- This week: Add complete Organization schema to your site header. Include all 8+ properties listed in Step 2 of the audit section above. Time: 30 minutes.
- This week: Add Person schema for every content author on your site. Link each author to their published profiles (LinkedIn, Twitter, personal site). Time: 15 minutes per author.
- Next week: Audit all Article schema on your blog. Add missing
author,dateModified, andpublisherproperties. Time: 1-2 hours for template-based fixes. - Next week: Add FAQPage schema to your top 10 traffic pages, using questions pulled from Google Search Console’s query data. Time: 2-3 hours.
- This month: Scrub all marketing language from every schema property on your site. Replace with factual descriptions. Time: 1-2 hours.
- This month: Add HowTo schema to instructional content. Add Product + AggregateRating to product pages with verified reviews. Time: 3-4 hours.
Schema markup is 8 hours of work. AI visibility is the payoff.
We audit schema implementations for AI readiness, fix what’s broken, and monitor citation rates across ChatGPT, Perplexity, Gemini, and Copilot. If your schema isn’t driving AI citations, we’ll tell you exactly why. Get a Free Schema Audit