Schema.org FAQ & HowTo for Google AI Overviews: 2026 Guide
FAQPage schema, HowTo schema, Article: which Schema.org types to implement to appear in Google AI Overviews in 2026. A complete technical guide with copy-ready JSON-LD examples.
In 2023, Google quietly removed FAQPage rich results from nearly every website — only government and health domains still get accordion-style FAQ displays in the SERP. Many SEO teams concluded that the FAQPage schema was dead and buried. The opposite is true.
In 2026, Google AI Overviews appear on more than 31% of search results pages. These AI-generated summary blocks draw heavily on structured data to identify, extract, and cite precise passages from web pages. A properly implemented FAQPage or HowTo schema no longer produces a fancy rich result beneath your blue link — but it directly feeds the models that decide whether your content deserves to be cited at the top of the SERP.
This guide is a complete technical tutorial. You will find Google's official rules, copy-ready validated JSON-LD blocks, a decision table covering which schema fits which content type, common pitfalls to avoid, and a method for implementing structured data at scale. If you run AI chatbots on your website, the final section shows how to connect your knowledge base to your schema strategy.
AI Overviews are the production evolution of what Google called SGE (Search Generative Experience) during its 2023 beta. Rolled out broadly in May 2024 in the United States, then progressively across the UK and Europe, they are now the first visible element on a significant proportion of SERPs — roughly 31% according to Semrush and BrightEdge estimates from early 2026.
The mechanism is fundamentally different from a classic search engine. When a user asks a complex informational query, Google no longer just ranks pages: it generates a real-time synthesis drawn from several selected sources, cited below the AI block. This is Retrieval-Augmented Generation (RAG) applied directly to search.
What types of queries trigger AI Overviews?
AI Overviews appear primarily on informational queries: "how to do X", "what is Y", "what's the difference between X and Z", "best way to...". Pure transactional queries ("buy", "pricing", "quote") generate them far less often. Local queries ("pizza near me") and navigational queries ("Gmail login") almost never trigger them.
This is precisely where FAQPage and HowTo schemas become strategic: your pages that answer questions are exactly the ones AI Overviews are trying to cite.
How Google selects its sources for AI Overviews
Google does not disclose its selection algorithm, but analysis of observed citations since 2024 reveals clear patterns:
Topical authority: sites that cover a specific domain in depth are favored over generalist publications.
Extractable content: short, direct, clearly delimited passages are cited more often than dense wall-of-text content.
Structured data: pages with FAQPage, HowTo, or Article schemas provide Google with semantic anchors that make extraction easier and more reliable.
Freshness: the dateModified property in Article schemas is factored in as a freshness signal.
E-E-A-T: identified author, cited sources, domain-level coherence and depth.
Why Schema.org Still Matters Despite What Google Says
John Mueller (Google Search Relations) has stated multiple times that structured data is not a direct ranking factor. That statement is technically accurate — and strategically misleading.
Schema.org is not a classic SEO ranking lever. Adding JSON-LD will not move you from position 8 to position 1. However, structured data serves three concrete functions for AI Overviews:
Function 1 — Eliminating semantic ambiguity
A language model processing your HTML page must infer what each block of text means. With a FAQPage schema, you tell it explicitly: "this text is a question, this text is its answer." The AI does not have to guess — you hand it the structure. This reduces the risk of incorrect extraction and increases the probability of being cited accurately.
Function 2 — Strengthening E-E-A-T trust signals
An Article schema with complete author, datePublished, dateModified, and publisher fields tells Google that your content is produced by an identifiable entity and kept up to date. It is a machine-readable E-E-A-T signal, complementing the human-readable signals (author bio, external mentions, inbound links).
Function 3 — Pre-formatting content for citation
AI Overviews look for short, self-contained passages. A Q&A pair structured in JSON-LD is exactly that format: a complete unit of information with a well-formed question and a direct answer. You are pre-cutting your content into the format the AI wants to cite.
To understand the broader logic of optimizing for generative AI search engines, our article on GEO (Generative Engine Optimization) details the six criteria that ChatGPT, Perplexity, and Gemini evaluate when selecting sources.
FAQPage Schema: Complete Guide
What is the FAQPage schema and what does it actually do?
The FAQPage type from schema.org describes a page whose primary content is a list of questions and answers. It is implemented as JSON-LD inside a <script type="application/ld+json"> tag, placed in the <head> or at the end of <body>. Google reads it without executing JavaScript — JSON-LD is rendered immediately visible to the crawler.
When to use the FAQPage schema
Use FAQPage in these situations:
Dedicated FAQ pages: a standalone "/faq" or "/help" page with answers written by your team.
Blog articles with a Q&A section: if your article contains a "Frequently Asked Questions" section with complete answers.
Landing pages addressing common objections: sales or pricing pages that answer prospect questions directly.
Technical documentation pages: only if you are the sole author of both the questions and the answers.
Product support pages: provided that answers are written by your team, not submitted by users.
Promotional content disguised as FAQ: questions engineered solely to promote a product without genuinely answering user intent.
Questions whose answers are not visible on the page: the JSON-LD content must exactly match the content visible to users in the HTML.
Critical context for 2026: classic FAQPage rich results (the accordion display beneath your link in the SERP) have been restricted since September 2023 to government and health domains. The vast majority of websites will no longer see this format in search results. However, the schema remains active as a signal for AI Overviews — which is exactly why it is still worth implementing correctly.
Complete FAQPage JSON-LD — copy-ready example
Here is a validated example with required and recommended properties, using English-language questions relevant to an AI chatbot SaaS context:
{
"@context": "https://schema.org",
"@type": "FAQPage",
"mainEntity": [
{
"@type": "Question",
"name": "What is a RAG chatbot and how is it different from a standard AI chatbot?",
"acceptedAnswer": {
"@type": "Answer",
"text": "A RAG (Retrieval-Augmented Generation) chatbot combines a large language model with a custom knowledge base. Unlike a standard rule-based chatbot or a generic LLM, a RAG chatbot searches your internal documents before generating a response — which allows it to provide accurate, grounded answers about your products, services, or processes, without hallucinating information it was never trained on."
}
},
{
"@type": "Question",
"name": "How long does it take to deploy an AI chatbot on a website?",
"acceptedAnswer": {
"@type": "Answer",
"text": "With a platform like Heeya, the initial deployment of an AI chatbot takes between 5 and 30 minutes: create the agent, upload your knowledge base documents, customize the widget appearance, and embed it on your site with a single script tag. Advanced configuration — tone, behavior, form capture tools — requires an additional 1 to 2 hours."
}
},
{
"@type": "Question",
"name": "Does the FAQPage schema improve Google rankings?",
"acceptedAnswer": {
"@type": "Answer",
"text": "Not directly. According to Google, structured data is not an algorithmic ranking factor. However, a correctly implemented FAQPage schema increases the probability of appearing in Google AI Overviews, which synthesize answers from well-structured sources. It is an AEO (Answer Engine Optimization) lever rather than a classic ranking lever."
}
}
]
}
Content rules you must follow
The value of name must be the complete question, phrased as a genuine user question.
The value of text inside acceptedAnswer must match word-for-word (or be a direct subset of) the visible text on the page.
Limited HTML is allowed inside text: <h1>–<h6>, <br>, <ol>, <ul>, <li>, <a>, <p>, <div>, <b>, <strong>, <i>, <em>.
Only one FAQPage instance per URL — no duplicate FAQPage schemas on the same page.
Do not mark up the same FAQ content across multiple different URLs on your site.
Schema Markup Validator (validator.schema.org): conformance validation against schema.org specs — stricter than the Google tool.
After validation, submit your URL in Google Search Console (URL Inspection → Request Indexing) to accelerate crawl pickup.
HowTo Schema: Complete Guide
What is the HowTo schema and where does it stand in 2026?
The HowTo schema describes a sequential step-by-step procedure: a tutorial, a recipe, an installation guide, assembly instructions. Like FAQPage, it has seen a reduction on the rich results side. Since 2023, Google removed HowTo rich results from mobile. In 2026, HowTo steps no longer appear as enriched results for the majority of queries.
The logic is identical to FAQPage: the HowTo schema remains a valuable semantic signal for AI Overviews. When a user asks "how to set up X", AI Overviews look for sources with clearly structured steps. A HowTo schema ensures your steps are identified as steps — without any ambiguity of interpretation.
Legitimate use cases for the HowTo schema
Technical tutorials: installing software, configuring an API, setting up a developer tool.
Recipes and manufacturing instructions: the most classic pairing (time, ingredients, steps).
Business procedures: "how to write an invoice", "how to submit an insurance claim", "how to create a project proposal".
Do NOT use HowTo for opinion pieces, comparisons, or content without a clear sequential dependency between steps.
Complete HowTo JSON-LD — copy-ready example
{
"@context": "https://schema.org",
"@type": "HowTo",
"name": "How to Deploy an AI Chatbot on Your Website in 5 Steps",
"description": "A practical guide to building and embedding a RAG chatbot on your website using Heeya, without requiring advanced technical skills.",
"totalTime": "PT30M",
"estimatedCost": {
"@type": "MonetaryAmount",
"currency": "USD",
"value": "0"
},
"tool": [
{
"@type": "HowToTool",
"name": "Heeya account (free to start)"
},
{
"@type": "HowToTool",
"name": "Your knowledge base documents (PDF, DOCX, TXT)"
}
],
"step": [
{
"@type": "HowToStep",
"name": "Create your agent",
"text": "Log in to your Heeya dashboard and click 'New Agent'. Give it a name, a description, and write your 'System Guidance' — the instructions that define your chatbot's personality, scope, and boundaries.",
"url": "https://heeya.fr/en/solutions/chatbot#step-1",
"image": "https://heeya.fr/static/img/blog/howto-step1-create-agent.webp"
},
{
"@type": "HowToStep",
"name": "Upload your knowledge base",
"text": "In the 'Files' tab, import your documents (PDF, DOCX, PPTX, TXT). Heeya automatically splits them into chunks, generates vector embeddings, and stores them in your dedicated Qdrant collection. You can also scrape URLs from your existing website.",
"url": "https://heeya.fr/en/solutions/chatbot#step-2",
"image": "https://heeya.fr/static/img/blog/howto-step2-upload-docs.webp"
},
{
"@type": "HowToStep",
"name": "Test your chatbot",
"text": "Use the preview chat inside the dashboard to check the quality of responses. Ask the questions your typical visitors ask. Adjust the System Guidance if the answers drift or miss the mark.",
"url": "https://heeya.fr/en/solutions/chatbot#step-3",
"image": "https://heeya.fr/static/img/blog/howto-step3-test-chatbot.webp"
},
{
"@type": "HowToStep",
"name": "Customize the widget",
"text": "In the 'Integration' tab, configure the widget appearance: primary color, welcome message, avatar, and position. Match it to your brand identity.",
"url": "https://heeya.fr/en/solutions/chatbot#step-4"
},
{
"@type": "HowToStep",
"name": "Embed on your website",
"text": "Copy the generated embed script and paste it before the closing
tag on your site. Compatible with all major CMS platforms: WordPress, Webflow, Shopify, Wix, or custom-built sites. The chatbot appears immediately without a page reload.",
"url": "https://heeya.fr/en/solutions/chatbot#step-5"
}
]
}
Key HowTo properties to know
name: title of the procedure (required).
description: short summary of the procedure (recommended).
totalTime: total duration in ISO 8601 format (PT30M = 30 minutes, PT2H = 2 hours).
step: array of HowToStep objects (required, at least 2 steps).
Within each HowToStep: name (step title), text (instructions), url (anchor link to the section), image (optional but recommended).
tool and supply: lists of required tools and materials — especially useful for recipes and technical tutorials.
The HowTo trap to avoid at all costs
The most common mistake: marking up content as HowTo when the steps are not sequential and mandatory. If the user can skip step 2 without blocking step 3, this is not a HowTo — it is a list of recommendations. Use an HTML list with <ul>/<li>, or an Article schema with structured sections instead.
The other critical trap: not matching the JSON-LD to the visible content. If your HowTo schema describes 5 steps but your page only displays 3, Google may penalize the page for misleading structured data.
Article and BlogPosting — the non-negotiable baseline
Every blog article should carry an Article schema (or its subtype BlogPosting) with at minimum:
{
"@context": "https://schema.org",
"@type": "BlogPosting",
"headline": "Schema.org FAQ & HowTo for Google AI Overviews: 2026 Guide",
"datePublished": "2026-05-05T09:00:00+00:00",
"dateModified": "2026-05-05T09:00:00+00:00",
"author": {
"@type": "Person",
"name": "Anas R.",
"url": "https://heeya.fr/en/blog"
},
"publisher": {
"@type": "Organization",
"name": "Heeya",
"url": "https://heeya.fr",
"logo": {
"@type": "ImageObject",
"url": "https://heeya.fr/static/img/logo.webp"
}
},
"image": "https://heeya.fr/static/img/blog/schema-faq-howto-ai-overviews.svg",
"description": "FAQPage schema, HowTo schema, Article: which Schema.org types to implement to appear in Google AI Overviews in 2026. A complete technical guide with copy-ready JSON-LD examples.",
"mainEntityOfPage": {
"@type": "WebPage",
"@id": "https://heeya.fr/en/blog/schema-org-faq-howto-google-ai-overviews"
}
}
The dateModified property is especially important for AI Overviews: it signals content freshness to Google without requiring you to republish or restructure the article.
BreadcrumbList — machine-readable site architecture
The BreadcrumbList schema helps Google understand where your page sits within the site architecture. It contributes indirectly to topical authority — a signal that AI Overviews use to decide whether your domain is a legitimate source on a given subject.
Organization — your machine-readable identity card
A complete Organization schema on your homepage or About page allows AI search engines to identify you as a trustworthy entity. Include: name, url, logo, description, foundingDate, sameAs (your LinkedIn, Twitter/X, GitHub profiles). The sameAs field is particularly powerful for Google's Knowledge Graph — it anchors your entity across the web.
Product and Review — for SaaS and e-commerce pages
If your page describes a product or service with pricing, the Product schema combined with AggregateRating can generate star rating rich results in the SERP — one of the few formats still performing very well in 2026. For Heeya, this applies to the chatbot solution page and feature-specific landing pages.
Person — reinforcing author E-E-A-T signals
Every author should have a Person schema on their profile page, with name, url, jobTitle, sameAs (LinkedIn, Twitter/X), and worksFor. This schema is directly used by Google to evaluate author credibility — an E-E-A-T criterion that influences source selection in AI Overviews.
To understand how AEO and classic SEO work together around these trust signals, see our article on AEO vs SEO in 2026.
Practical Implementation: Jinja2, PHP, JavaScript, and Scale
Jinja2 implementation (Python / FastAPI)
In a Jinja2 project, the cleanest approach is to define the JSON-LD schema in the article template, then render it in the head block of the base layout. Here is the recommended pattern:
Note the use of @graph to group multiple schemas in a single <script> block — this is the recommended approach to avoid conflicts and keep the code lightweight.
PHP implementation
In PHP, the most robust method is to build the associative array and serialize it with json_encode. Use the JSON_UNESCAPED_UNICODE flag to preserve any special characters (accented letters, em dashes, etc.) in your content:
For JavaScript frameworks, inject JSON-LD server-side (SSR) rather than client-side to guarantee the Google crawler reads it without executing JavaScript. In Next.js, place the JSON-LD directly in the <Head> via dangerouslySetInnerHTML, or use the <Script strategy="beforeInteractive"> component. In Nuxt.js, use useHead() with script: [{ type: 'application/ld+json', children: JSON.stringify(schema) }].
Large-scale management: dynamic generation
If you manage dozens or hundreds of pages, building schemas manually is not viable. Adopt a systematic approach:
Centralize the logic in a dedicated service (schema_service.py or schema.js) that generates schemas from each page's metadata.
Store FAQ questions in your database, not in templates — so you can update them without touching code.
Validate in CI/CD: integrate an automated test that checks JSON-LD conformance for every page before deployment (the Python library extruct can parse and validate schemas directly from HTML).
Monitor via Search Console: configure email alerts on the "Rich results" report to be notified the moment an error appears in production.
This centralized management logic is exactly what we apply at Heeya for generating schemas for every blog article — the frontmatter variables (TITLE, SLUG, DATE_ISO, AUTHOR) automatically feed the schema templates. To understand how RAG knowledge bases can feed this type of content pipeline, see our complete guide to RAG for businesses.
Validation and Monitoring
The 4 tools you need to know
1. Rich Results Test (Google)
Google's official tool (search.google.com/test/rich-results) analyzes a URL or a code snippet and tells you whether the schema is eligible for rich results, which properties are missing or incorrect, and what errors are blocking pickup. Essential before every production deployment.
2. Schema Markup Validator
Available at validator.schema.org, this tool validates conformance to the schema.org specifications — stricter than the Google tool. It flags deprecated properties and incorrect types. Particularly useful for complex schemas (@graph, nested schemas).
3. Google Search Console — "Rich results" report
In Search Console, the "Rich results" report (under "Experience") lists all pages with schema errors detected during crawl. Set up email alerts to be notified immediately of any regression. This report distinguishes blocking errors (schema ignored entirely) from warnings (schema partially applied).
4. Screaming Frog + JSON-LD extraction
For large-scale audits, Screaming Frog (from version 19 onwards) can extract and validate JSON-LD schemas across an entire site. Combined with a CSV export, it makes it fast to identify pages with missing schemas or incomplete structured data.
Recommended validation frequency
Every deployment: automated CI/CD test on modified templates.
After every major CMS update: verify that plugins or theme updates have not overwritten your schemas.
Monthly: review the Search Console report to catch new errors before they compound.
After a Google Search Central update: validation rules evolve — what was valid in 2024 may be deprecated in 2026.
Common Errors and How to Avoid Them
Error 1 — Fabricated questions in your FAQPage schema
Building questions that no one actually asks, designed purely to insert target keywords. A textbook bad example: "Why is Heeya the best AI chatbot on the market?" This type of content deceives users and search engines alike. Every question in your FAQPage schema must correspond to a genuine user question. The ideal source: questions actually asked to your AI support chatbot or your support ticket system.
Error 2 — Mismatch between JSON-LD and visible content
This is the most serious violation in Google's eyes. If your FAQPage JSON-LD contains a three-paragraph answer but the page only shows a "Learn more" button without the full answer text, Google may deindex the page or ignore the schema. The JSON-LD content must be an exact subset of the HTML content visible to users.
Error 3 — HowTo without logical sequence
Marking up content titled "5 tips for..." as HowTo when the tips are independent and interchangeable. A genuine HowTo implies a mandatory sequence: step 2 requires completing step 1. If that dependency does not exist, use an ItemList schema or a plain Article with structured lists.
Error 4 — Duplicate schemas across multiple pages
Copy-pasting the same FAQPage block with identical questions across multiple pages on your site. Google cannot determine which page is the canonical reference and may ignore both. Every FAQPage schema must be unique and match the specific content of the page it lives on.
Error 5 — Forgetting to update dateModified
Updating the content of an article without updating the dateModified property in the Article schema. Google does not know the content was refreshed and continues evaluating it as stale. Automate the update of this date in your CMS or deployment pipeline.
Error 6 — Using FAQPage on forums or UGC content
Applying the FAQPage schema to pages where answers are submitted by users — community Slack threads, product forums, comment sections. Google's rules are explicit: FAQPage applies only when your organization is the sole author of both the questions and the answers. For UGC content, use DiscussionForumPosting.
Error 7 — Ignoring Search Console warnings
Know the difference between errors and warnings in the Search Console report. A warning (recommended property missing) does not block the schema from being applied, but it reduces its relevance. Address them within 30 days of appearing. Blocking errors (required property missing, incorrect type) must be fixed immediately. For a broader strategy on AI search visibility, see our complete checklist for getting cited in ChatGPT Search.
FAQ — Schema.org, AI Overviews, and Structured Data
Does the FAQPage schema improve Google rankings in 2026?
Not directly. Google has confirmed via John Mueller that structured data is not an algorithmic ranking factor. However, a correctly implemented FAQPage schema increases the probability of your content being extracted and cited in Google AI Overviews, which appear on approximately 31% of SERPs in 2026. It is an AEO (Answer Engine Optimization) lever, distinct from classic ranking signals.
Why is my FAQPage schema no longer generating rich results?
Since September 2023, Google restricted FAQPage rich results (the accordion display in the SERP) to government and health domains only. The vast majority of websites no longer see this format in search results. This does not mean the schema is useless: it continues to feed AI Overviews as a semantic signal. Continue implementing it correctly to maximize your citation chances inside those AI blocks.
What is the difference between FAQPage and HowTo schema?
The FAQPage schema structures a list of independent question-and-answer pairs, with no mandatory order between them. The HowTo schema structures a sequential procedure where each step depends on the previous one. FAQPage suits dedicated FAQ pages and Q&A sections within articles. HowTo suits tutorials, step-by-step guides, recipes, and installation instructions. Both can coexist on a single page using @graph if the content justifies it.
How many questions can you include in a FAQPage schema?
There is no technical limit imposed by Google or schema.org. In practice, Google's guidelines state that each question must have a single accepted answer, visible on the page. For AI Overviews optimization, 5 to 10 well-chosen, precisely answered questions outperform a long list of superficial ones. Prioritize quality and alignment with real user questions over quantity.
Can you combine multiple schemas on the same page?
Yes, and it is recommended. The clean method is to use the @graph property inside a single script tag of type application/ld+json, listing all schemas for the page. For example, a blog article can simultaneously carry a BlogPosting schema, a BreadcrumbList, and a FAQPage schema if the page contains a question-and-answer section. Grouping schemas in a @graph avoids conflicts and reduces HTML payload.
Is the HowTo schema still useful in 2026?
Yes, but for a different reason than before 2023. HowTo rich results (steps displayed directly in the SERP) were removed from mobile in 2023. In 2026, they no longer generate classic enriched displays for most sites. However, the HowTo schema remains a powerful semantic signal for AI Overviews: it segments your procedure into extractable units that AI models use to synthesize answers to "how to" questions.
How do you validate a JSON-LD schema before publishing?
Use two tools in this order: (1) Google's Rich Results Test (search.google.com/test/rich-results) to verify conformance with rich results rules and detect blocking errors; (2) the Schema Markup Validator (validator.schema.org) for stricter conformance against schema.org specifications. After publishing, submit the URL in Google Search Console (URL Inspection) and monitor the "Rich results" report to catch crawl errors.
Can you use FAQPage schema when answers are hidden behind an accordion?
Yes, explicitly permitted by Google. Questions whose answers are revealed after a user click — accordion, dropdown, disclosure widget — are eligible for the FAQPage schema, provided the content is genuinely present in the HTML and accessible without complex JavaScript. What Google prohibits is including content in the JSON-LD that does not exist anywhere in the page HTML — not hiding content behind a click interaction.
Do AMP pages require a different schema?
Google AMP requires a valid Article schema (with headline, image, datePublished, author, publisher) to be eligible for the Top Stories carousel. For other schemas (FAQPage, HowTo), the JSON-LD format is identical in AMP and standard HTML. In 2026, AMP has lost most of its importance for classic ranking. For most SaaS and B2B blogs, AMP is not a priority.
Your structured data is in place — what about your AI chatbot?
The questions your visitors ask your chatbot are your best source of real FAQ content. Deploy Heeya in 5 minutes and start collecting those questions today — then turn them into schema-optimized FAQ sections that AI Overviews actually cite.