koira
speakable schemavoice searchstructured data

Why Speakable Schema Is the Missing Piece in Your Voice Search Strategy

KOIRA Team8 min read1,549 words
Diagram showing a webpage with highlighted speakable sections and a voice assistant reading those sections aloud
Intro
Breakdown
Solution
FAQ
◆ Key takeaways
  • Speakable schema (schema.org/speakable) tells voice assistants precisely which sentences on a page to read aloud — without it, they guess.
  • Most SMB websites have zero speakable markup, making this one of the lowest-competition structured data wins available right now.
  • Speakable works best on short, declarative paragraphs — the same writing style that wins featured snippets and AI answer boxes.
  • You can implement speakable via JSON-LD without touching your HTML, making it accessible even without a developer.
  • Pairing speakable schema with FAQ and HowTo schema creates a layered structured data strategy that covers text, voice, and AI search surfaces simultaneously.
  • Voice queries skew local and transactional — speakable markup on your service pages and FAQs directly targets the queries most likely to convert.

What Speakable Schema Actually Is

Speakable schema is a schema.org vocabulary type that lets you mark specific sections of a webpage as suitable for audio playback by voice assistants. When Google Assistant, Alexa, or an AI search engine reads your page, it doesn't know which sentences are the important ones — it has to guess from context. Speakable markup eliminates that guesswork.

The markup uses either CSS selectors or XPath expressions to point at specific DOM elements and say: these paragraphs are the answer. A voice assistant that supports speakable will prioritize those flagged sections when constructing a spoken response.

Google announced support for speakable in 2019 for news publishers, then quietly expanded the signals it pays attention to as voice and AI-driven search matured. In 2026, with AI Overviews pulling from structured sources and smart speakers handling a growing share of local queries, speakable is no longer a niche publisher tool — it's a practical lever for any business that wants to show up in voice results.


Why Most SMBs Have Never Heard of It (And Why That's an Opportunity)

Run a structured data audit on a typical SMB website. You'll find, at best, LocalBusiness and maybe some FAQ schema. Speakable? Almost never.

There are two reasons for this gap. First, speakable was originally scoped to news and media — Google's documentation launched under "news publisher" guidelines, and the SEO community filed it under "not for us." Second, voice search optimization has historically been treated as a vague future-state thing rather than a tactical priority.

Both assumptions are outdated. Voice and AI search in 2026 aren't niche. When someone asks their phone "what's the best [service] near me" or tells a smart speaker "read me about [topic]," the engine needs a clean, machine-readable signal for what to say. Speakable gives you that signal. Your competitors almost certainly haven't set it up.

The adoption gap is your competitive window. Structured data has a well-documented history: early adopters of FAQ schema, then HowTo schema, captured disproportionate SERP real estate before those markup types became mainstream. Speakable is at an earlier stage of that same curve.


The Technical Anatomy of a Speakable Block

Speakable schema is implemented in JSON-LD (the same format Google recommends for all structured data) and lives in a <script type="application/ld+json"> tag in your page's <head> or <body>.

A minimal working example looks like this:

{
  "@context": "https://schema.org/",
  "@type": "WebPage",
  "name": "How Our Plumbing Service Works",
  "speakable": {
    "@type": "SpeakableSpecification",
    "cssSelector": [".summary-block", ".faq-answer"]
  }
}

The cssSelector property points at CSS class names in your HTML. Whatever text lives inside those elements is what the voice assistant will read. You can also use xpath instead of cssSelector if your CMS generates predictable XPath structures.

Three rules for what to flag as speakable:

  1. Keep it short. Voice assistants read aloud — nobody wants to hear a 400-word paragraph. Flag sections that are 2–4 sentences maximum.
  2. Make it self-contained. The flagged text should answer a question on its own, without needing surrounding context. "We offer same-day plumbing repairs in Austin, TX. Call before noon for same-day service." is perfect. "As mentioned above, our process involves…" is not.
  3. Lead with the answer. Voice search is an answer engine. Structure your speakable sections so the first sentence is the direct answer, followed by one sentence of supporting detail.

Speakable, AEO, and GEO: How They Fit Together

Answer Engine Optimization (AEO) is the practice of structuring content so AI systems — not just Google's ten blue links — can surface it as a direct answer. Generative Engine Optimization (GEO) is the parallel effort for large-language-model-based search (Perplexity, ChatGPT Search, Google AI Overviews).

Speakable schema sits at the intersection of both. It's simultaneously a technical signal for voice assistants and a semantic clarity signal for AI-driven engines trying to understand which sentences on your page are the most authoritative and quotable.

Think of it this way:

  • FAQ schema tells engines you have a Q&A structure
  • HowTo schema tells engines you have a step-by-step process
  • Speakable schema tells engines these are the sentences worth quoting out loud

Together, these three schema types create a layered structured data strategy that covers every major AI and voice surface. Implementing all three on a single service page takes about 30 minutes and has no downside.


Where to Use Speakable on an SMB Website

Not every page is a candidate. Speakable performs best on pages that already answer specific questions — which is most of your site, if you've structured it well.

High-priority pages for speakable markup:

  • FAQ pages and FAQ sections — Each answer block is a natural speakable unit. Flag the first 2 sentences of each answer.
  • Service pages — The opening summary paragraph that explains what you do, who you serve, and where you operate is ideal speakable content.
  • About/Contact pages — "We're a family-owned HVAC company serving the Denver metro since 2008. Our office is open Monday through Saturday, 7am–6pm." That's exactly what voice search surfaces for "who is [business name]."
  • Blog posts and guides — The TL;DR or key-points section. Flag the summary, not the full body.
  • Location pages — For multi-location businesses, the opening paragraph of each location page is a prime speakable target.

What to avoid flagging: Navigation text, calls-to-action ("Click here to book"), legal disclaimers, or any sentence that only makes sense in visual context ("See the chart below").


Speakable Schema and Local Voice Search

Voice queries skew heavily local. "Plumber open near me," "best Italian restaurant downtown," "what time does [business] close" — these are the queries smart speakers and voice assistants field dozens of times a day in every market.

If your LocalBusiness schema already covers your name, address, phone, and hours, you've handled the structured data baseline. Speakable fills the gap that LocalBusiness doesn't: it tells the assistant what to say about you when the query is informational, not just directional.

A real example: a pest control company in Phoenix marks up its service page with a speakable block that reads: "We handle termite inspections and treatments for Phoenix homeowners, with same-week appointments available year-round. Our licensed technicians are state-certified and carry full liability insurance."

When someone asks "what pest control companies in Phoenix offer same-week service," that speakable block is exactly what a voice assistant needs to construct a spoken answer — and it positions that company ahead of competitors whose pages have the same information buried in unstructured prose.


Common Implementation Mistakes

1. Flagging too much text. If your speakable selector captures a 600-word section, you haven't given the assistant a useful answer — you've given it a wall of text. Be surgical.

2. Using speakable on thin or duplicate content. Speakable amplifies whatever it points at. If the content is generic ("We offer high-quality services at competitive prices"), marking it speakable won't help — and it signals low quality to the engine.

3. Inconsistency between speakable content and structured facts. If your speakable text says "open Monday through Friday" but your LocalBusiness schema says you're open weekends too, that's a conflict. Engines notice. Keep your facts consistent across all structured data types.

4. Forgetting to validate. Use Google's Rich Results Test and the Schema Markup Validator after every implementation. A syntax error in your JSON-LD silently nullifies the entire block.


Does Google Actually Use Speakable in 2026?

The honest answer: Google hasn't published granular confirmation of how heavily speakable influences Google Assistant responses, and it's never been a ranking factor for traditional web search. What we do know:

  • Google's structured data documentation still actively references speakable and describes it as relevant for voice and audio interfaces.
  • Third-party testing consistently shows that pages with speakable markup appear in spoken responses at higher rates than equivalent pages without it.
  • AI-driven search engines (including Google's AI Overviews) have demonstrated a preference for pages with dense, unambiguous structured data signals — and speakable is part of that signal set.

The practical stance: There is no credible downside to implementing speakable correctly. The upside — even if it's incremental in 2026 — compounds as voice and AI search traffic grows. It takes one hour to implement on a typical SMB website. That math works.


The Bigger Picture: Structured Data as a Competitive Moat

The businesses winning AI and voice search in 2026 aren't necessarily the ones with the best content. They're the ones whose content is the most machine-readable. Structured data is the translation layer between what you've written and what an AI engine can confidently surface as an answer.

Speakable schema is one piece of that layer — alongside LocalBusiness, FAQ, HowTo, Product, and Review schema. Each type covers a different surface and a different query type. None of them are hard to implement once you understand what they do. Together, they turn a well-written page into a page that machines can confidently cite.

The window for early-mover advantage on speakable is open right now. Most SMBs haven't touched it. The ones that implement it cleanly over the next six months will be the ones their voice assistant recommends when a customer asks the right question.

The businesses winning AI and voice search in 2026 aren't necessarily the ones with the best content — they're the ones whose content is the most machine-readable.

Save this for later
Get a PDF copy of this post →
Drop your email, we’ll send you the full piece as a clean PDF. Plus the weekly KOIRA roundup.
Title: Speakable Schema: What It Is and Why Voice Search Needs It
Speakable Schema
A schema.org structured data type that uses CSS selectors or XPath to flag specific webpage sections as ideal for text-to-speech playback by voice assistants and AI search engines.
SpeakableSpecification
The schema.org sub-type used inside a speakable property to define exactly which page elements — via cssSelector or xpath — should be treated as speakable content.
Answer Engine Optimization (AEO)
The practice of structuring and marking up content so AI-driven answer engines and voice assistants can surface it directly as a spoken or displayed response, rather than as a traditional blue link.
JSON-LD
JavaScript Object Notation for Linked Data — Google's recommended format for embedding structured data in a self-contained script block that doesn't require changes to visible HTML.
Generative Engine Optimization (GEO)
The discipline of optimizing content for large-language-model-based search engines such as Perplexity, ChatGPT Search, and Google AI Overviews, which generate answers rather than returning ranked lists of links.
Traditional SEO vs. Voice-Ready SEO: How Speakable Schema Changes the Game
AreaTraditional SEO approachVoice-ready approach with speakable schema
Content targetingOptimized for ten blue links and visual SERP featuresOptimized for spoken responses and AI answer boxes alongside visual results
Structured data usageLocalBusiness and maybe FAQ schema — no speakableLayered schema: LocalBusiness + FAQ + HowTo + Speakable covering every surface
Page copy styleLong-form paragraphs written for human reading flowShort, self-contained declarative sections flagged explicitly for audio delivery
Voice search visibilityVoice assistant guesses which sentences to read — inconsistent resultsVoice assistant reads exactly the flagged speakable blocks — predictable, accurate responses
Competitive landscapeMost competitors have similar or better structured data coverageSpeakable adoption is near zero among SMBs — early movers capture disproportionate voice real estate
MeasurementTracks rankings and clicks via Search ConsoleValidates markup via Rich Results Test and monitors voice query responses directly on devices

How to implement speakable schema on your SMB website

  1. 01
    Identify your best speakable content. Go through your FAQ page, key service pages, and location pages. Find every section that answers a specific question in 2–4 short, self-contained sentences — these are your speakable candidates.
  2. 02
    Note the CSS class or ID of each target element. In your browser, right-click the paragraph or section you want to flag, choose 'Inspect,' and find its CSS class name (e.g., '.service-summary' or '#faq-answer-1'). Write these down — you'll use them in your JSON-LD.
  3. 03
    Write your speakable JSON-LD block. Create a script block with type 'application/ld+json' containing a WebPage or NewsArticle object with a speakable property pointing to a SpeakableSpecification that lists your cssSelector values in an array.
  4. 04
    Paste the script into your page's head section. Add the JSON-LD block to the <head> of each target page. In WordPress, you can use a plugin like 'Insert Headers and Footers' or add it directly to your theme's header template. In other CMS platforms, use the custom code or header injection setting.
  5. 05
    Validate your implementation. Paste the page URL into Google's Rich Results Test (search.google.com/test/rich-results) and the Schema Markup Validator (validator.schema.org). Fix any syntax errors flagged — a single missing comma can break the entire block.
  6. 06
    Test the spoken output on a voice device. Ask Google Assistant or another voice assistant a question your speakable page is designed to answer. Listen to whether the response aligns with your flagged content. Refine the selector scope or rewrite the flagged text if the response is garbled or off-target.
  7. 07
    Roll out to remaining priority pages and monitor. Once the first page validates and performs correctly, repeat the process for your other high-priority pages. Add a quarterly reminder to re-test voice responses, especially after any significant content or CMS updates that may have changed your CSS class structure.
FAQ
What is speakable schema and what does it do?
Speakable schema is a structured data type from schema.org that marks specific sections of a webpage as suitable for text-to-speech playback by voice assistants. You implement it via JSON-LD, pointing CSS selectors or XPath expressions at the text elements you want voice assistants to read aloud. When a voice assistant fields a relevant query, it uses those flagged sections to construct its spoken response rather than guessing which sentences matter most.
Does Google still support speakable schema in 2026?
Yes. Google's structured data documentation continues to reference speakable as a supported type relevant to voice and audio interfaces. While Google hasn't disclosed the exact weight speakable carries in Google Assistant responses, the schema remains in active use and third-party tests consistently show that speakable-marked pages outperform unmarked equivalents in spoken search results. There is no known downside to correct implementation.
Which pages on my website should I add speakable schema to first?
Start with your FAQ page, key service pages, and any location pages if you have multiple locations. These pages already contain the short, declarative, question-answering content that speakable is designed to surface. Flag only the summary paragraphs or concise answer blocks — not full-length content sections — to give voice assistants a clean, speakable unit that stands on its own.
How is speakable schema different from FAQ schema?
FAQ schema signals that a page contains a question-and-answer structure and can generate an FAQ rich result in traditional SERPs. Speakable schema specifically flags content as appropriate for audio delivery by voice assistants — it doesn't generate a visual rich result but influences which text is read aloud. The two types are complementary: FAQ schema helps you win visual SERP features while speakable schema covers the voice and AI-spoken response layer.
Can I implement speakable schema without a developer?
Yes. Speakable is implemented in JSON-LD, a self-contained script block you paste into your page's HTML — it doesn't require changes to your existing markup or code. Most CMS platforms (WordPress, Squarespace, Wix) allow you to inject custom scripts into page headers. If you're using a plugin like Yoast or RankMath for schema, you can also add speakable via custom schema fields. The main requirement is knowing the CSS class names of the elements you want to flag.
How long does it take to see results from speakable schema?
Structured data changes are typically picked up within one to two weeks of Google re-crawling your page. However, visibility in voice search results isn't tracked in standard tools like Google Search Console, so the impact is harder to measure directly than traditional ranking changes. The most practical way to verify it's working is to use Google's Rich Results Test to confirm valid implementation, then periodically test your target queries on voice devices. Sustained voice visibility builds over months as the engine gains confidence in your structured data.
Written with AI assistance and reviewed by the KOIRA team before publishing.
Find KOIRA on
LinkedInCrunchbaseWellfoundF6S
Keep reading
Updates
Local SEO in May 2026: What Actually Changed
8 min read
Product
Why AI Content Doesn't Sound Like You (And How to Fix It)
8 min read
Updates
AI Search Engines Changed This Quarter: What to Do
8 min read
Guides
SEO vs GEO vs AEO: Why You Need All Three in 2026
9 min read
Stay in the loop
New posts, straight to your inbox.
Marketing and sales insights from the KOIRA team. No filler.
Speakable Schema: What It Is and Why Voice Search Needs It
Get KOIRA