Skip to main content

Does a Standalone Context Layer Make Sense in the Era of Genie Ontology?

Samuel Nagy
Samuel Nagy
VP of Strategic Growth

At the Data + AI Summit 2026, Databricks put context at the center of its agentic strategy with Genie Ontology, a context layer of its own. The obvious question for anyone building on the context layer idea is whether a standalone one still makes sense. It does, for two reasons: governed business definitions that someone owns and validates, and context that has to reach across more than one platform.

What Databricks Announced at Summit 2026

At the Databricks Data + AI Summit on June 16, 2026, the headline was not a faster engine or a cheaper warehouse. It was context. Databricks introduced Genie One, an agentic AI coworker that is now generally available and produces real artifacts (documents, reports, analyses) rather than just answering questions, alongside Genie Agents and a developer-facing Genie Code. Underneath all of them sits the announcement that matters most for this discussion: Genie Ontology.

Databricks framed the whole launch around a single idea. As CEO Ali Ghodsi put it, if AI cannot tell a CFO why margins changed or find a sales leader's next upsell, "that's not an AI problem, that's a context problem." Genie Ontology is Databricks' answer to that problem: a context layer built into the platform that grounds Genie in the business meaning behind the data, governed by Unity Catalog beneath it.

For anyone who has spent the last year arguing that enterprise AI is bottlenecked by governed context rather than by the model, this is a strong validation. One of the largest data platforms just built its own context layer and put it at the heart of its agentic story. Which raises a fair question: if Databricks now ships a context layer, does a standalone one still make sense?

Genie Ontology Is a Real Context Layer

Start by giving it credit, because the honest answer depends on being precise about what Genie Ontology is. Databricks describes it as "an automatic context layer" that extracts knowledge from tables, queries, dashboards, pipelines, and connected apps, and organizes it into "a living graph of how a company works." It learns continuously from Databricks data and 50+ connected applications, and it ranks definitions by authority so Genie retrieves trusted meaning instead of re-deriving it from raw schemas every time.

It is worth clearing up a common confusion here, because the name does not quite match the thing. Genie Ontology is broader than a semantic layer and narrower than a formal ontology. A semantic layer translates technical data into consistent business definitions and metrics; in the Databricks stack, that role is played by the new Unity Catalog Metrics, which defines KPIs once and serves them consistently. Genie Ontology consumes that semantic layer and adds relationships across assets, signals from real usage, and context pulled from unstructured apps. It behaves like a learned knowledge graph and context layer, not a strict logical ontology. So the semantic layer is one input that feeds it, not a synonym for it. (For the deeper version of that distinction, see context layer vs semantic layer.)

Genie Ontology is fed by a governed foundation modeled in Unity Catalog: a Glossary of authoritative terms (entering preview), Domains that scope assets into business areas, and Metrics for governed KPIs. The architecture is exactly what a context layer should have: a governed source of meaning feeding an interface that AI reasons over. None of what follows is a claim that Genie Ontology lacks context. It clearly has it. The two reasons a standalone layer still matters are about something else: who governs that context, and how far it reaches.

So Why Would You Still Run a Standalone One?

If Genie Ontology is a real context layer, the case for a standalone one cannot be "Databricks doesn't have context." It has to be sharper than that. Two gaps survive the launch, and both are structural rather than temporary. The first is about the difference between context that is generated and context that is governed. The second is about the difference between one platform and the estate most companies actually run.

Reason 1: Auto-Generated Context Is Not Governed Context

Genie Ontology's strength is that it builds itself. It learns meaning automatically and ranks it by authority, so it improves with use and needs little manual upkeep. That is the right design for grounding an agent quickly. It is not the same thing as governed business meaning.

A self-learning graph optimizes for what is most used and most credible in the data. Governance optimizes for something different: a definition that a named person owns, validates, and signs off on, that is versioned so you can see what changed and when, and that someone is accountable for when a regulator or an auditor asks. For the metric on a board slide, the figure in a regulatory report, or the definition of "active customer" that drives revenue recognition, "the model inferred it from usage" is not an answer you want to give. You need a human owner and an approval trail.

"An auto-learned graph tells you what the data probably means. Governance tells you what the business has agreed it means, and who is accountable for that."

Databricks does provide the governed inputs (the Unity Catalog Glossary is co-curated by people, Metrics are defined once), so the raw material for governance exists. But the governed business glossary is what makes a definition an owned, validated asset rather than a learned snippet, and that discipline is exactly what a context layer built for governance does first. A standalone layer treats ownership, validation, and approval as the starting point, not a byproduct of usage.

A governed business term in the Dawiso Business Glossary: an approved definition with a named owner and steward, status, synonyms, and a knowledge graph of related terms, illustrating context that is owned and validated rather than auto-inferred
Click to enlarge
Auto-generated context vs governed context TWO KINDS OF CONTEXT, BOTH USEFUL Auto-generated context e.g. Genie Ontology Learns automatically Ranked by usage and authority Optimized for grounding agents Improves with use, low upkeep No formal owner or sign-off Governed context e.g. a governed business glossary Authored by a named owner Validated and approved Versioned and auditable Accountable to people Survives the next AI tool Complementary, not opposed: generated context grounds the agent, governed context makes the meaning accountable.
Click to enlarge

Reason 2: Most Enterprises Run More Than Databricks

The second reason is simpler and, for most organizations, decisive. Genie Ontology learns from Databricks data and the apps connected to it. That is the right scope for the share of your business that lives in Databricks. It is not the whole business.

A typical estate also runs Snowflake, dbt, BI tools, CRMs, and operational systems. A single concept like "active customer" and its lineage usually span several of them: defined in dbt, materialized in a warehouse, reported in BI, acted on in a CRM. Context modeled inside one platform describes the part of that journey that platform can see. The most accurate agent is the one grounded in all of it.

There is a delivery side to this too. Genie Ontology grounds Genie. The copilots, custom agents, and assistants your teams build on other stacks need the same governed meaning, delivered through an open standard rather than locked to one vendor's agents. That standard is the Model Context Protocol (MCP). Without a shared, governed layer served over MCP, each platform builds an excellent context layer for itself and the organization ends up with several disconnected ones: context islands, the AI-era version of data silos, a pattern we cover in context silos.

One governed context layer across the estate GOVERN MEANING ONCE, SERVE IT OVER MCP Databricks Snowflake dbt BI tools CRM GOVERNED CONTEXT LAYER one owned definition, cross-platform lineage and classification, served over MCP MCP GenieDatabricks-native Copilotsother stacks Custom agentsyour apps
Click to enlarge

What This Looks Like in Practice

None of this is an argument against Genie Ontology. The practical setup is the two working together, with Databricks as a first-class source inside a cross-platform layer rather than a rival to it. Dawiso connects to Databricks alongside more than 40 other platforms and governs the business meaning an agent needs to act correctly across your platforms.

The business glossary defines each term once, with an owner and an approval workflow, so "active customer" means the same thing whether the data sits in Databricks, Snowflake, or a CRM. Interactive data lineage and classification follow data across systems, complementing Unity Catalog's native lineage rather than duplicating it. And the metadata layer for AI serves that governed context to any MCP-compatible agent through the MCP Server, so the meaning Genie relies on inside Databricks is also available to the other assistants across the organization.

The division of labor is clean. Unity Catalog and Genie Ontology keep grounding Genie natively in Databricks meaning. A standalone context layer makes that meaning owned, validated, and consistent across the estate, then serves it to any MCP-compatible agent. That is the difference between an agent that understands Databricks and one that understands your business.

Dawiso interactive data lineage tracing how data flows across systems into an AI model, so an answer grounded in cross-platform context can be traced back to its origin
Click to enlarge

The Verdict

So, does a standalone context layer still make sense in the era of Genie Ontology? Yes, for two reasons that the launch does not close.

First, governance is not the same as generation. Genie Ontology generates context automatically and grounds Genie well, but business-critical definitions need an owner, validation, versioning, and accountability that a self-learning graph does not provide on its own. Second, your business runs on more than one platform. Context and lineage span Snowflake, dbt, BI, and operational systems, and your agents are not all Databricks-native, so meaning has to be governed once and served across the estate through open MCP.

Genie Ontology is a strong signal that the industry now agrees on the destination: context is the constraint, not the model. Build context in Databricks with Genie Ontology, and then make sure that context is owned, governed across the estate, and reachable by any MCP-compatible agent. That is what a standalone context layer is for.

FAQ

Is Genie Ontology a semantic layer or a context layer?
It is broader than a semantic layer. A semantic layer translates technical data into consistent business definitions and metrics, which in the Databricks stack is what Unity Catalog Metrics provides. Genie Ontology consumes that and goes further: Databricks describes it as an automatic context layer that learns a living graph of how the company works from tables, queries, dashboards, pipelines, and connected apps. So the semantic layer is one input that feeds it, not the whole thing. It is not a formal ontology either, since it is a learned knowledge graph rather than a strict logical model.
Does Genie Ontology replace the need for a data governance tool?
No. Genie Ontology generates context automatically to ground Genie, and Unity Catalog enforces permissions and lineage beneath it. What an auto-learned graph does not give you is ownership and validation: a named steward, an approval workflow, versioning, and accountability for each business definition. For regulated or business-critical metrics you still need a governed layer where definitions are owned and signed off by people, not inferred from usage.
We only use Databricks. Do we still need a separate context layer?
Yes, and even a Databricks-only estate benefits, for two forward-looking reasons. First, vendor lock-in: if your business meaning lives only inside one platform, you are tied to that platform's roadmap and pricing, and the context layer is the single hardest thing to rebuild if you ever migrate. Keeping definitions in a layer you own and that is portable protects you. Second, single-platform rarely stays that way: the moment you add a BI tool, a CRM, a second warehouse, or an agent built on another stack, context governed only in Databricks stops covering the whole business. A standalone context layer means your business meaning is owned, portable, and ready the day a second tool arrives, rather than something you scramble to build later.
How does Dawiso work alongside Genie Ontology?
Dawiso treats Databricks as a first-class source, not a rival. It defines each business term once in a governed business glossary, traces cross-platform lineage and classification across the whole estate, and serves that governed context to any MCP-compatible agent through an open MCP Server. Genie Ontology keeps grounding Genie inside Databricks; Dawiso makes sure the same governed meaning is owned, validated, and available to the other tools and agents across the organization.

See it in action

Dawiso Context Layer

Govern business meaning once, then serve it to Genie and every other AI tool through an open protocol.