In Response to Jason Cui & Jennifer Li, Andreessen Horowitz

The Context Layer
Is the Stack.

a16z mapped the infrastructure gap blocking enterprise AI. Here is why what they identify is an architectural requirement, and what it demands in practice.

Alexander Braun · March 2026 · 10 min read
"Some of the most important context is implicit, conditional, and historically contingent, and only exists as tribal knowledge inside teams. Human input provides the final crucial links that enable true agent automation."
Jason Cui & Jennifer Li, Andreessen Horowitz. March 2026
Most.
Enterprise AI agent deployments fail — not from model weakness, but from a lack of business context. MIT’s 2025 State of AI in Business report found “most fail due to brittle workflows, lack of contextual learning, and misalignment with day-to-day operations.”
MIT State of AI in Business 2025 · via Andreessen Horowitz, March 2026
5 steps
Required for a modern context layer per a16z: access the right data, automate initial context construction, layer in human refinement for tribal knowledge, connect agents via API or MCP, keep the context self-updating as definitions change.
Andreessen Horowitz · "Your Data Agents Need Context" · March 2026
3 paths
Market categories a16z identifies: data gravity platforms adding lightweight context, AI data analyst companies pivoting toward context construction, and dedicated context layer companies building from the ground up. The third category is early, forming, and has the clearest mandate.
Andreessen Horowitz · Market Map · March 2026
1×
How many times business context should be processed: once, at indexing. Every subsequent agent query runs directly from governed code — not from organizational data re-tokenized on demand, not re-sent to the model on each call.
LazyFox architecture · Core design principle
Executive Summary
Key Finding
Enterprise AI agent deployments are failing at scale, not from model weakness, but from a lack of proper business context. Agents cannot answer even basic questions like "What was revenue growth last quarter?" because business definitions, fiscal conventions, and source-of-truth rules do not exist in the data layer. They exist in people’s heads.
Root Cause
Traditional semantic layers are hand-built, brittle, and scoped to a single BI tool. They cannot capture tribal knowledge, the implicit, conditional, historically contingent rules that only exist inside teams. When agents hit this gap, they guess. The failure is usually silent until it surfaces in a board meeting where two dashboards disagree on the same quarter.
Market Recommendation
A new category of dedicated context layer is emerging to solve what data platforms and legacy semantic tools cannot. a16z identifies five requirements: access the right data, automate context construction, capture tribal knowledge through human refinement, connect agents via API or MCP, and keep context self-updating as definitions evolve. "Not every enterprise can (or should) build this in-house."
How LazyFox Delivers on This
Governed Metric Definitions
LazyFox’s logical layer captures revenue, ARR, fiscal quarters, and every other business-critical term as versioned, governed code. Agents read from a single authoritative definition, not from three conflicting warehouse tables. Semantic drift detection flags automatically when a definition diverges across connected systems.
Tribal Knowledge as Governed Infrastructure
The implicit, conditional rules a16z says only exist inside teams , "use Affinity for new USCAN deals from 2025, Salesforce for everything before", are captured in LazyFox’s contextual layer in natural language alongside metric code. Indexed once, served to every agent at runtime. No re-processing per call.
Cross-System Reconciliation
Most context solutions scope to one system at a time. LazyFox reconciles meaning across all connected systems simultaneously at runtime, so an agent querying your CRM, warehouse, and finance system receives one governed answer, not three conflicting ones that require human adjudication.
Token Efficiency & Vendor Independence
Business context is indexed once. Every subsequent agent query runs directly from governed code, no re-tokenization per call, no organizational knowledge migrating into a model provider’s weights. The semantic layer is a company asset, not a side effect of which model you use today.
Read the full analysis below
The Article

What a16z Is Actually Saying

An executive summary of the research thesis, and why it describes an infrastructure requirement, not a product category.

In March 2026, Andreessen Horowitz partners Jason Cui and Jennifer Li published a detailed analysis of why enterprise AI data agent deployments are failing. Their conclusion is crisp: agents aren’t failing because models are weak. They are failing because enterprises deploy agents against data they cannot actually interpret, data without governed definitions, without tribal knowledge, without a reliable source of truth.

The article traces a decade of data infrastructure evolution. The modern data stack transformed how enterprises ingest, clean, and warehouse data. Then in 2024–2025, as LLM capabilities grew, organizations rushed to deploy agents on top of that stack. The agents hit a wall. Not a model wall. A context wall.

The canonical example they use is devastating in its simplicity: an agent asked "What was revenue growth last quarter?" fails, not because it cannot write SQL, but because it does not know what revenue means in this company, which fiscal quarter is meant, or which of three tables named fct_revenue, mv_revenue_monthly, and mv_customer_mrr is the authoritative source.

"The agent isn’t given the proper business context to answer even the most basic questions. There needs to be up-to-date and maintained context that not only understands how an enterprise works and how the data systems are structured, but also maintains the tribal knowledge to tie everything together."

Jason Cui & Jennifer Li, Andreessen Horowitz, March 2026 , Read the full article →

a16z maps what a modern context layer must include that a traditional semantic layer does not: canonical entity resolution, tribal knowledge in natural language, governance rules for agents, and a self-updating architecture that keeps context current as systems evolve. What they describe is not a product category. It is an infrastructure requirement. And it points directly to what LazyFox is built to close.

Finding 1

The Revenue Question Cannot Be Answered Without Governance

Agents don’t fail on SQL generation. They fail on meaning, and meaning cannot be retrieved from a schema. It has to be governed.

The a16z example is worth examining closely because it is not exotic. "What was revenue growth last quarter?" is a question every enterprise answers dozens of times a week. What makes it hard for an agent is exactly what makes it easy for someone who has been at the company for three years: they know that "revenue" means ARR, not run rate; that the fiscal quarter ends in March; that fct_revenue is the source of truth after the Q4 migration, but that mv_revenue_monthly is what finance still uses for historical comparisons pre-2024.

That knowledge does not live in a schema. It lives in a person, or in a stale YAML file from a data team member who left two years ago. When the agent looks at the warehouse, it finds three tables with similar names and no signal about which one is authoritative, no definition of the fiscal boundary, and no mapping from the business term "revenue" to the engineering artifact fct_revenue.

a16z is explicit that the fix is not a better model. It is not a more capable text-to-SQL system. It is a governed layer that captures metric definitions as maintained, versioned code that agents can query before they act. Traditional semantic layers cover some of this, but they are hand-built by data teams in BI-specific syntax, connected to a single tool, and cannot capture the implicit business rules that make definitions actually correct.

"Just like developers can set up .cursorrules files to guide agents and control output behavior, data practitioners can maintain rules and guidelines, the context layer becomes a multi-dimensional corpus where code lives alongside natural language."

Jason Cui & Jennifer Li, Andreessen Horowitz, March 2026

This is precisely the logical layer of LazyFox’s three-tier semantic architecture. Metric definitions are captured as governed, versioned code. Semantic drift detection flags when a definition diverges across connected systems. When an agent asks "what is revenue?" it does not receive three conflicting answers from three different tables. It receives the single, current, governed definition, and the source of truth to query against it.

Where does the definition live?

Without a governed context layer, agents guess. With one, they execute from definitions your team controls, indexed once, served from code on every subsequent query.

Without a Context Layer
Query: "What was revenue growth last quarter?"
Natural language question. Deceptively simple.
Agent scans warehouse
fct_revenue? mv_revenue_monthly? mv_customer_mrr?
ARR or run rate? Fiscal or calendar quarter?
No governed definition
Agent guesses or halts
Definition last updated by engineer who left in 2023. Does not include Product Line C launched Jan 2025.
Agent either halts, guesses, or returns a plausible-sounding number that is wrong, with no way for the system to know which.
With LazyFox Semantic Layer
Query: "What was revenue growth last quarter?"
Same question. Governed context available.
LazyFox Logical Layer
Revenue = ARR · Fiscal Q ends March 31 · Source of truth: fct_revenue (post-Q4 migration) · Includes Product Lines A, B, C
Versioned & governed
Agent executes from governed code
Context indexed once. Query runs directly, no re-tokenization, no model guessing, no latency per call.
Accurate answer grounded in governed definitions. Same speed regardless of which AI model is connected.
The failure mode is silent. Agents that guess on metric definitions return plausible-sounding numbers. Most organizations discover the error weeks later, in a board meeting, when two dashboards disagree on the same quarter.
The fix is architectural. Governing definitions outside the model means they persist across model upgrades, are auditable, and degrade gracefully, escalating to a human when the layer has a gap rather than hallucinating an answer.
Finding 2

Tribal Knowledge Is Not a Nice-to-Have. It Is the Critical Path.

a16z identifies the one thing automated context construction cannot supply, and why it is the reason most context layers fail before they start.

a16z is specific about where automated context construction reaches its limit: "It is tempting to set agents loose and have them collect all internal knowledge, but some of the most important context is implicit, conditional, and historically contingent, and only exists as tribal knowledge inside teams."

The example they give is surgical in its precision: "For CRM data, look at Affinity for all new USCAN deals from 2025 onwards but Salesforce for all global leads before that." No automated system will derive this instruction from schema inspection. It exists because a migration happened on a specific date, because one team adopted a new tool and another did not, because a commercial decision was made at a specific moment in time that is now embedded in the data architecture but documented nowhere.

The key insight from a16z is that these rules are not exceptions to the context layer, they are the point of the context layer. Metric definitions can be partially automated. Source-of-truth selection for historical edge cases cannot. This is the step that separates a context layer that works in demos from one that works in production at 2 a.m. on a quarter close.

"The context layer becomes a multi-dimensional corpus where code lives alongside natural language, capturing any context an agent might need. Just like developers can set up .cursorrules files to guide agents, data practitioners can maintain rules and guidelines."

Jason Cui & Jennifer Li, Andreessen Horowitz, March 2026

LazyFox’s contextual layer is built for exactly this. Business rules , "use the lakehouse for post-migration revenue, the legacy warehouse for pre-2024 comparisons", are captured in natural language alongside metric definitions. They are indexed once. Every agent query reads from governed code at runtime; no model re-processes that context on each call, adding neither latency nor cost per query.

And when a rule changes, when the migration completes, when a new product line launches, when the fiscal year boundary shifts, the change propagates across every connected agent automatically. The context layer is a living document, not a snapshot.

Four systems. One governed definition.

Most context solutions reconcile one system at a time. LazyFox reconciles meaning across all connected systems simultaneously at runtime, so agents always receive a single, current answer regardless of how many sources conflict.

CRM. Salesforce
"Revenue"
Bookings at contract signing. Includes multi-year deals at full contract value.
Conflicts with warehouse
Data Warehouse. Databricks
"Revenue"
fct_revenue: recognized by month. mv_revenue_monthly: legacy view, pre-2024 data only.
Two tables, one label
Finance System. NetSuite
"Revenue"
GAAP-recognized per fiscal quarter. Q ends March 31. Excludes deferred SaaS revenue.
Fiscal ≠ calendar Q
Data Catalog. Alation
"Revenue"
Definition last updated Q3 2023. Does not include Product Line C (launched Jan 2025).
Stale definition
Semantic Governance Layer
Indexed once
Reconciled at runtime
All systems simultaneously
Governed Definition. Served to Every Agent
Revenue (ARR, GAAP-recognized)
Monthly recognized SaaS revenue per fiscal quarter (Q ends March 31). Source of truth: fct_revenue in Databricks post-Q4 2023 migration; mv_revenue_monthly for historical comparisons pre-2024. Includes Product Lines A, B, C. Excludes multi-year bookings until recognized.
↗ Conflict resolved across 4 systems · Auto-updated on Product Line C launch (Jan 2025) · Version 7
Finding 3

Not Every Enterprise Should Build This In-House

a16z names a new market category. Here is what separates a genuine context layer from the alternatives, and what the architectural requirements actually demand.

a16z closes their analysis with a market observation that is straightforward but important: "Realistically not every enterprise can (or should) build this in house." They map three categories of solutions: data gravity platforms adding lightweight context features (Databricks, Snowflake), AI data analyst companies that have pivoted toward context construction as a core competency, and a third category, dedicated context layer companies, building from the ground up. They are explicit that the third is early, but equally explicit that the mandate is clear.

LazyFox sits in that third category. And the architectural requirements that fall out of the a16z analysis define exactly what a genuine context layer must do:

  • No data migration. A context layer that requires moving data is a rearchitecting project, not a product. It must sit above existing systems as a governance and interpretation layer, not replace them.
  • Cross-system reconciliation, not single-system scope. An agent querying across a CRM, warehouse, and finance system cannot receive three independently scoped answers that it then has to adjudicate. Reconciliation must happen across all connected systems simultaneously at runtime.
  • Token efficiency by design. The context injection a16z describes (via API or MCP) fails at scale if context is re-processed per query. Business context must be indexed once; every subsequent interaction runs from governed code, not from the model re-ingesting organizational knowledge on each call.
  • Self-updating without manual maintenance. When data systems change, context must propagate automatically. A layer that requires a data engineer to update YAML files after every schema change is the old semantic layer problem with a new name.

LazyFox’s three-layer architecture, structural (schema mapping), logical (metric definitions and governance), contextual (tribal knowledge and business rules), is designed around all four constraints. It is vendor-agnostic because context lives in code that your team owns, not in a model provider’s weights. It reconciles across all connected systems simultaneously. It indexes once and serves from governed code. And it propagates changes automatically when upstream definitions shift.

The 167% net revenue retention LazyFox has achieved with an early enterprise customer reflects exactly the compounding dynamic a16z describes: each definition captured, each tribal knowledge rule encoded, each drift event detected makes the context layer more accurate and more comprehensive over time. The value does not reset on a contract cycle. It compounds.

"We are at an interesting point in time of market development, where the problem of a lack of context has become apparent, but we are still in the early innings of building solutions. The future is exciting, perhaps the vision of truly self-serve analytics can be fully realized."
Jason Cui & Jennifer Li, Andreessen Horowitz. March 2026

See the context layer in practice.

Book a technical walkthrough. We’ll connect LazyFox to your data stack and show exactly what your agents are missing.