a16z mapped the infrastructure gap blocking enterprise AI. Here is why what they identify is an architectural requirement, and what it demands in practice.
"Some of the most important context is implicit, conditional, and historically contingent, and only exists as tribal knowledge inside teams. Human input provides the final crucial links that enable true agent automation."Jason Cui & Jennifer Li, Andreessen Horowitz. March 2026
An executive summary of the research thesis, and why it describes an infrastructure requirement, not a product category.
In March 2026, Andreessen Horowitz partners Jason Cui and Jennifer Li published a detailed analysis of why enterprise AI data agent deployments are failing. Their conclusion is crisp: agents aren’t failing because models are weak. They are failing because enterprises deploy agents against data they cannot actually interpret, data without governed definitions, without tribal knowledge, without a reliable source of truth.
The article traces a decade of data infrastructure evolution. The modern data stack transformed how enterprises ingest, clean, and warehouse data. Then in 2024–2025, as LLM capabilities grew, organizations rushed to deploy agents on top of that stack. The agents hit a wall. Not a model wall. A context wall.
The canonical example they use is devastating in its simplicity: an agent asked "What was revenue growth last quarter?" fails, not because it cannot write SQL, but because it does not know what revenue means in this company, which fiscal quarter is meant, or which of three tables named fct_revenue, mv_revenue_monthly, and mv_customer_mrr is the authoritative source.
"The agent isn’t given the proper business context to answer even the most basic questions. There needs to be up-to-date and maintained context that not only understands how an enterprise works and how the data systems are structured, but also maintains the tribal knowledge to tie everything together."
Jason Cui & Jennifer Li, Andreessen Horowitz, March 2026 , Read the full article →a16z maps what a modern context layer must include that a traditional semantic layer does not: canonical entity resolution, tribal knowledge in natural language, governance rules for agents, and a self-updating architecture that keeps context current as systems evolve. What they describe is not a product category. It is an infrastructure requirement. And it points directly to what LazyFox is built to close.
Agents don’t fail on SQL generation. They fail on meaning, and meaning cannot be retrieved from a schema. It has to be governed.
The a16z example is worth examining closely because it is not exotic. "What was revenue growth last quarter?" is a question every enterprise answers dozens of times a week. What makes it hard for an agent is exactly what makes it easy for someone who has been at the company for three years: they know that "revenue" means ARR, not run rate; that the fiscal quarter ends in March; that fct_revenue is the source of truth after the Q4 migration, but that mv_revenue_monthly is what finance still uses for historical comparisons pre-2024.
That knowledge does not live in a schema. It lives in a person, or in a stale YAML file from a data team member who left two years ago. When the agent looks at the warehouse, it finds three tables with similar names and no signal about which one is authoritative, no definition of the fiscal boundary, and no mapping from the business term "revenue" to the engineering artifact fct_revenue.
a16z is explicit that the fix is not a better model. It is not a more capable text-to-SQL system. It is a governed layer that captures metric definitions as maintained, versioned code that agents can query before they act. Traditional semantic layers cover some of this, but they are hand-built by data teams in BI-specific syntax, connected to a single tool, and cannot capture the implicit business rules that make definitions actually correct.
"Just like developers can set up .cursorrules files to guide agents and control output behavior, data practitioners can maintain rules and guidelines, the context layer becomes a multi-dimensional corpus where code lives alongside natural language."
Jason Cui & Jennifer Li, Andreessen Horowitz, March 2026This is precisely the logical layer of LazyFox’s three-tier semantic architecture. Metric definitions are captured as governed, versioned code. Semantic drift detection flags when a definition diverges across connected systems. When an agent asks "what is revenue?" it does not receive three conflicting answers from three different tables. It receives the single, current, governed definition, and the source of truth to query against it.
Without a governed context layer, agents guess. With one, they execute from definitions your team controls, indexed once, served from code on every subsequent query.
a16z identifies the one thing automated context construction cannot supply, and why it is the reason most context layers fail before they start.
a16z is specific about where automated context construction reaches its limit: "It is tempting to set agents loose and have them collect all internal knowledge, but some of the most important context is implicit, conditional, and historically contingent, and only exists as tribal knowledge inside teams."
The example they give is surgical in its precision: "For CRM data, look at Affinity for all new USCAN deals from 2025 onwards but Salesforce for all global leads before that." No automated system will derive this instruction from schema inspection. It exists because a migration happened on a specific date, because one team adopted a new tool and another did not, because a commercial decision was made at a specific moment in time that is now embedded in the data architecture but documented nowhere.
The key insight from a16z is that these rules are not exceptions to the context layer, they are the point of the context layer. Metric definitions can be partially automated. Source-of-truth selection for historical edge cases cannot. This is the step that separates a context layer that works in demos from one that works in production at 2 a.m. on a quarter close.
"The context layer becomes a multi-dimensional corpus where code lives alongside natural language, capturing any context an agent might need. Just like developers can set up .cursorrules files to guide agents, data practitioners can maintain rules and guidelines."
Jason Cui & Jennifer Li, Andreessen Horowitz, March 2026LazyFox’s contextual layer is built for exactly this. Business rules , "use the lakehouse for post-migration revenue, the legacy warehouse for pre-2024 comparisons", are captured in natural language alongside metric definitions. They are indexed once. Every agent query reads from governed code at runtime; no model re-processes that context on each call, adding neither latency nor cost per query.
And when a rule changes, when the migration completes, when a new product line launches, when the fiscal year boundary shifts, the change propagates across every connected agent automatically. The context layer is a living document, not a snapshot.
Most context solutions reconcile one system at a time. LazyFox reconciles meaning across all connected systems simultaneously at runtime, so agents always receive a single, current answer regardless of how many sources conflict.
a16z names a new market category. Here is what separates a genuine context layer from the alternatives, and what the architectural requirements actually demand.
a16z closes their analysis with a market observation that is straightforward but important: "Realistically not every enterprise can (or should) build this in house." They map three categories of solutions: data gravity platforms adding lightweight context features (Databricks, Snowflake), AI data analyst companies that have pivoted toward context construction as a core competency, and a third category, dedicated context layer companies, building from the ground up. They are explicit that the third is early, but equally explicit that the mandate is clear.
LazyFox sits in that third category. And the architectural requirements that fall out of the a16z analysis define exactly what a genuine context layer must do:
LazyFox’s three-layer architecture, structural (schema mapping), logical (metric definitions and governance), contextual (tribal knowledge and business rules), is designed around all four constraints. It is vendor-agnostic because context lives in code that your team owns, not in a model provider’s weights. It reconciles across all connected systems simultaneously. It indexes once and serves from governed code. And it propagates changes automatically when upstream definitions shift.
The 167% net revenue retention LazyFox has achieved with an early enterprise customer reflects exactly the compounding dynamic a16z describes: each definition captured, each tribal knowledge rule encoded, each drift event detected makes the context layer more accurate and more comprehensive over time. The value does not reset on a contract cycle. It compounds.
"We are at an interesting point in time of market development, where the problem of a lack of context has become apparent, but we are still in the early innings of building solutions. The future is exciting, perhaps the vision of truly self-serve analytics can be fully realized."Jason Cui & Jennifer Li, Andreessen Horowitz. March 2026
Book a technical walkthrough. We’ll connect LazyFox to your data stack and show exactly what your agents are missing.