• AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Artificial Intelligence

DataHub Turns SQL Query History into Context Layer to Cut AI Data Errors

Paul Balo by Paul Balo
May 29, 2026
in Artificial Intelligence, Software
Share on FacebookShare on Twitter

DataHub is introducing a new context intelligence layer that mines years of SQL query logs to help AI agents stop making basic mistakes when working with enterprise data.

The company says the approach is designed to solve a problem many teams are already running into: large language model (LLM) agents hitting data warehouses directly and returning wrong answers because they lack the context humans take for granted.

One example comes from Miro’s data team. When they pointed AI agents straight at their Snowflake environment, the agents produced incorrect results more than 65% of the time. The underlying issue wasn’t the model itself, but the fact that the agents were dropped into an environment with more than 10,000 tables and no semantic layer to guide which tables or joins to use for a given business question.

In that kind of sprawl, agents have no reliable way to map a natural-language request to the right data assets. They often guess, hallucinating joins and table combinations that look plausible in SQL but don’t reflect how analysts actually work with the data.

DataHub’s new capability, called Context Intelligence, aims to fix that by treating past analyst behaviour as the ground truth for how data should be used. Instead of relying only on raw schemas and table names, it analyses existing SQL query history to build a semantic index of which tables, joins and patterns have successfully answered real business questions.

That index is then exposed directly to AI agents through multiple popular tooling ecosystems, including MCP, LangChain, Google’s Agent Development Kit and CrewAI. In practice, this means an agent can look up how human analysts have historically queried a given metric or domain, and reuse those patterns rather than inventing joins from scratch.

According to co-founder and CTO Shirshanka Das, the goal is to let enterprises turn “years of analyst query history into a living, retrievable knowledge base where agents stop hallucinating joins because they have access to the joins that have worked before, validated by the people who ran them.”

Context Intelligence is built on the same query-log infrastructure DataHub has already used for lineage tracking in production deployments worldwide. That lineage work focuses on understanding how data flows from operational systems, through streaming infrastructure, into warehouses and on to downstream business tools. The new layer effectively repurposes that foundation to serve LLM-based agents.

DataHub itself began life inside LinkedIn as a metadata management project. It was created to tackle two simultaneous challenges: making data across the organization easier to discover and use, while ensuring that the same data was only used appropriately and for the right purposes.

Das, who led data infrastructure at LinkedIn for nearly 11 years, helped drive that effort before open-sourcing DataHub in early 2020, after nearly six years of internal development. Since then, the open source project has grown significantly, with more than 15,000 contributors and 3,000 production deployments around the world.

Over the years, lineage has been a primary use case for DataHub users: tracking how data moves and transforms across complex stacks, and supporting needs like regulatory compliance audits and operational triage. By layering Context Intelligence on top of this lineage-aware foundation, DataHub is now positioning its platform as a bridge between traditional data cataloguing and the new generation of AI agents that need reliable, enterprise-specific context to operate safely.

The company’s bet is that query history provides a far richer, more practical signal for agent routing than schemas alone. Where schemas describe what data exists, query logs capture how experts actually use that data—information that can be turned into a guidebook for AI systems navigating large warehouse environments.

For organizations experimenting with LLM agents on top of platforms like Snowflake, the message is clear: without a way to encode and surface hard-won institutional knowledge about tables, joins and trusted patterns, even sophisticated models can fail badly. DataHub’s Context Intelligence is an attempt to close that gap by elevating SQL query history into a first-class source of truth for AI-driven analytics.

Related Posts:

  • RSAC-2026-Conference
    RSAC 2026: AI Agents Are Flooding Security Tools,…
  • cloudflare1
    Cloudflare Targets Faster AI Agents with Dynamic Workers
  • slack_rts_api_1760110931697
    Slack Launches Platform for Building AI Agents and Apps
  • Google-Workspace-Studio
    Google Launches Workspace Studio for AI Automation Agents
  • Memento-Skills
    Memento-Skills Lets AI Agents Evolve Without Retraining
  • OAI_Blog_Agents_Library_Image5_workspace_agents
    OpenAI’s New Workspace Agents Aim to Turn ChatGPT…
  • reload-raises-2-275m-to-help-companies-treat-ai-ag
    Reload Raises $2.275M to Power AI Agent Workflows
  • gemini-3.1-pro_deep-research-and.width-1200.format-webp
    Google Launches Deep Research and Deep Research Max…

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: ai agentdatabasedatahubsql
Paul Balo

Paul Balo

Paul Balo is the founder of TechBooky and a highly skilled wireless communications professional with a strong background in cloud computing, offering extensive experience in designing, implementing, and managing wireless communication systems.

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • DataHub Turns SQL Query History into Context Layer to Cut AI Data Errors May 29, 2026
  • DeepSeek Locks in 75% Price Cut on V4 Pro, Undercutting Western AI Models by up to 25x May 29, 2026
  • Mistral AI Targets Enterprise with Industrial Push, New Data Center and Assistant Rebrand May 29, 2026
  • Microsoft 365 Copilot Receives Faster Performance Features & Redesigned Look May 29, 2026
  • Anthropic Surges To A $965 Billion Valuation, Overtaking OpenAI May 29, 2026
  • Google Rolls Out Media App Switcher For Android Auto May 29, 2026
  • Bluesky Adopts Long-Form Content To Rival X Articles May 29, 2026
  • Meta Rolls Out Subscriptions For Instagram, Facebook, & WhatsApp, With AI Plans May 28, 2026
  • TELCOs (Airtel & Glo) Resumes Airtime Borrowing To Customers May 27, 2026
  • Over 185,000 Affected By 7-Eleven Data Breach May 26, 2026
  • Kenya’s $21 Million Appeal To Track Social Media May 26, 2026
  • Huawei Reveals New Chip Strategy to Beat US Sanctions and Challenge Nvidia May 25, 2026

Browse Archives

May 2026
MTWTFSS
 123
45678910
11121314151617
18192021222324
25262728293031
« Apr    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.