Step 1: Make Data Products Findable for Data Users and AI Agents

Session 1 Valuable, Managed, Usable (28).png

Managing data is only the starting point. The ultimate goal is to make data usable—and that begins with making it easy to find. Data products must be discoverable, understandable, consumable, and interoperable. If they cannot be located or understood by users and AI systems, their potential value is lost. Findability is a fundamental condition for data to generate impact.

Avoiding the Catalog Dumping Ground

Traditional data catalogs have often failed to serve their intended purpose. Over time, many have become cluttered repositories where all available data is dumped without structure or quality control. As a result, users face overwhelming search results where high-value data is buried under irrelevant or outdated content. A modern approach must shift from indiscriminate data aggregation to curated, purposeful design.

Key points

1. Findability is a product experience: If search results are noisy, users stop trusting the catalog.

2. Curate before you scale: Put the few high-value, reusable products first.

3. Make trust visible: Certification, ownership, and definitions should be obvious at a glance.

Focused Curation of High-Value Data

To make “findable” scale, prioritize reusable data products:

Curate the high-value products that serve multiple use cases.
Make semantic definitions visible so people searching for business terms land on the right product.
Use the catalog to reduce duplication by steering teams to the already-trusted products.

Focused Curation of High-Value Data

Success starts with prioritization. Organizations should begin by identifying the most critical data sets—such as the top five in each domain—and ensure these are clean, certified, and well-documented. These prioritized assets should be clearly labeled, searchable through consistent naming, and accompanied by business-friendly descriptions and relevant use cases. Minimal standards such as basic data lineage and certification status must be visibly maintained.

This focus on quality over quantity transforms the experience of data discovery. When a user searches for common business terms—like "customer data" or "product categories"—the results should present the most relevant and trusted data products at the top, clearly differentiated from lower-value or outdated content.

Fixing the User Experience for People and Machines

Too often, the results from data catalog or analytics tool searches include low-quality entries: poorly documented, outdated, or abandoned datasets. Worse still, these often appear indistinguishable from trustworthy assets. As users unknowingly build on faulty data, inconsistencies multiply—leading to broken metrics, reporting errors, and loss of trust.

The solution is to simplify and certify. Data platforms should highlight the verified assets while removing or hiding irrelevant or untrusted entries. For AI agents and tools, the impact of this curation is profound. Clean and well-described top data sets enable AI systems to function with greater accuracy and confidence, as demonstrated through successful deployments of analytics assistants.

Avoiding the Catalog Dumping Ground

Focused Curation of High-Value Data

Focused Curation of High-Value Data

Fixing the User Experience for People and Machines

Conclusion