February 25, 2025 in Executive Edge

Mastering the Future: Integrating AI into MDM Strategies

Thomas Wyrick

SHARE: PRINT ARTICLE:

https://doi.org/10.1287/LYTX.2025.01.12

Is your data doing what it should? For many organizations, the answer is likely no. According to Harvard Business Review, “on average, 47% of newly-created data records have at least one critical (e.g., work-impacting) error.”

This is where master data management (MDM) plays a vital role. By ensuring the accuracy and integration of enterprise data, MDM lays the groundwork for reliable operations. With the introduction of artificial intelligence (AI), businesses now have unprecedented opportunities to overcome data challenges and unlock new levels of efficiency.

There are three main challenges with MDM today:

Data Quality Validation: Traditional data validation relies on straightforward business rules, which can miss anomalies such as new data formats or sources. When data deviates from these rules, it’s often ignored – leading to missed insights or errors that can ripple across systems.
Cleansing and Enrichment: This process enhances business rules and APIs, but often lacks flexibility, requires specialized expertise (e.g., regex configurations) and can introduce high costs. Solutions from providers, such as Dun & Bradstreet and Experian, are helpful but not without constraints.
Entity Resolution (Matching): This often relies on rules-based methods, which struggle to account for every possible data variation. Probabilistic matching, although useful for structured data, frequently falls short when managing edge cases or complex datasets.

The good news is that AI has the potential to address and overcome all three of these challenges.

Practical Strategies to Incorporate AI into MDM

To dive into this further, here are some common enterprise scenarios and how AI can help.

Scenario #1: Unplanned Data Quality Issues. Data quality issues can arise unexpectedly – such as instances in which Chinese characters replace fields typically reserved for English, or when sales staff repurpose columns for irrelevant data. These issues can disrupt reporting and decision-making.

AI Solution: Outlier Detection. AI models can analyze the structure of your data, flagging outliers that deviate from expected patterns. These anomalies can then be routed for human review or addressed with refined validation rules, ultimately strengthening accuracy and reliability.

Scenario #2: Rule- and API-Based Enrichers Are Complex and Costly. Implementing rule-based enrichment tools can be tedious, whereas traditional web services often introduce high costs, making them impractical for many enterprises.

AI Solution: Leverage LLMs for Data Enrichment. Large language models (LLMs) can simplify data enrichment by generating precise values through API calls. Hosting the LLM within your own cloud environment ensures faster response times and optimized computing performance. With significant advancements in open-source LLMs over the past year, this approach has become not only viable but also highly cost-effective for businesses.

Scenario #3: Limitations of Traditional Entity Resolution Systems. Traditional entity resolution systems rely on either business rules or probabilistic matching techniques. Both approaches share a critical limitation: Managing the complexity of all possible permutations of record matches is an overwhelming challenge. A common method involves creating three datasets: matches, non-matches and potential matches requiring manual review. However, these manual reviews are often labor-intensive, slowing down processes and rendering the system difficult to scale.

AI Solution: Leveraging Supervised Learning for Entity Resolution. Supervised learning models offer a transformative solution by analyzing labeled historical data to predict outcomes for unlabeled data. In the context of entity resolution, as users label the dataset of potential matches, these models learn from their decisions. Over time, the models can accurately predict human responses, thereby reducing or even eliminating the need for manual intervention. This approach significantly enhances efficiency and scalability while maintaining high accuracy in matching.

Future Trends in AI and MDM

AI is already rapidly transforming MDM, with additional advancements on the horizon. In the near term, AI copilots will simplify complex tools, reducing the complexity for end users, and vector-based natural language querying will make interacting with data more intuitive through LLMs.

Long-term innovations such as LLM-based entity resolution and the convergence of MDM, data catalogs and AI promise to automate key processes – identifying master data sources, building pipelines and creating “golden records” with minimal manual effort.

Many MDM tools already incorporate AI, often leveraging models like OpenAI, Claude or LLAMA. However, companies must ensure these integrations align with information technology risk policies and long-term objectives. As AI continues to transform MDM, organizations that adopt these advancements will streamline operations, enhance efficiency, reduce complexity and gain a powerful competitive edge in their industries.

Thomas Wyrick

Thomas Wyrick is director, U.S. Professional Services at Semarchy.

Keywords: