Waterline Data’s AI-Driven Data Catalog 5.0 is First to Connect, Govern and Rationalize Data Across Cloud and On-Premises Platforms
Mountain View, Calif. — February 5, 2019 — Waterline Data, a global leader in data cataloging solutions and applications, today announced it has released the latest version of the Waterline Data AI-driven Data Catalog (AIDC), a platform that continues to set the pace for innovation while helping global organizations such as Fannie Mae and Nordea accelerate the analytics process. Leveraging Waterline’s newly-patented Fingerprinting ™ technology, AIDC 5.0 adds to Waterline’s ability to catalog individual data items, automatically and at scale, by now allowing organizations to work with related data sets across the enterprise for far deeper and cleaner insights for analytics than any other catalog can provide.
“Fannie Mae began working with Waterline Data, because we needed to provide our analysts with fast access to growing volumes of high quality, governed data in order to remain competitive in today’s data economy,” said Prakash Jaganathan, Data Management Leader, Fannie Mae. “With Waterline Data’s AI-driven Data Catalog, we were able to completely modernize our cloud data environment, fully automating and accelerating the cataloging and searchability of data to deliver game changing value to the business and our customers.”
“Since it debuted its AI-driven Data Catalog five years ago, Waterline has been differentiated by its data fingerprinting approach, supporting automated tagging of data and metadata to drive competitive data-driven decision making,” said Matt Aslett, Research VP, 451 Research. “The company is well-placed to take advantage of growing interest in data cataloging, and in particular in enterprise data catalogs that support the automated cataloging of data across the data estate—including on-prem and cloud environments.”
AIDC 5.0’s new features include:
AIDC 5.0 allows business analysts to create a combined data asset by pulling data from different systems and formats and publishing those as reusable Data Objects. AIDC 5.0 enables users to find joinable assets that contain the data they are interested in, instantly test possible join options even for data from different systems that has never been joined before, and analyze the resulting joins—all without the usual guesswork, added time, and happenstance discovery of useful data combinations. This is a big first for a data catalog industry that otherwise only offers single SQL source queries for simpler findings and faster time to insight.
Data Rationalization Dashboard
The new rationalization dashboard allows organizations to eliminate clutter in cloud migrations and identify potential security/privacy risks for cleaner searches and a healthier data estate. The dashboard identifies redundant data to help guide users to the approved master version. It also identifies copies that are no longer identical. These insights effectively root out redundant data to aid in reducing escalating licensing and storage costs while identifying bad copies in order to only surface the data that matters.
Business Rules Engine
AIDC 5.0’s patented Fingerprinting technology creates a secure and consistent logical view of enterprise data assets. Business rules leverage this view to help data stewards create consistent data governance rules that can be applied to all data assets regardless of platform, format or schema naming conventions. The rules combine data and metadata predicates to perform data quality checks as well as identify regulatory compliance targets, metadata quality checks, and programmatic tagging. The new rules engine is designed to create an automated high quality, compliant data estate to support data driven decision making and self-service analytics.
The power of the Waterline Data AI-driven Data Catalog and its latest 5.0 features is in its Fingerprinting method, driven by AI-based, machine-learning technology that analyzes thousands of points per field in each data set. Combined with human oversight, Waterline Data’s Fingerprinting technology reduces manual tagging of data by over 80%, allowing organizations to derive value from highly trusted data in days instead of weeks or months.
“Waterline Data’s AI-driven Data Catalog 5.0 is the latest in a long line of data cataloging innovations,” said Waterline Data CEO Kailash Ambwani. “After debuting the first data catalog in the market, we continued to innovate with our AI and ML capabilities, leveraging big data and cloud technology to be the first to break the data catalog ceiling from being able to process a million rows of data in thousands of data sets to a billion rows in millions of data sets. We are now continuing our tradition of innovation by being the first to introduce modern day rationalization, data objects and the rule-based engine for compliance, data quality and tagging that bring the power of data cataloging to even greater heights.”
About Waterline Data
Waterline Data automates data discovery, compliance and the ability to take action on data by using a powerful combination of artificial intelligence, machine learning, ratings and reviews, and tribal knowledge to deliver an AI-driven Data Catalog. Our customers spend less time searching for data and more time using it to derive value while complying with data governance mandates such as GDPR. The company is funded by Menlo Ventures, Jackson Square Ventures, Partech Ventures, and Infosys, and implemented in large enterprises around the globe. Founded in 2013, the company is headquartered in Mountain View, California. For more, visit us via www.waterlinedata.com, Twitter or LinkedIn.
Enterprise data leaders interested in seeing Waterline Data in action can register for a personalized live demo here.