intelligent data catalog

How Artificial Intelligence enhances data catalogs

June 5, 2019
June 5, 2019
05 June 2019

Can machines think? We are talking about artificial intelligence, “the biggest myth of our time”!

A simple definition for AI could be: “a set of applied theories and techniques to create machines capable of simulating intelligence.” Among those AI functions, there is deep learning, an automated learning method used to process data.

Data must be understood and accessible. It’s with the help of an intelligent data catalog that data users, such as data scientists, can easily research and efficiently choose the right datasets for their machine learning algorithms.

Let’s see how.

Search engine: facilitation dataset research

By connecting to all of an enterprise’s data sources, a data catalog can efficiently pull up a maximum amount of documentation (otherwise known as metadata) from its storage systems.

This information, indexed and filterable in Zeenea’s search engine, allows for data users to quickly attain the data sets needed for their information systems.

Recommendation system

Guiding Data Scientists in their choices

An intelligent data catalog is a tool that rests on “fingerprinting” technology. This intelligent feature gives recommendations to data users as to what data sets are the most relevant for their projects based on, among others:

  • How the data is used,
  • The quality and scoring of the documentation,
  • Its previous searches,
  • What other users search for.
  • Give more meaning to their datasets

This feature offers data users that are responsible for a particular data set some suggestions as for its documentation. These recommendations can, for example, be associated with tags, contacts, or even business terms of other data sets based on:

  • The analysis on the data itself (statistical analysis),
  • The schema resembling other data sets,
  • The links on the other data set’s fields.
  • Automatically contextualizing data sets in a data catalog allows for any data user to work with data that is understood and appropriate for their use cases.

Automatic dataset linking: visualizing your data’s life cycle

As mentioned above, with fingerprinting technology, a data catalog can recognize and connect to other data sets. We are talking about data lineage: a visual representation of data life cycles.

Automatic error detection: be aware of errors in datasets

In order to overcome potential data interpretation problems, an intelligent data catalog must be able to automatically detect errors or misunderstandings in the quality and documentation of any data.

This key feature, based on the analysis of data or its documentation, must alert data users of its integrity.

GDPR notification: be notified of sensitive information

An intelligent data catalog must be able to detect personal/private data in any given data set and report it on its interface. This feature helps enterprises respond to the different GDPR demands put into place in May 2018, and also to alert potential users on the sensitivity level as well as the use of their data.


zeenea logo

At Zeenea, we work hard to create a data fluent world by providing our customers with the tools and services that allow enterprises to be data driven.

zeenea logo

Chez Zeenea, notre objectif est de créer un monde “data fluent” en proposant à nos clients une plateforme et des services permettant aux entreprises de devenir data-driven.

zeenea logo

Das Ziel von Zeenea ist es, unsere Kunden "data-fluent" zu machen, indem wir ihnen eine Plattform und Dienstleistungen bieten, die ihnen datengetriebenes Arbeiten ermöglichen.

Related posts

Articles similaires

Ähnliche Artikel

Be(come) data fluent

Read the latest trends on big data, data cataloging, data governance and more on Zeenea’s data blog.

Join our community by signing up to our newsletter!

Devenez Data Fluent

Découvrez les dernières tendances en matière de big data, data management, de gouvernance des données et plus encore sur le blog de Zeenea.

Rejoignez notre communauté en vous inscrivant à notre newsletter !

Werden Sie Data Fluent

Entdecken Sie die neuesten Trends rund um die Themen Big Data, Datenmanagement, Data Governance und vieles mehr im Zeenea-Blog.

Melden Sie sich zu unserem Newsletter an und werden Sie Teil unserer Community!

Let's get started
Make data meaningful & discoverable for your teams
Learn more >

Los geht’s!

Geben Sie Ihren Daten einen Sinn

Mehr erfahren >

Démarrez maintenant
Donnez du sens à votre patrimoine de données
En savoir plus
Soc 2 Type 2
Iso 27001
© 2024 Zeenea - Tous droits réservés.