Logo Zeenea 2020
Actian Logo Rgb Horizontal
  • Product
        • TECHNOLOGY

        • Data Discovery Platform
        • Connectivity
        • Knowledge Graph
        • SOLUTIONS

        • Data Catalog
        • Enterprise Data Marketplace
        • CAPABILITIES

        • Business Glossary
        • Data Compliance
        • Data Discovery
        • Data Governance
        • Data Lineage
        • Data Quality
        • Data Shopping
        • Data Stewardship
        • Metadata Management
        • APPLICATIONS

        • Zeenea Studio
        • Zeenea Explorer
        • Unlock the Live Product Tour
  • Use Cases
        • INDUSTRIES

        • Financial Services
        • Manufacturing
        • Retail
        • Pharmaceuticals
        • DATA LEADERS

        • Data Managers
        • Data Producers
        • Data Consumers
        • The Practical Guide To Data Mesh Banner En
  • Resources
        • KNOWLEDGE HUB

        • Data Library
        • Customer Stories
        • Podcast
        • Blog
        • PRODUCT HUB

        • Product Updates
        • Zeenea Explained
        • The Data Democracy Podcast
  • Company
        • ABOUT

        • Our Story
        • Trust Center
        • GET IN TOUCH

        • Contact
        • Partner Network
        • SERVICES

        • Professional Services
        • Support
        • BELIEFS

        • Data Democracy
        • Data Mesh
        • What is a Data Democracy
  • Try Zeenea
  • Get a demo
  • English
    • Deutsch
    • Français
metacat-netflix
Data Inspiration

Metacat: Netflix makes their Big Data accessible and useful

March 29, 2019
March 29, 2019
29 March 2019

Like many numerous companies, Netflix has a colossal amount of data that come from many different data sources in various formats. As the leading streaming video on demand company (SVOD), data exploitation is, of course, a major strategic asset. Given the diversity of its data sources, the streaming platform wanted a way to federate and interact with these assets using a single tool. This led to the creation of Metacat.

This article explains the motivations behind the creation of Metacat, a metadata solution intended to facilitate the discovery, treatment, and management of Netflix’s data.

Read our previous articles on Google and AirBnB.

 

Netflix’s key figures

Netflix has come a long way since its DVD rental company in the 1990s. Video consumption on Netflix accounts for 15% of global internet traffic. But Netflix today is also:

 

  • 130 million paying subscribers worldwide (400% increase since 2011)
  • $10 billion turnover, including $403 million in profits
  • $100 billion market capitalizations, or the sum of all the leading television groups in Europe
  • $6 billion investment in original creations (TV shows and movies).

Netflix is also a data warehouse of 60 petabytes (60 million billion bytes), which is a real challenge for the firm to exploit and federate these data.

 

Netflix’s Big Data platform architecture

 

netflix metacat architecture

Its basic architecture includes three key services. These are the Execution Service (Genie), the Metadata Service (Metacat), and the Event Service (Microbot).

 

data sources netflix metacat

In order to operate between its different languages and data sources, which are not very compatible with each other, Metacat was born. This tool acts as a data and metadata access layer from Netflix’s data sources. A centralized service accessible by any data user in order to facilitate their discovery, treatment, and management.

 

Metacat & its features

Netflix has data queries, such as Hive, Pig, or Spark, that are not operable together. By introducing a common abstraction layer, Netflix can provide data access to its users, regardless of their storage systems.

In addition, Metacat goes so far as to simplify transferring one dataset to a datastore to another.

 

Business metadata

Hand-written, user-defined, business-oriented metadata, in free format can be added via Metacat. Its main information includes the connections, configurations, metrics, and the life cycles of each dataset. 

Data discovery

By creating Metacat, Netflix makes it easy for consumers to find business datasets. The tool publishes schema and business metadata defined by its users in Elasticsearch, making it easier to find full-text information in its data sources.

Data modification and audit

As a cross-functional tool for all data stores, Metacat registers and notifies all changes made to the metadata and the data itself from its storage systems.

 

Metacat and the future of Netflix

According to Netflix, the current version of Metacat is a step towards the new features they are working on. They still want to improve the visualization of their metadata, as it would be very useful for restoration purposes.

Metacat, according to Netflix, should also be able to have a plug-in architecture. Thus, their tool could validate and maintain all of its metadata. This is because users define metadata in free form. Therefore, Netflix needs to put into place a validation process that can be done before storing the metadata.

As a centralizing tool for multi-source and multi-format data, Netflix’s Metacat has clearly made progress.

The development of this in-house service has adapted to all the tools used by the company, allowing Netflix to become Data Driven.

 

Sources

  • Metacat: Making Big Data Discoverable and Meaningful at Netflix https://netflixtechblog.com/metacat-making-big-data-discoverable-and-meaningful-at-netflix-56fb36a53520
  • La folie Netflix en cinq chiffres https://www.lesechos.fr/tech-medias/medias/la-folie-netflix-en-cinq-chiffres-1132022

 

Learn more about data discovery solutions in our white paper: “Data Discovery through the eyes of Tech Giants”

Discover the various data discovery solutions developed by large Tech companies, some belonging to the famous “Big Five” or “GAFAM”, and how they helped them become data-driven.

data-discovery-mockup-EN-no-shadow
download our white paper
← Previous Next →
← Vorherige Nächste →
← Précédent Suivant →

Zeenea Actian Logo

At Zeenea, we work hard to create a data fluent world by providing our customers with the tools and services that allow enterprises to be data driven.

Zeenea Actian Logo

Chez Zeenea, notre objectif est de créer un monde “data fluent” en proposant à nos clients une plateforme et des services permettant aux entreprises de devenir data-driven.

Zeenea Actian Logo

Das Ziel von Zeenea ist es, unsere Kunden "data-fluent" zu machen, indem wir ihnen eine Plattform und Dienstleistungen bieten, die ihnen datengetriebenes Arbeiten ermöglichen.

Related posts

Articles similaires

Ähnliche Artikel

What is sensitive data discovery?

What is the difference between Data Fabric and Data Mesh?

What are the differences between a Data Analyst and a Business Analyst?

Marquez: the metadata discovery solution at WeWork

Gartner’s top Data & Analytics trends in 2020

Be(come) data fluent

Read the latest trends on big data, data cataloging, data governance and more on Zeenea’s data blog.

Join our community by signing up to our newsletter!

Devenez Data Fluent

Découvrez les dernières tendances en matière de big data, data management, de gouvernance des données et plus encore sur le blog de Zeenea.

Rejoignez notre communauté en vous inscrivant à notre newsletter !

Werden Sie Data Fluent

Entdecken Sie die neuesten Trends rund um die Themen Big Data, Datenmanagement, Data Governance und vieles mehr im Zeenea-Blog.

Melden Sie sich zu unserem Newsletter an und werden Sie Teil unserer Community!

Let's get started

Make data meaningful & discoverable for your teams

Get a free demo
Learn more

Los geht’s!

Geben Sie Ihren Daten einen Sinn

Demo Anfragen

Mehr erfahren >

Zeenea Actian Logo
  • Follow
  • Follow
  • Follow
  • Product
  • Data Discovery Platform
  • Connectivity
  • Knowledge Graph
  • Data Catalog
  • Enterprise Data Marketplace
  • Zeenea Studio
  • Zeenea Explorer
  • Pricing
  • Capabilities
  • Business Glossary
  • Data Compliance
  • Data Discovery
  • Data Governance
  • Data Lineage
  • Data Quality
  • Data Shopping
  • Data Stewardship
  • Metadata Management
  • Use Cases
  • Financial Services
  • Manufacturing
  • Retail
  • Pharmaceuticals
  • Data Managers
  • Data Producers
  • Data Consumers
  • Resources
  • Data library
  • Customer Stories
  • Podcast
  • Blog
  • Product Updates
  • Zeenea Explained
  • Company
  • Our story
  • Trust Center
  • Contact us
  • Partner Network
  • Professional Services
  • Support
  • Data Democracy
  • Data Mesh
Soc 2 Type 2
Iso 27001
Product
  • Data Discovery Platform
  • Connectivity
  • Knowledge Graph
  • Data Catalog
  • Enterprise Data Marketplace
  • Zeenea Studio
  • Zeenea Explorer
  • Pricing
Capabilities
  • Business Glossary
  • Data Compliance
  • Data Discovery
  • Data Governance
  • Data Lineage
  • Data Quality
  • Data Shopping
  • Data Stewardship
  • Metadata Management
Use Cases
  • Financial Services
  • Manufacturing
  • Retail
  • Pharmaceuticals
  • Data Managers
  • Data Producers
  • Data Consumers
Resources
  • Data library
  • Customer Stories
  • Podcast
  • Blog
  • Product Updates
  • Zeenea Explained
Company
  • Our story
  • Trust Center
  • Contact us
  • Partner Network
  • Professional Services
  • Support
  • Data Democracy
  • Data Mesh
Soc 2 Type 2
Iso 27001
© 2025 Zeenea - All Rights Reserved

Privacy policy  -  Legal notice

Zeenea Actian Logo
  • Follow
  • Follow
  • Follow
    • Produkt
    • Data Discovery Platform
    • Konnektivität
    • Knowledge Graph
    • Data Catalog
    • Enterprise Data Marketplace
    • Zeenea Studio
    • Zeenea Explorer
    • Preise
  • Funktionalitäten
  • Business Glossary
  • Data Compliance
  • Data Discovery
  • Data Governance
  • Data Lineage
  • Data Quality
  • Data Shopping
  • Data Stewardship
  • Metadata Management
  • Use Cases
  • Banken & Versicherungen
  • Industrie
  • Retail
  • Pharmaindustrie
  • Data Manager
  • Data Producer
  • Data Consumer
  • Ressourcen
  • Data library
  • Customer Stories
  • Podcast
  • Blog
  • Product Updates
  • Zeenea Explained
  • Unternehmen
  • Unsere Geschichte
  • Trust Center
  • Kontakt
  • Partner Network
  • Professional Services
  • Support
  • Data Democracy
  • Data Mesh
Soc 2 Type 2
Iso 27001
Produkt
  • Data Discovery Platform
  • Konnektivität
  • Knowledge Graph
  • Data Catalog
  • Enterprise Data Marketplace
  • Zeenea Studio
  • Zeenea Explorer
  • Preise
Funktionalitäten
  • Business Glossary
  • Data Compliance
  • Data Discovery
  • Data Governance
  • Data Lineage
  • Data Quality
  • Data Shopping
  • Data Stewardship
  • Metadata Management
Use Cases
  • Banken & Versicherungen
  • Industrie
  • Retail
  • Pharmaindustrie
  • Data Manager
  • Data Producer
  • Data Consumer
Ressourcen
  • Data library
  • Customer Stories
  • Podcast
  • Blog
  • Product Updates
  • Zeenea Explained
Company
  • Unsere Geschichte
  • Trust Center
  • Kontakt
  • Partner Network
  • Professional Services
  • Support
  • Data Democracy
  • Data Mesh
Soc 2 Type 2
Iso 27001
© 2025 Zeenea - All Rights Reserved

Privacy policy  -  Legal notice

Démarrez maintenant

Donnez du sens à votre patrimoine de données

Demandez une démo

En savoir plus

Zeenea Actian Logo
  • Follow
  • Follow
  • Follow
  • Produit
  • Data Discovery Platform
  • Connectivité
  • Knowledge Graph
  • Data Catalog
  • Enterprise Data Marketplace
  • Zeenea Studio
  • Zeenea Explorer
  • Tarifs
  • Capacités
  • Business Glossary
  • Data Compliance
  • Data Discovery
  • Data Governance
  • Data Lineage
  • Data Quality
  • Data Shopping
  • Data Stewardship
  • Metadata Management
  • Cas d'usage
  • Banque & assurance
  • Industrie
  • Retail
  • Industrie pharmaceutique
  • Data Managers
  • Data Producers
  • Data Consumers
  • Ressources
  • Librairie Data
  • Cas Clients
  • Podcast
  • Blog
  • Nouveautés Produit
  • Zeenea Explained
  • Société
  • Notre Histoire
  • Trust Center
  • Contact
  • Partner Network
  • Professional Services
  • Support
  • Data Democracy
  • Data Mesh
Soc 2 Type 2
Iso 27001
Produit
  • Data Discovery Platform
  • Connectivité
  • Knowledge Graph
  • Data Catalog
  • Enterprise Data Marketplace
  • Zeenea Studio
  • Zeenea Explorer
  • Tarifs
Capacités
  • Business Glossary
  • Data Compliance
  • Data Discovery
  • Data Governance
  • Data Lineage
  • Data Quality
  • Data Shopping
  • Data Stewardship
  • Metadata Management
Cas d'usage
  • Banque & assurance
  • Industrie
  • Retail
  • Industrie pharmaceutique
  • Data Managers
  • Data Producers
  • Data Consumers
Ressources
  • Librairie Data
  • Cas Clients
  • Podcast
  • Blog
  • Nouveautés Produit
  • Zeenea Explained
Société
  • Notre Histoire
  • Trust Center
  • Contact
  • Partner Network
  • Professional Services
  • Support
  • Data Democracy
  • Data Mesh
Soc 2 Type 2
Iso 27001
© 2025 Zeenea - Tous droits réservés.

Politique de confidentialité   -  Informations légales