What is the difference between a data steward and a data owner? This question comes up over and over again!
There are many different definitions associated with data management and data governance on the internet. Moreover, depending on the company, their definitions and responsibilities can vary significantly. To try and clarify the situation, we’ve written this article to shed light on these two profiles and establish a potential complementarity.
Above all, we firmly believe that there is no idyllic or standard framework. These definitions are specific to each company because of their organization, culture, and their “legacy”.
Data owners and data stewards: two roles with different maturities
The recent appointment of CDOs was largely driven by the digital transformations undertaken in recent years: mastering the data life cycle from its collection to its value creation. To try to achieve this, a simple – yet complex – objective has emerged: first and foremost, to know the company’s information assets, which are all too often siloed.
Thus, the first step for many CDOs was to reference these assets. Their mission was to document them from a business perspective as well as the processes that have transformed them, and the technical resources to exploit them.
This founding principle of data governance was also evoked by Christina Poirson, CDO of Société Générale during a roundtable discussion at Big Data Paris 2020. She explained the importance of knowing your data environment and the associated risks to ultimately create value. During her presentation, Christina Poirson developed the role of the Data Owner and the challenge of sharing data knowledge. Part of the business roles, they are responsible for defining their datasets as well as their uses and their quality level, without questioning the Data Owner:
“The data in our company belongs either to the customer or to the whole company, but not to a particular BU or department. We manage to create value from the moment the data is shared”.
It is evident that the role of “Data Owner” has been present in organizations longer than the “Data Steward” has. They are stakeholders in the collection, accessibility and quality of datasets. We qualify a Data Owner as being the person in charge of the final data. For example, a marketing manager can undertake this role in the management of customer data. They will thus have the responsibility and duty to control its collection, protection and uses.
More recently, the democratization of data stewards has led to the creation of dedicated positions in organizations. Unlike a Data Owner and manager, the Data Steward is more widely involved in a challenge that has been regaining popularity for some time now: data governance.
In our articles, “Who are data stewards” and “The Data Steward’s multiple facets“, we go further into explaining about this profile, who are involved in the referencing and documenting phases of enterprise assets (we are talking about data of course!) to simplify their comprehension and use.
Data steward and data owners: two complementary roles?
In reality, companies do not always have the means to open new positions for Data Stewards. In an ideal organization, the complementarity of these profiles could tend towards :
A data owner is responsible for the data within their perimeter in terms of its collection, protection and quality. The data steward would then be responsible for referencing and aggregating the information, definitions and any other business needs to simplify the discovery and understanding of these assets.
Let’s take the example of the level of quality of a dataset. If a data quality problem occurs, you would expect the Data Steward to point out the problems encountered by its customers to the Data Owner, who is then responsible for investigating and offering corrective measures.
To illustrate this complementarity, Chafika Chettaoui, CDO at Suez – also present at the Big Data Paris 2020 roundtable – confirms that they added another role in their organization: the Data Steward. According to her and Suez, the Data Steward is the person who makes sure that the data flows work. She explains:
“The Data Steward is the person who will lead the so-called Data Producers (the people who collect the data in the systems), make sure they are well trained and understand the quality and context of the data to create their reporting and analysis dashboards. In short, it’s a business profile, but with real data valence and an understanding of data and its value”.
To conclude, there are two notions regarding the differentiation of the two roles: the Data Owner is “accountable for data” while the Data Steward is “responsible for” the day-to-day data activity.