Using Data Responsibly: The Data Market

People and Data

The Data Market

Technological change and evolving business models

Personal data is generated through an individual's actions (such as making a payment using a credit card), through business processes that digitize analog data (such as medical histories), or through consequent machine response (such as call data records). Such data is now increasingly coming from use of the internet, wireless sensors, and the billions of mobile phones around the world. As the world gets more connected, more people are leaving a digital trail, wherever they go and whatever they do.

This data, which has become more voluminous and granular over time, piqued the interest of various organizations that saw the financial value embedded in it. By the early 1990s, personal data such as telephone numbers and email addresses was widely used for marketing. Companies crunched data to predict how likely people would be to buy a product, and began using that knowledge to come up with targeted marketing messages. As more digital data was collected, organizations began to use increasingly powerful computing tools to manipulate and apply that data. Marketing companies built richer consumer profiles to predict future purchases and manufacturing and services companies to design and model new products.

Companies are now using such data to develop services powered by artificial intelligence (AI), and the bigger the data set, the better the AI. These and other innovations have greatly increased the value of data and its potential for being monetized, or bought and sold as a product in its own right. Data continues to gain value as its potential uses increase. Organizations – including businesses, governments, and others – can derive value from data by applying the insights arising from data's analysis to internal cost and revenue optimization, marketing and advertising, intelligence and surveillance, and automation.

The application of personal data for online advertising has skyrocketed, with the internet now surpassing television as the leading advertising channel. At a forecast US$237 billion in 2018, digital ads are expected to grow from 44 percent of global advertising revenues in 2018 to more than 50 percent by 2020. Facebook and Google accounted for 84 percent of digital advertising revenue in 2017 (excluding China). In 2016, Facebook's advertising revenues were US$27 billion (up more than 1,300 percent since 2010), accounting for more than 97 percent of its total revenues. Google's advertising revenues – US$79 billion (growing 180 percent since 2010) – accounted for 88 percent of its total revenue. Combined, the advertising revenues of these two online platforms were on par with the gross domestic product (GDP) of Morocco.

Data market actors

Table 4.1, complemented by figure 4.1, identifies the main types of actors operating in the markets built around personal data and the relationships among them. Using these categories of participants, it is possible to illustrate a simple model of the data market, as shown in figure 4.2. People produce personal data, the "raw material," which they "sell" (traditionally at zero price) to other market players who then use that data to derive various benefits. Individuals also provide "free labor" on many of these online platforms – by creating content such as posts and reviews and by uploading photos and videos – that data collectors can "scrape" – extracting data from online sources – to infer personal traits and preferences. This personal data, along with the data that individuals generate from their activities, and that might be inferred from their data (such as their political or culinary preferences), is the main source of data for these organizations.

People do not always directly derive value or benefits from this data (until recently, as discussed later). But people have been deriving indirect benefits from their sale of data in services or products that data-using organizations provide. These benefits are discussed in the next section.

On the other side of the market, the "buyers" are the various organizations that collect and use the data. In some cases, these organizations depend on the data as a necessary input for their operating model, as do online social media, search engine websites, and various information and news sites. Using advertising as a source of revenue, they typically compensate people – producers of the data – with free or highly subsidized access to their services. Hence, data has financial value to those organizations, either immediately when it is sold to other organizations (such as marketing companies) or through the services that an organization offers others (such as a search engine selling advertisements tied to search terms).

In other cases, data is an input into an operating model. Health systems or government services are one example. Their processes are traditionally standardized and have relied in the past on highly abstracted models of user preferences. Data is thus an input into these systems and does not have an immediate financial value but has informational value. This implies that such services are often performed for a fee, whether paid immediately or separately (such as through taxes). However, these interactions do generate significant amounts of data, and thought has increasingly gone into creating more specialized services and choices based on that data (such as in e-government services) or improving the quality of those abstract models to design improved services (such as better medicines or treatments). Businesses can unlock financial value by generating more effective insights from data to launch a new product, reduce waste or costs, enable better decisions, and boost innovations.

One may say that the value of data depends on how and for what the data is used and how well it is prepared (cleaned and organized). In either case, however, data can find its way to other parties. The regulation of those data flows is the responsibility of data protectors, which can include rules pertaining to privacy and sharing of specific types of data (such as health or financial data), as well as rules about electronic transactions. Data could also be held as an "asset" by those who collect it directly or via others, and new rule systems have emerged around concepts such as the "right to be forgotten" by such entities. Hence, data regulators can also protect people by defining and enforcing rules around the use of their data.

In this construction of the generic data market, organizations have an opportunity to capture the value from the data people produce, and they can determine how much of this value returns to those people. As noted, this data has significant financial and political value since it contains information on behavior and preferences. Where people do provide such data voluntarily, it is because they expect to gain some of those benefits – whether it is access to online services or better medical care, or merely the chance to win a competition.

Questions then emerge from the perspective of data producers: are people aware of what data they are providing and under what conditions (or at what cost)? Do they understand the value (monetary and otherwise) of the benefits they receive? Are they able to assign value to the data they provide in a manner that explicitly differentiates between their perception of value and the actual value of the benefits that they already have or could receive? And what might ensure that maximum benefits are delivered to those who produce the data? The following section unpacks the benefits and costs that accrue to individuals as they participate in these data markets as data producers.

Table 4.1 Typology of actors in the personal data market

Actor	Description	Examples
Data producers	Personal data is generated by individuals as they fill in forms (either online or offline where the latter is digitized), through sensors (such as fitness trackers and home monitors), through using applications and services on mobile phones and the internet, through using credit cards, and from being captured by security cameras and other sensors.	People generate data anonymously through sensors, security cameras, and the like. Individuals generate data using mobile phones, credit cards, internet search, fitness trackers, and so on. In some cases, civil society organizations can help produce data, especially among poorer communities.
Data collectors	Companies and governments collect data in different ways. Businesses collect personal information from their individual customers. Similarly, governments collect data from citizens for a wide range of purposes.	A bank asking for financial and personal information. An internet service provider recording web sites a user has visited. An information services company soliciting personal information for an individual to open an email or social media account. Citizens providing birth, marriage, and death details to governments for civil registries.
Data aggregators (brokers)	Obtain personal data from public and private parties to combine for resale to businesses. Some add additional value through analytics.	Data mining companies, such as Acxiom or notoriously Cambridge Analytica, that collect information from sources such as public records and consumer surveys to provide insights for clients such as banks, car companies, and retailers.
Data users	Businesses who purchase data aggregators' products. Users of the analyzed data can also play the role of data collectors and aggregators on the market.	Businesses and governments for law enforcement. Alphabet (Google's parent company), Amazon, Facebook, and others are data collectors, data aggregators, and users of the analyzed data. Advertisers are major users of personal data to better target online ads.
Open data providers	Prepare (for example, anonymize) and make relevant personal data open to use and redistribute.	National governments, affiliated agencies, or organizations (such as civil society groups).
Data protectors	Address privacy and control of personal data. Protect the interests of individuals that have generated that data or its derivatives.	National data protection authorities through privacy and computer crime legislation. Companies offer products that provide data security, stronger data protection, or information about personal data that is collected. Many tips are available for protecting personal data. However, the decision to provide personal data in exchange for use of some services is still up to the user.

Figure 4.2 The personal data market

Course Introduction

Course Syllabus

Unit 1: Introduction to Data-Driven Decision-Making

1.1: What is Data-Driven Decision-Making?

Data-Driven Decision-Making

Data-Driven Decisions

Project Lifecycle

More on Data-Driven Decisions

1.1.1: Data-Driven Information

Making Data-Driven Decisions

Data-Driven Decision Support

1.1.2: Data-Driven Learning

Process Indicators

Goal-Setting for Achievement

1.1.3: Data-Driven Science

Theory Driven or Process Driven Predictions?

Learn Data Science

1.2: Using Data-Driven Decision-Making in the Real World

Data-Driven Development

Using Data Responsibly

How Data Informs Business

Decision-Making in Management

The Advantage of Digital Decision-Making

The Effects of using Business Intelligence Systems

Unit 1 Discussion

Unit 1 Study Resources

Unit 1 Review Video

Unit 1 Review Slides

Study Guide: Unit 1

Unit 1 Assessment

Unit 1 Assessment

Unit 2: Transforming to a Data-Driven Decision-Making Enterprise

2.1: DDDM Implementation Continuums

Developing an Analytical Mindset

Data-Driven Decision-Making Change Model

2.1.1: Data/Technology

The Data and Technology Continuum

2.1.2: Organizational/People

The Organization Continuum

2.1.3: Process/Workflow

The Process Continuum

2.2: Critical Success Factors

Critical Success Factors

Unit 2 Discussion

Unit 2 Study Resources

Unit 2 Review Video

Unit 2 Review Slides

Study Guide: Unit 2

Unit 2 Assessment

Unit 2 Assessment

Unit 3: The Role of Leadership

3.1: The Role of Leadership

Leadership Needs in the 21st Century

Becoming a Data Driven PM

Embracing Big Data and Data Analytics

3.2: Leadership versus Other Critical Success Factors

Leadership and Innovation

3.3: What Is Effective Leadership?

Effecitve Leaders

Effective vs. Poor Leadership

Leadership Development Practices

Unit 3 Discussion

Unit 3 Study Resources

Unit 3 Review Video

Unit 3 Review Slides

Study Guide: Unit 3

Unit 3 Assessment

Unit 3 Assessment

Unit 4: Types of Data

4.1: Quanitative Data

Qualitative versus Quantitative Data

Characteristics of Qualitative and Quantitative Data

4.2: Qualitative Data

Qualitative and Quantitative Data

More on Qualitative and Quantitative Data

Quantitative versus Qualitative Data Summary

4.3: Big Data

What is Big Data?

Big Data in New Product Development

4.4: Types of Analytics