Home > Data Science / AI > Artificial intelligence, machine learning, data science: are these terms interchangeable?

Integrating AI and Data Science

Artificial intelligence, machine learning, data science: are these terms interchangeable?

21 December 2017 Updated at 5 May 2023

Many writers talk about AI, machine learning and data science, as if these terms were broadly interchangeable. What’s going on exactly?

More and more articles are appearing on Artificial Intelligence (AI, machine learning, (or deep learning), and many writers talk about AI, machine learning and data science without differentiation, as if these terms were broadly interchangeable. What’s going on exactly?

Let us start by describing Artificial Intelligence as the implementation of intelligent agents. According to Peter Norvig and Stuart Russel, an intelligent agent is an autonomous entity capable of perceiving its environment via sensors, of interacting with it using actuators (in other words, interacting with its environment), capable of learning, analysing, using knowledge, and taking decisions.

Historically, the first AIs were not actually “learning”. At best they used heuristic functions combined with rules engines. Today, the evolution of technology means that we can no longer conceive of an AI which is not “learning”. In particular this is attributable to recent progress in deep learning algorithms.

And indeed, the fact of “teaching” a machine is literally what “machine learning” is all about. This is based on (mainly statistical) algorithms to enable a machine to “learn” on the basis of a number of correct responses which are known beforehand (training data or learning base). Without the availability of this – often very large – database, learning is not possible.

Machine Learning, a key discipline in Artificial Intelligence

It is immediately clear from these definitions that machine learning is a major discipline in contemporary artificial intelligence. The algorithms which make learning possible have mainly been developed thanks to another, rather older, discipline, statistics.

The simpler the algorithm, the closer it is to the underlying statistics; the more complex it is, the more calls on combinations of elementary statistical approaches which hence form the building bricks of contemporary machine learning (as it is explained very well by Russian data scientist and mathematician Vladimir Vapnik). In passing we would note that the more complex the algorithm, the more precise it will be, but also the larger a training base it would need to be able to operate.

As a large part of the success of statistics and machine learning lies on the correct preparation and transformation of data, we are very rapidly seeing the appearance of a discipline which covers at lease data preparation, statistics and machine learning, which we can safely call “data science”.

The global discipline which allows the development of all sorts of algorithms for AI is thus known as data science, and its practitioners as data scientists or data engineers.

It seem perfectly clear that data science and artificial intelligence have a great deal in common.

To what extent can we think of the two disciplines as the same?

The first objection that can be raised is that it is possible to work in data science without doing artificial intelligence. For example, it would be enough to conduct market research using statistical sampling from a population. Such a study can perfectly reasonably be thought of as data science, without having anything to do with artificial intelligence.

Indeed, there is an entire explanatory and predictive facet of data science which aims to provide “one shot” responses to business questions but without any desire at all to automate the answer.

This leads us to a first conclusion which is that AI does not include anywhere near all the activities of data science.

So does AI fall wholly within data science?

The aspects of collecting and returning information are quite clearly an integral part of data science. Indeed one of the major of data science actually lies in the ability to return information properly and to provide a good explanation of the knowledge acquired from algorithms to services using it.

If we consider that the act of perceiving one’s environment using sensors forms part of information collection process, and the part that enables an agent to act on it directly using actuators forms part of the action of returning this information or this knowledge, all that is left to us us to examine the “intelligence” element of AI to find out whether we can include this activity in data science.

This “intelligent” element is defined as we have seen, as the capacity for an intelligent agent to learn, analyse and use knowledge and to take decisions. We have called this activity “machine learning” accepted that it was an integral (and even major) element of data science.

From this we can conclude unequivocally that AI must logically form part of the broader discipline that constitutes data science, the reverse being false since data science also includes data preparation, statistics and all forms of study performed using all or part of these methods.

Artificial intelligence is the most complex discipline of data science

This leads us to legitimately define data science as the conjunction of four hybrid disciplines:

data preparation
statistics
machine learning
artificial intelligence

We can thus see that these terms are not in any way interchangeable. Practitioners working in one or more of these four disciplines are all data scientists or data engineers.

These four disciplines are, to reiterate, imbricated and interdependent since, these days, without machine learning it is impossible to have artificial intelligence, without statistics we cannot have machine learning and without data transformation, statistical modelling cannot work.

Among the disciplines of data science, AI is the most complex to implement, since it of necessity calls on the other three, from dataprep to machine learning.

However, it is not possible, without seriously misusing the language, to replace the term data science with AI, which is in only one of its uses, or perhaps the culmination of a knowledge-based perspective.

Business & Decision

Data Scientist – Director of the Data Science & Customer Intelligence offerings at Business & Decision France. Also teaching Data Mining & Statistics applied to Marketing at EPF Schoolg and ESCP-Europe.

Learn more >

Comment (1)

Your email address is only used by Business & Decision, the controller, to process your request and to send any Business & Decision communication related to your request only. Learn more about managing your data and your rights.

venkatesh Le 27 February 2020 à 7h11

Great job, I love this topic & especially the way you have explained it is really awesome. Thanks for sharing this info.

Data Strategy

Data Governance and Data Management: what's the difference?

In a world where companies' ambition is to be data-driven, data governance and data management are still too often regarded as being synonymous. Let us clear up the confusion. Data...

Premium

Data Governance

REPLAY | Let’s win the Data Mesh Battle: the winning alliance between Data Architecture and Data Governance

The Data Mesh vision has brought to light the various challenges that companies face in managing and effectively utilizing their data. This is not a new challenge, as it has...

Premium

Data Trends

REPLAY | The missing pillars in the Data Mesh approach

Is Data Mesh a utopia? For two years now, the concept of Data Mesh has been seen as a revolution in the world of data since it would fill the...

Premium

Data Strategy

WHITEPAPER | Spiderman guides you towards a data-driven company

There is tremendous enthusiasm for Data Mesh. And for good reason: we finally have a complete framework for valuing data at company level. This white paper offers you a deep...

Data Trends

Data Mesh, a total data-driven model

Through its four main pillars, Data Mesh truly moves away from the dogma of centralisation and all-technology in favor of a global approach based on federation. Data Mesh thus promises...

Data Trends

#Data #AI: 7 hot topics for 2023

The 7 hot topics Data and AI of this 7th edition are the solutions for the performing company. What are specifically the trends and topics to track in 2023? This...

Data Trends

Data Mesh: Practical examples and feedback

Mastering data and its uses to create value is an ambition that is increasingly shared. However, organisations continue to face obstacles that Data Mesh could help to overcome… provided the...

Data Trends

Data Mesh: federated governance to guarantee efficiency

Data governance is an essential part of any data strategy. Nevertheless, it remains complex to deploy in a traditional organisation, but through its federated approach, Data Mesh is able to...

Data Trends

Data infrastructure self-service as the technological driving force behind Data Mesh

Data Mesh is not strictly speaking a technological approach, but data domains need powerful technical resources to develop their products. The data platform and its infrastructure are a facilitator for...

Data Trends

Data Mesh: data is a product

Oil, digital black gold, strategic asset… With Data Mesh, data is regarded as a product. Data domains are responsible for managing the life cycle of these products and for sharing...

Data Trends

Data domains: Data Mesh gives business domains superpowers

The Data Mesh concept is based on four main pillars, the first of which is an organisation divided into data domains. To be effective, this structure must reflect the business...

Data Trends

Data Mesh:The ultimate model for data-driven companies?

A new paradigm for data management, Data Mesh breaks with data centralisation models used for the past 30 years. Its foundations: federated decentralisation and redistribution of responsibility for the benefit...

Data tutorials, tools and languages

TUTORIEL | Spark Structured Streaming: performance testing

Spark is an open source distributed computing framework that is more efficient than Hadoop, supports three main languages (Scala, Java and Python) and has rapidly carved out a significant niche...

Integrating AI and Data Science

Green AI: Responsible artificial intelligence is also frugal

When it comes to Artificial Intelligence, it’s not only about improving performance at any costs. Its benefits along its adoption requires AI to be responsible by also including an environmental...

Artificial intelligence, machine learning, data science: are these terms interchangeable?

Machine Learning, a key discipline in Artificial Intelligence

To what extent can we think of the two disciplines as the same?

So does AI fall wholly within data science?

Artificial intelligence is the most complex discipline of data science

Discover also

Data Governance and Data Management: what's the difference?

REPLAY | Let’s win the Data Mesh Battle: the winning alliance between Data Architecture and Data Governance

REPLAY | The missing pillars in the Data Mesh approach

WHITEPAPER | Spiderman guides you towards a data-driven company

Data Mesh, a total data-driven model

#Data #AI: 7 hot topics for 2023

Data Mesh: Practical examples and feedback

Data Mesh: federated governance to guarantee efficiency

Data infrastructure self-service as the technological driving force behind Data Mesh

Data Mesh: data is a product

Data domains: Data Mesh gives business domains superpowers

Data Mesh:The ultimate model for data-driven companies?

TUTORIEL | Spark Structured Streaming: performance testing

Green AI: Responsible artificial intelligence is also frugal

Informations sur la gestion de vos données et vos droits