Data science is a multidisciplinary field that employs scientific methods, procedures, algorithms, and systems to extract information and insights from structured and unstructured data, as well as to apply that knowledge and actionable insights across a wide range of application areas. Data mining, machine learning, and big data are all connected to data science.

Data science is a “concept that unifies statistics, data analysis, informatics, and related approaches” in order to use data to “understand and analyse actual occurrences.” Within the framework of mathematics, statistics, computer science, information science, and domain knowledge, it employs techniques and theories from a variety of domains. Jim Gray, the winner of the Turing prize, envisioned data science as a “fourth paradigm” of research (empirical, theoretical, computational, and now data-driven), claiming that “everything about science is data-driven.”