Normal view MARC view ISBD view

Dark data : why what you dont know matters

By: Hand, David J.
Publisher: Princeton : Princeton University Press, 2022Description: xii, 330 p. ; ill., 22 cm.ISBN: 9780691234465.Subject(s): Big data | Missing observations statistics | Confirmation bias | Credit scoring | Identity theft | Information Asymmetry | Insider trading | Missing values | Nuclear fusionDDC classification: 519.5 Summary: A practical guide to making good decisions in a world of missing dataIn the era of big data, it is easy to imagine that we have all the information we need to make good decisions. But in fact the data we have are never complete, and may be only the tip of the iceberg. Just as much of the universe is composed of dark matter, invisible to us but nonetheless present, the universe of information is full of dark data that we overlook at our peril. In Dark Data, data expert David Hand takes us on a fascinating and enlightening journey into the world of the data we don't see.Dark Data explores the many ways in which we can be blind to missing data and how that can lead us to conclusions and actions that are mistaken, dangerous, or even disastrous. Examining a wealth of real-life examples, from the Challenger shuttle explosion to complex financial frauds, Hand gives us a practical taxonomy of the types of dark data that exist and the situations in which they can arise, so that we can learn to recognize and control for them. In doing so, he teaches us not only to be alert to the problems presented by the things we don’t know, but also shows how dark data can be used to our advantage, leading to greater understanding and better decisions.Today, we all make decisions using data. Dark Data shows us all how to reduce the risk of making bad ones. "Data describe and represent the world. However, no matter how big they may be, data sets don't - indeed cannot - capture everything. Data are measurements - and, as such, they represent only what has been measured. They don't necessarily capture all the information that is relevant to the questions we may want to ask. If we do not take into account what may be missing/unknown in the data we have, we may find ourselves unwittingly asking questions that our data cannot actually address, come to mistaken conclusions, and make disastrous decisions. In this book, David Hand looks at the ubiquitous phenomenon of "missing data." He calls this "dark data" (making a comparison to "dark matter" - i.e., matter in the universe that we know is there, but which is invisible to direct measurement). He reveals how we can detect when data is missing, the types of settings in which missing data are likely to be found, and what to do about it. It can arise for many reasons, which themselves may not be obvious - for example, asymmetric information in wars; time delays in financial trading; dropouts in clinical trials; deliberate selection to enhance apparent performance in hospitals, policing, and schools; etc. What becomes clear is that measuring and collecting more and more data (big data) will not necessarily lead us to better understanding or to better decisions. We need to be vigilant to what is missing or unknown in our data, so that we can try to control for it. How do we do that? We can be alert to the causes of dark data, design better data-collection strategies that sidestep some of these causes - and, we can ask better questions of our data, which will lead us to deeper insights and better decisions.
Tags from this library: No tags from this library for this title. Log in to add tags.

Includes bibliographical references and index.

A practical guide to making good decisions in a world of missing dataIn the era of big data, it is easy to imagine that we have all the information we need to make good decisions. But in fact the data we have are never complete, and may be only the tip of the iceberg. Just as much of the universe is composed of dark matter, invisible to us but nonetheless present, the universe of information is full of dark data that we overlook at our peril. In Dark Data, data expert David Hand takes us on a fascinating and enlightening journey into the world of the data we don't see.Dark Data explores the many ways in which we can be blind to missing data and how that can lead us to conclusions and actions that are mistaken, dangerous, or even disastrous. Examining a wealth of real-life examples, from the Challenger shuttle explosion to complex financial frauds, Hand gives us a practical taxonomy of the types of dark data that exist and the situations in which they can arise, so that we can learn to recognize and control for them. In doing so, he teaches us not only to be alert to the problems presented by the things we don’t know, but also shows how dark data can be used to our advantage, leading to greater understanding and better decisions.Today, we all make decisions using data. Dark Data shows us all how to reduce the risk of making bad ones. "Data describe and represent the world. However, no matter how big they may be, data sets don't - indeed cannot - capture everything. Data are measurements - and, as such, they represent only what has been measured. They don't necessarily capture all the information that is relevant to the questions we may want to ask. If we do not take into account what may be missing/unknown in the data we have, we may find ourselves unwittingly asking questions that our data cannot actually address, come to mistaken conclusions, and make disastrous decisions. In this book, David Hand looks at the ubiquitous phenomenon of "missing data." He calls this "dark data" (making a comparison to "dark matter" - i.e., matter in the universe that we know is there, but which is invisible to direct measurement). He reveals how we can detect when data is missing, the types of settings in which missing data are likely to be found, and what to do about it. It can arise for many reasons, which themselves may not be obvious - for example, asymmetric information in wars; time delays in financial trading; dropouts in clinical trials; deliberate selection to enhance apparent performance in hospitals, policing, and schools; etc. What becomes clear is that measuring and collecting more and more data (big data) will not necessarily lead us to better understanding or to better decisions. We need to be vigilant to what is missing or unknown in our data, so that we can try to control for it. How do we do that? We can be alert to the causes of dark data, design better data-collection strategies that sidestep some of these causes - and, we can ask better questions of our data, which will lead us to deeper insights and better decisions.

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha