Kazan Federal University Digital Repository

Dark data: why what you don't know matters/ David J. Hand.

Show simple item record

dc.contributor.author Hand D. J. ((David J.),)
dc.date.accessioned 2024-01-29T21:42:03Z
dc.date.available 2024-01-29T21:42:03Z
dc.date.issued 2020
dc.identifier.citation Hand D. J. Dark data: why what you don't know matters - 1 online resource - URL: https://libweb.kpfu.ru/ebsco/pdf/2218633.pdf
dc.identifier.isbn 0691198853
dc.identifier.isbn 9780691198859
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/179783
dc.description Includes bibliographical references and index.
dc.description.abstract "Data describe and represent the world. However, no matter how big they may be, data sets don't - indeed cannot - capture everything. Data are measurements - and, as such, they represent only what has been measured. They don't necessarily capture all the information that is relevant to the questions we may want to ask. If we do not take into account what may be missing/unknown in the data we have, we may find ourselves unwittingly asking questions that our data cannot actually address, come to mistaken conclusions, and make disastrous decisions. In this book, David Hand looks at the ubiquitous phenomenon of "missing data." He calls this "dark data" (making a comparison to "dark matter" - i.e., matter in the universe that we know is there, but which is invisible to direct measurement). He reveals how we can detect when data is missing, the types of settings in which missing data are likely to be found, and what to do about it. It can arise for many reasons, which themselves may not be obvious - for example, asymmetric information in wars; time delays in financial trading; dropouts in clinical trials; deliberate selection to enhance apparent performance in hospitals, policing, and schools; etc. What becomes clear is that measuring and collecting more and more data (big data) will not necessarily lead us to better understanding or to better decisions. We need to be vigilant to what is missing or unknown in our data, so that we can try to control for it. How do we do that? We can be alert to the causes of dark data, design better data-collection strategies that sidestep some of these causes - and, we can ask better questions of our data, which will lead us to deeper insights and better decisions"--
dc.description.tableofcontents Cover; Contents; Preface; Part 1: Dark Data: Their Origins and Consequences; Chapter 1: Dark Data: What We Don't See Shapes Our World; The Ghost of Data; So You Think You Have All the Data?; Nothing Happened, So We Ignored It; The Power of Dark Data; All around Us; Chapter 2: Discovering Dark Data: What We Collect and What We Don't; Dark Data on All Sides; Data Exhaust, Selection, and Self-Selection; From the Few to the Many; Experimental Data; Beware Human Frailties; Chapter 3: Definitions and Dark Data: What Do You Want to Know?; Different Definitions and Measuring the Wrong Thing
dc.description.tableofcontents You Can't Measure EverythingScreening; Selection on the Basis of Past Performance; Chapter 4: Unintentional Dark Data: Saying One Thing, Doing Another; The Big Picture; Summarizing; Human Error; Instrument Limitations; Linking Data Sets; Chapter 5: Strategic Dark Data: Gaming, Feedback, and Information Asymmetry; Gaming; Feedback; Information Asymmetry; Adverse Selection and Algorithms; Chapter 6: Intentional Dark Data: Fraud and Deception; Fraud; Identity Theft and Internet Fraud; Personal Financial Fraud; Financial Market Fraud and Insider Trading; Insurance Fraud; And More
dc.description.tableofcontents Chapter 7: Science and Dark Data: The Nature of DiscoveryThe Nature of Science; If Only I'd Known That; Tripping over Dark Data; Dark Data and the Big Picture; Hiding the Facts; Retraction; Provenance and Trustworthiness: Who Told You That?; Part II: Illuminating and Using Dark Data; Chapter 8: Dealing with Dark Data: Shining a Light; Hope!; Linking Observed and Missing Data; Identifying the Missing Data Mechanism; Working with the Data We Have; Going Beyond the Data: What If You Die First?; Going Beyond the Data: Imputation; Iteration; Wrong Number!
dc.description.tableofcontents Chapter 9: Benefiting from Dark Data: Reframing the QuestionHiding Data; Hiding Data from Ourselves: Randomized Controlled Trials; What Might Have Been; Replicated Data; Imaginary Data: The Bayesian Prior; Privacy and Confidentiality Preservation; Collecting Data in the Dark; Chapter 10: Classifying Dark Data: A Route through the Maze; A Taxonomy of Dark Data; Illumination; Notes; Index
dc.language English
dc.language.iso en
dc.subject.other Missing observations (Statistics)
dc.subject.other Big data.
dc.subject.other Big data.
dc.subject.other Missing observations (Statistics)
dc.subject.other COMPUTERS / Database Management / Data Mining
dc.subject.other Electronic books.
dc.title Dark data: why what you don't know matters/ David J. Hand.
dc.type Book
dc.description.pages 1 online resource
dc.collection Электронно-библиотечные системы
dc.source.id EN05CEBSCO05C365


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics