Big Data And Data Sharing: Ethical Issues

3y ago

126 Views

4 Downloads

541.74 KB

11 Pages

Last View : 4d ago

Last Download : 3m ago

Upload by : Sabrina Baez

Report this link

Download PDF

Transcription

ukdataservice.ac.ukBig data anddata sharing:Ethical issues

UK Data Service – Big data anddata sharing: Ethical issuesAuthor:UK Data ServiceUpdated: February 2017Version: 1We are happy for our materials to be used and copied but request that users should: link to our original materials instead of re-mounting our materials on your websitecite this an original source as follows:Libby Bishop (2017). Big data and data sharing: Ethical issues. UK Data Service, UK DataArchive.

UK Data Service – Big data anddata sharing: Ethical issuesContentsWhat are research ethics?3What makes research ethics for social research with big data different?3What are the principal ethical issues in social research with big data?4Privacy4Informed consent4De-identification5Inequality – digital divide5Research integrity5Emerging issues6Example – Informed consent to publish Tweets6Background to the research question6Issues, constraints and decisions6Outcomes7Legal disclaimer7Resources7Anonymisation7Genre specific7References8About Libby Bishop92

UK Data Service – Big data anddata sharing: Ethical issuesThis a brief introduction to ethical issues arising in social research with big data. It is notcomprehensive, instead, it emphasises ethical issues that are most germane to data curationand data sharing.The ethical challenges raised by research that uses new and novel data can seem daunting;the risks are both real and substantial. However, the opportunities are also great, and with agrowing collection of guidance and examples, it is possible to pursue many suchopportunities in an ethical manner.What are research ethics?Ethics refers to standards of right and wrong that prescribe what we ought to do, typicallyguided by duties, rights, costs and benefits. In research ethics, these relationships are amongresearchers, participants, and the public. Many guides exist, such as the 2016 ESRC’sFramework for Research Ethics. There are also more general codes, such as the 1978Belmont Report, which identifies the core principles of respect for persons, beneficence andjustice in human subjects research, and the more general European Convention on HumanRights, or ECHR, ratified in 1953.What makes research ethics for social research with big datadifferent?An OECD report (2013) on new data has identified the forms of big data most commonlyused for social research as administrative data, records of commercial transactions, socialmedia and other internet data, geospatial data and image data.These data differ from traditional research data (e.g., surveys) in that they have not beengenerated specifically by researchers for research purposes. As a result, the usual ethicalprotections that are applied at several points in the research data life cycle have not takenplace. The data collection was not subject to any formal ethical review process, e.g.,research ethics committees or institutional review boards.Protections applied when data are collected (e.g., informed consent) and processed(e.g., de-identification), will not have been implemented.Using the data for research may substantially differ from the original purpose forwhich it was collected (e.g., data to improve direct health care used later forresearch), and this was not anticipated when data were generated.Data are less often held as discrete collections, indeed the value of big data lies in thecapacity to accumulate, and pool and link many data sources.The relationship between data curators and data producers is often indirect and variable. Arecent OECD (2016) report argues this relationship is often weaker or non-existent with bigdata, limiting the capacity of repositories to carry out key activities to safely managepersonal or sensitive data.3

UK Data Service – Big data anddata sharing: Ethical issuesWhat are the principal ethical issues in social research with bigdata?PrivacyPrivacy is recognised as a human right under numerous declarations and treaties. In the UK,the ECHR has been implemented through the Human Rights Act 1998, with protection ofpersonal data provided by the Data Protection Act 1998. The privacy of research subjects canbe protected by a combination of approaches: limiting what data are collected; altering datato be less disclosive; and regulating access to data. But big data can challenge these existingprocedures: The definitions of “private” and “privacy” are ambiguous or contested in many big dataresearch contexts.Are social media spaces public or private? Some, such as Twitter seem more public bydefault, whereas Facebook is more private.Many users believe, and act as if, the setting is more private than it is, at least asspecified in the user agreements of many social media platforms. Is compliance withformal agreements sufficient in such cases?Some approaches to ethical research depend on being able to unambiguouslydistinguish public and private users or usages. However, data costs and analyticalcomplexity are driving closer collaborations between public and privateorganisations, blurring these distinctions.There is debate as to whether data science should be classified as human subjectsresearch at all, and hence exempted from concerns—such as privacy—that aregrounded in human rights.Informed consentThe ethical issue of consent arises because in big data analytics, very little may be knownabout intended future uses of data when it is collected. With such uncertainty, neitherbenefits nor risks can be meaningfully understood. Thus, it is unlikely that consent obtainedat the point of data collection (one-off) would meet a strict definition of “informed consent”.For example, procedures exist for “broad” and “generic” consent to share genomic data, butare criticised on the grounds that such consent cannot be meaningful in light of risks ofunknown future genetic technologies. In 2002, O’Neill noted how this limitation of consentis not new, but the use of data for such different purposes, and the scale of possible harmsmake it more problematic with big data.Even if such conceptual issues are minimised (or ignored), practical challenges remain. Obtaining informed consent may be impossible or prohibitively costly due to factorssuch as scale, or the inability to privately contact data subjects.The validity of consent obtained by agreement to terms and conditions is debateable,especially when agreement is mandatory to access a service.4

UK Data Service – Big data anddata sharing: Ethical issuesDe-identificationUnfortunately, there exist no robust, unanimously internationally agreed definitions for theterms de-identification, anonymisation, and pseudonymisation. Generally, a dataset is said tobe de-identified if elements that might immediately identify a person or organisation havebeen removed or masked. In part because a number of relevant laws, such as data protectionlegislation, define different treatments for identifiable and nonidentifiable data, much hasrested on being able to make this distinction. Despite this legal situation, recognition isgrowing that such distinctions are becoming less tenable. Identifiability is increasingly being seen as a continuum, not binary.Disclosure risks increase with dimensionality (i.e., number of variables), linkage ofmultiple data sources, and the power of data analytics.Disclosure risks can be mitigated, but not eliminated.De-identification remains a vital tool to lower disclosure risk, as part of a broaderapproach to ensuring safe use of data.Inequality – digital divideWhile the benefits of scale in many domains are clear (e.g., medical care), some see risks inthe accumulation of data at a new scale with power that entails, whether data is held in publicor private institutions. For reasons of scale and complexity, a relatively small number ofentities have the infrastructures and skills to acquire, hold, process and benefit from big data. While the question of who owns data is a legal one, the consequences of inequalitypose ethical questions.Who can access data? In principle, any researcher can access Twitter via its API, butthe costs and skills needed do present access barriers.Who governs data access? Increasingly, data with disclosure risks can be safelycurated, with access enabled through governance mechanisms, such as committees.Is such access genuinely equally open? How is this documented?Research integrityData repositories play a vital role in supporting research integrity by holding data and makingthem available to others for both validation, replication, as well as providing new researchopportunities. To do so, data must have clear “provenance”, its sources and processing needto be known, identified, and documented. The attenuated relationship between datacurators and data producers, who may not be ‘researchers’ per se, makes this challenging fora number of reasons:. Much data not collected for research, such as administrative data, has differentstandards (e.g., quality, metadata) to research data.For some genres, often with commercial value, such as Twitter data, there are legalrestrictions on reproducing data, including providing data to support publications. Fora comprehensive treatment of issues of preserving social media, see Thomson 2016.Data repositories face challenges in upholding their commitments to standards of5

UK Data Service – Big data anddata sharing: Ethical issuestransparency and reproducibility when working with groups of data producers whodo not routinely generate data for social research.Emerging issues Alternatives to individual informed consent, e.g., “social consent” are being testedwhereby sufficient protections are in place to ethically permit data use withoutindividual informed consent.There is growing recognition of the need to respect the source and provenance ofdata—and more broadly its “contextual integrity”—when deciding what, if any, reuseis permissible.Most research ethics are based on the assumption that the entity at risk is anindividual, hence de-identification offers protection. If harms can be inflicted, forexample, denial of health care, based on group membership with no need forindividual identification, then the protection of de-identification is no longeradequate.If it is no longer possible to neatly divide public and private, then some suggestassessing data use based on outcomes, and permitted uses with “public benefit” or inthe “public interest”. However, definitions are often vague, and such benefits accruelong after the decision about data use has been made. How can data users be heldaccountable for delivering the promised public well-being?Example – Informed consent to publish TweetsBackground to the research questionIn 2015 Dan Gray, at the University of Cardiff, used Twitter to study misogynist speech. Heencountered numerous legal and ethical challenges with consent and anonymisation whenconsidering how to fairly represent research participants. He collected some 60,000 Tweetsin 2015 by filtering on keywords of hateful speech and needed to be able to publish selectedquotations of Tweets to support his arguments.Issues, constraints and decisions Twitter’s Terms and Conditions prohibit modifying content, meaning that tweets couldnot be anonymised.Gray had to decide if the Tweets could be considered public, and moreover, wouldtheir public status be sufficient to justify publishing without consent.Survey analysis done at the Social Data Science Lab at Cardiff, where Gray wasconnected, showed that Tweeters did not want their content used, even for research,if they were identifiable.If he did decide to seek consent, there was no way to do so as private communicationto the Tweeter. This would have been possible only if the Tweeters were followinghim, and they were not.Mutual following was not possible as a way of contacting Tweeters because theResearch Ethics Committee required that he use an anonymised profile.6

UK Data Service – Big data anddata sharing: Ethical issuesOutcomes He opted to contact by direct Tweet, though this risked allowing tweeters to findhim, and also to contact other tweeters of hateful discourse.“Consent by Tweet” severely constrained his ability to explain risks and benefits of theresearch.Consent was successfully obtained for a number of tweets, enabling sharing ofselected unanonymised tweets in publications.Gray was able to draw upon the UK’s COSMOS Risk Assessment for guidance, butpoints out that its rigorous attention to harm and privacy can become a barrier,shielding hateful discourse from critical scrutiny.Legal disclaimer Ethical practice has to develop, in part because the law nearly always lags what ispossible.There will always be acts that are legal, not ethical, so law not enough.There are complex legal questions for big data also (copyright) and researchersshould get legal advice through their research support offices.Below are just a selection of the resources and references that have informed the content onthese pages.ResourcesGuides and checklists for ethical issues in big data social research A recent OECD report includes a Privacy Heuristic in Appendix 4 with key questions toconsider when beginning research, such as: what are data subjects’ expectationsabout how there information might be used?The UK Cabinet Office has produced guidance with detailed case examples for UKresearch by government, but is relevant to more general research as well.The UK Data Service is committed to developing tools and procedures to safelyhandle data, even when that data has disclosure risks. The Service currently uses aframework called the 5 Safes for selected data, and is adapting the framework formore general application, such as for big data.Anonymisation UK Anonymisation Network. Anonymisation Decision-making cision-making-framework/ONS Disclosure control guidance for microdata produced from social olicyforsocialsurveymicrodataGenre specific7

UK Data Service – Big data anddata sharing: Ethical issues Tweet publication decision flowchart. /EthicsSM-SRA-Workshop.pdfReferencesEvans, H., Ginnis, S. Bartlett, J. (2015) Social Ethics a guide to embedding ethics in socialmedia research. IPSOS Mori. ummary.pdfGray, D. Talking About Women: Misogyny on Twitter, Master of Science Dissertation, CardiffUniversity, September 2015.Information Commission Office. (2014) Big data and data Markham, A. and Buchanan, E. (2012) Ethical Decision-Making and InternetResearch: Recommendations from the AoIR Ethics Working Committee(Version 2.0) https://aoir.org/reports/ethics2.pdf.Metcalf, Jacob, Emily F. Keller, and danah boyd. 2016. “Perspectives on Big Data, Ethics, andSociety.” Council for Big Data, Ethics, and Society. Accessed December 16, um, Helen (2009), Privacy in Context: Technology, Policy, and the Integrity of SocialLife, Stanford: Stanford University Press.OECD (2013) “New Data for Understanding the Human Condition”, Global Science ForumReport. rstanding-the-humancondition.pdfOECD (2016), “Research Ethics and New Forms of Data for Social and Economic Research”,OECD Science, Technology and Industry Policy Papers, No. 34, OECD Publishing, �Neill, O. (2002) Autonomy and Trust in Bioethics. Cambridge: Cambridge University c1/trust and autonomy.pdfRichards, N. and King, J. (2014) Big Data Ethics. Wake Forest Law Review (49).Schneier, B. 2015. Data and Goliath. New York: W.W. Norton and Co.Social Data Science Lab (2016). Lab Online Guide to Social Media Research Ethics. Retrievedfrom http://socialdatalab.net/ethics-resources.Thomson, S. D. , Preserving Social Media, DPC Tech Watch Report 16-01 y-watch-reportsUK Cabinet Office-Data Science Ethical Framework /uploads/attachment data/file/524298/Data science ethics framework v1.0 for publication 1 .pdf8

UK Data Service – Big data anddata sharing: Ethical issuesWeller, K. and Kinder-Kurlanda, K. A Manifesto for Data Sharing in Social Media Research.ACM. http://dl.acm.org/citation.cfm?id 2908172&CFID 686568339&CFTOKEN 73278098Zwitter, A. (2014) Big data ethics. Big Data and 7/2053951714559253About Libby BishopLibby Bishop (Ph.D.) is Producer Relations Manager at the UK Data Service and based at theUK Archive (University of Essex). She specialises in ethics of data reuse: consent,confidentiality, anonymisation and secure access to data. As a member of the Advisory Panel,she helped to revise the Economic and Social Research Council’s Framework for ResearchEthics. She is also a member of the University of Essex Research Ethics Committee. Herrecent work has focused on big data, ethics and data sharing. She is a member of the BigData, Ethics, and Society Network and has a forthcoming chapter, “Ethical challenges ofsharing social media research data” in The ethics of Internet-mediated Research and UsingSocial Media for Social Research (ed. K. Woodfield).9

20 February 2017T 44 (0) 1206 872143E help@ukdataservice.ac.ukW ukdataservice.ac.ukThe UK Data Service deliversquality social and economicdata resources for researchers,teachers and policymakers. Copyright 2017University of Essex andUniversity of Manchester

Related Documents:

Big Data Analytics Turning Big Data Into Big Money

The Rise of Big Data Options 25 Beyond Hadoop 27 With Choice Come Decisions 28 ftoc 23 October 2012; 12:36:54 v. . Gauging Success 35 Chapter 5 Big Data Sources.37 Hunting for Data 38 Setting the Goal 39 Big Data Sources Growing 40 Diving Deeper into Big Data Sources 42 A Wealth of Public Information 43 Getting Started with Big Data .

53 Views

1y ago

BigDataBench: a Big Data Benchmark Suite from Internet Services

big data systems raise great challenges in big data bench-marking. Considering the broad use of big data systems, for the sake of fairness, big data benchmarks must include diversity of data and workloads, which is the prerequisite for evaluating big data systems and architecture. Most of the state-of-the-art big data benchmarking efforts target e-

26 Views

1y ago

A Study on Big data security issues and challenges

of big data and we discuss various aspect of big data. We define big data and discuss the parameters along which big data is defined. This includes the three v’s of big data which are velocity, volume and variety. Keywords— Big data, pet byte, Exabyte

49 Views

2y ago

Top Big Data Analytics Use Cases - oracle.com

Retail. Big data use cases 4-8. Healthcare . Big data use cases 9-12. Oil and gas. Big data use cases 13-15. Telecommunications . Big data use cases 16-18. Financial services. Big data use cases 19-22. 3 Top Big Data Analytics use cases. Manufacturing Manufacturing. The digital revolution has transformed the manufacturing industry. Manufacturers

28 Views

1y ago

Big Data: Impact and applications in Grocery Retail

Big Data in Retail 80% of retailers are aware of Big Data concept 47% understand impact of Big Data to their business 30% have executed a Big Data project 5% have or are creating a Big Data strategy Source: "State of the Industry Research Series: Big Data in Retail" from Edgell Knowledge Network (E KN) 6

21 Views

1y ago

File and Printer Sharing with Microsoft Windows - Oakton

With simple file sharing, the act of enabling file sharing on a folder and specifying the type of access is simplified to the following choices: Whether to enable sharing for the folder The name of the share Whether to allow network users to change files in the folder The Sharing tab for simple file sharing is shown in the following figure.

22 Views

1y ago

TPShare: A Time-Space Sharing Scheduling Abstraction for Shared Cloud ...

We name the coordinated time- or space-sharing scheduling as time-space sharing scheduling. Although Hawk [10], Mercury [26], and Eagle [9] all employ a mixed scheduling policy of time-sharing and space-sharing, they do not consider to coordinate time- or space-sharing scheduling among differenthorizontallayers. Instead, they simply employ .

12 Views

1y ago

PROGRAM KERJA (PROKER) PROGRAM STUDI AKUNTANSI UNIVERSITAS ...

mampu mengemban misi memperluas akses pendidikan di bidang akuntansi. -4- Untuk meraih kepercayaan sebagai agen pemberdayaan masyarakat, melalui tridharma perguruan tinggi, Prodi S1 Akuntansi FE UUI harus menjadi program studi yang dikenal memiliki reputasi andal. Untuk mewujudkan visi dan misi yang sudah ditetapkan Pihak Rektorat, Prodi S-1 Akuntansi – Fakultas Ekonomi – Universitas .

68 Views

3y ago

Recent Views

Pegasus - University of Exeter Blogs

'Pegasus', Dept. of Classics and Ancient History, Amory Building, Rennes Drive, University of Exeter, Exeter EX4 4RJ E-mail: pegasus@exeter.ac.uk . Pegasus - 2 - Issue 52 (2009) s in . The major event this last year was the announcement of the outcome of the Research Assessment Exercise 2008 in .

11m ago

72 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

2y ago

335 Views

Mathematics 2 Problem Sets - Exeter

Mathematics 2 Mathematics Department Phillips Exeter Academy Exeter, NH August 2019. To the Student Contents: Members of the PEA Mathematics Department have written the material in this book. As you work through it, you will discover that algebra, geometry, and trigonometry have been integr

2y ago

119 Views

The Seafarer RL 4 The Wanderer The Wife’s Lament

survive in the Exeter Book, a manuscript of Anglo-Saxon poems produced by a single scribe around a.d. 950. In addition to these and other secular poems, the Exeter Book contains religious verse, nearly 100 riddles, and a heroic narrative. It is the largest collection of Old English poetry in existence. Neglected Treasure Originally, the Exeter

2y ago

139 Views

UPPER SCHOOL - Exeter

Club, Exeter Soccer Club, or Exeter Volleyball Club. UPPER SCHOOL students may also partici-pate in musical or choral groups. DESIGNING YOUR OWN CURRICULUM As an UPPER SCHOOL student, you have the free-dom to design your own academic curriculum. You may enroll in any three of the more than

2y ago

341 Views

Exeter a gentleman in Moscow weekend

“A Gentleman in Moscow” Itinerary at a Glance: Day 1 Arrive in Moscow Day 2 Backstage Tour of the Bolshoi Day 3 Kremlin Tour Day 4 Depart Moscow . Why Exeter International? Our Knowledge & Experience . At Exeter International we have been creating mem

2y ago

328 Views

Phillips Exeter Academy Courses of Instruction 2022-23

Academic Excellence Academic excellence is a signature strength of Phillips Exeter Academy. In every discipline and . — academic, artistic, athletic and extracurricular — . and passions and the agency needed to carry these forward. Non Sibi Non Sibi, or Not For Oneself, inscribed on Exeter’s

1y ago

114 Views

Exeter College Association

club in Takoradi, Ghana. Katrina Hancock read Earth Sciences at Exeter between 1998 and 2002. She joined the Development Office in 2004 and has been Director of Development since 2006. Mark Houghton-Berry,Honorary Fellow, read Literae Humaniores at Exeter between 1976 and 1980. He is CEO of Tudor Capital LP, the European arm of a US based hedge .

1y ago

95 Views

Exeter Plan: Outline Draft Plan Consultation Sustainability Appraisal .

1. Draft Non-Technical Summary of the SA Report for the Exeter Plan (Outline Draft Consultation) M. Andrew B. Miller S. Temple K. Nicholls K. Nicholls 10.08.2022 2. Final Non-Technical Summary of the SA Report for the Exeter Plan (Outline Draft Consultation) M. Andrew B. Miller S. Temple K. Nicholls K. Nicholls 09.09.2022 3.

1y ago

108 Views

2017-2018 GRANDE ÉCOLE MSc in MANAGEMENT

Descriptif des cours Course Outlines 10 Catalogue des cours/ Course Catalog 2017-2018 FIN: Finance/Finance A : Actuariat/Actuarial, Insurance E : Finance d’entreprise/Corporate Finance The course liste tables and the course outlines G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d’Information, Sciences de la Décision et .

3y ago

312 Views

Behavioral Finance and Wealth L Management

Introduction to Behavioral Finance CHAPTER1 What Is Behavioral Finance? Behavioral Finance: The Big Picture Standard Finance versus Behavioral Finance The Role of Behavioral Finance with Private Clients How Practical Application of Behavioral Finance Can Create a Successful Advisory Rel

2y ago

377 Views

Catalogue des Cours Course Catalog - ESSEC Business School

10 Catalogue des cours/Course Catalog 2021-2022 FIN: Finance/Finance E : Finance d'entreprise/Corporate Finance G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d'Information, Sciences de la Décision et Statistiques/ Information Systems, Decision Sciences and Statistics

1y ago

222 Views

SINGAPORE - Kelly Services

FINANCE Chief Financial Officer Degree/Master 15 20,000 25,000 Finance Assistant Diploma 1-3 2,800 3,400 Finance Controller Degree 10-15 10,000 18,000 Finance Director Degree 15 15,000 20,000 Finance Executive/ Senior Finance Executive Degree 2-5 3,000 6,000 Finance Manager/ Assistan

2y ago

527 Views

Ministries of Finance and Nationally Determined Contributions

Rodrigo Rojo, IDB Sr. Consultant and advisor to Ministry of Finance of Chile. Colombia German Romero Otalora and Laura Marcela Ruiz Daza — Office of the Vice-Minister — Ministry of Finance. Ireland Paul Ryan — International Finance Division — Ministry of Finance Sean Judge — Department of Finance — Ministry of Finance

1y ago

232 Views

Big Data And Data Sharing: Ethical Issues

It looks like you're using an ad-blocker