Ethical thinking in Data Science and Artificial Intelligence

On this page you will find relevant information on Ethical thinking in Data Science and Artificial Intelligence including workshops, embedded modules that you can adapt and related publications. For questions or follow ups please contact Dr. Vandana Janeja. This is an important disciplinary discussion and we would love to hear from you.

Ethical Data Life Cycle

Ethics generally works on the principles of do no harm. Although research protocols to protect human beings have been in place for a while now, the pervasiveness of multiple types of data and their use make it less clear where the impact on human beings is in the data life cycle. Thus, harm is not only direct based on exposing identifiable data for individuals, but also indirect resulting from the reuse of easily available data and combining multiple datasets.

In particular for data science there is a need to develop ethical critical thinking while analyzing the data. Throughout the entire lifecycle of the data in the knowledge discovery process there are many opportunities for ethical decision making that a data scientist can evaluate to do no harm.

V. P. Janeja, Do No Harm: An Ethical Data Life Cycle, AAAS Sci on the Fly, April 2019.

Module to embed an Ethical perspective in Data Science and AI classes

Funded by Hrabowski Innovation Fund

Embedded Module

We designed a module which could be embedded in data science and related curricula. This module included the following:

Ethical Data life cycle
Discuss theory of ethics
Introduction to various data science code of conducts
Case studies, readings, and reflection questions in the context of an ethical data life cycle
Jupyter Notebook demonstrating principles of ethics in action in an algorithmic choice (Example of KNN algorithm)

GitHUB for Jupyter Notebook and Module files:

https://github.com/MultiDataLab/EthicalDataScience

Goldstein, A., Alodadi, M., & Janeja, V. (2023). OPPORTUNITIES FOR ETHICAL DECISION MAKING: A CASE STUDY IN K-NN. In INTED2023 Proceedings (pp. 8525-8531). IATED. doi: 10.21125/inted.2023.2362

Evaluation

A series of statements were presented to the students to measure the effect of the course on their attitudes related to the ethical considerations in the analysis of big data. Particularly we included questions around (1) perceptions towards ethical considerations and (2) actions they may need to take as a result of their understanding of ethical issues.

Through our survey-based evaluation we found that overall, the ethics module has had a positive effect on the attitude of the students towards the importance of ethical considerations in the analysis of data.

In general, students in the 2021 offering tended to start the module with a higher tendency towards neutrality and agreement to the action statements than those students in the 2020 offering. Essentially the responses are somewhat right skewed to agreement. A possible explanation for this occurrence could be the effect of the increased exposure to data associated with the pandemic which may have brought in students at a higher level of awareness. However, this is not necessarily the case for the perception statements. It is our hypothesis that perceptions are complex to understand but easier to establish, whereas actions while they take time to establish are concrete and easier to actualize. The goal of the module was to increase the ethical thinking of students when analyzing data in real world projects they would encounter. The survey results demonstrated that the goal was achieved as there was a general increase of the percentage of students that agree with the statements presented about ethical thinking. These findings were presented at the Edulearn 2022 conference[1].

[1] Vandana P. Janeja, Maria Sanchez, “Rethinking Data Science Pedagogy with Embedded Ethical Considerations,” EDULEARN 2022, https://doi.org/10.21125/edulearn.2022.1964

EDULearn Full paper PDF

Testimony in support of AI Bills in Maryland

James Foulds, Vandana P. Janeja, Testimony in support of “State Government – Technology Advisory Commission – Established”, HOUSE BILL 1174, March 5, 2024
Tim Finin and V.P. Janeja, Written Testimony in support of House Bill 999 “Workgroup on Establishing a Science and Technology Best Practices and Innovation Network” March 5, 2024, Passed
Tim Finin and V.P. Janeja, Written Testimony in support of House Bill 1297 “Education – Artificial Intelligence – Guidelines and Pilot Program” February 28, 2024
Vandana P. Janeja, Testimony in support of Senate Bill 955 “State Government – Technology Advisory Commission – Established” February 2024
Vandana P. Janeja, James Foulds, Testimony in support of HOUSE BILL 1323, Algorithmic Decision Systems –Procurement and Discriminatory Acts: role of Responsible AI, Testimony to the Health and Government Operations Committee, March 15, 2021, https://mgaleg.maryland.gov/cmte_testimony/2021/hgo/10DEmLNbK67NWBLQFL2o1_4AByAJ-EK9n.pdf

Workshop : Ethics in Data Science Pedagogy

Funded by NSF

Executive Summary, Including Ethics in Data Science Pedagogy (NSF-EDSP-2019), June 17-18, 2019, Alexandria, VA, UMBC: Shimei Pan, Jimmy Foulds, Susan Sterett, Vandana Janeja, UCB: Cathryn Carson , Georgetown: Lisa Singh, UW: Bill Howe, https://sites.google.com/umbc.edu/edsp19/resources?authuser=0
Workshop Website

Publications

Goldstein, A., Alodadi, M., & Janeja, V. (2023). OPPORTUNITIES FOR ETHICAL DECISION MAKING: A CASE STUDY IN K-NN. In INTED2023 Proceedings (pp. 8525-8531). IATED. doi: 10.21125/inted.2023.2362
Vandana P. Janeja, Maria Sanchez, “Rethinking Data Science Pedagogy with Embedded Ethical Considerations,” EDULEARN 2022, https://doi.org/10.21125/edulearn.2022.1964
Vandana P. Janeja, James Foulds, Testimony in support of Algorithmic Decision Systems –Procurement and Discriminatory Acts: role of Responsible AI, 2021 https://mgaleg.maryland.gov/cmte_testimony/2021/hgo/10DEmLNbK67NWBLQFL2o1_4AByAJ-EK9n.pdf
Executive Summary, Including Ethics in Data Science Pedagogy (NSF-EDSP-2019), June 17-18, 2019, Alexandria, VA, UMBC: Shimei Pan, Jimmy Foulds, Susan Sterett, Vandana Janeja, UCB: Cathryn Carson , Georgetown: Lisa Singh, UW: Bill Howe, https://sites.google.com/umbc.edu/edsp19/resources?authuser=0
V. P. Janeja, S. Pan, J. Foulds, L. Boot, Including Ethics in Data Science Pedagogy: Why, What and How? UMBC- Provost’s Teaching and Learning Symposium, 2019

C. Erickson, C. Carson, J. Aikat, S. Davis, & V. Janeja, (2019, April 1). Ethics Panel Report: 2018 Data Science Leadership Summit. Zenodo. http://doi.org/10.5281/zenodo.3890536
V. P. Janeja, Do No Harm: An Ethical Data Life Cycle, AAAS Sci on the Fly, April 2019.
V. P. Janeja and Susan M. Sterett, Infusing Ethical Considerations in a Data Science Curriculum, UMBC- Provost’s Teaching and Learning Symposium, 2018

MData Lab

College of Engineering and Information Technology

MData Lab

Ethical thinking in Data Science and Artificial Intelligence

Module to embed an Ethical perspective in Data Science and AI classes

Testimony in support of AI Bills in Maryland

MData Lab

Module to embed an Ethical perspective in Data Science and AI classes

Testimony in support of AI Bills in Maryland

Subscribe to UMBC Weekly Top Stories

I am interested in: