Ethical Considerations of Including Personal Demographic Information in Open Knowledge Platforms




metadata, ethics, open knowledge, data privacy, linked data, Wikidata, gender


In recent years, galleries, libraries, archives, and museums (GLAMs) have sought to leverage open knowledge platforms such as Wikidata to highlight or provide more visibility for traditionally marginalized groups and their work, collections, or contributions. Efforts like Art + Feminism, local edit-a-thons, and, more recently, GLAM institution-led projects have promoted open knowledge initiatives to a broader audience of participants. One such open knowledge project, the Program for Cooperative Cataloging (PCC) Wikidata Pilot, has brought together over seventy GLAM organizations to contribute linked open data for individuals associated with their institutions, collections, or archives. However, these projects have brought up ethical concerns around including potentially sensitive personal demographic information, such as gender identity, sexual orientation, race, and ethnicity, in entries in an open knowledge base about living persons. GLAM institutions are thus in a position of balancing open access with ethical cataloging, which should include adhering to the personal preferences of the individuals whose data is being shared. People working in libraries and archives have been increasingly focusing their energies on issues of diversity, equity, and inclusion in their descriptive practices, including remediating legacy data and addressing biased language. Moving this work into a more public sphere and scaling up in volume creates potential risks to the individuals being described. While adding demographic information on living people to open knowledge bases has the potential to enhance, highlight, and celebrate diversity, it could also potentially be used to the detriment of the subjects through surveillance and targeting activities. In this article we seek to investigate the changing role of metadata and open knowledge in addressing, or not addressing, issues of under- and misrepresentation, especially as they pertain to gender identity as described in the sex or gender property in Wikidata. We report findings from a survey investigating how organizations participating in open knowledge projects are addressing ethical concerns around including personal demographic information as part of their projects, including what, if any, policies they have implemented and what implications these activities may have for the living people being described.


Download data is not yet available.


Adolpho, Kalani. 2019. “Who Asked You? Consent, Self-Determination, and the Report of the PCC Ad Hoc Task Group on Gender in Name Authority Records.” In Ethical Questions in Name Authority Control, edited by Jane Sandberg, 111–31. Sacramento, CA: Library Juice Press.

Archives for Black Lives in Philadelphia’s Anti-Racist Description Working Group. 2019. Archives for Black Lives in Philadelphia Anti-Racist Description Resources.

Ayers, Phoebe, Charles Matthews, and Ben Yates. 2008. How Wikipedia Works And How You Can Be a Part of It. No Starch Press.

Baucom, Erin. 2018. “An Exploration into Archival Descriptions of LGBTQ Materials.” The American Archivist 81 (1): 65–83.

Billey, Amber. 2019. “Just Because We Can, Doesn’t Mean We Should: An Argument for Simplicity and Data Privacy with Name Authority Work in the Linked Data Environment.” Journal of Library Metadata 19 (1–2): 1–17.

Billey, Amber, Emily Drabinski, and K. R. Roberto. 2014. “What’s Gender Got to Do with It? A Critique of RDA 9.7.” Cataloging & Classification Quarterly 52 (4): 412–21.

Billey, Amber, Matthew Haugen, John Hostage, Nancy Sack, and Adam L. Schiff. 2016. “Report of the PCC Ad Hoc Task Group on Gender in Name Authority Records.” Program for Cooperative Cataloging. Archived at:

Burgess, John T. F. 2019. “Principles and Concepts in Information Ethics.” In Foundations of Information Ethics, edited by John T. F. Burgess, Emily J. M. Knox, and Robert Hauptman, 1–16. Chicago: ALA NealSchuman.

Campbell, Grant, and Scott R. Cowan. 2016. “The Paradox of Privacy: Revisiting a Core Library Value in an Age of Big Data and Linked Data.” Library Trends 64 (3): 492–511.

Cannan, Judith P., Paul Frank, and Les Hawkins. 2019. “LC/NACO Authority File in the Library of Congress BIBFRAME Pilots.” Journal of Library Metadata 19 (1–2): 39–51.

Caswell, Michelle, and Marika Cifor. 2016. “From Human Rights to Feminist Ethics: Radical Empathy in the Archives.” Archivaria 81 (Spring): 23–43.

Chiam, Zhan, Sandra Duffy, Matilda González Gil, Lara Goodwin, and Nigel Timothy Mpemba Patel. 2020. Trans Legal Mapping Report 2019: Recognition Before the Law. 3rd ed. Geneva: ILGA World. Archived at: “H.R.5 - 117th Congress (2021-2022): Equality Act.” Accessed March 17, 2021.

DeBonis, Mike. 2021. “The Push for LGBTQ Civil Rights Stalls in the Senate as Advocates Search for Republican Support.” Washington Post, June 20, 2021. Archived at:

Diamond, Lisa M. 2020. “Gender Fluidity and Nonbinary Gender Identities Among Children and Adolescents.” Child Development Perspectives 14 (2): 110–15.

“Gender Unicorn.” n.d. Trans Student Education Resources (blog). Accessed June 17, 2021. Archived at:

Hannah, Kaiti, and Liz Scott. 2020. “Language Remediation Project Underway at the Western Development Museum: Answering TRC Calls to Action #43 and #67.” Western Development Museum. Archived at:

Larade, Sharon P., and Johanne M. Pelletier. 1993. “Mediating in a Neutral Environment: Gender-Inclusive or Neutral Language in Archival Descriptions.” Archivaria 35 (Spring): 99–109.

Lellman, Charlotte, Hanna Clutterbuck-Cook, Amber LaFountain, and Jessica Sedgwick. 2020. “Guidelines for Inclusive and Conscientious Description.” Center for the History of Medicine: Policies and Procedures Manual. Boston, MA: Center for the History of Medicine, Francis A. Countway Library of Medicine. Harvard University Wiki. Archived at:

Linked Data for Production: Pathway to Implementation (LD4P2). 2021. Ethics in Linked Data Affinity Group. Accessed November 29, 2021.

Long, Kara, Santi Thompson, Sarah Potvin, and Monica Rivero. 2017. “The ‘Wicked Problem’ of Neutral Description: Toward a Documentation Approach to Metadata Standards.” Cataloging & Classification Quarterly 55 (3): 107–28.

Mizota, Sharon. 2021. “Change Is Good: Navigating Wikidata as a Controlled Descriptive Vocabulary.” Descriptive Notes (blog), March 30, 2021. Archived at:

O’Neill, Shannon, and Rachel Searcy. 2020. “Righting (and Writing) Wrongs: Reparative Description for Japanese American Wartime Incarceration.” The Back Table (blog), New York University Libraries.

Program for Cooperative Cataloging (PCC) Standing Committee on Training. 2020. “NACO Participants’ Manual.” Program for Cooperative Cataloging.

Smith-Yoshimura, Karen. 2020. Transitioning to the Next Generation of Metadata. Dublin, OH: OCLC Research.

Society of American Archivists. 2021. “Reparative Description.” Dictionary of Archives Terminology.,characterize%20archival%20resources%20(View%20Citations).

Tillman, Ruth Kitchin. 2019. “Barriers to Ethical Linked Data Name Authority Modeling.” In Ethical Questions in Name Authority Control, edited by Jane Sandberg, 243–60. Sacramento, CA: Library Juice Press.

Thompson, Kelly J. 2016. “More Than a Name: A Content Analysis of Name Authority Records for Authors Who Self-Identify as Trans.” Library Resources & Technical Services 60 (3): 140–55.

US Department of Labor. n.d. “Guidance on the Protection of Personal Identifiable Information.” Accessed June 21, 2021. Archived at:

VAWnet. n.d. “Violence Against Trans and Non-Binary People.” National Resource Center on Domestic Violence.

Whittaker, Thomas A. 2019. “Demographic Characteristics in Name Authority Records and the Ethics of a Person-Centered Approach to Name Authority Control.” In Ethical Questions in Name Authority Control, edited by Jane Sandberg, 57–68. Sacramento, CA: Library Juice Press.

Wikidata. 2013–20. “Wikidata:Property talk:P21/Archive 1.”

Wikidata. 2020. “Wikidata:Living people.” Last modified August 4, 2020.

Wikidata. 2021a. “Wikidata:Notability.” Last modified June 14, 2021.

Wikidata. 2021b. “Wikidata:Property that may violate privacy.” Last modified June 4, 2021.

Wikidata. 2021c. “sex or gender (P21).” Last modified June 8, 2021.

Wikidata. 2021d. “sexual orientation (P91).” Last modified June 4, 2021.

Wikipedia. 2021. “Wikipedia:Biographies of living persons.” Last modified June 25, 2021.

Winston, Rachel E. 2021. “Praxis for the People: Critical Race Theory and Archival Practice.” In Knowledge Justice: Disrupting Library and Information Studies Through Critical Race Theory, edited by Sofia Y. Leung and Jorge R. Lopez-McKnight, 283–98. Cambridge, MA: MIT Press.

Yarmosky, Jessica. 2019. “‘I Can Exist Here’: On Gender Identity, Some Colleges are Opening Up.” NPR, March 21, 2019. Archived at:



How to Cite

Lindsey, Nerissa, Greta Kuriger Suiter, and Kurt Hanselman. 2022. “Ethical Considerations of Including Personal Demographic Information in Open Knowledge Platforms”. KULA: Knowledge Creation, Dissemination, and Preservation Studies 6 (3):1-15.



Research Articles

Similar Articles

You may also start an advanced similarity search for this article.