Skip to main content

Documentation & accountability

Guidelines for ethical data annotation and release practices without exploiting workers and stealing data.

A colorful slide of what looks like post its. With "Who is who in data labeling?" first and then a bunch of sketches of people. Most of the text in the post its is not legible. But there is a title, "Stakeholders Map"

Our Research

We can build useful AI systems without stealing data and exploiting labor. Instead of attempting to guzzle every available data on the internet, we advocate for curation practices that only collect the data needed to build specific modes. Instead of exploiting labor, we advocate for a collaborative approach that treats data workers as knowledgeable builders of AI systems. Instead of releasing datasets and models into the world without documentation, we advocate for the careful documentation practices that are widespread in other engineering disciplines.

Image: Janet Turra & Cambridge Diversity Fund / https://betterimagesofai.org / https://creativecommons.org/licenses/by/4.0/

Data Feminism

The narratives around big data and data science are overwhelmingly white, male, and techno-heroic. In Data Feminism (MIT Press, 2020), Catherine D'Ignazio and Lauren Klein present a mode of thinking about data science and data ethics that is informed by intersectional feminist thought.