Gateway to Think Tanks
来源类型 | Research Report |
规范类型 | 报告 |
Toward an Open Data Bias Assessment Tool | |
Ajjit Narayanan; Graham MacDonald | |
发表日期 | 2019-03-05 |
出版年 | 2019 |
语种 | 英语 |
概述 | Data are a critical resource for government decisionmaking, and in recent years, local governments, in a bid for transparency, community engagement, and innovation, have released many municipal datasets on publicly accessible open data portals. Advocates, reporters, and others have voiced concerns about the bias of algorithms used to guide public decisions and the data that power them.Although significant |
摘要 | Data are a critical resource for government decisionmaking, and in recent years, local governments, in a bid for transparency, community engagement, and innovation, have released many municipal datasets on publicly accessible open data portals. Advocates, reporters, and others have voiced concerns about the bias of algorithms used to guide public decisions and the data that power them. Although significant progress is being made in developing tools for algorithmic bias and transparency, we could not find any standardized tools available for assessing bias in open data itself. In other words, how can policymakers, analysts, and advocates systematically measure the level of bias in the data that power city decisionmaking, whether an algorithm is used or not? To fill this gap, we present a prototype of an automated bias assessment tool for geographic data. This new tool will allow city officials, concerned residents, and other stakeholders to quickly assess the bias and representativeness of their data. The tool allows users to upload a file with latitude and longitude coordinates and receive simple metrics of spatial and demographic bias across their city. The tool is built on geographic and demographic data from the Census and assumes that the population distribution in a city represents the “ground truth” of the underlying distribution in the data uploaded. To provide an illustrative example of the tool’s use and output, we test our bias assessment on three datasets—bikeshare station locations, 311 service request locations, and Low Income Housing Tax Credit (LIHTC) building locations—across a few, hand-selected example cities. Across the small sample of cities we studied, we consistently find that bikeshare stations are concentrated in downtown areas, overserve neighborhoods with high numbers of non-Hispanic white, non-Hispanic Asian, and college-educated residents, and underserve neighborhoods with large numbers of non-Hispanic Black, Hispanic, unemployed, and poor residents. The results from our analysis of bias in 311 service requests and LIHTC building location data are much more mixed across cities. Of particular note: 311 service requests from Boston and DC overrepresent white and college-educated neighborhoods while 311 service requests from Philadelphia overrepresent non-Hispanic Black and poorer neighborhoods. LIHTC location data from Raleigh demonstrate that buildings tend to be in neighborhoods with higher shares of Black and poor residents and lower shares of white and college-educated residents relative to the city average, in contrast to the other cities we studied, which tended to have much smaller differences. |
主题 | Neighborhoods, Cities, and Metros ; Poverty, Vulnerability, and the Safety Net ; Race and Ethnicity |
URL | https://www.urban.org/research/publication/toward-open-data-bias-assessment-tool |
来源智库 | Urban Institute (United States) |
资源类型 | 智库出版物 |
条目标识符 | http://119.78.100.153/handle/2XGU8XDN/480514 |
推荐引用方式 GB/T 7714 | Ajjit Narayanan,Graham MacDonald. Toward an Open Data Bias Assessment Tool. 2019. |
条目包含的文件 | ||||||
文件名称/大小 | 资源类型 | 版本类型 | 开放类型 | 使用许可 | ||
toward_an_open_data_(4051KB) | 智库出版物 | 限制开放 | CC BY-NC-SA | 浏览 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。