G2TT
来源类型Research Report
规范类型报告
Toward an Open Data Bias Assessment Tool
Ajjit Narayanan; Graham MacDonald
发表日期2019-03-05
出版年2019
语种英语
概述Data are a critical resource for government decisionmaking, and in recent years, local governments, in a bid for transparency, community engagement, and innovation, have released many municipal datasets on publicly accessible open data portals. Advocates, reporters, and others have voiced concerns about the bias of algorithms used to guide public decisions and the data that power them.Although significant
摘要

Data are a critical resource for government decisionmaking, and in recent years, local governments, in a bid for transparency, community engagement, and innovation, have released many municipal datasets on publicly accessible open data portals. Advocates, reporters, and others have voiced concerns about the bias of algorithms used to guide public decisions and the data that power them.

Although significant progress is being made in developing tools for algorithmic bias and transparency, we could not find any standardized tools available for assessing bias in open data itself. In other words, how can policymakers, analysts, and advocates systematically measure the level of bias in the data that power city decisionmaking, whether an algorithm is used or not?

To fill this gap, we present a prototype of an automated bias assessment tool for geographic data. This new tool will allow city officials, concerned residents, and other stakeholders to quickly assess the bias and representativeness of their data. The tool allows users to upload a file with latitude and longitude coordinates and receive simple metrics of spatial and demographic bias across their city.

The tool is built on geographic and demographic data from the Census and assumes that the population distribution in a city represents the “ground truth” of the underlying distribution in the data uploaded. To provide an illustrative example of the tool’s use and output, we test our bias assessment on three datasets—bikeshare station locations, 311 service request locations, and Low Income Housing Tax Credit (LIHTC) building locations—across a few, hand-selected example cities.

Across the small sample of cities we studied, we consistently find that bikeshare stations are concentrated in downtown areas, overserve neighborhoods with high numbers of non-Hispanic white, non-Hispanic Asian, and college-educated residents, and underserve neighborhoods with large numbers of non-Hispanic Black, Hispanic, unemployed, and poor residents. The results from our analysis of bias in 311 service requests and LIHTC building location data are much more mixed across cities. Of particular note: 311 service requests from Boston and DC overrepresent white and college-educated neighborhoods while 311 service requests from Philadelphia overrepresent non-Hispanic Black and poorer neighborhoods. LIHTC location data from Raleigh demonstrate that buildings tend to be in neighborhoods with higher shares of Black and poor residents and lower shares of white and college-educated residents relative to the city average, in contrast to the other cities we studied, which tended to have much smaller differences.

主题Neighborhoods, Cities, and Metros ; Poverty, Vulnerability, and the Safety Net ; Race and Ethnicity
URLhttps://www.urban.org/research/publication/toward-open-data-bias-assessment-tool
来源智库Urban Institute (United States)
资源类型智库出版物
条目标识符http://119.78.100.153/handle/2XGU8XDN/480514
推荐引用方式
GB/T 7714
Ajjit Narayanan,Graham MacDonald. Toward an Open Data Bias Assessment Tool. 2019.
条目包含的文件
文件名称/大小 资源类型 版本类型 开放类型 使用许可
toward_an_open_data_(4051KB)智库出版物 限制开放CC BY-NC-SA浏览
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Ajjit Narayanan]的文章
[Graham MacDonald]的文章
百度学术
百度学术中相似的文章
[Ajjit Narayanan]的文章
[Graham MacDonald]的文章
必应学术
必应学术中相似的文章
[Ajjit Narayanan]的文章
[Graham MacDonald]的文章
相关权益政策
暂无数据
收藏/分享
文件名: toward_an_open_data_bias_assessment_tool_3.pdf
格式: Adobe PDF
此文件暂不支持浏览

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。