Data. Policy. Impact.

The Data Science and Public Policy Lab at Carnegie Mellon University, across the Machine Learning Department and the Heinz College of Public Policy, works to develop and further the use of ML/AI/data science in social good, and public policy research and practice. Our work includes educating current and future policymakers, researchers, and practitioners, working on ML/AI/data science projects with government, nonprofit, academic, and foundation partners, conducting new research, and developing new methods, open-source tools, and guides that support and extend the use of ML/AI/data science for public policy and social impact. Our team consists of data scientists and researchers from computer science, statistics, and social science backgrounds to bring in methods from all of these disciplines, software engineers to make sure our work becomes usable code and implemented, domain and policy experts to provide context and relevance, and project managers who help get things done.

Training and Education Programs
Data Science/ML/AI Projects with Governments and NGOs
Research Areas
Tools. Open Source Code, and Guides

We believe that effective use of data, ML, and AI is critical in making adaptive and personalized policies that improve lives of everyone in a measurable, fair, and equitable manner.

Our Work

Collaborative Projects

We work with governments, non-profits, and other organizations on data science projects across health, criminal justice, public safety, education, economic development, transportation, and more. Most of our projects tackle operational problems that have tangible impact, and result in software that can be used by our partner organizations (and others) for social impact and improved policies. Recent examples of our projects include:

Building Data-Driven Police Early Intervention Systems
Prioritizing Preventative Lead Hazard Inspections
Prioritizing Health and Safety Housing Inspections
Reducing incarcerations by identifying at risk individuals in need of social services

HOW TO WORK WITH US

Research Areas

Our research initiatives are motivated by working on hands-on data science projects with governments, non-profits, and other policy organizations. As we tackle policy problems, we identify open areas where existing methods from computer science, machine learning, artificial intelligence or social sciences are lacking and formulate our research initiatives to fill those gaps. We then push the results of our research back into our data science tools so they can be used across our projects and by our project partners. We are currently working on:

Auditing and Correcting for Bias and Equity Issues in Data Science Systems
Increasing the interpretability and transparency of machine learning models used in policy decisions
Designing experimental validation methodologies for machine learning systems
Developing methods for monitoring and updating deployed data science systems

MORE ABOUT OUR RESEARCH EFFORTS

ML/AI/Data Science Pipelines and Tools

We believe in open and reusable code and tools. All of our (non-confidential) project code is available under an open source license on our github page. All of our internal data science tools are also available for other organizations to use. Examples of such tools include:

Triage: Our data science pipeline platform that’s used in many of our internal projects, which contains components for generating features, building machine learning models, and evaluating those models.
Entity Deduplication Tool (pgdedupe)
Post-Modeling Tools for analyzing the models built, feature importances, and exploring the outputs of those models before deployment.
Bias Audits: To run bias audits on the outputs of machine learning models

MORE ABOUT OUR TOOLS

Trainings

We run training programs, workshops, and tutorials for students, government agencies, non profits, foundations, and corporations. Some of our trainings include:

The value of data driven decision-making (for managers and executives in government agencies and non profits)
How to scope data science projects
Assessing your Data maturity
Hands-on Technical Trainings including the Applied Data Analytics for Public Policy (with Coleridge Initiative)

Our trainings for governments and non-profits are designed for Directors and Executives of organizations as well as Analysts and Policymakers.

MORE ABOUT OUT TRAINING PROGRAMS

Blog

Our Project Partners

The future of Public Policy is open, adaptive, scalable, micro-policies that benefit everyone in a measurable, equitable, and fair manner. We can help get there.

Contact Us

CMU Scopeathon Ties Data Science to Better Communities

Our Work Receives Maryland Association of Counties Innovation Award for Artificial Intelligence Tool That Detects Collapsed Roofs

We're Hiring! Click Here for More Info

The Atlantic discusses our work on predicting police shootings

Pulse Lab Jakarta discussed our collaboration to tackle Traffic Safety in Jakarta

Our projects on Police Early Intervention Systems and Blight Prevention highlighted in "Stories from the World of Municipal Analytics"

GovTech Magazine article uses DSAPP's frameworks to highlight "How to do Data Science in Government"

Our work on bias and fairness audits for machine learning models is featured in Nature

Digital Content Media Outlet Which-50 Features Director Rayid Ghani on Auditing Government AI for Bias

From Government Technology Magazine: Former Associate Director Lauren Haynes on how to Scope Data Projects

Aequitas, our open-source Bias Audit Tool for auditing predictive risk-assessment tools for bias and fairness just launched. Try it out!

DSaPP Director Rayid Ghani, Aequitas Featured in Article on Bias and Data-Driven Policy in Philadelphia

Our Data-Driven Police Early Intervention System is live at Charlotte Mecklenburg Police Department

Data. Policy. Impact.

Health

Criminal Justice

Public Safety

Economic Development

Education

Energy and Environment

Transportation and Infrastructure

Our Work

Collaborative Projects

Building Data-Driven Police Early Intervention Systems

Prioritizing Preventative Lead Hazard Inspections

Prioritizing Health and Safety Housing Inspections

Reducing incarcerations by identifying at risk individuals in need of social services

Research Areas

Auditing and Correcting for Bias and Equity Issues in Data Science Systems

Increasing the interpretability and transparency of machine learning models used in policy decisions

Designing experimental validation methodologies for machine learning systems

Developing methods for monitoring and updating deployed data science systems

ML/AI/Data Science Pipelines and Tools

We believe in open and reusable code and tools. All of our (non-confidential) project code is available under an open source license on our github page. All of our internal data science tools are also available for other organizations to use. Examples of such tools include:

Triage: Our data science pipeline platform that’s used in many of our internal projects, which contains components for generating features, building machine learning models, and evaluating those models.

Entity Deduplication Tool (pgdedupe)

Post-Modeling Tools for analyzing the models built, feature importances, and exploring the outputs of those models before deployment.

Bias Audits: To run bias audits on the outputs of machine learning models

Trainings

We run training programs, workshops, and tutorials for students, government agencies, non profits, foundations, and corporations. Some of our trainings include:

The value of data driven decision-making (for managers and executives in government agencies and non profits)

How to scope data science projects

Assessing your Data maturity

Hands-on Technical Trainings including the Applied Data Analytics for Public Policy (with Coleridge Initiative)

Our trainings for governments and non-profits are designed for Directors and Executives of organizations as well as Analysts and Policymakers.

Blog

Our Project Partners

The future of Public Policy is open, adaptive, scalable, micro-policies that benefit everyone in a measurable, equitable, and fair manner. We can help get there.

We're Hiring!
Click Here for More Info