Skip to main content

A Framework to Establish a Data Linkage Program

Multi-colored cables connected to a server. Photographer: Andrey Rudkov/Bloomberg
Creating a standardized framework for developing data linkage programs to enhance decision-making across government
  • Client
    National Center for Science and Engineering Statistics within the U.S. National Science Foundation
  • Dates
    September 2024 – September 2026

Problem

Declining survey rates and rising costs hinder data quality, coverage, and analytical capabilities. 

The National Secure Data Service (NSDS) Demonstration project, led by the U.S. National Science Foundation’s (NSF) National Center for Science and Engineering Statistics (NCSES), aims to inform a government-wide effort on strengthening data linkage and data access infrastructure. This effort facilitates statistical activities in support of increased evidence building for the American public. One goal of the NSDS Demonstration project is to inform efforts for developing a shared services model that would streamline and innovate data sharing and linking to enable decision-making at all levels of government and in all sectors.

NCSES within NSF is the principal source of analytical and statistical reports, data, and related publications that describe and provide insight into the nation’s science and engineering resources. Like other federal surveys, NCSES’s surveys are experiencing several challenges, including declining response rates, increased costs, and decreasing resources to support survey operations. Agencies across the federal government are dealing with the challenges of how to maximize the available data resources, including linking sources. To further these efforts, this project uses the NCSES as a case study to inform the development of a standardized framework that agencies can use to develop linkage programs. Additionally, the project supports the NSDS Demonstration effort to inform the development of a shared services model for streamlined data sharing and linking to enable informed decision-making across all levels of government and sectors.

Solution

NORC is building a comprehensive data linkage framework for NCSES to enhance data integration, quality, and usability.

This project supports the NSDS Demonstration effort by identifying and documenting steps for federal agencies to consider when establishing a data linkage program. To that end, NORC will leverage its extensive expertise in advanced record linkage methodologies, data quality assurance, and data protection measures. NORC will first create an inventory of existing NCSES-linked data files, identifying data sources, documenting linkage characteristics, and compiling dissemination materials in a user-friendly format. Next, building on best practices, NORC will develop frameworks for data sharing agreements, input data preparation, and record linkage methods. These frameworks will establish clear guidelines for data sharing agreements, while setting rigorous standards to ensure the integrity and reliability of data preparation and linkage processes. Finally, NORC will design frameworks for disseminating linked data information and accessing linked datasets, ensuring data utility and privacy. This comprehensive strategy will enhance the quality, consistency, and usability of NCSES data assets and inform other agencies hoping to develop similar frameworks.

Result

NCSES’s enhanced data linkage capabilities will support evidence-based research.

The project will deliver an inventory of current NCSES linked data files and develop a framework to establish a comprehensive data linkage program. Key components of the program will be developed including frameworks for data sharing agreements, preparation of input data for linkage, record linkage methods, and dissemination of linked data and corresponding documentation. These deliverables will establish a robust foundation for implementing high-quality linkages, maintaining data privacy and utility, and improving the reliability and accessibility of the linked datasets.

The enhanced data linkage capabilities will support evidence-based research and policy decisions, enabling NCSES to leverage its data assets more effectively. This project will help establish future data linkage initiatives within the federal statistical community, setting best practices in record linkage and secure data sharing methods.

Contact Information

Vice President, Statistics & Data Science

Project Leads

“By establishing a robust data linkage framework, NORC is helping NCSES harness the power of integrated data to drive evidence-based policymaking and improve accessibility to critical federal data assets.”

Vice President, Statistics & Data Science

“By establishing a robust data linkage framework, NORC is helping NCSES harness the power of integrated data to drive evidence-based policymaking and improve accessibility to critical federal data assets.”

Explore NORC Research Science Projects

Analyzing Parent Narratives to Create Parent Gauge™

Helping Head Start build a tool to assess parent, family, and community engagement

Client:

National Head Start Association, Ford Foundation, Rainin Foundation, Region V Head Start Association

America in One Room

A “deliberative polling” experiment to bridge American partisanship

Client:

Stanford University