Skip to Main Content

Government Information Data Rescue

Data Rescue Activist Tools

These data activist organizations focus on rescuing and preserving data:

  • ArchiveBox has archived datasets from data.gov, CIBP, USCIS, NOAA, NASA, NSIDC
  • Archive Team is focusing on archiving datasets from the U.S. Government
  • GitHub: Awesome Datahoarding provides lists of tools for web harvesting
  • GovDiff shows side-by-side comparisons of government website changes
  • MIT Libraries: Data Management Checklist provides a checklist for curating data rescue efforts
  • r/Data Hoarder is a subreddit of data preservation activists
  • Safeguarding Research Discourse Group is a preservation group hosted outside of the US
  • WebRecorder has archived 8TB+ of government sites