Skip to main content


This is very welcome news in the face of Trumpist depredations of US public data.
lil.law.harvard.edu/blog/2025/…

"In recent months the Harvard Law School [@harvard_law] Library Innovation Lab [@harvardlil] has created a data vault to download, sign as authentic, and make available copies of public government data that is most valuable to researchers, scholars, civil society and the public at large across every field. To begin, we have collected major portions of the datasets tracked by data.gov, federal #Github repositories, and #PubMed...."

#DataGov #DefendResearch #Libraries #OpenData #Preservation #Trump #TrumpVResearch #USPol #USPolitics

reshared this

in reply to petersuber

Related: Also see the End of Term Web Archive, which routinely scrapes and preserves US govt web pages before new presidents take office. Launched in 2008 and still going strong. To supplement its archive, it welcomes URL nominations of sites to save.
eotarchive.org/

#OpenData #Preservation #Trump #USPol #USPolitics

reshared this

in reply to petersuber

Related: "As Data Goes Off-Line Under #Trump, Environmental Researchers Are Uploading Backups"
insidehighered.com/news/facult…

"In the first few days of Donald Trump’s second term as president, the White House Council on Environmental Quality’s #Climate and Economic Justice Screening Tool (#CEJST, for short) disappeared from government websites. It was an interactive map of U.S. Census tracts that are “marginalized by underinvestment and overburdened by pollution,” as the pre-Trump federal government put it—something researchers and the public could use to quickly locate and zoom in on specific communities and analyze the problems they face. The Internet Archive’s Wayback Machine stored a copy of the webpage, but even there, the map was gone. However, thanks to a team of researchers from multiple universities and other organizations, a new working version was posted online Friday."

#OpenData #Preservation #USPol #USPolitics

reshared this

in reply to petersuber

Bill Statler reshared this.

in reply to petersuber

Related. Also see the Safeguarding Research project.
safeguarding-research.discours…

"You know of any publicly available material that needs safeguarding? Please post about it here!"

#Preservation #USPol #USPolitics

Bill Statler reshared this.

in reply to petersuber

Related. "Researchers rush to preserve federal health databases before they disappear from government websites"
journalistsresource.org/home/r…

"Tips for preserving websites:
* To find the missing websites, go to Wayback Machine and type in the website’s URL in the search bar.
* If you’re concerned that certain websites or web pages may be removed, you can suggest federal websites and content that end in .gov, .mil and .com to the End of Term Web Archive.
* You can suggest federal climate and environmental databases to Environmental Data and Governance Initiative.
* You can suggest databases to The Data Liberation Project, which is run by MuckRock and Big Local News.
* Tell science journalist Maggie Koerth what CDC data you've downloaded and whether you've made them publicly available...."

#Preservation #Trump #USPol #USPolitics

in reply to petersuber

Related. "Scientists globally are racing to save vital health databases taken down amid Trump chaos."
nature.com/articles/d41586-025…

#CDC #FDA #Medicine #Preservation #Takedowns #Trump #USPol #USPolitics

reshared this

in reply to petersuber

Update. "Inside the race to archive the US government’s websites"
technologyreview.com/2025/02/0…

Surveying a range of initiatives with good clarity on the obstacles.

"There are questions about whether scraping the data will really be enough. Restoring websites and complex data sets is often not a simple process.…'The repairs and attempts to recover are sometimes insurmountable where we need continuous readings of data.' 'All of this data archiving work is a temporary Band-Aid,' says Gosnell. 'If data sets are removed and are no longer updated, our archived data will become increasingly stale and thus ineffective at informing decisions over time.' "

#Climate #Data #Medicine #OpenData #Preservation #Takedowns #Trump #USPol #USPolitics

This entry was edited (10 months ago)
in reply to petersuber

Update. "The Public Environmental Data Partners [#PEDP] are committed to preserving and providing public access to federal environmental data. We are a volunteer coalition of several environmental, justice, and policy organizations, researchers across several universities, archivists, and students who rely on federal datasets and tools to support critical research, advocacy, policy, and litigation work. To gather insights on what data to preserve, we reached out to our networks, which consist largely of environmental justice groups and networks, state and local government climate offices, and academic researchers. We compiled a large list of federal databases and tools, and prioritized them based on their relative impact, our confidence that we could archive them, and the relative effort it would take to obtain and archive them."
screening-tools.com/

Continuously updated.

#Climate #Data #Environment #OpenData #Preservation #Takedowns #Trump #USPol #USPolitics

This entry was edited (10 months ago)

Glyn Moody reshared this.

in reply to petersuber

Update. "This is Version 2 of the Climate and Economic Justice Screening Tool, released by the Council on Environmental Quality [#CEQ] in December 2024. Although the tool remains unchanged, public access through the White House was discontinued on January 22, 2025. We re-created Version 2 and made it publicly accessible."
screening-tools.com/climate-ec…

#Climate #Environment #OpenSource #Trump #USPol #USPolitics

Tech Cyborg reshared this.

in reply to petersuber

Update. "Today we [Harvard Law School @harvard_law Library Innovation Lab @harvardlil] released our archive of data.gov on Source Cooperative. The 16TB collection includes over 311,000 datasets harvested during 2024 and 2025, a complete archive of federal public datasets linked by data.gov. It will be updated daily as new datasets are added to data.gov. This is the first release in our new data vault project to preserve and authenticate vital public datasets for academic research, policymaking, and public use."
lil.law.harvard.edu/blog/2025/…

#DataGov #Libraries #OpenData #Preservation #Trump #USPol #USPolitics

in reply to petersuber

Update. "Federal data is disappearing. On Thursday, meet the teams working to rescue it and learn how you can help. Join the Internet Archive [@internetarchive] and the Library Innovation Lab [@harvardlil] on Feb. 13, 3pm Eastern for a special event exploring the terabytes of data they have already saved and how to access it."
muckrock.com/news/archives/202…

#Censorship #OpenData #Preservation #Takedowns #Trump #USPol #USPolitics

reshared this

in reply to petersuber

@internetarchive

I've donated to the @internetarchive because retaining factual data is critical to ever restoring a normal, post-fascist world.

in reply to petersuber

Update. If you're following this thread, you should also follow the Data Rescue Project by visiting its web site and subscribing to its email list. It aims "to serve as a clearinghouse for #data rescue-related efforts and data access points for public US governmental data that are currently at risk." And it's #crowdsourced, which gives it fighting chance to be comprehensive and up to date.
datarescueproject.org/about-da…

If you're on #Bluesky, also follow its B account.
bsky.app/profile/datarescuepro…

I'm very aware that a solo effort, like this Mastodon thread, doesn't scale to the size of this task and I welcome the arrival of a crowdsourced effort. I will use it and refer people to it.

#Censorship #Preservation #Takedowns #Trump #USPol #USPolitics

This entry was edited (10 months ago)
in reply to petersuber

Thanks for the resources. I'd encourage everyone to think locally, too -- it may not just be Federal data at risk. For instance, at a certain university, I understand there's a footer applied to each email sent out to students, staff, or faculty that includes the words "diversity" and "inclusion". That same university, I understand, also has pages about resources for LGBTQ+ people, and climate change, on their university web site. No doubt that university gets a lot of Federal money for research, etc, and so...
in reply to Kate Nyhan is changing servers

in reply to petersuber

Update. "As the US government removes health websites and data, here’s a list of non-government data alternatives and archives"
journalistsresource.org/home/a…

"There’s no perfect alternative to the government databases, but some non-governmental organizations have their own datasets, which can be useful to journalists. Several #journalism associations have also been downloading government data and making them available to their members. To help journalists with their continued reporting, we have curated a list of non-government websites that have health data, although some use government data to create their reports. We’ll continue to update this list. If you have a suggestion for a database, please email us."

h/t @kdnyhan

#Censorship #DefendResearch #Medicine #OpenData #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Update. "Here’s why and how Public Environmental Data Partners [#PEDP] and others are making sure that the #climate science the public depends on is available forever."
theconversation.com/how-to-fin…

#Censorship #DefendResearch #OpenData #Preservation #Takedowns #Trump #USPol #USPolitics

This entry was edited (10 months ago)

Glyn Moody reshared this.

in reply to petersuber

in reply to petersuber

Update. "A Renewed Call for Preservation of At-Risk Government Data"
icpsr.umich.edu/web/about/cms/…

"The directors of the University of Michigan's Institute for Social Research (#ISR) and the Inter-university Consortium for Political and Social Research (#ICPSR) are emphasizing the critical need for preserving government data that may be at risk due to recent policy shifts....Through #DataLumos, an ICPSR archive for valuable government data resources, ICPSR is helping the data community to preserve, document, and disseminate thousands of files from agencies such as the Centers for Disease Control [#CDC] and the Department of Education [#DOE]."

#Censorship #DefendResearch #Preservation #Takedowns #Trump #USPol #USPolitics

This entry was edited (10 months ago)

Glyn Moody reshared this.

in reply to petersuber

Update. The Data Rescue Project spotlights 13 college and university #libguides on projects to rescue scientific and govt data from #censorship or deletion.
datarescueproject.org/librarie…

#DefendResearch #DRP #Libraries #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Update. "The cost of losing government webpages and public data"
marketplace.org/shows/marketpl…

"Jack Cushman, director of the Harvard Library Innovation Lab [@harvardlil], has been preserving sites and data that went dark after executive orders from President #Trump. He underlines the importance of keeping digital copies or risk parting with “our cultural memory.”

#Censorship #DefendResearch #Podcast #Preservation #Takedowns #USPol #USPolitics

in reply to petersuber

Update. "Federal data is disappearing. Meet the teams working to rescue it and learn how you can help."
youtube.com/watch?v=hiZuKA-o4V…

"Since the start of the new #Trump administration, hundreds of federal data sets and government websites have gone offline without warning, sometimes returning with major changes and sometimes not returning at all. On February 13th, #MuckRock hosted an event with organizations that are helping lead the efforts to preserve the public’s data."

This is a video of the event.

#Censorship #DefendResearch #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Update. "The fight to preserve federal government data."
muckrock.com/news/archives/202…

"Despite the Trump administration restricting access to these government sites and its underlying data, organizations and individuals have been working to preserve this data. Here are just a few of the efforts that part of an ongoing effort to preserve public information."

#Censorship #DefendResearch #Preservation #Takedowns #USPol #USPolitics

in reply to petersuber

Update. "How the Wayback Machine is preserving outdated government websites."
cbsnews.com/video/how-the-wayb…

"The #WaybackMachine is helping preserve the record of government websites before they were changed by the TTrump administration. CBS News Confirmed's Rhona Tarrant reports."

#Censorship #DefendResearch #InternetArchive #Preservation #Takedowns #USPol #USPolitics

reshared this

in reply to petersuber

Apparently it is standard practice for them to do this with every administration change. Just this one is so mission critical.
in reply to petersuber

Update. "Wayback Machine Saves Thousands of Federal Webpages Amid Purge of Government Data Under Trump"
democracynow.org/2025/2/28/int…

"Thousands of informational government webpages have been taken down so far in the second #Trump administration, including on public health, scientific research and LGBTQ rights. Amid this mass erasure of public information, the #InternetArchive is racing to save copies of those deleted resources."

#Censorship #DefendResearch #Preservation #Takedowns #USPol #USPolitics

in reply to petersuber

interesting, way back machine was highly targeted prior to that …
in reply to petersuber

reshared this

in reply to petersuber

Update. "Archivists Recreate Pre-Trump #CDC Website, Are Hosting It in Europe"
404media.co/archivists-recreat…

"A team of volunteer archivists has recreated the Centers for Disease Control website exactly as it was the day Donald Trump was inaugurated. The site, called RestoredCDC.org, went live Tuesday and is currently being hosted in Europe."

#Censorship #DefendResearch #Medicine #Preservation #Takedowns #Trump #USPol #USPolitics

This entry was edited (9 months ago)

Tim Chambers reshared this.

in reply to petersuber

in reply to petersuber

in reply to petersuber

Update. "SAFE-Track: Secure Anonymous Federal Evidence, Data and Analysis Tracking"
datafoundation.org/pages/safet…

"The Data Foundation's SAFE-Track portal provides a secure, encrypted channel for documenting changes to federal evidence and #data activities. As a trusted, non-partisan authority on government data and evidence policy, the #DataFoundation maintains this platform to systematically understand and analyze impacts on America's evidence infrastructure…SAFE-Track enables confidential reporting through:
* Complete #anonymity for submissions
* End-to-end #encryption of all data
* No requirement for email or personal identification
* Option for secure follow-up communication through anonymous conversation codes
* Protections against collection of personally identifiable information."

#Censorship #DefendResearch #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Update. From @kfitz: "Digital Preservation in a Time of Disorder"
about.hcommons.org/2025/03/19/…

"#KnowledgeCommons has applied for a significant grant from Lever for Change to build, implement, and sustain a digital preservation network that will be free from the US government’s, and any other single government’s, interference…KC is a #nonprofit, community-governed, #OpenAccess platform for creating and sharing knowledge world-wide…But, in the present moment, our US-centeredness is a significant threat to that mission. We propose to establish three linked but independent nonprofit public-benefit companies incorporated in the US, Europe, and South Africa, all dedicated to the social and technological processes of gathering, preserving, and ensuring the public accessibility of academic research."

#Censorship #DefendResearch #Preservation #Takedowns #USPol #USPolitics

reshared this

in reply to petersuber

Egon Willigh☮gen 🟥 reshared this.

in reply to petersuber

I think this new law would apply to Heritage Foundation and Project 2025 as well,ie, domestic *terrorist* groups.
in reply to petersuber

Update. "A group of US researchers funded through federal grants who wish to document the damage to our scientific #infrastructure. Our #data #archive is for use by journalists, academics, and the general public."
theresearchwelost.com/

#Censorship #DefendResearch #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

in reply to petersuber

Update. "Trump’s ‘climate’ purge deleted a new extreme weather risk tool. We recreated it"
theguardian.com/environment/ng…

"#TheGuardian has recreated a searchable climate future risk tool developed by #FEMA but then deleted."

#Censorship #Climate #DefendResearch #OpenData #Takedowns #Trump #USPol #USPolitics

Glyn Moody reshared this.

in reply to petersuber

Update. Speak up to help #AED reduce the odds that valuable public US govt datasets will be taken down.
essentialdata.us/

"Demonstrating the broad real-world value of federal data is the most strategic path to ensuring its continued flow. The goal of America's Essential Data is to make it easy for: … federal agency data stewards and their leadership to better understand the true value of their data, especially as it relates to administration priorities. Do you use a federal dataset that delivers important benefits for the American people? Help us tell the story of that dataset!"

#Censorship #DefendResearch #OpenData #Takedowns #Trump #USPol #USPolitics

This entry was edited (8 months ago)
in reply to petersuber

Update. "#SciOp is part of Safeguarding Research & Culture (#SRC). The bits must flow: let us resurrect the ancient art of #Bittorrent to ensure that our cultural, intellectual and scientific heritage exists in multiple copies, in multiple places, and that no single entity or group of entities can make it all disappear."
sciop.net/

#Censorship #DefendResearch #OpenSource #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Update. "With the recent announcement that numerous datasets — such as those from #NOAA — are scheduled for decommissioning in May, #PANGAEA has opened its archive to help safeguard these valuable resources. If you become aware of any endangered datasets, please don't hesitate to contact us. PANGAEA data!"
pangaea.de/

#Censorship #DefendResearch #OpenData #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Update. "ERICA is a rescue catalog which preserves over 500,000 Open Access publications originally hosted by the US Department of Education in the #ERIC research repository. ERIC was defunded on the 23 April 2025 and the maintenance contract is set to expire soon, meaning that ERIC is likely to shut down. The PDFs were rescued using the Internet Archive's #WaybackMachine by a volunteer of the #DataRescueProject. When you click on one of the publication ID links, you will be redirected to the archived PDF in the the Wayback Machine. Feel free to host your own copy of ERICA by simply cloning this source code repo, which also includes the metadata for the catalog."
erica.datarescueproject.org/

#DefendResearch #Education #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Update. Kudos to the Northwestern University Libraries for training people to take part in the #DataRescueProject. Other #libraries should follow suit.
dailynorthwestern.com/2025/04/…

#DefendResearch #Preservation #Takedowns #Trump #USPol #USPolitics

This entry was edited (7 months ago)

Glyn Moody reshared this.

in reply to petersuber

Northwestern's Storymaps platform gives people without Omeka access or servers and thus without GIS software an opportunity to build important data narration tools. Northwestern's info work seems exceptionally community-minded. Kudos to them.
in reply to petersuber

Update. "Because of #Trump: [The German] Central Library of #Medicine builds alternative to US database."
heise.de/news/Wegen-Trump-Zent…

From Google's English: "The German National Library of Medicine (#ZBMed) has announced its intention to create an "open, reliable, and sustainable alternative" to the #PubMed database, one of the most important and comprehensive resources for biomedical literature worldwide. The meta-database, with references to relevant articles and over 38 million citations, is operated by the National Library of Medicine (#NLM), a division of the National Institutes of Health (#NIH) in the United States. ZB Med is responding to concerns that the US administration under Donald Trump is cutting funding for the NIH. There are also fears that political influences could compromise PubMed's scientific integrity."

#Censorship #DefendResearch #Germany #Preservation #Takedowns #Trump #USPol #USPolitics

in reply to petersuber

Trump's (and America's) xenophobia is going to lead to a new renaissance in European science.

Meanwhile America will slide into a new Dark Ages of superstition and disease.

#ThisIsAmericaNow

in reply to petersuber

at least one piece of news that makes me a proud German today!
in reply to petersuber

@astroPug
It might be possible to set up a Signal account to send these, anonymously encrypted.
in reply to petersuber

I was just thinking, with all of the badly worded edicts that King Don has been firehosing us with, if I were a sysadmin I would engage i malicious compliance just to wake people up and not sugarcoat the awfulness. Let the whole website return errors, take the whole thing down rather than hide info. Make people mad.

"Just doing what you told me to do…”

in reply to petersuber

@harvard_law@bird.makeup

Duplicate the data sets at multiple sites, several of them public.

in reply to petersuber

Thank you so much. I don't pretend to know that much about it all, but that folks are working on this is a comfort.
in reply to petersuber

the most interesting, and compelling, snippet "sign as authentic" .

Right before the new administration tried to rewrite history as well as ignoring its lessons.