Trending

#datarescue

Latest posts tagged with #datarescue on Bluesky

Latest Top
Trending

Posts tagged #datarescue

Post image

Big DRP shout-out to one of our volunteers who worked on this project to make the data files in AtlasPlus more accessible!
#DataRescue #HealthData #PublicDataisaPublicGood
@altcdc.altgov.info

www.datarescueproject.org/atlasplus-da...

5 4 0 0

Reminder to regularly back-up your @zotero.org data, as their online sync are hosted on Amazon Web Services.

“Nullius in Weba” (have no confidence in any web service)
#ESR #zotero #dataRescue #librarian #academia #digitalScholarship

5 6 1 0
Preview
Reconstructing nineteenth-century Danube river water levels with transformer-based computer vision Abstract. We convert nineteenth-century Bavarian Danube gauge charts (1826–1894) into daily water-level series referenced to gauge zero through a novel semi-automated workflow combining light document...

essd.copernicus.org/articles/18/...

#ecology #hydrology #datarescue #digitalhumanities

1 1 0 0

Everyone in the DRP knows how wonderful and incredible @lyndamk.bsky.social is. We are thrilled for her and honored that she is representing #DataRescue at this international level. Congrats Lynda!!

8 5 3 0
Post image

New Journal Article (Essay): "Things Fall Apart: Lessons From a Defunded Data Repository" (via Data Science Journal, @codata-isc.bsky.social ) #SEDAC #CIESIN #repositories #datarescue @datarescueproject.org

4 1 0 0
bob from bobs burgers saying "i mean, i dont want to oversell it, but it changes you"

bob from bobs burgers saying "i mean, i dont want to oversell it, but it changes you"

Welcome back to #MemeMonday!

Our volunteers are bun-believably great! Interested in joining us? Check out our website for more information on how to get started.

www.datarescueproject.org/faq/

#DataRescue

5 2 0 0

In this paper, I propose the concept of anticipatory maintenance to show that #DataRescue (2016/17) was not just “rescue” but maintenance work: future-oriented care to keep data accessible, acting as if loss had already begun by building redundancy and decentralized preservation arrangements.

1 0 0 0
Post image

My paper is out in @bigdatasoc.bsky.social : “Emergency curation as anticipatory maintenance: Lessons from the 2016/2017 #DataRescue movement.” doi.org/10.1177/2053...

2 2 1 0

All of this. Fantastic keynote from @lyndamk.bsky.social and @mikalarae.bsky.social!

Mikala said it best (check the original post's replies): the DRP community, everyone involved, continuously reminds me that "hope is resistance."

#DataRescue #USFederalData

4 0 0 0
DIY Web Archiving | Zine Bakery Zine Bakery Bakeshop #2, by Quinn Dombrowski, Tessa Walsh, Anna Kijas, Ilya Kreymer, and Amanda Wyatt Visconti

For #LoveData26 week, #ICYMI for 2 free data advocacy zines: "DIY Web Archiving" to protect data (& other online things!) you care about 🔗⬇️, & a #datarescue zine documenting the recently removed Nat'l Park exhibit on the people Washington enslaved zinebakery.com/bakeshop/cen...

12 8 0 0
Banner with the words "Love Data Week" in bold letters. The word "Love" is formed with a heart pattern. Below, a hashtag reads "#LoveData26."

Banner with the words "Love Data Week" in bold letters. The word "Love" is formed with a heart pattern. Below, a hashtag reads "#LoveData26."

Where did the data go?
Since 2025, federal data on health & climate has been disappearing from gov websites. For #LoveData26, librarian Laura Hjerpe explores the legal fight to restore access and how you can help rescue at-risk info:
https://library.virginia.edu/news/2026/wheres-data
#DataRescue

5 2 0 1
Preview
Internet Archive Adds Searchable Access to Archived Pages From the CIA World Factbook Mark Graham at the Internet Archive tells us that searchable access to more than 18,000 archived pages from the CIA World Factbook found in The Wayback Machine’s collection are now available online…

Internet Archive Adds Searchable Access to Archived Pages From the CIA World Factbook - Library Journal infoDOCKET www.infodocket.com/2026/02/08/i...

#DataRescue

2 1 0 1
Preview
Data Rescue Projects receives support from the John D. and Catherine T. MacArthur Foundation to support data rescue efforts FOR IMMEDIATE RELEASE Since launching in February 2025, the Data Rescue Project has grown substantially. At this point, the DRP has enabled the rescue of more than 1,000 datasets from US Federal webs...

#Opendata and #datarescue
www.datarescueproject.org/data-rescue-...

#publicinteresttechnology #publicinteresttech #civictech #datascience

0 0 0 0
Post image

Congrats!!! The Data Rescue Project is the Winner of the 2025 RDAP Work of the Year Award rdapassociation.org/news/13593532 #datarescue @datarescueproject.org @rdapassociation.bsky.social

6 2 0 0
A photo of gritty that says "Me and all my rescued federal data"

A photo of gritty that says "Me and all my rescued federal data"

The DRP Steering Committee is in Philadelphia for a retreat, so here's a Philadelphia-based #DataRescue meme

20 4 1 2

Going to be thinking about more #DataRescue work that can be zineified: both for LOCKSS (lots of copies keeps stuff stafe! free distributed physical copies=definitely part of that) + part of preservation is how many folks stay aware of and read/use a thing

5 1 0 0
Preview
SOS! Help Save Our Signs Today The Save Our Signs Project team needs your help – and soon. The Washington Post reported on Tuesday morning that sites across the country have been targeted for erasure, including Grand Canyon, Glacie...

#censorship #politics #signs #DataRescue

'The Washington Post reported on Tuesday morning that sites across the country have been targeted for erasure...Save Our Signs needs your help to document these educational materials before they disappear.'

www.datarescueproject.org/sos-help/

0 0 0 0
Preview
DRP AMA 2026 Questions This form is to collect questions for the Data Rescue Project's first "Ask Me Anything!" to be held on Bluesky January 30, 6-8pm Eastern. If you can't attend or on Bluesky, don't worry-- you can…

Have any #DataRescue burning questions?? We have just the thing...

We're hosting an #AMA. Join us on January 30, 6-8pm Eastern through bluesky! Please submit your questions through our form.

If you can't make it, there is an option to receive a response via email!

5 3 0 2
Preview
Government’s historic role as trusted information source is under threat The U.S. government for decades has been the world’s leading provider of reliable data. Many researchers wonder if that is still the case.

Posted within minutes of each other: one article about the #DataRescue movement and one about the #SaveOurSigns efforts.

@washingtonpost.com

www.washingtonpost.com/politics/202...

www.washingtonpost.com/entertainmen...

8 1 0 0
Preview
Backing up Spotify We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB). It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirro...

#Spotify #music #metadata #DataRescue

'It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.'

annas-archive.li/blog/backing...

1 1 0 0
Preview
Federal GIS Data Saved and Archived The following post was written by Frank Donnelly, the Head of GIS and Data Services at the Brown University Library and one of our DRP rescuers! The Data Rescue Project is pleased to announce that we...

New From @datarescueproject.org: Federal #GIS #Data Saved and Archived www.datarescueproject.org/hifld-data-s... #datarescue

0 0 0 0
Preview
Making 10M government PDF documents searchable Government organizations love to distribute documents as PDF files. They are easy to forward and to print. The problem is when you want to find and access them later among millions of other files. …

Making 10M government PDF documents searchable flowingdata.com/2025/11/26/m...

"The code for GovScape is open source and available on GitHub."

#OpenData #OpenGov #OCR #DataRescue #GovDocs

1 1 0 0
Preview
Guest Post: A Day in the Life with Federal Government Data Today, we have the fourth post in the series from Claire McKay Bowen and Aaron R. Williams to help diverse audiences understand and support the federal statistical system. Everyone living in the…

We often discuss how public data influences our everyday lives whether we acknowledge it or not. This week's guest article highlights your daily interactions with public data: www.datarescueproject.org/guest-post-a...
#PublicData #DataRescue

5 2 0 0
Preview
Interest Form: Repository Crisis Scorecards Thanks for your interest in the Repository Crisis Scorecards (RCS) project, funded by the Sloan Foundation. This form is designed to gather general interest from individuals who want to stay informed…

What can you do?
🤝 Sign up to let us know you’re interested: www.esipfed.org/rcs-interest
🔄 Repost to help us spread the word.
🔗 Learn more about the project: www.esipfed.org/repository-r...

#DataCenters #DataStewards #DataManagement #BigData #OpenData #DataRescue #DigitalPreservation

0 0 0 1
Preview
How the Paris-Saclay library supports research software - Software Heritage Paris-Saclay’s ADAC Cédric Mercier details their strategy for research data/code management and Software Heritage sponsorship.

How @univparissaclay.bsky.social library supports research software www.softwareheritage.org/2025/11/14/paris-saclay-... #coderescue #datarescue

1 1 0 0

This project has had great engagement so far with tens of thousands of classifications.

We'd love for even more help to improve our understanding of the past and future climates of Africa. Click on the link in the quoted message to get involved and spread the word! #datarescue

0 1 0 0
Preview
NCES Datalabs Tables: Rescue Complete! If you have been following along with the Data Rescue Project newsletter, you have been receiving occasional updates on our struggle to find and download all of the summary tables that were created by...

New post on our efforts to back up the NCES Datalabs tables. Big thanks to all of the volunteers involved in backing these up! #DataRescue

www.datarescueproject.org/nces-datalab...

10 3 0 1
Data rescue for World Digital Preservation Day 2025 Today, Thursday 6 November 2025 if I actually manage to finish and publish this today, is World Digital Preservation Day so I thought I would try and get a blog post out about some work I’ve been doing to rescue at-risk data. I’ve briefly mentioned this in my post about Library of Congress Subject Headings but not in much detail. The project is Safeguarding Research & Culture and I got involved back in March or April when Henrik reached out on social media looking for someone with library & metadata experience to contribute. I said that I wasn’t a Real Librarian but I’d love to help if I could, and now here we are. The concept is simple: download public datasets that are at risk of being lost, and replicate them as widely as possible to make them hard to destroy, though obviously there’s a lot of complexity buried in that statement. When the Trump administration first took power, there were a lot of people around the world worried about this issue and wanting to help, so while there are a number of institutions & better resourced groups doing similar things, we aim to complement them by mobilising grassroots volunteers. Downloading data isn’t always straightforward. It may be necessary to crawl an entire website, or query a poorly-documented API, or work within the constraints of rate-limiting so as not to overload an under-resourced server. That takes knowledge and skill, so part of the work is guiding and mentoring new contributors and fostering a community that can share what they learn and proactively find and try out new tools. We also need people to be able to find and access the data, and volunteers to be able to contribute their storage to the network. We distribute data via the venerable BitTorrent protocol, which is very good at defeating censorship and getting data out to as many peers as possible as quickly as possible. To make those torrents discoverable, our dev team led by the incredible Jonny have built a catalogue of dataset torrents, playfully named SciOp. That’s built on well-established linked data standards like DCAT, the Data Catalogue Vocabulary, so the metadata is standardised and interoperable, and there’s a public API and a developing commandline client to make it even easier to process and upload datasets. There are even RSS and RDF feeds of datasets by tag, size, threat status or number of seeds (copies) in the network that you can plug into your favourite BitTorrent client to automatically start downloading newly published datasets. There are even exciting plans in the works to make it federated via ActivityPub, to give us a network of catalogues instead of just a single one. We’re accidentally finding ourselves needing to push the state of the art in BitTorrent client implementations. If you’re familiar with the history of BitTorrent as a favoured tool for _ahem_ less-than-legal media sharing, it probably won’t surprise you that most current BitTorrent clients are optimised for working with single audio-visual streams of about 1 to 2½ hours in length. Our scientific & cultural data is much more diverse than that, and the most popular clients can struggle for various reasons. In many cases there are BEPs (BitTorrent Enhancement Proposals) to extend the protocol to improve things, but these are optimal features that most clients don’t implement. The collection of BEPs that make up “BitTorrent v2” is a good example: most clients don’t support v2 well, so most people don’t bother making v2-compatible torrents, but that means there’s no demand to implement v2 in the clients. We are planning to make a scientific-grade BitTorrent client as a test-bed for these and other new ideas. Myself I’m running one of a small number of “super” nodes in the swarm, with much more storage available than the average laptop or desktop, and often much better bandwidth too. That’s good, because some of our datasets run to multiple terabytes, plus to ensure new nodes can get started quickly we need to have some always-on nodes with most of the data available to others. Since BitTorrent is truly peer-to-peer, it doesn’t matter how many people have a copy of a given dataset, if none of them are online no-one else can access it. This is all very technically interesting, but communications, community, governance, policy, documentation, funding are also vitally important, and for us these are all works in progress. We need volunteers to help with all of this, but especially those less-technical aspects. If you’re interested in helping, please drop us a line at contact@safeguar.de, or join our community forum and introduce yourself and your interests. If you want to contribute but don’t feel you have the time or skills, well, to start with we’re more than happy to show you the ropes and help you get started, but as an alternative, I’m running one of those “super” nodes and you can contribute to my storage costs via GoFundMe: even a few quid helps. I currently have 3x 6TB hard drives with no space to mount them, so I’m currently in need of a drive cage to hold them and plug them into my server. Special shout-out also to our sibling project, the Data Rescue Project, who are doing amazing work on this and often send us requests for websites or complex datasets for our community to save. I’ve barely scratched the surface here, but I _really_ want to actually get this post out for WDPD so I’m going to stop here and hopefully continue soon!

I did it! Here's my post on @SafeguardingResearch #DataRescue and distributed #DigiPres for #WDPD2025! https://erambler.co.uk/blog/wdpd2025-data-rescue/

Shoutout to @lavaeolus & @jonny, plus @datarescueproject.org

1 6 1 0

Congrats to the @datarescueproject.org!
#DataRescue

2 0 0 0