this post was submitted on 15 Jun 2023
196 points (100.0% liked)
Technology
37738 readers
504 users here now
A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.
Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.
Subcommunities on Beehaw:
This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
A friend of mine talked about data preservation in the internet in a blog post, which I consider to be a good read. Sure, there's a lot lost, but as he sais in the blog post, that's mostly gonna be trash content, the good stuff is generally comparatively well archived as people care about it.
That is likely true for a majority of "the good stuff", but making that determination can be tricky. Let's consider spam emails. In our daily lives they are useless, unwanted trash. However, it's hard to know what a future historian might be able to glean from a complete record of all spam in the world over the span of a decade. They could analyze it for social trends, countries of origin, correlation with major global events, the creation and destruction of world governments. Sometimes the garbage of the day becomes a gold mine of source material that new conclusions can be drawn from many decades down the road.
I'm not proposing that we should preserve all that junk, it's junk, without a doubt. But asking a person today what's going to be valuable to society tomorrow is not always possible.
I wonder if one of the things that tends to get filtered out in preservation is proportion.
When we willfully save things, it may be either representative specimens, or rarities chosen explicitly because they're rare or "special". However, in the end, we end up with a sample that no longer represents the original material.
Coin collections disproportionately contain rare dates. Weird and unsuccessful locomotives clutter railway museums. I expect that historians reading email archives in 2250 will see a far lower spam proportion than actually existed.