ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view sites you want to preserve offline.
You can set it up as a command-line tool, web app, and desktop app (alpha), on Linux, macOS, and Windows.
You can feed it URLs one at a time, or schedule regular imports from browser bookmarks or history, feeds like RSS, bookmark services like Pocket/Pinboard, and more. See input formats for a full list.
It saves snapshots of the URLs you feed it in several formats: HTML, PDF, PNG screenshots, WARC, and more out-of-the-box, with a wide variety of content extracted and preserved automatically (article text, audio/video, git repos, etc.). See output formats for a full list.
The goal is to sleep soundly knowing the part of the internet you care about will be automatically preserved in durable, easily accessable formats for decades after it goes down.
In the age of the 4K smart TV, audio-visual companies are looking forward to ever-greater digital innovations. Few are looking back at the cavalcade of tape formats that came before, which never had the same desirable aesthetic that made film so enduringly popular.
Mr Ficker says it is “very unlikely” that archives around the world could raise the capital to have tape machines manufactured again, even if they worked together.
But he said he “hadn’t written it off”, because “if that’s what it takes, will then we will be pursuing those strategies”.
He also suggests future digital innovations might make it possible to read the data off magnetic tapes in a different way, using software to reconstruct the images.
This week the Berkman Center at Harvard announced they will be closing their blogging platform, the first of its kind that has been around since 2003. While rumors of bloggings death are perhaps greatly exaggerated, Jim and Tim chat about the historical precedence for institutional platforms like WordPress multisite, their role in the greater landscape of digital identity and web publishing, and how that has changed over time.
Archiving and digital legacy [ought] to be in the air.
The bava has been at it; Jim has been cleaning up his pile pf past webs, like an abandoned Known and the OpenVA web site. He’s been writing about Archiving his Digital Past and the concept of an Archive of One’s Own.
I’ve been harping s…