ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view sites you want to preserve offline.
You can set it up as a command-line tool, web app, and desktop app (alpha), on Linux, macOS, and Windows.
You can feed it URLs one at a time, or schedule regular imports from browser bookmarks or history, feeds like RSS, bookmark services like Pocket/Pinboard, and more. See input formats for a full list.
It saves snapshots of the URLs you feed it in several formats: HTML, PDF, PNG screenshots, WARC, and more out-of-the-box, with a wide variety of content extracted and preserved automatically (article text, audio/video, git repos, etc.). See output formats for a full list.
The goal is to sleep soundly knowing the part of the internet you care about will be automatically preserved in durable, easily accessable formats for decades after it goes down.
30 years on, SEO and social media silos have replaced pre-web visions of linking
It’s time we brought back the hyperlink and learned how to really use it. It’s time we used information abundance to our advantage. And it’s time we disentangled our communications from platforms tuned for the spread of disinformation. The health of our democracies just might depend on it.
The oldest and simplest of internet technologies, the hyperlink and the “new” kind of text it affords — hypertext — is the foundational language of the internet, HyperText Markup Language (HTML). Hypertext connects all the disparate pieces of the web together. And it’s Sci-Fi name isn’t an accident. It’s hyperdrive for the internet, bending information space so that any user can travel galaxy-scale information distances with a small movement of a finger. The hyperlink still remains one of the most powerful elements of the web. In fact, I’d argue that the hyperlink is our most potent weapon in the fight against disinformation.
This potential though is being challenged by platforms that keep users trapped within. This is something that Chris Aldrich touched upon in a recent post about Facebook:
The note post type has long since fallen by the wayside and I rarely, if ever, come across people using it anymore in the wild despite the fact that it’s a richer experience than traditional status updates. I suspect the Facebook black box algorithm doesn’t encourage its use. I might posit that it’s not encouraged as unlike most Facebook functionality, hyperlinks in notes on desktop browsers physically take one out of the Facebook experience and into new windows!
Unless it contains sensitive information, publish your work to a public URL that can be referenced by others. This allows ideas to build upon one another in a ‘slow hunch’ fashion. Likewise, with documents and other digital artefacts, publish and then share rather than deal with version control issues by sending the document itself.
Another approach is a federated system, such as Mike Caulfield’s Wikity theme.