Or do you use anything else to archive the mighty www?
I tried a lot of self-hosted read-it later services, but they all have some wired issues when scrapping some specific websites with discussion (like github, stackoverflow…) so I gave up on them.
For bookmarking and archiving I use Linkding.
For text processing and archiving I use singlefile + zotero.
Yep, been self-hosting it locally for a while now. To put simply, I archive anything that is within my personal realm of interest that I believe has a chance to be deleted, and is important to keep a copy of. It could be troubleshooting tips for specific tech issues, things that may be under threat of takedown, or maybe just an article I like and want a local copy of. It’s a wonderful tool.
I have it on my computer, but I dislike that they keep turning it more and more into a service that’s supposed to run 24/7. Liked it better when it was usable as a bunch of HTML files.
It’s great otherwise. I archive unofficial repair guides for stuff I own, news articles that are directly relevant to my life (like something big that happened nearby or something I was a part of), articles that etched in my memory and I would like to see them again.
Wasn’t aware of it, had a brief look at their site - can this share the archive with others, or is it on a roadmap to do so?
I feel like there’s a missed opportunity there…?
ArchiveBox is great.
I’m big into retro computing and general old electronics shit, and I archive everything I come across that’s useful.
I just assume anything and everything on some old dude’s blog about a 30 year old whatever is subject to vanishing at any moment, and if it was useful once, it’ll be useful again later probably so fuck it, make a copy of everything.
Not like storage is expensive, anyway.
I archive blog posts mostly. Nice to have them more than bookmarked and i’ve had many smaller blog just vanish over the years.
Sometimes i use grab-site for full domain captures and a simple wget -p -k for less demanding sites.
I have a project like it. Lots of collective commons, free books, lots of things without copyright. It’s a box anyone can get into in a localized area. On a pi zero w. Fun little project to put together.
Lots of Wikipedia and text to be honest.