Or do you use anything else to archive the mighty www?
Yep, been self-hosting it locally for a while now. To put simply, I archive anything that is within my personal realm of interest that I believe has a chance to be deleted, and is important to keep a copy of. It could be troubleshooting tips for specific tech issues, things that may be under threat of takedown, or maybe just an article I like and want a local copy of. It’s a wonderful tool.
I have a project like it. Lots of collective commons, free books, lots of things without copyright. It’s a box anyone can get into in a localized area. On a pi zero w. Fun little project to put together.
Lots of Wikipedia and text to be honest.
ArchiveBox is great.
I’m big into retro computing and general old electronics shit, and I archive everything I come across that’s useful.
I just assume anything and everything on some old dude’s blog about a 30 year old whatever is subject to vanishing at any moment, and if it was useful once, it’ll be useful again later probably so fuck it, make a copy of everything.
Not like storage is expensive, anyway.
I tried a lot of self-hosted read-it later services, but they all have some wired issues when scrapping some specific websites with discussion (like github, stackoverflow…) so I gave up on them.
For bookmarking and archiving I use Linkding.
For text processing and archiving I use singlefile + zotero.
Yeah, I feel the same way. I also settled on singlefile. I save them to my NAS in a organized directory. My NAS directories are mounted on all my computers so I just have a FireFox bookmark to that local directory and I can seemlessly browse and open them.
I have it on my computer, but I dislike that they keep turning it more and more into a service that’s supposed to run 24/7. Liked it better when it was usable as a bunch of HTML files.
It’s great otherwise. I archive unofficial repair guides for stuff I own, news articles that are directly relevant to my life (like something big that happened nearby or something I was a part of), articles that etched in my memory and I would like to see them again.
Wasn’t aware of it, had a brief look at their site - can this share the archive with others, or is it on a roadmap to do so?
I feel like there’s a missed opportunity there…?
I’m trialing it but I’m also thinking of trying Linkwarden to compare.
I have linkwarden (I mainly save recipes tbh) and I like it a lot. There’s some parts of the ui that could be better, but overall it’s easy to setup and use and pretty intuitive
I archive blog posts mostly. Nice to have them more than bookmarked and i’ve had many smaller blog just vanish over the years.
Sometimes i use grab-site for full domain captures and a simple wget -p -k for less demanding sites.
I didn’t even know this existed. I’m so testing this out! Thanks for the post.
Yeah me neither. It’s going in my homelab this afternoon.
But what are you both thinking you’ll archive?
Every website I bookmark in case it goes away