I found myself wanting to test out the v0.8 Release Candidate for ArchiveBox. Since setting a custom port is very easy, running multiple instances of archivebox is trivial:
uv tool run archivebox@0.8.5rc50 init
uv tool run archivebox@0.8.5rc50 manage createsuperuser
uv tool run archivebox@0.8.5rc50 server 6789
π We are up and running!

Letβs set up the rest of the dependencies:
uv tool run archivebox@0.8.5rc50 install
π§ͺ RSS Parsing
The improvements to RSS parsing are of particular interest to me.
π Letβs attempt to backup my toots:
uv tool run archivebox@0.8.5rc50 add --parser=rss --depth=1 https://infosec.exchange/@brie.rss
No dice: AttributeError: object has no attribute 'title'.
uv tool run archivebox@0.8.5rc50 add --parser=rss --depth=1 https://brie.dev/rss.xml
Errors were reported. The command completed successfully after several minutes but nothing appears in the UI.
AttributeError: object has no attribute 'updated_parsed'
π I was successful in parsing my private and public Pinboard feeds.

It is interesting to observe some of the flags that are passed to Google Chrome:
--virtual-time-budget=15000
--disable-features=DarkMode
--run-all-compositor-stages-before-draw
--hide-scrollbars
--autoplay-policy=no-user-gesture-required
--no-first-run
--use-fake-ui-for-media-stream
--use-fake-device-for-media-stream "--simulate-outdated-no-au='Tue, 31 Dec 2099 23:59:59 GMT'"
--screenshot "https://brie.dev/troubleshooting/"
π§ͺ Webhooks
Letβs try adding one via the UI! It works. It looks like the version is not parsed properly in the user-agent that is printed in the webhook payload:
"--user-agent",
"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/128.0.0.0 Safari/537.36 ArchiveBox/{VERSION} (+https://github.com/ArchiveBox/ArchiveBox/)"
πΎ Next Steps
For my production ArchiveBoxes, I think I would definitely want to parse through the webhook payload to filter things so thatβ¦
- I am notified of some failures
- I get a digest of successful ingestions and updates
π Observations
- β The UI is a bit prettier and thatβs nice!
- β Things go better when all dependencies have been met.
πΈοΈ β π― HTTP 500
I can induce a HTTP 500 byβ¦
- attempting to delete a snapshot with any data saved
- looking at any
/changeURL - leaving Referenced model blank when adding a webhook
π READmore
- v0.8.5-rc: Prettier + faster CLI for InstalledBinaries, Machines/NetworkInterfaces health now audit logged
- PR: Add Webhooks support to new REST API
- docs: classarchivebox.api.models.OutboundWebhook(*args: Any, **kwargs: Any)
Congrats on being an enthusiastic internet archiver! π
