WACZ (Web Archive Collection Zipped) software

Richard L. Hamilton rlhamil at smart.net
Mon Mar 27 21:25:09 UTC 2023


I gather that not unlike a Java jar file, a .wacz file is a zip file with additional requirements as to layout (and that some components be stored (uncompressed) rather than subjected to "deflate" compression (those being mostly compressed already).

If that's the case, you could probably rename it to have a .zip suffix and then unzip it (preferably in a new empty directory, so it wouldn't leave a mess in a directory that already had other files and directories in it).

What you do with it then (to "replay" it or whatever), I have no idea without actually getting one and looking at it and trying some things. Although I gather you can use the site https://replayweb.page <https://replayweb.page/> to select an already downloaded .wacz file and replay it. That might work better with Chrome than Safari, none of this is anything I've tried.

If you simply want a static representation of what a page seen via the Wayback Machine looks like, you might do better to "print" it (possibly in reader view) to a pdf file, which you could read with Preview, Adobe Reader, etc.

> On Mar 27, 2023, at 17:07, Eric Gallager via macports-users <macports-users at lists.macports.org> wrote:
> 
> So, the Internet Archive has recently added an "Email me a WACZ file
> with the results" option to their "Save Page Now" service in the
> Wayback Machine, so I tried that out and got some WACZ files, although
> now I don't know what to do with them. Is anyone aware of any software
> for handling WACZ files that's available in MacPorts? Or, if there
> isn't any yet, could some be added?
> More info on the format can be found here:
> https://replayweb.page/docs/wacz-format
> There are some python tools for interacting with the format, but I
> couldn't get pypi2port to generate a Portfile for me for them, and
> plus there are kind of too many python things in MacPorts anyways:
> https://github.com/webrecorder/py-wacz
> Anything else?
> Thanks,
> Eric Gallager
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.macports.org/pipermail/macports-users/attachments/20230327/6da4e533/attachment.htm>


More information about the macports-users mailing list