Saturday, February 11, 2023

How to Create Internet Wayback Machine with ArchiveBox Software

ArchiveBox can be used to archive URLs either from commandline or via its WebUI.

To archive a single URL from command line, simply pass the as an argument like below:

archivebox add https://example.com/some/page

Or,

echo https://example.com/some/page | archivebox add

Example:

$ archivebox add https://github.com/ArchiveBox/ArchiveBox


To archive a list of URLs from a text file, run:

$ archivebox add < archive_urls.txt

Or,

$ cat archive_urls.txt.txt | archivebox add

Or,

$ archivebox add ~/Downloads/browser_bookmarks.html

Or,

$ archivebox add ~/Downloads/pinboard_bookmarks.json

Or,

$ curl https://getpocket.com/users/USERNAME/feed/all | archivebox add

You can also add --depth=1 to any one of the above commands to recursively download URLs and all URLs one hop away.

$ archivebox add --depth=1 < ~/Downloads/bookmarks_export.html

0 comments:

Post a Comment