When you need to pull a static copy of a site, whether for development or archival purposes, it’s useful to have a tool that can resolve links and update them for local browsing.
One such tool is httrack (http://www.httrack.com/). It can pull a local mirrored copy of a website, retaining all the links and relevant web server resources.
Since there is no simple way to install httrack using yum on Amazon Linux, you have to build from source. Don’t be afraid! Follow these steps to ensure the build is done right:
Download the Linux source from here (http://www.httrack.com/page/2/en/index.html)
- Extract the source
tar xvf <downloaded-file>.tar.gz
- Install some dependencies
yum install openssl-devel zlib-devel yum groupinstall 'Development Tools'
- Move to the directory containing the source of httrack
- Run ./configure
- Run make
- Run make install
- Verify the installatio
Now you can mirror a site locally for offline browsing or archiving.
To run the httrack Wizard, run httrack with no options:
The application will walk you through a wizard to configure the fetch operation.
For more posts like this, follow me on Twitter or subscribe using an RSS reader.