Configuring httrack for build on Amazon Linux

When you need to pull a static copy of a site, whether for development or archival purposes, it’s useful to have a tool that can resolve links and update them for local browsing.

One such too is httrack (http://www.httrack.com/). It can pull a local mirrored copy of a website, retaining all the links and relevant web server resources.

Since there is no easy way to install httrack using yum on Amazon Linux, you have to build from source. Don’t be afraid! Follow these steps to ensure the build is done right:

  1. Download the Linux source from here (http://www.httrack.com/page/2/en/index.html)
  2. Extract the source

    tar xvf <downloaded-file>.tar.gz

  3. Install some dependencies

    yum install openssl-devel zlib-devel
    yum groupinstall ‘Development Tools’

  4. Move to the directory containing the source of httrack
  5. Run ./configure
  6. Run make
  7. Run make install
  8. Verify the installation

    which httrack

Now you can mirror a site locally for offline browsing or archiving.

To run the httrack Wizard, run httrack with no options:

httrack

The application will walk you through a wizard to configure the fetch operation.

 

screenshot-2014-10-09-11-36-37

For more posts like this, follow me on Twitter or subscribe using an RSS reader.