The current released version of Apache Nutch is 1.4. Since Nutch 1.3, there was no Hadoop distribution integrated with Nutch’s release package. So I have to build a Hadoop cluster seperately first, and then configure Nutch 1.4 work with Hadoop. My server OS is ubuntu 10.04 LTS, I have two server names cluster1 and cluster2. I’ll note the steps here. Read more »
