Web Scale Crawling with Apache Nutch
This talk will give an overview of Apache Nutch. I will describe its main components and how it fits with other Apache projects such as Hadoop, Lucene, SOLR, Tika or HBase. The presentation will contain examples of real-case uses.
The second part of the presentation will be focused on the latest developments in Nutch and the changed introduces by the forthcoming version 2.0.