Name: | nutch |
---|---|
Version: | 1.0 |
Release: | 0.16.20081201040121nightly.el6 |
Architecture: | noarch |
Group: | Development/Tools |
Size: | 25547424 |
License: | ASL 2.0 |
RPM: | nutch-1.0-0.16.20081201040121nightly.el6.noarch.rpm |
Source RPM: | nutch-1.0-0.16.20081201040121nightly.el6.src.rpm |
Build Date: | Sat Aug 31 2013 |
Build Host: | ca-build44.us.oracle.com |
Vendor: | Oracle America |
URL: | http://lucene.apache.org/nutch/index.html |
Summary: | Open source web-search software |
Description: | Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc. |
- rebuild nutch from git
- Fixing nutch buildroot symlink issue
- We also need the nutch lib directory to contian the nutch jar
- updating bin directory permissions
- fixing nutch executable permissions
- we need scripts in bin for spacewalk-doc-indexes
- shrinking nutch rpm from 70M to 22M
- Correcting URL of the tarball in Nutch pkg (lzap+git@redhat.com)
- Removing unnecessary files - Erasing empty lines
- dropping unnecessary files
- rebuild
- Rebuild for new build tools.
- initial