Name: | nutch |
---|---|
Version: | 1.0 |
Release: | 0.19.20081201040121nightly.el7 |
Architecture: | noarch |
Group: | Unspecified |
Size: | 25474152 |
License: | ASL 2.0 |
RPM: | nutch-1.0-0.19.20081201040121nightly.el7.noarch.rpm |
Source RPM: | nutch-1.0-0.19.20081201040121nightly.el7.src.rpm |
Build Date: | Fri May 10 2019 |
Build Host: | x86-ol7-builder-03.us.oracle.com |
Vendor: | Oracle America |
URL: | http://lucene.apache.org/nutch/index.html |
Summary: | Open source web-search software |
Description: | Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc. |
- removed %%defattr from specfile - remove install/clean section initial cleanup - removed Group from specfile - removed BuildRoot from specfiles
- 1483503 - move hadoop logs to /var/log
- recompile all packages with the same (latest) version of java - fixed tito build warning - replace legacy name of Tagger with new one
- rebuild nutch from git
- Fixing nutch buildroot symlink issue
- We also need the nutch lib directory to contian the nutch jar
- updating bin directory permissions
- fixing nutch executable permissions
- we need scripts in bin for spacewalk-doc-indexes
- shrinking nutch rpm from 70M to 22M