password文件必须设置为只读,否则就会出现如下错误:
F:/heritrix/bin>heritrix --admin=admin:admin
WARNING: It's currently not possible to run Heritrix in background
on Windows. It was just started minimized in a new Window
and will be shut down as soon as you log off.
2011-04-06 星期三 13:00:22.39 Starting heritrix
Heritrix failed to start properly. Possible causes:
- Login and password have not been specified (see --admin switch)
- another program uses the port for the web UI (8080 by default)
(e.g. another Heritrix instance)
- JMX password file is missing or permissions not set correctly
Do you want to try to fix the permissions (Y/N)?y
Restart Heritrix (Y/N)?n
有时候启动一个任务之后,会发现网页抓取的很慢,下载的网页数长时间停留在1个,
但是这并不表示heritrix出问题了或者自己的配置有错误。