public class Ftp extends Object implements Protocol
| Modifier and Type | Field and Description |
|---|---|
static org.slf4j.Logger |
LOG |
CHECK_BLOCKING, CHECK_ROBOTS, X_POINT_ID| Constructor and Description |
|---|
Ftp() |
| Modifier and Type | Method and Description |
|---|---|
protected void |
finalize() |
Configuration |
getConf() |
ProtocolOutput |
getProtocolOutput(Text url,
CrawlDatum datum)
Returns the
Content for a fetchlist entry. |
RobotRules |
getRobotRules(Text url,
CrawlDatum datum)
Retrieve robot rules applicable for this url.
|
static void |
main(String[] args)
For debugging.
|
void |
setConf(Configuration conf) |
void |
setFollowTalk(boolean followTalk)
Set followTalk
|
void |
setKeepConnection(boolean keepConnection)
Set keepConnection
|
void |
setMaxContentLength(int length)
Set the point at which content is truncated.
|
void |
setTimeout(int to)
Set the timeout.
|
public void setTimeout(int to)
public void setMaxContentLength(int length)
public void setFollowTalk(boolean followTalk)
public void setKeepConnection(boolean keepConnection)
public ProtocolOutput getProtocolOutput(Text url, CrawlDatum datum)
ProtocolContent for a fetchlist entry.getProtocolOutput in interface Protocolpublic void setConf(Configuration conf)
setConf in interface Configurablepublic Configuration getConf()
getConf in interface Configurablepublic RobotRules getRobotRules(Text url, CrawlDatum datum)
ProtocolgetRobotRules in interface Protocolurl - url to checkdatum - page datumCopyright © 2012 The Apache Software Foundation