public class CrawlDbFilter extends Object implements Mapper<Text,CrawlDatum,Text,CrawlDatum>
| Modifier and Type | Field and Description |
|---|---|
static org.slf4j.Logger |
LOG |
static String |
URL_FILTERING |
static String |
URL_NORMALIZING |
static String |
URL_NORMALIZING_SCOPE |
| Constructor and Description |
|---|
CrawlDbFilter() |
| Modifier and Type | Method and Description |
|---|---|
void |
close() |
void |
configure(JobConf job) |
void |
map(Text key,
CrawlDatum value,
OutputCollector<Text,CrawlDatum> output,
Reporter reporter) |
public static final String URL_FILTERING
public static final String URL_NORMALIZING
public static final String URL_NORMALIZING_SCOPE
public static final org.slf4j.Logger LOG
public void configure(JobConf job)
configure in interface JobConfigurablepublic void close()
close in interface Closeableclose in interface AutoCloseablepublic void map(Text key, CrawlDatum value, OutputCollector<Text,CrawlDatum> output, Reporter reporter) throws IOException
map in interface Mapper<Text,CrawlDatum,Text,CrawlDatum>IOExceptionCopyright © 2012 The Apache Software Foundation