public class RevisionLinkInputFormat extends WikiRevisionInputFormat<org.apache.hadoop.io.LongWritable,LinkProfile>
Modifier and Type | Class and Description |
---|---|
static class |
RevisionLinkInputFormat.LinkExtractor |
static class |
RevisionLinkInputFormat.RevisionLinkReader |
compressionCodecs, END_ID, END_NAMESPACE, END_PAGE, END_PAGE_TAG, END_PARENT_ID, END_REDIRECT, END_REVISION, END_TEXT, END_TIMESTAMP, END_TIMESTAMP_TAG, END_TITLE, KEY_SKIP_FACTOR, LOG, MINOR_TAG, REVISION_BEGIN_TIME, REVISION_END_TIME, SEED_FILE, SKIP_NON_ARTICLES, SKIP_REDIRECT, SPLIT_INDEX_OPTION, SPLIT_MAPFILE_LOC, START_ID, START_NAMESPACE, START_PAGE, START_PAGE_TAG, START_PARENT_ID, START_REDIRECT, START_REVISION, START_TEXT, START_TIMESTAMP, START_TIMESTAMP_TAG, START_TITLE, THRESHOLD, TIME_FORMAT
Constructor and Description |
---|
RevisionLinkInputFormat() |
Modifier and Type | Method and Description |
---|---|
org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,LinkProfile> |
createRecordReader(org.apache.hadoop.mapreduce.InputSplit input,
org.apache.hadoop.mapreduce.TaskAttemptContext context) |
configure, getSplits, getSplits, isSplitable, makeSplit
addInputPath, addInputPaths, computeSplitSize, getBlockIndex, getFormatMinSplitSize, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, listStatus, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize
public org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,LinkProfile> createRecordReader(org.apache.hadoop.mapreduce.InputSplit input, org.apache.hadoop.mapreduce.TaskAttemptContext context) throws IOException, InterruptedException
createRecordReader
in class WikiRevisionInputFormat<org.apache.hadoop.io.LongWritable,LinkProfile>
IOException
InterruptedException
Copyright © 2014. All rights reserved.