|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.lucene.benchmark.byTask.feeds.BasicDocMaker
org.apache.lucene.benchmark.byTask.feeds.LineDocMaker
org.apache.lucene.benchmark.byTask.feeds.EnwikiDocMaker
public class EnwikiDocMaker
A LineDocMaker which reads the uncompressed english wikipedia dump.
Config properties:
keep.image.only.docs=false|true
Plus those available in LineDocMaker
LineDocMaker
Field Summary | |
---|---|
protected boolean |
keepImages
|
Fields inherited from class org.apache.lucene.benchmark.byTask.feeds.BasicDocMaker |
---|
BODY_FIELD, BYTES_FIELD, config, DATE_FIELD, forever, ID_FIELD, indexVal, NAME_FIELD, storeVal, termVecVal, TITLE_FIELD |
Constructor Summary | |
---|---|
EnwikiDocMaker()
|
Method Summary | |
---|---|
Document |
makeDocument()
Create the next document. |
void |
setConfig(Config config)
Set the properties |
Methods inherited from class org.apache.lucene.benchmark.byTask.feeds.LineDocMaker |
---|
getNextDocData, makeDocument, numUniqueTexts, resetInputs |
Methods inherited from class org.apache.lucene.benchmark.byTask.feeds.BasicDocMaker |
---|
addBytes, addUniqueBytes, collectFiles, getByteCount, getCount, getHtmlParser, numUniqueBytes, printDocStatistics, resetUniqueBytes, setHTMLParser |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected boolean keepImages
Constructor Detail |
---|
public EnwikiDocMaker()
Method Detail |
---|
public void setConfig(Config config)
DocMaker
setConfig
in interface DocMaker
setConfig
in class LineDocMaker
public Document makeDocument() throws java.lang.Exception
DocMaker
makeDocument
in interface DocMaker
makeDocument
in class LineDocMaker
java.lang.Exception
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |