it.unipd.dei.ims.circo.lucene
Class LuceneDocumentEmitter

java.lang.Object
  extended by it.unipd.dei.ims.circo.indexing.AbstractDocumentEmitter<D>
      extended by it.unipd.dei.ims.circo.indexing.BaseDocumentEmitter<D>
          extended by it.unipd.dei.ims.circo.indexing.CircoDocumentEmitter<org.apache.lucene.document.Document>
              extended by it.unipd.dei.ims.circo.lucene.LuceneDocumentEmitter
All Implemented Interfaces:
DocumentEmitter<org.apache.lucene.document.Document>, java.lang.Iterable<org.apache.lucene.document.Document>, java.util.Iterator<org.apache.lucene.document.Document>

public class LuceneDocumentEmitter
extends CircoDocumentEmitter<org.apache.lucene.document.Document>
implements DocumentEmitter<org.apache.lucene.document.Document>

Provides an implementation of the DocumentEmitter interface for Lucene.

Since:
0.10
Version:
0.10
Author:
Nicola Ferro

Field Summary
 
Fields inherited from class it.unipd.dei.ims.circo.indexing.AbstractDocumentEmitter
logger
 
Constructor Summary
LuceneDocumentEmitter(java.io.File inputDirectory, boolean compressed, CircoAnalyzer ca)
          Creates a new Lucene document emitter.
LuceneDocumentEmitter(java.lang.String inputDirectory, boolean compressed, CircoAnalyzer ca)
          Creates a new Lucene document emitter.
 
Method Summary
protected  org.apache.lucene.document.Document toDocument(Resource r)
          Maps a Resource to a document.
 
Methods inherited from class it.unipd.dei.ims.circo.indexing.CircoDocumentEmitter
close, hasNext, next
 
Methods inherited from class it.unipd.dei.ims.circo.indexing.BaseDocumentEmitter
endStreamChunk, getCurrentInputFile, getCurrentStreamChunk, getInputDirectory, hasNextResource, isCompressed, nextResource, startStreamChunk
 
Methods inherited from class it.unipd.dei.ims.circo.indexing.AbstractDocumentEmitter
equals, getIdentifier, hashCode, iterator, remove, toString
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
 
Methods inherited from interface it.unipd.dei.ims.circo.indexing.DocumentEmitter
close, getIdentifier, getInputDirectory, isCompressed
 
Methods inherited from interface java.lang.Iterable
iterator
 
Methods inherited from interface java.util.Iterator
hasNext, next, remove
 

Constructor Detail

LuceneDocumentEmitter

public LuceneDocumentEmitter(java.io.File inputDirectory,
                             boolean compressed,
                             CircoAnalyzer ca)
Creates a new Lucene document emitter.

Parameters:
inputDirectory - the directory where stream chunks are stored.
compressed - indicates whether the stream chunks are compressed or not.
ca - the CIRCO analyzer to be used for generating token streams.

LuceneDocumentEmitter

public LuceneDocumentEmitter(java.lang.String inputDirectory,
                             boolean compressed,
                             CircoAnalyzer ca)
Creates a new Lucene document emitter.

Parameters:
inputDirectory - the directory where stream chunks are stored.
compressed - indicates whether the stream chunks are compressed or not.
ca - the CIRCO analyzer to be used for generating token streams.
Method Detail

toDocument

protected org.apache.lucene.document.Document toDocument(Resource r)
                                                  throws it.unipd.dei.ims.ici.component.ComponentException
Description copied from class: CircoDocumentEmitter
Maps a Resource to a document.

Specified by:
toDocument in class CircoDocumentEmitter<org.apache.lucene.document.Document>
Parameters:
r - the resource to be mapped.
Returns:
the document corresponding to the given resource.
Throws:
it.unipd.dei.ims.ici.component.ComponentException - if something goes wrong while adding the document.

Copyright © 2008-2009 Information Management Systems (IMS) research group,
Department of Information Engineering, University of Padua, Italy.