it.unipd.dei.ims.circo.lucene.demo
Class ClefDocumentLexer

java.lang.Object
  extended by it.unipd.dei.ims.circo.lucene.demo.ClefDocumentLexer

public final class ClefDocumentLexer
extends java.lang.Object

Provides a basic parser for CLEF collections. The following collections are supported:

This class is a scanner generated by JFlex 1.4.1 on 10/02/05 13.06.

Since:
0.10
Version:
0.10
Author:
Giorgio Maria Di Nunzio

Field Summary
static int AAMU
           
static int AAMUDOCNO
           
static int AAMUTEXT
           
static int AAMUTITLE
           
static int AD
           
static int ADDOCNO
           
static int ADTEXT
           
static int ADTEXT2
           
static int ADTITLE
           
static int AGZ
           
static int AGZDOCNO
           
static int AGZTEXT
           
static int AGZTITLE
           
static int ATS
           
static int ATSDOCNO
           
static int ATSTEXT
           
static int ATSTITLE
           
static int DOC
           
static int EFE
           
static int EFEDOCNO
           
static int EFETEXT
           
static int EFETITLE
           
static int FRANKFURT
           
static int FRANKFURTDOCNO
          lexical states
static int FRANKFURTTEXT
           
static int FRANKFURTTITLE
           
static int GLASGOW
           
static int GLASGOWDOCNO
           
static int GLASGOWTEXT
           
static int GLASGOWTITLE
           
static int IZV
           
static int IZVDOCNO
           
static int IZVTEXT
           
static int IZVTITLE
           
static int LASTAMPA
           
static int LASTAMPADOCNO
           
static int LASTAMPATEXT
           
static int LASTAMPATITLE
           
static int LATIMES
           
static int LATIMESDOCNO
           
static int LATIMESTEXT
           
static int LATIMESTITLE
           
static int LEMONDE
           
static int LEMONDEDOCNO
           
static int LEMONDETEXT
           
static int LEMONDETITLE
           
static int MAGYAR
           
static int MHDOCNO
           
static int MHTEXT
           
static int MHTITLE
           
static int NH
           
static int NHDOCNO
           
static int NHTEXT
           
static int NHTEXT2
           
static int NHTITLE
           
static int SDA
           
static int SDADOCNO
           
static int SDATEXT
           
static int SDATITLE
           
static int SEGA
           
static int SEGADOCNO
           
static int SEGATEXT
           
static int SEGATITLE
           
static int SPIEGEL
           
static int SPIEGELDOCNO
           
static int SPIEGELTEXT
           
static int SPIEGELTITLE
           
static int YYEOF
          This character denotes the end of file
static int YYINITIAL
           
 
Constructor Summary
ClefDocumentLexer(java.io.File f)
           
ClefDocumentLexer(java.io.InputStream in)
          Creates a new scanner.
ClefDocumentLexer(java.io.Reader in)
          Creates a new scanner There is also a java.io.InputStream version of this constructor.
 
Method Summary
 java.util.Map<java.lang.String,java.lang.String[]> docs()
           
 boolean endFile()
           
 java.util.Map<java.lang.String,java.lang.String[]> getDocuments()
           
 void yybegin(int newState)
          Enters a new lexical state
 char yycharat(int pos)
          Returns the character at position pos from the matched text.
 void yyclose()
          Closes the input stream.
 int yylength()
          Returns the length of the matched text region.
 int yylex()
          Resumes scanning until the next regular expression is matched, the end of input is encountered or an I/O-Error occurs.
 void yypushback(int number)
          Pushes the specified amount of characters back into the input stream.
 void yyreset(java.io.Reader reader)
          Resets the scanner to read from a new input stream.
 int yystate()
          Returns the current lexical state.
 java.lang.String yytext()
          Returns the text matched by the current regular expression.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

YYEOF

public static final int YYEOF
This character denotes the end of file

See Also:
Constant Field Values

FRANKFURTDOCNO

public static final int FRANKFURTDOCNO
lexical states

See Also:
Constant Field Values

ADDOCNO

public static final int ADDOCNO
See Also:
Constant Field Values

LASTAMPATEXT

public static final int LASTAMPATEXT
See Also:
Constant Field Values

IZVTITLE

public static final int IZVTITLE
See Also:
Constant Field Values

AGZTEXT

public static final int AGZTEXT
See Also:
Constant Field Values

MHTITLE

public static final int MHTITLE
See Also:
Constant Field Values

AD

public static final int AD
See Also:
Constant Field Values

NHTITLE

public static final int NHTITLE
See Also:
Constant Field Values

IZVDOCNO

public static final int IZVDOCNO
See Also:
Constant Field Values

MHDOCNO

public static final int MHDOCNO
See Also:
Constant Field Values

AAMU

public static final int AAMU
See Also:
Constant Field Values

NHDOCNO

public static final int NHDOCNO
See Also:
Constant Field Values

LATIMESTEXT

public static final int LATIMESTEXT
See Also:
Constant Field Values

LASTAMPATITLE

public static final int LASTAMPATITLE
See Also:
Constant Field Values

ATS

public static final int ATS
See Also:
Constant Field Values

SEGATITLE

public static final int SEGATITLE
See Also:
Constant Field Values

LEMONDE

public static final int LEMONDE
See Also:
Constant Field Values

SEGA

public static final int SEGA
See Also:
Constant Field Values

LASTAMPADOCNO

public static final int LASTAMPADOCNO
See Also:
Constant Field Values

SPIEGELTEXT

public static final int SPIEGELTEXT
See Also:
Constant Field Values

SEGADOCNO

public static final int SEGADOCNO
See Also:
Constant Field Values

IZV

public static final int IZV
See Also:
Constant Field Values

SPIEGEL

public static final int SPIEGEL
See Also:
Constant Field Values

EFETEXT

public static final int EFETEXT
See Also:
Constant Field Values

LEMONDETITLE

public static final int LEMONDETITLE
See Also:
Constant Field Values

YYINITIAL

public static final int YYINITIAL
See Also:
Constant Field Values

AGZ

public static final int AGZ
See Also:
Constant Field Values

FRANKFURT

public static final int FRANKFURT
See Also:
Constant Field Values

LEMONDEDOCNO

public static final int LEMONDEDOCNO
See Also:
Constant Field Values

SEGATEXT

public static final int SEGATEXT
See Also:
Constant Field Values

SDATEXT

public static final int SDATEXT
See Also:
Constant Field Values

LATIMES

public static final int LATIMES
See Also:
Constant Field Values

AGZTITLE

public static final int AGZTITLE
See Also:
Constant Field Values

EFE

public static final int EFE
See Also:
Constant Field Values

NHTEXT

public static final int NHTEXT
See Also:
Constant Field Values

ADTEXT2

public static final int ADTEXT2
See Also:
Constant Field Values

AGZDOCNO

public static final int AGZDOCNO
See Also:
Constant Field Values

SDA

public static final int SDA
See Also:
Constant Field Values

NHTEXT2

public static final int NHTEXT2
See Also:
Constant Field Values

AAMUTITLE

public static final int AAMUTITLE
See Also:
Constant Field Values

AAMUTEXT

public static final int AAMUTEXT
See Also:
Constant Field Values

AAMUDOCNO

public static final int AAMUDOCNO
See Also:
Constant Field Values

DOC

public static final int DOC
See Also:
Constant Field Values

LATIMESTITLE

public static final int LATIMESTITLE
See Also:
Constant Field Values

GLASGOWTITLE

public static final int GLASGOWTITLE
See Also:
Constant Field Values

FRANKFURTTEXT

public static final int FRANKFURTTEXT
See Also:
Constant Field Values

LATIMESDOCNO

public static final int LATIMESDOCNO
See Also:
Constant Field Values

ATSTEXT

public static final int ATSTEXT
See Also:
Constant Field Values

MAGYAR

public static final int MAGYAR
See Also:
Constant Field Values

GLASGOWDOCNO

public static final int GLASGOWDOCNO
See Also:
Constant Field Values

GLASGOW

public static final int GLASGOW
See Also:
Constant Field Values

NH

public static final int NH
See Also:
Constant Field Values

LEMONDETEXT

public static final int LEMONDETEXT
See Also:
Constant Field Values

ADTEXT

public static final int ADTEXT
See Also:
Constant Field Values

ATSTITLE

public static final int ATSTITLE
See Also:
Constant Field Values

SPIEGELTITLE

public static final int SPIEGELTITLE
See Also:
Constant Field Values

EFETITLE

public static final int EFETITLE
See Also:
Constant Field Values

ATSDOCNO

public static final int ATSDOCNO
See Also:
Constant Field Values

LASTAMPA

public static final int LASTAMPA
See Also:
Constant Field Values

MHTEXT

public static final int MHTEXT
See Also:
Constant Field Values

SPIEGELDOCNO

public static final int SPIEGELDOCNO
See Also:
Constant Field Values

SDATITLE

public static final int SDATITLE
See Also:
Constant Field Values

EFEDOCNO

public static final int EFEDOCNO
See Also:
Constant Field Values

FRANKFURTTITLE

public static final int FRANKFURTTITLE
See Also:
Constant Field Values

ADTITLE

public static final int ADTITLE
See Also:
Constant Field Values

IZVTEXT

public static final int IZVTEXT
See Also:
Constant Field Values

SDADOCNO

public static final int SDADOCNO
See Also:
Constant Field Values

GLASGOWTEXT

public static final int GLASGOWTEXT
See Also:
Constant Field Values
Constructor Detail

ClefDocumentLexer

public ClefDocumentLexer(java.io.File f)
                  throws java.io.FileNotFoundException
Throws:
java.io.FileNotFoundException

ClefDocumentLexer

public ClefDocumentLexer(java.io.Reader in)
Creates a new scanner There is also a java.io.InputStream version of this constructor.

Parameters:
in - the java.io.Reader to read input from.

ClefDocumentLexer

public ClefDocumentLexer(java.io.InputStream in)
Creates a new scanner. There is also java.io.Reader version of this constructor.

Parameters:
in - the java.io.Inputstream to read input from.
Method Detail

docs

public java.util.Map<java.lang.String,java.lang.String[]> docs()

endFile

public boolean endFile()

getDocuments

public java.util.Map<java.lang.String,java.lang.String[]> getDocuments()

yyclose

public final void yyclose()
                   throws java.io.IOException
Closes the input stream.

Throws:
java.io.IOException

yyreset

public final void yyreset(java.io.Reader reader)
Resets the scanner to read from a new input stream. Does not close the old reader. All internal variables are reset, the old input stream cannot be reused (internal buffer is discarded and lost). Lexical state is set to ZZ_INITIAL.

Parameters:
reader - the new input stream

yystate

public final int yystate()
Returns the current lexical state.


yybegin

public final void yybegin(int newState)
Enters a new lexical state

Parameters:
newState - the new lexical state

yytext

public final java.lang.String yytext()
Returns the text matched by the current regular expression.


yycharat

public final char yycharat(int pos)
Returns the character at position pos from the matched text. It is equivalent to yytext().charAt(pos), but faster

Parameters:
pos - the position of the character to fetch. A value from 0 to yylength()-1.
Returns:
the character at position pos

yylength

public final int yylength()
Returns the length of the matched text region.


yypushback

public void yypushback(int number)
Pushes the specified amount of characters back into the input stream. They will be read again by then next call of the scanning method

Parameters:
number - the number of characters to be read again. This number must not be greater than yylength()!

yylex

public int yylex()
          throws java.io.IOException
Resumes scanning until the next regular expression is matched, the end of input is encountered or an I/O-Error occurs.

Returns:
the next token
Throws:
java.io.IOException - if any I/O-Error occurs

Copyright © 2008-2009 Information Management Systems (IMS) research group,
Department of Information Engineering, University of Padua, Italy.