|
|||||||||
PREV NEXT | FRAMES NO FRAMES |
sectiontype
and
proximitybreak
attributes encountered, as well as various
special markers used to locate where in the XML source text the indexed
text is stored.
FreqData.save(File)
.
SpanScorer.next()
and
SpanScorer.skipTo(int)
.
XMLTextProcessor.MAX_DELETION_BATCH
deletions.
lowerTerm
but less than upperTerm
.
BoostSet.getCachedSet(IndexReader, File, String)
instead.
index
if available, otherwise
it returns Character.MIN_VALUE
so that there is some sort
of a default
characters()
method from the SAX parser.
charBuf
buffer.
Chunk
s.ChunkedWordIter
as it progresses through
a document which has been broken into chunks.nodeWordCount
at which the current chunk begins.
Trace.tab()
.
accumText
member
where individual "virtual words" have been compacted down into special
offset markers.
value
contains any of the criteria
starting
at index start
and matching up to length length
StructuredFile.creatingSubfile
is true).
IdxTreeCuller
instance and call this method to
remove documents from indices when the associated source text no longer
exists.
XMLTextProcessor.curIdxSrc
Trace.tab()
is called.
XMLFormatter.tab()
is called.
FileUtils.createTempFile(net.sf.saxon.expr.XPathContext, java.lang.String, java.lang.String)
.
DiskHashWriter
.DefaultQueryProcessor.maxDocScore
)
EmptySpans.theInstance
.
ThreadWatcher.beginWatch(String,long,long)
.
FieldSpans.Entry
per field
o
is equal to this.
o
is equal to this.
QueryGenException
with the
given message.
QueryRequest
.FastTokenizer.DribbleReader
.
FastTokenizer.DribbleReader.fakeStr
FastTokenizer
).WordIter.getPos(int)
or WordIter.getPos(MarkPos,int)
WordIter.getPos(int)
or WordIter.getPos(MarkPos,int)
FileWalker
class is a utility class that simplifies
traversing all the files in a file-system directory, and optionally, in
any sub-directories.Trace.setOutputLevel(int)
.
WordIter.getPos(int)
using the iterator's current position.
XMLTextProcessor.queueText(IndexSource, boolean)
but not yet processed by
XMLTextProcessor.processQueuedTexts()
.
SpanRangeQuery.rewrite(IndexReader)
.
SpanWildcardQuery.rewrite(IndexReader)
.
SpanRangeQuery.rewrite(IndexReader)
.
XTFSaxonErrorListener.clearThreadErrors()
StructuredFile.setUserVersion(String)
.
StructuredStore.setUserVersion(String)
.
convert()
method that converts an HTML file into an XML string that can be
pre-filtered and added to a Lucene database by the
XMLTextProcessor
class.true
if the lower endpoint is inclusive
true
if the upper endpoint is inclusive
indexSearcher
to check if
the current document needs to be added to, updated in, or
removed from the index.
IndexSource
.indexReader
to check if the
current document needs to be added to, updated in, or removed
from the index.
SectionInfo
objects.
insertVirtualWords()
method to insert a specified number of virtual word symbols.
String
values
are equal.
String
values
are equal, optionally using the alternate value.
true
if the range query is inclusive
padEnd(String, int)
padStart(String, int)
TextConfig.reportLatency
is true.)
LazyTreeBuilder
, loading nodes on demand rather than holding all
of them in RAM.lazyReceiver
object
that translates SAX events to Saxon's internal Receiver API.
DiskHashReader
EmbeddedList
must implement this
interface.ContextMarker.MARK_NO_TERMS
ContextMarker.MARK_NO_TERMS
ContextMarker.MARK_NO_TERMS
ContextMarker
.FastTokenizer.DribbleReader.buf
processFiles()
.
FileWalker.processFile(String, String, String, String)
method.)
lowerTerm
but less than upperTerm
.
TagArray.curBlock
to point at it.
nodeWordCount
at which the next chunk begins.
SearchTree.VIRTUAL_MARKER
for more info.)
Path
class provides a number of utilities that makes
working with file system paths easier.convert()
method that converts the text in a PDF file into an XML string that can be
pre-filtered and added to a Lucene database by the
XMLTextProcessor
class.IndexSource
will have been
completed when this record is complete.
FastTokenizer.DribbleReader.buf
FastTokenizer.source
array
PackedByteBuf.bytes
Trace.writer
IdxTreeCleaner
instance and call this method to
remove "incomplete" documents from an index directory or a root
directory containing multiple indices.
IdxTreeDictMaker
instance and call this method to
create spelling dictionaries for one or more Lucene indices.
IdxTreeOptimizer
instance and call this method to
optimize one or more Lucene indices.
SectionInfo
instance onto the
top of the section stack.
QueryRoute.parse(NodeInfo)
QueryParser
.read
method of InputStream
.
IntList.calcSortMap()
LongList.calcSortMap()
TextServlet.saveWildcards(String)
.
StdTermFilter.saveWildcards(String)
.
include
which
have no overlap with spans from exclude
.
match
whose end
position is less than or equal to end
.
include
which
have no overlap with spans from exclude
.
include
which
have no overlap with spans from exclude
.
Span
s that orders them by ascending start position,
then end position.EmptySpans.theInstance
.
lowerTerm
but less than upperTerm
.
lowerTerm
but less than upperTerm
.
SpellWriter
, and provides
fast single- and multi-word spelling suggestions.SpellReader.open(File)
instead.
SpellReader
to obtain spelling suggestions.SpellWriter.open(File)
method.
StandardAnalyzer
, which performs
all the same functions plus queues words to a spelling dictionary.SpellWritingAnalyzer.STOP_WORDS
).
StructuredFile
.StructuredFile
.StructuredStore
.StructuredStore
.Trace.clearTabs()
.
Trace.debug(String)
.
Trace.error(String)
.
Trace.info(String)
.
Trace.more(String)
.
Trace.more(int, String)
.
Trace.tab()
Trace.untab()
Trace.warning(String)
.
Trace.defaultTabSize
(default 2) spaces.
Trace.tab()
is called.
XMLFormatter.tab()
is called.
WordIter.getPos(int)
or WordIter.getPos(MarkPos,int)
WordIter.getPos(int)
or WordIter.getPos(MarkPos,int)
WordIter.getPos(int)
or WordIter.getPos(MarkPos,int)
StandardTokenizer
filtered by a StandardFilter
, a SpellWritingFilter
, a LowerCaseFilter
and a StopFilter
.
SpanRecordingScorer.maxSpans
Trace
class provides a mechanism for logging output
messages to the console or any PrintStream or Writer.Trace.getThreadTrace()
.
Trace.tab()
, un-indenting subsequent output lines.
Trace.printStream
XMLTextProcessor
.ChunkMarkPos
by adding node number, word offset, and
section type information.LimIndexReader
except it also does a periodic check if
the request has taken too long and should kill itself.XtfSpecialTokensFilter
class is used by the
XTFTextAnalyzer
class to convert special "bump" count values in
text chunks to actual position increments for words prior to adding them
to a Lucene index.XtfSpecialTokensFilter
.
XTFTextAnalyzer
class performs the task of breaking up a
contiguous chunk of text into a list of separate words (tokens
in Lucene parlance.)XMLTextProcessor
class.
|
|||||||||
PREV NEXT | FRAMES NO FRAMES |