[ You are here:
XTF ->
Tag Reference ->
textIndexer -> Document Selector Output ]
Document Selector Output
The following tags make up the XML input for the
Document Selector stylesheet. They constitute a simple XML representation of files found in one (sub-) directory of the document library.
Index List Tag
This tag is the outermost tag for the XML output fragment issued by the
Document Selector stylesheet. It has the form:
<indexFiles>
FileToIndex
FileToIndex
…
FileToIndex
</indexFiles>
where
FileToIndex, FileToIndex... |
is zero or more File To Index Tags (see below), one for each file to index in the directory. |
File To Index Tag
One copy of this tag should be output by
Document Selector stylesheet for each file that must be indexed. It should appear within an Index List Tag container (see above). It has the form:
<file fileName = "FileName"
{format = "FileFormat"}
{preFilter = "PreFilterPath(s)"}
{displayStyle = "DocumentFormatterPath"}/>
where
fileName="FileName" |
is a required attribute that specifies the name of a file to be indexed. Note that this file name should not contain any path information for the file, but only the file name itself. |
format="FileFormat" |
is an optional attribute that specifies the format of a file to be indexed. Currently XML, PDF, HTML plain text, and most Microsoft Word files are handled by the textIndexer, and the format attribute should correspondingly be set to XML, PDF, HTML, Text, or MSWord. If this attribute is omitted, the textIndexer will try to infer the file type based on the file extension. |
preFilter="PreFilterPath(s)" |
is an optional attribute that specifies the path to the Pre-Filter stylesheet to be applied to this file. If this path is not specified as an absolute path, it is assumed to be relative to the XTF base installation directory (i.e., XTF_HOME.) Multiple pre-filters may be specified in a list; they should be separated by ";" or "," characters. The pre-filters will be applied in the order listed (e.g. the original file is sent to the first pre-filter; its output is sent to the second pre-filter, whose output is sent to the third, etc.) If this attribute is omitted, no pre-filter will be applied to the file. |
displayStyle="DocumentFormatterPath" |
is an optional attribute that specifies path to the Document Formatter stylesheet to use for this file. If this path is not specified as an absolute path, it is assumed to be relative to the XTF base installation directory (i.e., XTF_HOME.) If this attribute is present, the textIndexer will create a special cache that is used by the dynaXML servlet to display the current file more quickly. If this attribute is omitted, the cache is not created. For more details, see the discussion of Lazy Document Handling in the XTF Under the Hood guide. |