eXtensible Text Framework (XTF)
The eXtensible Text Framework (XTF) is a flexible indexing and query tool that supports searching across collections of heterogeneous data and presents results in a highly configurable manner. The highlights of the XTF system are described in an online brochure [
PDF] .
Downloads and Documentation
Technical Overview
The system is divided into four components:
- crossQuery: The front-end to the collection search system.
- dynaXML: Interface to individual documents.
- Text Engine: Used by crossQuery and dynaXML to perform text searches.
- Indexer: Full-text indexer based on Lucene.
The following diagrams give a general overview of how documents are indexed, stored, queried, retrieved, and displayed using XTF (somewhat outdated).
- System architecture diagram: A general illustration showing the roles the XTF components play in the user experience. [GIF]
- Collection searching diagram: A more detailed view of the collection searching process, covering query parsing and results formatting. [GIF]
- Individual object display diagram: A more detailed view of the object display and internal search mechanisms, covering request parsing, authentication, and document formatting. [GIF]
- Text indexing diagram: An illustration of the workflow for the creation of collection indexes. [GIF]
Who uses XTF?
The CDL uses XTF as a building block for new services and has used it to replace a number of systems previously used for text searching (i.e., DLXS, Greenstone, DynaWeb). As of 2008, CDL has deployed XTF in the following ways:
XTF is also extensively used outside the CDL:
Other institutions exploring XTF include: University of Sydney; OhioLink; University of Texas at Austin; University of Virginia; University of Denver; and University of Kansas Digital Initiatives.
Support
Implementers
While CDL does not directly support XTF implementers, we do make a good-faith effort to address the needs of the XTF community through the following resources on SourceForge:
- The xtf-user email list for those trying to set-up and use XTF. It is monitored by the principal developers of the application.
- The Bug Tracker is the place to submit bug reports.
- The Support Request Tracker and Feature Request Tracker are alternative ways to bring to our attention documentation errors and potential new features, respectively. However, the xtf-user list is probably a better way to start.
Developers
SourceForge resources for XTF developers and others who are interested in contributing to the architecture
- The xtf-devel email list is where developers share their ideas. It also logs all CVS commits.
- The Patch Tracker allows developers to submit XTF patches for our approval.
- Access to the CVS repository is also available