Classes related to document reading and parsing.
Imported documents
Declaration Syntax
C# | Visual Basic | Visual C++ |
namespace Keyoti.SearchEngine.Documents
Namespace Keyoti.SearchEngine.Documents
namespace Keyoti.SearchEngine.Documents
Types
All Types | Classes | Enumerations |
Icon | Type | Description |
---|---|---|
DataSetRecordParser |
Parser of XML representation of record from a DataSet based source.
| |
Document |
Represents an indexed document.
| |
DocumentText |
Represents the text in a document, including title and static summary.
| |
DocxParser |
Summary description for DocxParser.
| |
FatalReaderException |
A Reader exception that is fatal.
| |
GenericIFilterParser |
A generic, IFilter based document parser, which can read text from any document for which an IFilter is installed on the system.
| |
HtmlDocumentParser |
Parses Html documents for words and links.
| |
HtmlStringParser |
Converts HTML to plain text
| |
LastModifiedDateNotAvailableException |
Signifies that a Last-Modified HTTP header wasn't provided in the HTTP response.
| |
OdsDocumentParser |
Parses Open Document Format (ODF) spreadsheet files (.ods).
| |
OdtDocumentParser |
Parses Open Document Format (ODF) word processor files (.odt).
| |
Parser |
Abstract base class for document parsers.
| |
ParserProvider |
Provides a parser for different document types.
| |
PdfDocumentParser |
Pdf Parser
| |
PdfDocumentTitleSource |
Where the title of a Pdf document is obtained from.
| |
PptxParser |
Parses PPTX (PowerPoint) files.
| |
PreloadedDocument |
A document with preset text.
| |
Reader |
Reads files from the filesystem and pages from servers via HTTP.
| |
ReaderException |
An exception that originated in the Reader.
| |
ReaderExceptionOccurredEventArgs |
Holds data about the WebException that caused an exception in the Reader class.
| |
RtfDocumentParser |
Rtf document parser
| |
TxtDocumentParser |
Parser for .txt documents.
| |
Word2002Parser |
Parses v97-2003 binary format documents.
| |
WordDocumentParser |
Master parser for Word documents, this class applies RTF or Word2002 Parsers according to actual structure of .doc files.
| |
XlsxParser |
.xlsx (MS Excel) document parser.
| |
XmlDocumentParser |
Reads plain text from an Xml document stream.
|