Classes related to document reading and parsing.
Imported documents
data:image/s3,"s3://crabby-images/1c45a/1c45ad35cb53f18029d24bb8c9e73de43222cbcd" alt=""
C# | Visual Basic | Visual C++ |
namespace Keyoti.SearchEngine.Documents
Namespace Keyoti.SearchEngine.Documents
namespace Keyoti.SearchEngine.Documents
data:image/s3,"s3://crabby-images/1c45a/1c45ad35cb53f18029d24bb8c9e73de43222cbcd" alt=""
All Types | Classes | Enumerations |
Icon | Type | Description |
---|---|---|
![]() | DataSetRecordParser |
Parser of XML representation of record from a DataSet based source.
|
![]() | Document |
Represents an indexed document.
|
![]() | DocumentText |
Represents the text in a document, including title and static summary.
|
![]() | DocxParser |
Summary description for DocxParser.
|
![]() | FatalReaderException |
A Reader exception that is fatal.
|
![]() | GenericIFilterParser |
A generic, IFilter based document parser, which can read text from any document for which an IFilter is installed on the system.
|
![]() | HtmlDocumentParser |
Parses Html documents for words and links.
|
![]() | HtmlStringParser |
Converts HTML to plain text
|
![]() | LastModifiedDateNotAvailableException |
Signifies that a Last-Modified HTTP header wasn't provided in the HTTP response.
|
![]() | OdsDocumentParser |
Parses Open Document Format (ODF) spreadsheet files (.ods).
|
![]() | OdtDocumentParser |
Parses Open Document Format (ODF) word processor files (.odt).
|
![]() | Parser |
Abstract base class for document parsers.
|
![]() | ParserProvider |
Provides a parser for different document types.
|
![]() | PdfDocumentParser |
Pdf Parser
|
![]() | PdfDocumentTitleSource |
Where the title of a Pdf document is obtained from.
|
![]() | PptxParser |
Parses PPTX (PowerPoint) files.
|
![]() | PreloadedDocument |
A document with preset text.
|
![]() | Reader |
Reads files from the filesystem and pages from servers via HTTP.
|
![]() | ReaderException |
An exception that originated in the Reader.
|
![]() | ReaderExceptionOccurredEventArgs |
Holds data about the WebException that caused an exception in the Reader class.
|
![]() | RtfDocumentParser |
Rtf document parser
|
![]() | TxtDocumentParser |
Parser for .txt documents.
|
![]() | Word2002Parser |
Parses v97-2003 binary format documents.
|
![]() | WordDocumentParser |
Master parser for Word documents, this class applies RTF or Word2002 Parsers according to actual structure of .doc files.
|
![]() | XlsxParser |
.xlsx (MS Excel) document parser.
|
![]() | XmlDocumentParser |
Reads plain text from an Xml document stream.
|