Keyoti SearchUnit API Docs
DataSetRecordParser Class
API DocumentationKeyoti.SearchEngine.DocumentsDataSetRecordParser
Keyoti SearchUnit v6
Parser of XML representation of record from a DataSet based source.
Declaration Syntax
C#Visual Basic
public class DataSetRecordParser : HtmlDocumentParser
Public Class DataSetRecordParser
	Inherits HtmlDocumentParser
Members
All MembersConstructorsMethodsProperties



IconMemberDescription
DataSetRecordParser(Configuration)
New instance

Configuration
Gets the instance of the Configuration class that holds the settings to be used.
(Inherited from Parser.)
CopyStream(Stream) (Inherited from Parser.)
DeriveEncoding(Stream)
Tries to find the encoding of a HTML file from the Content-type meta tag.
(Inherited from HtmlDocumentParser.)
DeriveTitleFromDocument(String)
Attempts to return the title of the document, based on the documentBody
(Inherited from HtmlDocumentParser.)
Encoding
The character encoding used in the document Stream, if applicable.
(Inherited from Parser.)
Equals(Object)
Determines whether the specified Object is equal to the current Object.
(Inherited from Object.)
Finalize()()()()
Allows an object to try to free resources and perform other cleanup operations before it is reclaimed by garbage collection.
(Inherited from Object.)
FindIgnoreRegions(String)
Finds all ignore regions in documentBody.
(Inherited from HtmlDocumentParser.)
FindUrlsInPlainText(String) (Inherited from Parser.)
GetHashCode()()()()
Serves as a hash function for a particular type.
(Inherited from Object.)
GetHiddenFooter(Uri, String, String)
Creates a footer with additional (hidden) indexed text, based on Uri, Title, meta tags etc.
(Inherited from Parser.)
GetNextWord(String)
Returns the next 'word' in rawBody, is iterative, so subsequent calls move to consecutive words.
(Inherited from HtmlDocumentParser.)
GetType()()()()
Gets the type of the current instance.
(Inherited from Object.)
GetWordsInUri(Uri)
Returns list of words as strings in an ArrayList, that are in the Uri
(Inherited from Parser.)
IsCurrentWordInTitle()()()()
Whether word last returned by GetNextWord is in title.
(Inherited from HtmlDocumentParser.)
IsInIgnoredRegion(ArrayList)
Determines whether current word (at wordStart) is in an ignored region.
(Inherited from Parser.)
IsStreamNeeded()()()() Obsolete.
Whether the parser would need a stream to be passed to it in order to perform a ReadText or ReadLinks operation.
(Inherited from Parser.)
MemberwiseClone()()()()
Creates a shallow copy of the current Object.
(Inherited from Object.)
ParseWords(String, ArrayList, WordCollection, StringBuilder, ArrayList)
Parses rawBody into descrete Word objects and places them in readDocumentWords.
(Inherited from Parser.)
PreprocessBreakChunk(String)
Applies any required processing to a chunk of text that typically forms either a word or whitespace block.
(Inherited from Parser.)
ProcessWordsToFinalIndexedList(WordCollection, Boolean)
Processes the list of all words found in the document and returns a list that should be index.
(Inherited from Parser.)
ProcessWordsToFinalIndexedList(WordCollection, Boolean, ArrayList)
Processes the list of all words found in the document and returns a list that should be index.
(Inherited from Parser.)
Read(Stream, Document, Encoding)
Reads a document and returns an object holding it's text and any links.
(Overrides HtmlDocumentParser.Read(Stream, Document, Encoding).)
ReadDocumentContent(Stream, Encoding)
Returns string read from 'stream'.
(Inherited from HtmlDocumentParser.)
ReadLinks(Stream, Encoding) Obsolete.
Reads links to other pages.
(Inherited from Parser.)
ReadMetaTable(String)
Reads the meta tags for a document.
(Inherited from HtmlDocumentParser.)
ReadText(Stream, Uri, Encoding) Obsolete.
Reads text and returns list of words and title
(Inherited from Parser.)
ResetWordPointers()()()()
Resets the current word being processed.
(Inherited from Parser.)
ToString()()()()
Returns a string that represents the current object.
(Inherited from Object.)
TruncateWordWithRepeatedChar(String)
Removes repeated non-letters from word.
(Inherited from Parser.)
WordEnd
The current word's end.
(Inherited from Parser.)
WordStart
The current word's start.
(Inherited from Parser.)
Inheritance Hierarchy
Object
Parser
 HtmlDocumentParser
  DataSetRecordParser

Assembly: Keyoti4.SearchEngine.Core (Module: Keyoti4.SearchEngine.Core.dll) Version: 2022.8.22.610