Keyoti SearchUnit API Docs
Configuration Class
API DocumentationKeyoti.SearchEngineConfiguration
Keyoti SearchUnit v6
Crawling, indexing and search parameters.
Declaration Syntax
C#Visual Basic
public class Configuration : INotifyPropertyChanged
Public Class Configuration
	Implements INotifyPropertyChanged
Members
All MembersConstructorsMethodsPropertiesFieldsEvents



IconMemberDescription
Configuration()()()()
AutoCompleteSuggestionWordMaxLength
Maximum length of words to suggest to the user in the auto-complete drop down, when words originate in the lexicon.

BoostFactorTagName
The boost factor tag name.

CacheCrawlLinks
Whether to store links found on a page before that page is crawled. Set false to conserve memory.

CaseSensitivity Obsolete.
The case sensitivity setting for building and searching.

CentralEventDispatcher
The centralized event dispatcher that will fire all central events.

CrawlSubdomains
Whether to let the crawler crawl [other] subdomains that it finds links to, otherwise the crawler will stay on the exact hostname that it started on.

CreateForwardIndex
Whether to create a 'forward index' which allows for result summary generation and result preview features.

credentials Obsolete.
Map of web-site domain-names/server-names to usernames and passwords.

CurrentIndexVersion
The index version that this build works with.

CurrentMajorVersion
The major version of this build.

CustomDataFilterLiveDataMaximum
The maximum amount of live data to report with results to the client, for AJAX custom data filters.

DataAccessExceptionsThreshold Obsolete.
Number of exceptions that will be tolerated in the data access layer.

DbCommandTimeout
The number of seconds to set CommandTimeout to for MSSQL commands.

DbImportPageSize
The number of records to read in each database import paging operation.

DbType
The storage method for the document index - possible values "sql" or "xml"

DefaultPageNames
List of strings with default page names, these pages are synonomous with a URL ending in a directory.

DocumentWorkTimeout
The length of time, in milliseconds, until the document work times out

Equals(Object)
Determines whether the specified Object is equal to the current Object.
(Inherited from Object.)
ErrorPageRedir
Error page URL pattern

EventHandlerAssemblyPath
The absolute file path of an external assembly that will be automatically registered to handle operational events.

exceptionThresholds Obsolete.
Contains exception thresholds.

ExpressionOperatorsCollection
Collection of logic operators, containing the default pair (AND,OR)

FileTypesSettings
Maps mime-type strings to file extensions.

Finalize()()()()
Allows an object to try to free resources and perform other cleanup operations before it is reclaimed by garbage collection.
(Inherited from Object.)
ForceGCollectOnDocRead
Whether to force a garbage collect before and after reading a document. ONLY set true when debugging memory issues.

GetApplicationDataFolderPath()()()()
Returns the folder under LocalApplicationData where install info is stored.

GetHashCode()()()()
Serves as a hash function for a particular type.
(Inherited from Object.)
GetMimeFromFileExtension(String)
Returns the mime-type for a particular file extension, as per settings in fileTypesSettings

GetType()()()()
Gets the type of the current instance.
(Inherited from Object.)
IgnoreBlockBeginPattern
Pattern that marks the start of a block that should be ignored in the search process

IgnoreBlockEndPattern
Pattern that marks the end of a block that should be ignored in the search process

IgnoreCommonResourceTypes
Whether to ignore links to common resource types, eg. .jpg, .gif, .js.

IgnoreDiacritics
Whether to ignore diacritic marks, eg. treat 'é' in a word the same as 'e', so 'sake' will find 'saké' (default is true).

IgnoreLastModifiedDate
Whether to ignore the 'last modified date' returned from the server, during build operations.

IgnoreSSLErrors
Whether to ignore SSL issues on the website being imported - eg. expired certificate.

ImpliedLogicOperator
The implied logic operator between query expression words.

IndexDirectory
The path to the index directory - eg "c:\documentIndex"

IndexedDocumentBufferSize
The number of documents to index to the memory buffer before writing to disk.

IndexingInterval
The length of time in days that must pass before the document is re-indexed.

IndexNumbers
Whether to index numbers.

LemmaLanguage
The language to look for lemmas in.

LemmaSearchEnabled
Whether to look for variations of query words, eg plural forms, see LemmaWeightFactor and LemmaLanguage also.

LemmaWeightFactor
Factor to adjust weight of lemma (word variations) matches.

LogFileMaxSize
The size in bytes at which log files are refreshed.

Logging
Whether to log activity and errors to files stored in the index directory.

MaxDocumentSizeToIndex
Documents smaller than this number of kilobytes will be indexed, and documents larger than this will be skipped.

MaximumCrawlDepth
Specifies the maximum 'link depth' that should be crawled.

MaxResponseHeaderLength
The length in kilobytes of the response headers. A value of -1 means no limit will be imposed on the response headers

MemberwiseClone()()()()
Creates a shallow copy of the current Object.
(Inherited from Object.)
OnNotifyPropertyChanged(String)
Fires the PropertyChanged event.

ParseMSWordDocUsingIFilter
Whether to parse MS Word .doc files using the Windows IFilter (COM interop), if false the pure .NET parser is used (default false).

ParseRTFDocUsingIFilter
Whether to use the built-in Windows IFilter to parse RTF files. Default is true.

PdfDocumentTitleSource
Where the title of a Pdf document is obtained from.

PropertyChanged
Fired when a property value changes.

ProxyAddress
The address of a proxy server to use to make outbound connections during indexing. Leave empty to not use a proxy.

ProxyPort
The port to connect to the proxy server on (if a proxy is used).

ReadDocumentInNewThread
Whether to read document text in a separate thread, to enable timeouts.

ReaderExceptionsThreshold
Number of exceptions that will be tolerated in the reader layer.

ReadGlobalConfigKey(String)
Reads a value from the global install config (located under the user's application data folder).

RequestThrottle
The length of time, in milliseconds, that the application will wait before each request is made, in order to limit the server load.

RequestTimeout
The length of time, in milliseconds, until the request times out

RespectsRobotsMetaTags
Whether to consider the robots meta tags (noindex, nofollow).

RespectsRobotsTXT
Whether to consider the robots.txt specifications

ResultSummaryType
The type of summary that will be shown.

ServiceRebuildFrequency
Indexing interval (hours)

ServiceRebuildHour
The hour when the reindexing will start (int hour, 0-23)

ServiceWillOptimizeIndex
Whether the service should automatically optimize the index for searching.

ServiceWillSendEmailLog
Whether the service should send an email log after it has run.

SmtpEmailFrom
The email from address that the log will be sent 'from'.

SmtpEmailTo
The email from address that the log will be sent to.

SmtpMailServer
The email server to use to send email with service run logs.

SmtpPassword
The email server password if required. Password will be encrypted and the unencrypted form is not available from configuration files or API.

SmtpServerPort
The email server port number to use to send email with service run logs.

SmtpUsername
The email server username if required.

StopWords
List of 'common' words which are not searchable.

SummaryTextSplitter
The string used to split apart summary text from different parts of the result document, default is '...'.

SummaryWordLength
Number of words to show in result summaries.

ToString()()()()
Returns a string that represents the current object.
(Inherited from Object.)
UpgradeCrawledURLsToHTTPS
If the start URL is HTTPS, then upgrade all crawled links to HTTPS.

URLCaseSensitive
Whether, during indexing, to compare URLs in a case sensitive manner (default is false) - some web servers are case sensitive, IIS is not.

UseFileSizeToIdentifyChange
Whether to use the file size of a document to identify if it has been changed or not since it was last indexed (also see IgnoreLastModifiedDate).

UserAgent
The user agent that the search engine identifies itself to the server as.

WebProxy
Proxy object to use for connections. Setting can speed up access to external servers.

WebProxySet
Whether a proxy has been set.

WebsiteCredentials
Map of web-site domain-names/server-names to usernames and passwords.

WordBreakingCharacters
Characters that will not be considered parts of 'words' and will break a string apart. Any space characters will be ignored.

WordNonBreakingCharacters
Characters that will be considered parts of 'words' and not break a string apart. Eg. To be able to search for strings like my_email@blah.com - ensure that @ . and _ are in this list of characters. Any space characters will be ignored.

WordsAroundNumber
Number of words to show around non-continguous summary occurrences.

WordWeightFactorBody
The weighting factor given to words in the document body - increase to make words in the body text more important.

WordWeightFactorTitle
The weighting factor given to words in the document title - increase to make words in the title text more important.

WordWeightFactorUrl
The weighting factor given to words in the document title - increase to make words in the URL more important.

xmlLocation Obsolete.
The path to the XML document index directory - eg "c:\documentIndex"

Inheritance Hierarchy
Object
Configuration

Assembly: Keyoti4.SearchEngine.Core (Module: Keyoti4.SearchEngine.Core.dll) Version: 2022.8.22.610