Searching Document Content

Enabling document searching allows users to search the content of document-type assets. 

Enabling content indexing may impact the relevancy of certain search results (such as keyword searches).

Searching document content

Keyword search

  1. Use the search bar dropdown menu; click Keyword.
  2. Type the search content into the Search bar.
  3. Click Enter on your keyboard.

Do not include quotation in content searches, even when searching for quoted text such as a dialogue segment.

Advanced search

  1. Use the search bar dropdown menu; click Advanced.
  2. Select Contents from the Search type dropdown menu.
  3. Type your search term into the Value line. 
  4. Click Search.

Supported file formats

The following file types can be indexed for content searching:

  • csv
  • docx/doc
  • eml
  • html
  • pdf
  • pptx/ppt
  • rtf
  • swf
  • txt
  • vtt
  • xlsx

Configuring document indexing

The following properties must be enabled before any document context will be indexed, either via upload or reindexing the asset:

Property Description
search.keywordIncludeContents

If this property is true, the contents of document files will be searchable while performing both keyword and advanced content searches. If false, keyword search results will not include document content.

Value options: true / false

Requires restart? Yes

searchEngine.contentIndexingEnabled

If this property is true, document text will be indexed when a file of a supported format is uploaded or reindexed. Note that if you are turning this property on for the first time, any documents uploaded before the change will need to be reindexed or they will remain unsearchable. Setting this property's value to false will disable document content indexing. 

Value options: true / false

Requires restart? Yes

 

Advanced settings

By default, there is a restriction on the size of documents the system will index. The following properties adjust this default: 

Property Description
searchEngine.contentIndexingMaxSize

The value of this property should be the maximum file size of an asset, in MB, that NetX will index content for while uploading and reindexing assets. If an asset is larger than the value of this property, the content of the asset will only be indexed up to the size value of this property. This property applies to dochtmlpdfpptrtfswf, xlsx, and docx file types. The default value is 25000000

Type: number, in bytes

Value options: No

searchEngine.contentIndexingMaxPlainSize

The value of this property should be the maximum file size of an asset, in MB, that NetX will index content for while uploading and reindexing assets. If an asset is larger than the value of this property, the content of the asset will only be indexed up to the size value of this property. This property applies to csv, eml, and text file types. The default value is 5000000

Type: true / false

Value options: No

Was this article helpful?
0 out of 0 found this helpful