Content Comparison

...

On the server where the DAM Center is installed, navigate to the Cognitive Service directory (typically “Webs/<yourDAM>/DigizuiteCore/cognitiveservice”).
Edit the “appsettings.json”-file. The following parameters in the “ComputerVisionDetails”-section are relevant:

Parameter	Description
OcrKey	The key from the Computer Vision resource created above. (One of the “KEY” entries in the image above)
OcrServerUri	The URI from the Computer Vision resource created above. (The “Endpoint” entry in the image above)
OcrExtractFromPdf	If true, the text contents of PDF files are extracted when new PDF files are uploaded to the DAM.
OcrExtractFromImage	If true, the text contents of images are extracted when new images are uploaded to the DAM.
OcrLetAzureRequestFiles	If false, files are explicitly uploaded to the Computer Vision client. Otherwise, Azure will request the files from the DAM Center. Setting this to true is expected to be more efficient, but it requires that the DAM Center can be accessed by Azure. Thus, ensure that the DAM Center is not behind a strict firewall if this is set to true.
OcrTaskDelayLength	We regularly check the status of ongoing content extractions in the Computer Vision client. This gives the time interval between each check. The larger the time interval is, the less requests are made to Azure. However, it then also takes more time for the extracted contents of files to be available in the “Asset Content” metafield. You most likely don’t have to change this.

Once the information has been provided, and the “appsettings.json”-file has been saved, the contents of PDFs and/or images are extracted when PDFs/images are uploaded.

...

Info
The metafield “Asset content” is automatically created when installing or upgrading to 5.5. The field is created in metagroup “Content”, and it is very important that this exact field is used in the configuration as the GUID of the metadata field is used as a dependency in the system.

Including asset content in searches (When using Solr)

The extracted contents of assets can be included in freetext searches by adding the metafield “Asset content” in the search “DigiZuite_System_Framework_Search“ as a freetext input parameter. The “Asset content” metafield can be added as a freetext input parameter by doing the following:

...

Info
The contents of existing assets can be extracted by republishing the assets.

Including asset content in searches (When using ElasticSearch (MM 5.6+))

Go into “Generate settings” => “Asset search”, and add the field “Asset content” to “Freetext search fields”.

If the “Asset content” field is not available, make sure it’s available from the metadata editor in MM and readable for all the people that needs to search it, otherwise the Search engine will not index it.

...

Important Information

Please be aware of the following when using the Computer Vision resource:

...

Version	Old Version 1	New Version 2
Changes made by	Rasmus Hjelmberg Duemose Hansen	Rasmus Hjelmberg Duemose Hansen
Saved on	Oct 18, 2021	Sept 27, 2022

Content Comparison

Versions Compared

Key

Including asset content in searches (When using Solr)

Including asset content in searches (When using ElasticSearch (MM 5.6+))

Important Information