...
Login to the Azure portal (https://portal.azure.com/).
Search for “Cognitive Services”
Click “Add”, to add a new Cognitive Service.
Search for “Computer Vision” and create a new client.
...
The Computer Vision resource has a key and a server URI, which will be needed shortly. These can be found by navigating to the Computer Vision resource and locating “Keys and Endpoints”.
...
Content extraction can now be enabled in the Cognitive Service which is part of Digizuite Core.
On the server where the DAM Center is installed, navigate to the Cognitive Service directory (typically “Webs/<yourDAM>/DigizuiteCore/cognitiveservice”).
Edit the “appsettings.json”-file. The following parameters in the “ComputerVisionDetails”-section are relevant:
Parameter | Description |
---|---|
OcrKey | The key from the Computer Vision resource created above. |
OcrServerUri | The URI from the Computer Vision resource created above. It is called “Endpoint” in the “Keys and endpoints” section of the Computer Vision resource. |
OcrExtractFromPdf | If true, the text contents of PDF files are extracted when the PDF files are uploaded to the DAM. |
OcrExtractFromImage | If true, the text contents of images are extracted when the images are uploaded to the DAM. |
OcrLetAzureRequestFiles | If false, we explicitly upload files to the Computer Vision client. Otherwise, Azure will request the files from the DAM Center. Setting this to true is expected to be more efficient, but it requires that the DAM Center can be accessed by Azure. Thus, ensure that the DAM Center is not behind a strict firewall if this is set to true. |
OcrTaskDelayLength | We regularly check the status of ongoing content extractions in the Computer Vision client. This gives the time interval between each check. The larger the time interval is, the less requests are made to Azure. However, it then also takes more time for the extracted contents of files to be available in the “Asset Content” metafield. |
OcrLanguage |
2. Including asset contents in searches
...