Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Record view / Improve support of HTML in abstract
When harvesting some ODS portal eg. ``` {"@id":"4284","@type":"simpleurl","owner":["70"],"ownerGroup":[14697],"ownerUser":["70"],"site":{"name":"odwb","uuid":"1f4e3e33-be43-486e-a371-f9fed31d4df1","account":{"use":false,"username":[],"password":[]},"url":"https://www.odwb.be/api/explore/v2.0/catalog/datasets?limit=10&offset=0&timezone=UTC&include_links=false&include_app_metas=false","icon":"blank.png","loopElement":"/datasets","numberOfRecordPath":"/total_count","recordIdPath":"/dataset/dataset_id","pageSizeParam":"limit","pageFromParam":"offset","toISOConversion":"schema:iso19115-3.2018:convert/fromJsonOpenDataSoft"},"content":{"validate":"NOVALIDATION","importxslt":"none","batchEdits":"[]"},"options":{"every":"0 0 0 ? * *","oneRunOnly":false,"overrideUuid":"SKIP","status":"active"},"privileges":[{"@id":"1","operation":[{"@name":"view"},{"@name":"dynamic"},{"@name":"download"}]}],"ifRecordExistAppendPrivileges":false,"info":{"lastRun":"2024-09-09T08:21:08.124161Z","running":false,"result":{"added":"1036","atomicDatasetRecords":"0","badFormat":"0","collectionDatasetRecords":"0","datasetUuidExist":"0","privilegesAppendedOnExistingRecord":"0","doesNotValidate":"0","xpathFilterExcluded":"0","duplicatedResource":"0","fragmentsMatched":"0","fragmentsReturned":"0","fragmentsUnknownSchema":"0","incompatible":"0","recordsBuilt":"0","recordsUpdated":"0","removed":"0","serviceRecords":"0","subtemplatesAdded":"0","subtemplatesRemoved":"0","subtemplatesUpdated":"0","total":"1036","unchanged":"0","unknownSchema":"0","unretrievable":"0","updated":"0","thumbnails":"0","thumbnailsFailed":"0"}}} ``` Some records use HTML in abstract eg. https://www.odwb.be/explore/dataset/ares-referentiel-des-bassins-enseignement-qualifiant-formation-emploi-efe/information/?disjunctive.te_arrond_adm&disjunctive.te_province&disjunctive.te_region Linky directive breaks the HTML and should not be used when the text is HTML.
- Loading branch information