Is it possible to install and configure presidio from a databricks environment without internet access? #1435
-
Hi, I'm trying to configure presidio in a databricks environment with limited access to the internet. When I do so it always attempts to download en_core_web_lg from the internet. Even when I first dowload en_core_web_lg manually and load it to spacy, I always get an error after |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
In case someone else comes across this, I found the solution was to download the wheel file from https://spacy.io/models/en upload it to dbfs, then copy it from dbfs to the file system (dbutils.fs.cp). then install it with: |
Beta Was this translation helpful? Give feedback.
In case someone else comes across this, I found the solution was to download the wheel file from https://spacy.io/models/en upload it to dbfs, then copy it from dbfs to the file system (dbutils.fs.cp). then install it with:
!pip install /tmp/en_core_web_lg-3.7.1-py3-none-any.whl
after that presidio no longer tires to download en_core_web_lg when envoked.