Error using some pretrained pipelines in Spark/PySpark 3.x #2738

muhammetsnts · 2021-04-06T09:15:44Z

Description

When I try to use match_chunks and match_datetime pretrained pipelines in sparknlp_version 3.0.1, get errors while downloading these pipelines.

Current Behavior

Steps to Reproduce

pipeline = PretrainedPipeline('match_chunks', lang='en')
pipeline = PretrainedPipeline('match_datetime', lang='en')

Context

Your Environment

Spark NLP version sparknlp.version(): 3.0.1
Apache NLP version spark.version: 3.1.1
Java version java -version: openjdk version "1.8.0_282"

The text was updated successfully, but these errors were encountered:

maziyarpanahi · 2021-04-09T07:52:20Z

@Digaari Only report anything related to public and open-source. We are not responsible for anything else.

Digaari · 2021-04-12T04:48:17Z

Faced similar issue with check_spelling_dl in the same environment.

maziyarpanahi · 2021-06-24T11:19:05Z

Any model/pipeline with that specific error means they are not trained/saved in Spark 3.x/Scala 2.12, they need to be trained/saved using Spark 3.x.

Please make a list and pass it to the Models Hub team to fix them

muhammetsnts · 2021-06-27T07:26:52Z

@Digaari here is a list of pipelines that not working on spark 3.x.

clean_slang
check_spelling_dl
match_chunks
match_datetime

muhammetsnts · 2021-12-20T11:39:52Z

@maziyarpanahi this issue is still open and we've tested these models, they are still broken. As you said, @josejuanmartinez can assign someone from modelshub team.

maziyarpanahi · 2021-12-20T14:12:19Z

Thanks @muhammetsnts

These pipelines need to be re-do/re-uploaded by using Apache Spark 3.x. Some models/pipelines need two copies one for spark 2.x and one for soark 3.x.

So these work in Spark 2.x but the 3.x are missing. Please assign a member to save and upload these pipelines with the same version/metadata but on spark 3.x this time.

cholojuanito · 2022-04-07T22:45:45Z

I'm still running into this issue with the following environment

Spark NLP version sparknlp.version(): 3.4.2
Apache NLP version spark.version: 3.1.2
Java version java -version: openjdk version "11.0.14"

muhammetsnts assigned maziyarpanahi Apr 6, 2021

maziyarpanahi added bug models_hub pretrained models and pipelines labels Apr 6, 2021

maziyarpanahi changed the title ~~Error using some pretrained pipelines~~ Error using some pretrained pipelines in Spark/PySpark 3.x Apr 6, 2021

JohnSnowLabs deleted a comment from Digaari Apr 9, 2021

maziyarpanahi mentioned this issue Jan 4, 2022

2022-01-04-check_spelling_dl_en #6706

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error using some pretrained pipelines in Spark/PySpark 3.x #2738

Error using some pretrained pipelines in Spark/PySpark 3.x #2738

muhammetsnts commented Apr 6, 2021

maziyarpanahi commented Apr 9, 2021

Digaari commented Apr 12, 2021

maziyarpanahi commented Jun 24, 2021

muhammetsnts commented Jun 27, 2021

muhammetsnts commented Dec 20, 2021

maziyarpanahi commented Dec 20, 2021

cholojuanito commented Apr 7, 2022 •

edited

Loading

Error using some pretrained pipelines in Spark/PySpark 3.x #2738

Error using some pretrained pipelines in Spark/PySpark 3.x #2738

Comments

muhammetsnts commented Apr 6, 2021

Description

Current Behavior

Steps to Reproduce

Context

Your Environment

maziyarpanahi commented Apr 9, 2021

Digaari commented Apr 12, 2021

maziyarpanahi commented Jun 24, 2021

muhammetsnts commented Jun 27, 2021

muhammetsnts commented Dec 20, 2021

maziyarpanahi commented Dec 20, 2021

cholojuanito commented Apr 7, 2022 • edited Loading

cholojuanito commented Apr 7, 2022 •

edited

Loading