[SPARK-45107][PYTHON][DOCS] Refine docstring of explode #42860

allisonwang-db · 2023-09-08T23:28:44Z

What changes were proposed in this pull request?

This PR refines the docstring of function explode by adding more examples.

Why are the changes needed?

To improve PySpark documentations.

Does this PR introduce any user-facing change?

No

How was this patch tested?

doctest

Was this patch authored or co-authored using generative AI tooling?

No

LuciferYang · 2023-09-11T05:15:49Z

could you re-trigger the CI again @allisonwang-db

LuciferYang · 2023-09-12T03:31:15Z

python/pyspark/sql/functions.py

+    >>> from pyspark.sql import Row
+    >>> df = spark.createDataFrame([Row(a=1, list1=[1, 2], list2=[3, 4])])
+    >>> df.select(sf.explode(df.list1).alias("list1"), "list2") \
+    ...     .select("list1", sf.explode(df.list2).alias("list2")).show()


seems the test failure is related to this one

********************************************************************** File "/__w/spark/spark/python/pyspark/sql/functions.py", line 286, in pyspark.sql.functions.explode Failed example: df.select(sf.explode(df.list1).alias("list1"), "list2") ... .select("list1", sf.explode(df.list2).alias("list2")).show() Exception raised: Traceback (most recent call last): File "/usr/local/pypy/pypy3.8/lib/pypy3.8/doctest.py", line 1338, in __run exec(compile(example.source, filename, "single", File "<doctest pyspark.sql.functions.explode[19]>", line 1 df.select(sf.explode(df.list1).alias("list1"), "list2") ... .select("list1", sf.explode(df.list2).alias("list2")).show() ^ SyntaxError: invalid syntax ********************************************************************** 1 of 33 in pyspark.sql.functions.explode ***Test Failed*** 1 failures. /usr/local/pypy/pypy3.8/lib/pypy3.8/runpy.py:127: RuntimeWarning: 'pyspark.sql.functions' found in sys.modules after import of package 'pyspark.sql', but prior to execution of 'pyspark.sql.functions'; this may result in unpredictable behaviour warn(RuntimeWarning(msg)) /__w/spark/spark/python/pyspark/sql/udtf.py:163: UserWarning: Arrow optimization for Python UDTFs cannot be enabled: PyArrow >= 4.0.0 must be installed; however, it was not found.. Falling back to using regular Python UDTFs. warnings.warn( Had test failures in pyspark.sql.functions with pypy3; see logs. Error: running /__w/spark/spark/python/run-tests --modules=pyspark-sql,pyspark-testing --parallelism=1 ; received return code 255 Error: Process completed with exit code 19.

zhengruifeng · 2023-09-20T05:55:00Z

starting python compilation test...
python compilation succeeded.

starting black test...
black checks failed:
Oh no! 💥 💔 💥 The required version `23.9.1` does not match the running version `22.6.0`!
Please run 'dev/reformat-python' script.
1
Error: Process completed with exit code 1.

please rebase

zhengruifeng · 2023-09-20T23:59:58Z

thanks, merged to master

github-actions bot added SQL PYTHON labels Sep 8, 2023

HyukjinKwon approved these changes Sep 10, 2023

View reviewed changes

allisonwang-db force-pushed the spark-45107-refine-explode branch from f48f7d7 to b49fd09 Compare September 11, 2023 18:24

LuciferYang reviewed Sep 12, 2023

View reviewed changes

allisonwang-db added 3 commits September 20, 2023 09:54

refine

87b29ae

fix

50712ce

retrigger build

078c5c2

allisonwang-db force-pushed the spark-45107-refine-explode branch from 0049b31 to 078c5c2 Compare September 20, 2023 16:54

zhengruifeng closed this in c1dc9ae Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-45107][PYTHON][DOCS] Refine docstring of explode #42860

[SPARK-45107][PYTHON][DOCS] Refine docstring of explode #42860

allisonwang-db commented Sep 8, 2023

LuciferYang commented Sep 11, 2023

LuciferYang Sep 12, 2023

zhengruifeng commented Sep 20, 2023

zhengruifeng commented Sep 20, 2023

[SPARK-45107][PYTHON][DOCS] Refine docstring of explode #42860

[SPARK-45107][PYTHON][DOCS] Refine docstring of explode #42860

Conversation

allisonwang-db commented Sep 8, 2023

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

LuciferYang commented Sep 11, 2023

LuciferYang Sep 12, 2023

Choose a reason for hiding this comment

zhengruifeng commented Sep 20, 2023

zhengruifeng commented Sep 20, 2023