COMMITS
/ python/pyspark/sql/dataframe.py March 25, 2026
R
[SPARK-56179][PYTHON] Consolidate error classes for type mismatch - part 2
Ruifeng Zheng committed
March 24, 2026
R
[SPARK-56179][PYTHON] Consolidate error classes for type mismatch - part 1
Ruifeng Zheng committed
March 19, 2026
H
Revert "[SPARK-53675][PYTHON] Add str support in withColumn and withColumns in PySpark"
Hyukjin Kwon committed
March 18, 2026
Z
March 16, 2026
Y
[SPARK-55986][PYTHON] Upgrade black to 26.3.1
yangjie01 committed
March 13, 2026
T
[SPARK-55967][PYTHON] Unify column conversion for connect dataframe
Tian Gao committed
March 11, 2026
T
[SPARK-55954][PYTHON] Remove the incorrect overload type hint for fillna
Tian Gao committed
T
[SPARK-55955][PYTHON] Remove overload type hint for drop
Tian Gao committed
February 9, 2026
F
[SPARK-55229][SPARK-55231][PYTHON] Implement DataFrame.zipWithIndex in PySpark
Fangchen Li committed
February 5, 2026
D
[MINOR][PS] Convert loop append in Pyspark to list comprehension
Devin Petersohn committed
February 4, 2026
R
[MINOR][PYTHON][TESTS] Skip the doctest of toJSON
Ruifeng Zheng committed
January 19, 2026
R
[SPARK-55090][PYTHON][CONNECT] Implement `DataFrame.toJSON` in Python Client
Ruifeng Zheng committed
January 12, 2026
D
[SPARK-54337][PS] Add support for PyCapsule to Pyspark
Devin Petersohn committed
December 17, 2025
R
[SPARK-54728][PYTHON][DOCS] Remove a wrong note in `dataframe.isEmpty`
Ruifeng Zheng committed
December 10, 2025
Y
[SPARK-54653][DOCS] Add cross-session note to cache/persist public APIs
Yan Yan committed
November 26, 2025
Y
[SPARK-53597][PYTHON][SQL][DOCS] Add `asTable()` and `TableArg` examples
Yicong-Huang committed
November 13, 2025
R
[SPARK-54318][PYTHON][DOCS] Fix doctests in `pyspark.sql.dataframe`
Ruifeng Zheng committed
November 6, 2025
R
[SPARK-54211][PYTHON][DOCS] Update exmple of mapInArrow to use arrow builtin function
Ruifeng Zheng committed
September 22, 2025
S
[SPARK-53429][PYTHON] Support Direct Passthrough Partitioning in the PySpark Dataframe API
Shujing Yang committed
September 18, 2025
R
[SPARK-53632][PYTHON][DOCS][TESTS] Reenable doctest for `DataFrame.pandas_api`
Ruifeng Zheng committed
R
[SPARK-53630][PYTHON][DOCS][TESTS] Reenable doctest for `Dataframe.freqItems`
Ruifeng Zheng committed
R
[SPARK-53619][PYTHON][DOCS][TESTS] Enable doctests for toArrow/toPandas/mapInArrow/mapInPandas
Ruifeng Zheng committed
September 17, 2025
R
[SPARK-53606][DOCS] Fix MapInPandas/MapInArrow examples with barrier
Ruifeng Zheng committed
September 16, 2025
R
[MINOR][PYTHON][DOCS] Correct the examples of `toPandas` and `toArrow`
Ruifeng Zheng committed
August 18, 2025
R
[SPARK-53302][PYTHON][TESTS] Make doctest of df.unpivot deterministic
Ruifeng Zheng committed
July 16, 2025
M
[MINOR][DOCS] Fix example 4 in DataFrame.na.replace()
Misha Brukman committed
April 10, 2025
R
[SPARK-51754][PYTHON][DOCS][TESTS] Make `sampleBy` doctest deterministic
Ruifeng Zheng committed
April 9, 2025
K
[MINOR][PYTHON][DOCS] Fix a typo in pyspark documentation
Kiril Isakov committed
February 10, 2025
X
[SPARK-50753][PYTHON][DOCS] Add pyspark plotting to API documentation
Xinrong Meng committed
February 7, 2025
X
[SPARK-51103][PYTHON][DOCS] Add DataFrame conversion to table argument to API documentation
Xinrong Meng committed
February 6, 2025
N
[SPARK-51098][DOCS] Link exceptAll and subtract in Python DataFrame docs
Nicholas Chammas committed
January 10, 2025
R
[MINOR][DOCS] Add new dataframe methods to API references
Ruifeng Zheng committed
T
[SPARK-50778][PYTHON] Add metadataColumn to PySpark DataFrame
Takuya Ueshin committed
January 7, 2025
X
[SPARK-50392][PYTHON] DataFrame conversion to table argument in Spark Classic
Xinrong Meng committed
January 6, 2025
T
[SPARK-50694][SQL] Support renames in subqueries
Takuya Ueshin committed
December 26, 2024
December 23, 2024
X
[SPARK-50646][PYTHON][DOCS] Document explicit style of pyspark plotting
Xinrong Meng committed
December 6, 2024
T
[SPARK-50132][SQL][PYTHON] Add DataFrame API for Lateral Joins
Takuya Ueshin committed
December 4, 2024
K
[MINOR] Fix some typos
Kaz committed
November 28, 2024
W
[SPARK-50130][SQL][FOLLOWUP] Simplify the implementation of col.outer()
Wenchen Fan committed
November 12, 2024
T
[SPARK-50130][SQL][PYTHON] Add DataFrame APIs for scalar and exists subqueries
Takuya Ueshin committed
October 9, 2024
J
[SPARK-49857][SQL] Add storageLevel to Dataset localCheckpoint API
Julek Sompolski committed
October 7, 2024
R
[SPARK-49890][PYTHON] Extract the preparation of df.sample to parent class
Ruifeng Zheng committed
September 28, 2024
R
[SPARK-49810][PYTHON] Extract the preparation of `DataFrame.sort` to parent class
Ruifeng Zheng committed
September 25, 2024
R
[SPARK-49609][PYTHON][FOLLOWUP] Correct the typehint for `filter` and `where`
Ruifeng Zheng committed
September 20, 2024
X
[SPARK-49531][PYTHON][CONNECT] Support line plot with plotly backend
Xinrong Meng committed
September 15, 2024
D
Revert "[SPARK-49531][PYTHON][CONNECT] Support line plot with plotly backend"
Dongjoon Hyun committed
September 13, 2024
X
[SPARK-49531][PYTHON][CONNECT] Support line plot with plotly backend
Xinrong Meng committed
September 11, 2024
A
[SPARK-49553][PYTHON][DOCS] Remove the experimental API notes for pandas related functions
allisonwang-db committed
September 10, 2024
H
[SPARK-49567][PYTHON] Use `classic` instead of `vanilla` from PySpark code base
Haejoon Lee committed