2020-10-08 14:57:58 +02:00
|
|
|
.. Licensed to the Apache Software Foundation (ASF) under one
|
|
|
|
|
.. or more contributor license agreements. See the NOTICE file
|
|
|
|
|
.. distributed with this work for additional information
|
|
|
|
|
.. regarding copyright ownership. The ASF licenses this file
|
|
|
|
|
.. to you under the Apache License, Version 2.0 (the
|
|
|
|
|
.. "License"); you may not use this file except in compliance
|
|
|
|
|
.. with the License. You may obtain a copy of the License at
|
|
|
|
|
|
|
|
|
|
.. http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
|
|
|
|
|
|
.. Unless required by applicable law or agreed to in writing,
|
|
|
|
|
.. software distributed under the License is distributed on an
|
|
|
|
|
.. "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
|
|
|
|
|
.. KIND, either express or implied. See the License for the
|
|
|
|
|
.. specific language governing permissions and limitations
|
|
|
|
|
.. under the License.
|
|
|
|
|
|
|
|
|
|
.. _api.compute:
|
|
|
|
|
.. currentmodule:: pyarrow.compute
|
|
|
|
|
|
|
|
|
|
Compute Functions
|
|
|
|
|
=================
|
|
|
|
|
|
|
|
|
|
Aggregations
|
|
|
|
|
------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
all
|
|
|
|
|
any
|
2021-09-24 09:01:46 -04:00
|
|
|
approximate_median
|
2020-10-08 14:57:58 +02:00
|
|
|
count
|
2021-10-05 10:22:38 -04:00
|
|
|
count_distinct
|
2025-11-20 12:35:31 +01:00
|
|
|
first
|
|
|
|
|
first_last
|
2021-05-22 21:39:04 +08:00
|
|
|
index
|
2025-11-20 12:35:31 +01:00
|
|
|
kurtosis
|
|
|
|
|
last
|
2021-09-24 09:01:46 -04:00
|
|
|
max
|
2020-10-08 14:57:58 +02:00
|
|
|
mean
|
2021-09-24 09:01:46 -04:00
|
|
|
min
|
2020-10-08 14:57:58 +02:00
|
|
|
min_max
|
|
|
|
|
mode
|
2025-03-04 17:21:38 +01:00
|
|
|
pivot_wider
|
2021-08-10 17:02:52 +02:00
|
|
|
product
|
2021-09-24 09:01:46 -04:00
|
|
|
quantile
|
2025-11-20 12:35:31 +01:00
|
|
|
skew
|
2020-10-08 14:57:58 +02:00
|
|
|
stddev
|
|
|
|
|
sum
|
2021-09-24 09:01:46 -04:00
|
|
|
tdigest
|
2020-10-08 14:57:58 +02:00
|
|
|
variance
|
|
|
|
|
|
2023-01-26 14:01:01 -03:00
|
|
|
..
|
|
|
|
|
Nullary aggregate functions (count_all) aren't exposed in pyarrow.compute,
|
|
|
|
|
so they aren't listed here.
|
|
|
|
|
|
2022-05-31 15:23:09 +02:00
|
|
|
Cumulative Functions
|
|
|
|
|
--------------------
|
|
|
|
|
|
2024-04-30 17:27:26 -08:00
|
|
|
Cumulative functions are vector functions that perform a running accumulation on
|
|
|
|
|
their input using a given binary associative operation with an identity element
|
|
|
|
|
(a monoid) and output an array containing the corresponding intermediate running
|
|
|
|
|
values. The input is expected to be of numeric type. By default these functions
|
2023-06-23 00:03:16 +08:00
|
|
|
do not detect overflow. They are also
|
2022-05-31 15:23:09 +02:00
|
|
|
available in an overflow-checking variant, suffixed ``_checked``, which
|
|
|
|
|
throws an ``ArrowInvalid`` exception when overflow is detected.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
cumulative_sum
|
|
|
|
|
cumulative_sum_checked
|
2023-06-23 00:03:16 +08:00
|
|
|
cumulative_prod
|
|
|
|
|
cumulative_prod_checked
|
|
|
|
|
cumulative_max
|
2025-11-20 12:35:31 +01:00
|
|
|
cumulative_mean
|
2023-06-23 00:03:16 +08:00
|
|
|
cumulative_min
|
2022-05-31 15:23:09 +02:00
|
|
|
|
2020-10-08 14:57:58 +02:00
|
|
|
Arithmetic Functions
|
|
|
|
|
--------------------
|
|
|
|
|
|
2021-07-15 17:09:40 -04:00
|
|
|
By default these functions do not detect overflow. Most functions are also
|
2021-06-08 12:37:56 +02:00
|
|
|
available in an overflow-checking variant, suffixed ``_checked``, which
|
2020-10-08 14:57:58 +02:00
|
|
|
throws an ``ArrowInvalid`` exception when overflow is detected.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-05-20 01:46:11 +00:00
|
|
|
abs
|
|
|
|
|
abs_checked
|
2020-10-08 14:57:58 +02:00
|
|
|
add
|
|
|
|
|
add_checked
|
|
|
|
|
divide
|
|
|
|
|
divide_checked
|
2025-11-20 12:35:31 +01:00
|
|
|
exp
|
|
|
|
|
expm1
|
2020-10-08 14:57:58 +02:00
|
|
|
multiply
|
|
|
|
|
multiply_checked
|
2021-09-27 18:29:09 +02:00
|
|
|
negate
|
|
|
|
|
negate_checked
|
2021-04-13 12:46:08 -07:00
|
|
|
power
|
|
|
|
|
power_checked
|
2021-07-15 17:09:40 -04:00
|
|
|
sign
|
2022-02-20 10:00:07 -05:00
|
|
|
sqrt
|
|
|
|
|
sqrt_checked
|
2021-07-15 17:09:40 -04:00
|
|
|
subtract
|
|
|
|
|
subtract_checked
|
2021-06-30 11:19:58 +02:00
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Bit-wise Functions
|
|
|
|
|
------------------
|
2021-06-30 11:19:58 +02:00
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
bit_wise_and
|
|
|
|
|
bit_wise_not
|
|
|
|
|
bit_wise_or
|
|
|
|
|
bit_wise_xor
|
2021-09-27 18:29:09 +02:00
|
|
|
shift_left
|
|
|
|
|
shift_left_checked
|
|
|
|
|
shift_right
|
|
|
|
|
shift_right_checked
|
2020-10-08 14:57:58 +02:00
|
|
|
|
2021-07-16 15:25:58 -04:00
|
|
|
Rounding Functions
|
|
|
|
|
------------------
|
|
|
|
|
|
ARROW-12744: [C++][Compute] Add rounding kernel
This PR adds rounding compute functions, namely "round" and "round_to_multiple".
* `round(x, RoundOptions(ndigits, round_mode))` - round `x` to the precision indicated by `ndigits`
* `round_to_multiple(x, RoundToMultipleOptions(multiple, round_mode))` - round `x` to scale of `multiple`
Rounding modes supported are: DOWN, UP, TOWARDS_ZERO, TOWARDS_INFINITY, HALF_DOWN, HALF_UP, HALF_TOWARDS_ZERO, HALF_TOWARDS_INFINITY, HALF_TO_EVEN, HALF_TO_ODD.
By default tie-breaking modes round to the nearest integer and resolve ties with HALF_TO_EVEN.
The rounding functions expect floating-point inputs and return output of the same type. Integral inputs are implicitly type-casted and output is float64.
Closes #10349 from edponce/ARROW-12744-Add-rounding-kernel
Authored-by: Eduardo Ponce <edponce00@gmail.com>
Signed-off-by: Antoine Pitrou <antoine@python.org>
2021-09-13 19:26:29 +02:00
|
|
|
Rounding functions displace numeric inputs to an approximate value with a simpler
|
|
|
|
|
representation based on the rounding criterion.
|
2021-07-16 15:25:58 -04:00
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ceil
|
|
|
|
|
floor
|
ARROW-12744: [C++][Compute] Add rounding kernel
This PR adds rounding compute functions, namely "round" and "round_to_multiple".
* `round(x, RoundOptions(ndigits, round_mode))` - round `x` to the precision indicated by `ndigits`
* `round_to_multiple(x, RoundToMultipleOptions(multiple, round_mode))` - round `x` to scale of `multiple`
Rounding modes supported are: DOWN, UP, TOWARDS_ZERO, TOWARDS_INFINITY, HALF_DOWN, HALF_UP, HALF_TOWARDS_ZERO, HALF_TOWARDS_INFINITY, HALF_TO_EVEN, HALF_TO_ODD.
By default tie-breaking modes round to the nearest integer and resolve ties with HALF_TO_EVEN.
The rounding functions expect floating-point inputs and return output of the same type. Integral inputs are implicitly type-casted and output is float64.
Closes #10349 from edponce/ARROW-12744-Add-rounding-kernel
Authored-by: Eduardo Ponce <edponce00@gmail.com>
Signed-off-by: Antoine Pitrou <antoine@python.org>
2021-09-13 19:26:29 +02:00
|
|
|
round
|
2025-11-20 12:35:31 +01:00
|
|
|
round_binary
|
ARROW-12744: [C++][Compute] Add rounding kernel
This PR adds rounding compute functions, namely "round" and "round_to_multiple".
* `round(x, RoundOptions(ndigits, round_mode))` - round `x` to the precision indicated by `ndigits`
* `round_to_multiple(x, RoundToMultipleOptions(multiple, round_mode))` - round `x` to scale of `multiple`
Rounding modes supported are: DOWN, UP, TOWARDS_ZERO, TOWARDS_INFINITY, HALF_DOWN, HALF_UP, HALF_TOWARDS_ZERO, HALF_TOWARDS_INFINITY, HALF_TO_EVEN, HALF_TO_ODD.
By default tie-breaking modes round to the nearest integer and resolve ties with HALF_TO_EVEN.
The rounding functions expect floating-point inputs and return output of the same type. Integral inputs are implicitly type-casted and output is float64.
Closes #10349 from edponce/ARROW-12744-Add-rounding-kernel
Authored-by: Eduardo Ponce <edponce00@gmail.com>
Signed-off-by: Antoine Pitrou <antoine@python.org>
2021-09-13 19:26:29 +02:00
|
|
|
round_to_multiple
|
2021-07-16 15:25:58 -04:00
|
|
|
trunc
|
|
|
|
|
|
2021-07-07 02:46:04 +00:00
|
|
|
Logarithmic Functions
|
|
|
|
|
---------------------
|
|
|
|
|
|
|
|
|
|
Logarithmic functions are also supported, and also offer ``_checked``
|
|
|
|
|
variants which detect domain errors.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ln
|
|
|
|
|
ln_checked
|
|
|
|
|
log10
|
|
|
|
|
log10_checked
|
|
|
|
|
log1p
|
|
|
|
|
log1p_checked
|
|
|
|
|
log2
|
|
|
|
|
log2_checked
|
2021-08-13 11:46:22 -04:00
|
|
|
logb
|
|
|
|
|
logb_checked
|
2021-07-07 02:46:04 +00:00
|
|
|
|
2021-06-30 18:30:37 +02:00
|
|
|
Trigonometric Functions
|
|
|
|
|
-----------------------
|
|
|
|
|
|
|
|
|
|
Trigonometric functions are also supported, and also offer ``_checked``
|
|
|
|
|
variants which detect domain errors where appropriate.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
acos
|
|
|
|
|
acos_checked
|
|
|
|
|
asin
|
|
|
|
|
asin_checked
|
|
|
|
|
atan
|
|
|
|
|
atan2
|
|
|
|
|
cos
|
|
|
|
|
cos_checked
|
|
|
|
|
sin
|
|
|
|
|
sin_checked
|
|
|
|
|
tan
|
|
|
|
|
tan_checked
|
|
|
|
|
|
2025-11-20 12:35:31 +01:00
|
|
|
Hyperbolic Trigonometric Functions
|
|
|
|
|
----------------------------------
|
|
|
|
|
|
|
|
|
|
Hyperbolic trigonometric functions are also supported, and, where applicable, also offer ``_checked``
|
|
|
|
|
variants which detect domain errors if needed.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
acosh
|
|
|
|
|
acosh_checked
|
|
|
|
|
asinh
|
|
|
|
|
atanh
|
|
|
|
|
atanh_checked
|
|
|
|
|
cosh
|
|
|
|
|
sinh
|
|
|
|
|
tanh
|
|
|
|
|
|
2020-10-08 14:57:58 +02:00
|
|
|
Comparisons
|
|
|
|
|
-----------
|
|
|
|
|
|
2024-05-16 03:30:14 -08:00
|
|
|
These functions expect two inputs of the same type. If one of the inputs is ``null``
|
2020-10-08 14:57:58 +02:00
|
|
|
they return ``null``.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
equal
|
|
|
|
|
greater
|
|
|
|
|
greater_equal
|
|
|
|
|
less
|
|
|
|
|
less_equal
|
|
|
|
|
not_equal
|
|
|
|
|
|
2021-06-03 21:40:29 -04:00
|
|
|
These functions take any number of arguments of a numeric or temporal type.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-06-16 20:19:27 +02:00
|
|
|
max_element_wise
|
|
|
|
|
min_element_wise
|
2021-06-03 21:40:29 -04:00
|
|
|
|
2020-10-08 14:57:58 +02:00
|
|
|
Logical Functions
|
2020-10-19 14:55:19 +02:00
|
|
|
-----------------
|
2020-10-08 14:57:58 +02:00
|
|
|
|
|
|
|
|
These functions normally emit a null when one of the inputs is null. However, Kleene
|
|
|
|
|
logic variants are provided (suffixed ``_kleene``). See User Guide for details.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
and_
|
|
|
|
|
and_kleene
|
2021-09-27 18:29:09 +02:00
|
|
|
and_not
|
|
|
|
|
and_not_kleene
|
2020-10-08 14:57:58 +02:00
|
|
|
invert
|
|
|
|
|
or_
|
|
|
|
|
or_kleene
|
|
|
|
|
xor
|
|
|
|
|
|
|
|
|
|
String Predicates
|
|
|
|
|
-----------------
|
|
|
|
|
|
2021-06-08 12:37:56 +02:00
|
|
|
In these functions an empty string emits false in the output. For ASCII
|
2020-10-08 14:57:58 +02:00
|
|
|
variants (prefixed ``ascii_``) a string element with non-ASCII characters
|
|
|
|
|
emits false in the output.
|
|
|
|
|
|
2021-06-08 12:37:56 +02:00
|
|
|
The first set of functions emit true if the input contains only
|
2020-10-08 14:57:58 +02:00
|
|
|
characters of a given class.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ascii_is_alnum
|
|
|
|
|
ascii_is_alpha
|
|
|
|
|
ascii_is_decimal
|
|
|
|
|
ascii_is_lower
|
|
|
|
|
ascii_is_printable
|
|
|
|
|
ascii_is_space
|
|
|
|
|
ascii_is_upper
|
|
|
|
|
utf8_is_alnum
|
|
|
|
|
utf8_is_alpha
|
|
|
|
|
utf8_is_decimal
|
|
|
|
|
utf8_is_digit
|
|
|
|
|
utf8_is_lower
|
|
|
|
|
utf8_is_numeric
|
|
|
|
|
utf8_is_printable
|
|
|
|
|
utf8_is_space
|
|
|
|
|
utf8_is_upper
|
|
|
|
|
|
|
|
|
|
The second set of functions also consider the order of characters
|
|
|
|
|
in the string element.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ascii_is_title
|
|
|
|
|
utf8_is_title
|
|
|
|
|
|
2021-06-08 12:37:56 +02:00
|
|
|
The third set of functions examines string elements on
|
2020-10-08 14:57:58 +02:00
|
|
|
a byte-by-byte basis.
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
string_is_ascii
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
String Transforms
|
|
|
|
|
-----------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ascii_capitalize
|
|
|
|
|
ascii_lower
|
|
|
|
|
ascii_reverse
|
|
|
|
|
ascii_swapcase
|
|
|
|
|
ascii_title
|
|
|
|
|
ascii_upper
|
|
|
|
|
binary_length
|
2021-11-04 10:43:21 -04:00
|
|
|
binary_repeat
|
2021-09-27 18:29:09 +02:00
|
|
|
binary_replace_slice
|
2021-12-10 09:36:56 -05:00
|
|
|
binary_reverse
|
2021-09-27 18:29:09 +02:00
|
|
|
replace_substring
|
|
|
|
|
replace_substring_regex
|
|
|
|
|
utf8_capitalize
|
|
|
|
|
utf8_length
|
|
|
|
|
utf8_lower
|
2025-11-20 12:35:31 +01:00
|
|
|
utf8_normalize
|
2021-09-27 18:29:09 +02:00
|
|
|
utf8_replace_slice
|
|
|
|
|
utf8_reverse
|
|
|
|
|
utf8_swapcase
|
|
|
|
|
utf8_title
|
|
|
|
|
utf8_upper
|
|
|
|
|
|
|
|
|
|
String Padding
|
|
|
|
|
--------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ascii_center
|
|
|
|
|
ascii_lpad
|
|
|
|
|
ascii_rpad
|
|
|
|
|
utf8_center
|
|
|
|
|
utf8_lpad
|
|
|
|
|
utf8_rpad
|
2025-07-01 01:34:30 -07:00
|
|
|
utf8_zero_fill
|
2021-09-27 18:29:09 +02:00
|
|
|
|
|
|
|
|
String Trimming
|
|
|
|
|
---------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ascii_ltrim
|
|
|
|
|
ascii_ltrim_whitespace
|
|
|
|
|
ascii_rtrim
|
|
|
|
|
ascii_rtrim_whitespace
|
|
|
|
|
ascii_trim
|
|
|
|
|
ascii_trim_whitespace
|
|
|
|
|
utf8_ltrim
|
|
|
|
|
utf8_ltrim_whitespace
|
|
|
|
|
utf8_rtrim
|
|
|
|
|
utf8_rtrim_whitespace
|
|
|
|
|
utf8_trim
|
|
|
|
|
utf8_trim_whitespace
|
|
|
|
|
|
2021-05-21 08:52:47 -04:00
|
|
|
String Splitting
|
|
|
|
|
----------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
ascii_split_whitespace
|
2021-05-21 08:52:47 -04:00
|
|
|
split_pattern
|
|
|
|
|
split_pattern_regex
|
|
|
|
|
utf8_split_whitespace
|
|
|
|
|
|
2021-06-16 20:19:27 +02:00
|
|
|
String Component Extraction
|
|
|
|
|
---------------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
extract_regex
|
2025-11-20 12:35:31 +01:00
|
|
|
extract_regex_span
|
2021-06-16 20:19:27 +02:00
|
|
|
|
|
|
|
|
String Joining
|
|
|
|
|
--------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
binary_join
|
|
|
|
|
binary_join_element_wise
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
String Slicing
|
|
|
|
|
--------------
|
2020-10-08 14:57:58 +02:00
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2022-11-15 16:57:24 +05:30
|
|
|
binary_slice
|
2021-10-01 09:59:44 -04:00
|
|
|
utf8_slice_codeunits
|
2020-10-08 14:57:58 +02:00
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Containment Tests
|
2020-10-08 14:57:58 +02:00
|
|
|
-----------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-06-07 20:45:36 +02:00
|
|
|
count_substring
|
2021-06-10 16:02:22 -04:00
|
|
|
count_substring_regex
|
2021-06-08 12:37:56 +02:00
|
|
|
ends_with
|
2021-05-22 15:19:39 +08:00
|
|
|
find_substring
|
2021-06-28 16:15:59 +02:00
|
|
|
find_substring_regex
|
2020-10-08 14:57:58 +02:00
|
|
|
index_in
|
|
|
|
|
is_in
|
2021-05-20 10:44:09 -04:00
|
|
|
match_like
|
2020-10-08 14:57:58 +02:00
|
|
|
match_substring
|
2021-04-06 17:30:38 +02:00
|
|
|
match_substring_regex
|
2021-06-08 12:37:56 +02:00
|
|
|
starts_with
|
2022-01-04 20:42:28 +01:00
|
|
|
indices_nonzero
|
2020-10-08 14:57:58 +02:00
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Categorizations
|
|
|
|
|
---------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
is_finite
|
|
|
|
|
is_inf
|
|
|
|
|
is_nan
|
|
|
|
|
is_null
|
|
|
|
|
is_valid
|
2022-04-21 13:55:15 -05:00
|
|
|
true_unless_null
|
2021-09-27 18:29:09 +02:00
|
|
|
|
|
|
|
|
Selecting / Multiplexing
|
|
|
|
|
------------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
case_when
|
|
|
|
|
choose
|
|
|
|
|
coalesce
|
|
|
|
|
if_else
|
|
|
|
|
|
2020-10-08 14:57:58 +02:00
|
|
|
Conversions
|
|
|
|
|
-----------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
cast
|
2022-01-04 16:31:01 -05:00
|
|
|
ceil_temporal
|
|
|
|
|
floor_temporal
|
|
|
|
|
round_temporal
|
2023-03-23 05:20:06 -03:00
|
|
|
run_end_decode
|
|
|
|
|
run_end_encode
|
2021-09-27 18:29:09 +02:00
|
|
|
strftime
|
2020-10-08 14:57:58 +02:00
|
|
|
strptime
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Temporal Component Extraction
|
2021-09-27 16:44:26 +02:00
|
|
|
-----------------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
day
|
|
|
|
|
day_of_week
|
|
|
|
|
day_of_year
|
|
|
|
|
hour
|
2025-11-20 12:35:31 +01:00
|
|
|
is_dst
|
|
|
|
|
is_leap_year
|
2021-09-27 16:44:26 +02:00
|
|
|
iso_week
|
|
|
|
|
iso_year
|
2021-09-27 18:29:09 +02:00
|
|
|
iso_calendar
|
2021-09-27 16:44:26 +02:00
|
|
|
microsecond
|
|
|
|
|
millisecond
|
|
|
|
|
minute
|
|
|
|
|
month
|
|
|
|
|
nanosecond
|
|
|
|
|
quarter
|
|
|
|
|
second
|
|
|
|
|
subsecond
|
2021-09-27 18:29:09 +02:00
|
|
|
us_week
|
2022-02-10 15:47:24 -05:00
|
|
|
us_year
|
2021-09-27 18:29:09 +02:00
|
|
|
week
|
2021-09-27 16:44:26 +02:00
|
|
|
year
|
2022-01-04 20:35:24 -05:00
|
|
|
year_month_day
|
2021-09-27 16:44:26 +02:00
|
|
|
|
2021-09-29 09:50:37 +02:00
|
|
|
Temporal Difference
|
|
|
|
|
-------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
day_time_interval_between
|
|
|
|
|
days_between
|
|
|
|
|
hours_between
|
|
|
|
|
microseconds_between
|
|
|
|
|
milliseconds_between
|
|
|
|
|
minutes_between
|
|
|
|
|
month_day_nano_interval_between
|
|
|
|
|
month_interval_between
|
|
|
|
|
nanoseconds_between
|
|
|
|
|
quarters_between
|
|
|
|
|
seconds_between
|
|
|
|
|
weeks_between
|
|
|
|
|
years_between
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Timezone Handling
|
|
|
|
|
-----------------
|
2021-07-14 11:42:00 -04:00
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
assume_timezone
|
2023-12-20 11:06:57 +01:00
|
|
|
local_timestamp
|
2021-07-14 11:42:00 -04:00
|
|
|
|
2025-11-20 12:35:31 +01:00
|
|
|
Random Number Generation
|
|
|
|
|
------------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
random
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Associative Transforms
|
|
|
|
|
----------------------
|
2020-10-08 14:57:58 +02:00
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2025-11-20 12:35:31 +01:00
|
|
|
dictionary_decode
|
2021-09-27 18:29:09 +02:00
|
|
|
dictionary_encode
|
|
|
|
|
unique
|
|
|
|
|
value_counts
|
2020-10-08 14:57:58 +02:00
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Selections
|
|
|
|
|
----------
|
2020-10-08 14:57:58 +02:00
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
array_filter
|
|
|
|
|
array_take
|
|
|
|
|
drop_null
|
|
|
|
|
filter
|
2025-11-20 12:35:31 +01:00
|
|
|
inverse_permutation
|
|
|
|
|
scatter
|
2025-12-29 08:14:36 +01:00
|
|
|
take
|
2020-10-08 14:57:58 +02:00
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
Sorts and Partitions
|
2020-10-08 14:57:58 +02:00
|
|
|
--------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2021-09-27 18:29:09 +02:00
|
|
|
array_sort_indices
|
2025-11-20 12:35:31 +01:00
|
|
|
bottom_k_unstable
|
2020-10-08 14:57:58 +02:00
|
|
|
partition_nth_indices
|
2025-11-20 12:35:31 +01:00
|
|
|
rank
|
|
|
|
|
rank_normal
|
|
|
|
|
rank_quantile
|
2021-09-10 10:42:31 -04:00
|
|
|
select_k_unstable
|
2020-10-08 14:57:58 +02:00
|
|
|
sort_indices
|
2025-11-20 12:35:31 +01:00
|
|
|
top_k_unstable
|
|
|
|
|
|
|
|
|
|
Statistical Functions
|
|
|
|
|
---------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
winsorize
|
2020-10-08 14:57:58 +02:00
|
|
|
|
|
|
|
|
Structural Transforms
|
|
|
|
|
---------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2023-02-12 06:32:50 +01:00
|
|
|
fill_null
|
2022-01-05 15:48:10 -05:00
|
|
|
fill_null_backward
|
|
|
|
|
fill_null_forward
|
2021-09-20 15:39:12 -04:00
|
|
|
list_element
|
2020-10-08 14:57:58 +02:00
|
|
|
list_flatten
|
|
|
|
|
list_parent_indices
|
2022-11-15 18:56:16 +01:00
|
|
|
list_slice
|
2021-09-20 15:39:12 -04:00
|
|
|
list_value_length
|
2021-09-27 18:29:09 +02:00
|
|
|
make_struct
|
2022-02-03 08:38:33 -05:00
|
|
|
map_lookup
|
2021-09-27 18:29:09 +02:00
|
|
|
replace_with_mask
|
2021-11-10 14:27:04 +01:00
|
|
|
struct_field
|
2021-12-09 13:53:34 +01:00
|
|
|
|
2023-06-29 20:43:23 +08:00
|
|
|
Pairwise Functions
|
|
|
|
|
------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
pairwise_diff
|
2025-11-20 12:35:31 +01:00
|
|
|
pairwise_diff_checked
|
2023-06-29 20:43:23 +08:00
|
|
|
|
2021-12-09 13:53:34 +01:00
|
|
|
Compute Options
|
|
|
|
|
---------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
ArraySortOptions
|
|
|
|
|
AssumeTimezoneOptions
|
|
|
|
|
CastOptions
|
|
|
|
|
CountOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
CumulativeOptions
|
2022-05-31 15:23:09 +02:00
|
|
|
CumulativeSumOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
DayOfWeekOptions
|
|
|
|
|
DictionaryEncodeOptions
|
|
|
|
|
ElementWiseAggregateOptions
|
|
|
|
|
ExtractRegexOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
ExtractRegexSpanOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
FilterOptions
|
|
|
|
|
IndexOptions
|
2025-12-29 08:14:36 +01:00
|
|
|
InversePermutationOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
JoinOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
ListFlattenOptions
|
2022-11-15 18:56:16 +01:00
|
|
|
ListSliceOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
MakeStructOptions
|
2022-02-03 08:38:33 -05:00
|
|
|
MapLookupOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
MatchSubstringOptions
|
|
|
|
|
ModeOptions
|
|
|
|
|
NullOptions
|
|
|
|
|
PadOptions
|
2023-06-29 20:43:23 +08:00
|
|
|
PairwiseOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
PartitionNthOptions
|
2025-03-04 17:21:38 +01:00
|
|
|
PivotWiderOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
QuantileOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
RandomOptions
|
|
|
|
|
RankOptions
|
|
|
|
|
RankQuantileOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
ReplaceSliceOptions
|
|
|
|
|
ReplaceSubstringOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
RoundBinaryOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
RoundOptions
|
2022-01-04 16:31:01 -05:00
|
|
|
RoundTemporalOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
RoundToMultipleOptions
|
2023-03-23 05:20:06 -03:00
|
|
|
RunEndEncodeOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
ScalarAggregateOptions
|
2025-12-29 08:14:36 +01:00
|
|
|
ScatterOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
SelectKOptions
|
|
|
|
|
SetLookupOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
SkewOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
SliceOptions
|
|
|
|
|
SortOptions
|
|
|
|
|
SplitOptions
|
|
|
|
|
SplitPatternOptions
|
|
|
|
|
StrftimeOptions
|
|
|
|
|
StrptimeOptions
|
|
|
|
|
StructFieldOptions
|
|
|
|
|
TakeOptions
|
|
|
|
|
TDigestOptions
|
|
|
|
|
TrimOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
Utf8NormalizeOptions
|
2021-12-09 13:53:34 +01:00
|
|
|
VarianceOptions
|
2022-01-04 20:35:24 -05:00
|
|
|
WeekOptions
|
2025-11-20 12:35:31 +01:00
|
|
|
WinsorizeOptions
|
|
|
|
|
ZeroFillOptions
|
|
|
|
|
|
|
|
|
|
Functions Registry
|
|
|
|
|
------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
call_function
|
|
|
|
|
call_tabular_function
|
|
|
|
|
get_function
|
|
|
|
|
list_functions
|
2022-09-29 14:11:46 +05:30
|
|
|
|
|
|
|
|
User-Defined Functions
|
|
|
|
|
----------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
2025-11-20 12:35:31 +01:00
|
|
|
register_aggregate_function
|
2022-09-29 14:11:46 +05:30
|
|
|
register_scalar_function
|
2025-11-20 12:35:31 +01:00
|
|
|
register_tabular_function
|
|
|
|
|
register_vector_function
|
2024-01-30 09:16:53 +01:00
|
|
|
UdfContext
|
2025-03-31 03:59:41 +08:00
|
|
|
|
|
|
|
|
Expression Functions
|
|
|
|
|
--------------------
|
|
|
|
|
|
|
|
|
|
.. autosummary::
|
|
|
|
|
:toctree: ../generated/
|
|
|
|
|
|
|
|
|
|
field
|
|
|
|
|
scalar
|