turicreate.SArray.rolling_sum

SArray.rolling_sum(window_start, window_end, min_observations=None)

Calculate a new SArray of the sum of different subsets over this SArray.

Also known as a “moving sum” or “running sum”. The subset that the sum is calculated over is defined as an inclusive range relative to the position to each value in the SArray, using window_start and window_end. For a better understanding of this, see the examples below.

Parameters:
window_start : int

The start of the subset to calculate the sum relative to the current value.

window_end : int

The end of the subset to calculate the sum relative to the current value. Must be greater than window_start.

min_observations : int

Minimum number of non-missing observations in window required to calculate the sum (otherwise result is None). None signifies that the entire window must not include a missing value. A negative number throws an error.

Returns:
out : SArray

Examples

>>> import pandas
>>> sa = SArray([1,2,3,4,5])
>>> series = pandas.Series([1,2,3,4,5])

A rolling sum with a window including the previous 2 entries including the current: >>> sa.rolling_sum(-2,0) dtype: int Rows: 5 [None, None, 6, 9, 12]

Pandas equivalent: >>> pandas.rolling_sum(series, 3) 0 NaN 1 NaN 2 6 3 9 4 12 dtype: float64

Same rolling sum operation, but 2 minimum observations: >>> sa.rolling_sum(-2,0,min_observations=2) dtype: int Rows: 5 [None, 3, 6, 9, 12]

Pandas equivalent: >>> pandas.rolling_sum(series, 3, min_periods=2) 0 NaN 1 3 2 6 3 9 4 12 dtype: float64

A rolling sum with a size of 3, centered around the current: >>> sa.rolling_sum(-1,1) dtype: int Rows: 5 [None, 6, 9, 12, None]

Pandas equivalent: >>> pandas.rolling_sum(series, 3, center=True) 0 NaN 1 6 2 9 3 12 4 NaN dtype: float64

A rolling sum with a window including the current and the 2 entries following: >>> sa.rolling_sum(0,2) dtype: int Rows: 5 [6, 9, 12, None, None]

A rolling sum with a window including the previous 2 entries NOT including the current: >>> sa.rolling_sum(-2,-1) dtype: int Rows: 5 [None, None, 3, 5, 7]