turicreate.SArray.random_split — Turi Create API 6.4.1 documentation (original) (raw)
SArray.
random_split
(fraction, seed=None)¶
Randomly split the rows of an SArray into two SArrays. The first SArray contains M rows, sampled uniformly (without replacement) from the original SArray. M is approximately the fraction times the original number of rows. The second SArray contains the remaining rows of the original SArray.
Parameters: | fraction : float Approximate fraction of the rows to fetch for the first returned SArray. Must be between 0 and 1. seed : int, optional Seed for the random number generator used to split. |
---|---|
Returns: | out : tuple [SArray] Two new SArrays. |
Examples
Suppose we have an SArray with 1,024 rows and we want to randomly split it into training and testing datasets with about a 90%/10% split.
sa = turicreate.SArray(range(1024)) sa_train, sa_test = sa.random_split(.9, seed=5) print(len(sa_train), len(sa_test)) 922 102