Tall Arrays - MATLAB & Simulink (original) (raw)

Main Content

Arrays with more rows than fit in memory

Tall arrays provide a way to work with data backed by a datastore that can have millions or billions of rows. You can create tall numeric arrays, cell arrays, categoricals, strings, datetimes, durations, or calendar durations, and you can use any of these tall types as variables in a tall table or tall timetable. Many operations and functions work the same way with tall arrays as they do with in-memory MATLABĀ® arrays, but most results are evaluated only when you request them explicitly using gather. MATLAB automatically optimizes the queued calculations by minimizing the number of passes through the data. For more information, see Tall Arrays for Out-of-Memory Data.

For more information about integrating with big data systems or compiling tall array algorithms, see Extend Tall Arrays with Other Products.

Functions

expand all

Creation and Evaluation

tall Create tall array
datastore Create datastore for large collections of data
gather Collect tall array into memory after executing queued operations
write Write tall array to local and remote locations for checkpointing
mapreducer Define execution environment for mapreduce or tall arrays
tallrng Control random number generation for tall arrays

Determine Type

Develop Custom Algorithms

Topics