数据存储是一个用于读取单个文件或者文件或数据集合的对象。它相当于一个存储库,用来存储具有相同结构和格式的数据。例如,数据存储中每个文件包含的数据必须具有相同的类型(如数字或文本)、以相同顺序显示并用相同的分隔符分隔。
Choose the right datastore based on the file format of your data or
application.
以下示例说明如何为包含表格数据的大型文本文件创建数据存储,然后采用逐个块或逐个文件的方式读取和处理数据。
This example shows how to create a datastore for a collection of images, read the image files, and find the images with the maximum average hue, saturation, and brightness (HSV).
This example shows how to create a datastore for key-value pair data in a MAT-file that is the output of mapreduce.
This example shows how to create a datastore for a Sequence file containing key-value data.
处理 Amazon S3™、Microsoft® Azure® Storage Blob 或 HDFS™ 中的远程数据。
Setup a datastore on your machine that can be loaded and processed on another machine
or cluster.
Create a fully customized datastore for your custom or proprietary data.
This example shows how to develop a custom datastore that supports writing
operations.
After implementing your custom datastore, follow this test procedure to qualify your
custom datastore.