Object Storage Dataset¶
To get an an iterable style object from large/out-of-memory CSV files stored in local file-system or Cloud storage platforms like Amazon-S3, Google cloud storage, and Azure blob storage.
The ObjectStorageDataset provides support for tensor-based, out-of-memory datasets for the iterable-style interface. The ObjectStorageDataset is not available by default when you install PyTorch, so you need to install it separately in your Python environment using:
pip install osds
and once installed, import the class in your runtime using:
from osds.utils import ObjectStorageDataset