Object Storage Dataset

To get an an iterable style object from large/out-of-memory CSV files stored in local file-system or Cloud storage platforms like Amazon-S3, Google cloud storage, and Azure blob storage.

The ObjectStorageDataset provides support for tensor-based, out-of-memory datasets for the iterable-style interface. The ObjectStorageDataset is not available by default when you install PyTorch, so you need to install it separately in your Python environment using:

pip install osds

and once installed, import the class in your runtime using:

from osds.utils import ObjectStorageDataset