Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Datasets

Datasets store data in a variety of file formats. Uploading a dataset is the first step to using it in other capabilities, such as generators running in a pipeline. The output of a data generation pipeline is stored as a dataset and can be downloaded or used by other tools.

Uploading a Dataset

ursactl create dataset [--project project] [--type type] source path

The source is the local file you’re uploading to the DSS. The path is a filename (with optional folders) to identify the dataset in the remote system (if it has spaces, enclose it in quotes). The dataset will be identified by the UUID that is printed on success.

The command script will try to figure out what kind of content you are uploading. If it can’t figure it out, then you can indicate the MIME type with the --type option.

You can override the default project with the --project option.

For example:

ursactl create dataset foo.xml 'example/rdf-xml/file'
accepted: true
id: d1276197-bd55-4e56-86c3-b6fc695e17eb

In this example, the dataset metadata record was created and the file contents were uploaded and attached.

List Datasets

ursactl list datasets [--project project]

You can override the default project with the --project option.

For example:

ursactl list datasets
| Path                 | UUID                                 |   Size | Type                |
|----------------------+--------------------------------------+--------+---------------------|
| example/rdf-xml/file | ef0d8f8b-04fd-49ea-a6b9-66a4142e2360 |   1178 | application/rdf+xml |

Downloading a Dataset

ursactl get dataset dataset dest

The dataset is the path or the UUID of the dataset you are downloading. The dest is the local file you’re downloading the dataset into.

Deleting a Dataset

ursactl delete dataset dataset

The dataset is the path or the UUID of the dataset you are deleting.