Datasets
Datasets store data in a variety of file formats. Uploading a dataset is the first step to using it in other capabilities, such as generators running in a pipeline. The output of a data generation pipeline is stored as a dataset and can be downloaded or used by other tools.
Uploading a Dataset
ursactl create dataset [--project project] [--type type] source path
The source
is the local file you’re uploading to the DSS.
The path
is a filename (with optional folders) to identify the dataset in the remote system (if it has spaces, enclose it in quotes).
The dataset will be identified by the UUID that is printed on success.
The command script will try to figure out what kind of content you are uploading. If it can’t figure it out, then you can indicate the MIME type with the --type
option.
You can override the default project with the --project
option.
For example:
ursactl create dataset foo.xml 'example/rdf-xml/file'
accepted: true
id: d1276197-bd55-4e56-86c3-b6fc695e17eb
In this example, the dataset metadata record was created and the file contents were uploaded and attached.
List Datasets
ursactl list datasets [--project project]
You can override the default project with the --project
option.
For example:
ursactl list datasets
| Path | UUID | Size | Type |
|----------------------+--------------------------------------+--------+---------------------|
| example/rdf-xml/file | ef0d8f8b-04fd-49ea-a6b9-66a4142e2360 | 1178 | application/rdf+xml |
Downloading a Dataset
ursactl get dataset dataset dest
The dataset
is the path or the UUID of the dataset you are downloading.
The dest
is the local file you’re downloading the dataset into.
Deleting a Dataset
ursactl delete dataset dataset
The dataset
is the path or the UUID of the dataset you are deleting.