Create Dataset
Create Dataset
POST/api/2/auto_eval/dataset/create
Create a new Dataset. Use UploadDatasetFile to upload files to the dataset.
Request
- application/json
Body
Human-readable name for the dataset, if one exists.
Human-readable description of the dataset, if one exists.
PseudoLabel job fields.
The project to add the new dataset to
Responses
- 200
Successful operation
- application/json
- Schema
- Example (from schema)
Schema
Array [
]
dataset
object
required
A Dataset in the most basic sense: metadata and ownership, but nothing tied to its data.
The ID of the dataset.
Human-readable name for the dataset, if one exists.
Human-readable description of the dataset, if one exists.
The ID of the user who owns the dataset.
Possible values: [JOB_STATUS_UNSPECIFIED
, JOB_STATUS_QUEUED
, JOB_STATUS_RUNNING
, JOB_STATUS_COMPLETED
, JOB_STATUS_CANCELLED
, JOB_STATUS_FAILED
]
columns
object[]
required
The ID of the dataset file.
Index of the column within the dataset file.
The literal name for the column.
Datatypes for a column in a dataset file. We likely don't need everything here, but it's good to be explicit, for example to avoid unknowingly coercing int64 values into int32. Encoding for text is UTF_8 unless indicated otherwise.
Possible values: [DATASET_COLUMN_D_TYPE_UNSPECIFIED
, DATASET_COLUMN_D_TYPE_INT32
, DATASET_COLUMN_D_TYPE_INT64
, DATASET_COLUMN_D_TYPE_FLOAT32
, DATASET_COLUMN_D_TYPE_FLOAT64
, DATASET_COLUMN_D_TYPE_STRING
, DATASET_COLUMN_D_TYPE_BYTES
, DATASET_COLUMN_D_TYPE_ANY
]
labelState
object
The state of the latest labeling job for the dataset
The status of the latest general pseudo-labeling job for the dataset
Possible values: [JOB_STATUS_UNSPECIFIED
, JOB_STATUS_QUEUED
, JOB_STATUS_RUNNING
, JOB_STATUS_COMPLETED
, JOB_STATUS_CANCELLED
, JOB_STATUS_FAILED
]
aka user general instructions
if the labeling status is error, this field may contain an error message
{
"dataset": {
"id": "string",
"createdAt": "2024-07-29T15:51:28.071Z",
"updatedAt": "2024-07-29T15:51:28.071Z",
"name": "string",
"description": "string",
"ownerUserId": "string",
"numRows": 0,
"numCols": 0,
"initializationStatus": "JOB_STATUS_UNSPECIFIED",
"initializationError": "string",
"columns": [
{
"id": "string",
"createdAt": "2024-07-29T15:51:28.071Z",
"updatedAt": "2024-07-29T15:51:28.071Z",
"index": 0,
"literalName": "string",
"dtype": "DATASET_COLUMN_D_TYPE_UNSPECIFIED"
}
],
"labelState": {
"labelingStatus": "JOB_STATUS_UNSPECIFIED",
"promptTemplate": "string",
"error": "string"
}
}
}