Class: Dataset
A dataset either produced as part of MBO or used in the process of producing an MBO dataset.
URI: schema:Dataset
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| id | 1 String |
An identifier chosen from the range given to your work package by WP1 | direct |
| metadataPublisherId | 1 PersonOrOrganization |
The MBO Persistent IDentifier (mPID) of the person who entered this row of da... | direct |
| metadataDescribedForActionId | 1 Action |
The Action which resulted in this metadata record being described | direct |
| name | 1 String |
A name/title | direct |
| containsVariablesMboIds | 1..* PropertyValue |
The MBO Ids of the PropertyValues describing what was measured or calculated ... | direct |
| description | 0..1 String |
A description for this entity | direct |
| landingPage | * SchemaURL |
The digital location where the dataset can be acquired | direct |
| aboutTaxonMboIds | * Taxon |
The MBO Ids of the taxa observed in your dataset | direct |
| spatialCoveragePlaceMboId | * Place |
A place defining the spatial coverage of the dataset | direct |
| temporalCoverage | 0..1 String |
The temporal range which this dataset covers | direct |
| dataDownloadMboIds | * DataDownload |
The different formats in which the dataset can be accessed or downloaded | direct |
| authorId | * PersonOrOrganization |
The Permanent Identifier of person or organization who created this entity | direct |
| contributorIds | * PersonOrOrganization |
The Permanent Identifiers of people or organizations who contributed to the e... | direct |
| ownerId | 0..1 PersonOrOrganization |
The Permanent Identifier of an person or organization who owns the entity | direct |
| maintainerId | 0..1 PersonOrOrganization |
The Permanent Identifier of an person or organization who maintains this enti... | direct |
| publisherId | 0..1 PersonOrOrganization |
The Permanent Identifier of an person or organization who created the entity | direct |
| basedOnIds | * Uri |
The URL PID of any datasets which yours is based on | direct |
| hasPartIds | * Uri |
Pipe-delimited list of the parts which comprise the aggregate dataset being r... | direct |
| publishingStatusMboId | 0..1 PublishingStatusDefinedTerm |
The publishing status of the entity | direct |
| embargoStatementMboId | 0..1 EmbargoStatement |
an MBO identifier for Embargo information | direct |
| dateCreated | 0..1 String |
direct | |
| dateModified | * String |
direct | |
| datePublished | 0..1 String |
direct | |
| inProgressDate | 0..1 String |
The point in time when you expect that the thing (e | direct |
| licenseMboId | 0..1 License |
The MBO PID of the license which covers the entity you are describing | direct |
| conditionsOfAccess | 0..1 String |
A description of how the data may be accessed; for use when data is not publi... | direct |
| keywords | * String |
Key words classifying this entity | direct |
| audienceMboIds | * Audience |
The intended audiences for this entity | direct |
Usages
| used by | used in | type | used |
|---|---|---|---|
| Action | resultingDatasetMboIds | range | Dataset |
| DataDownload | datasetMboId | range | Dataset |
| DatasetComment | commentAboutDatasetMboId | range | Dataset |
| EmbargoStatement | embargoedDatasetMboId | range | Dataset |
Identifier and Mapping Information
Schema Source
- from schema: https://w3id.org/marco-bolo/csv-input-classes
Mappings
| Mapping Type | Mapped Value |
|---|---|
| self | schema:Dataset |
| native | mbo:Dataset |
LinkML Source
Direct
name: Dataset
description: A dataset either produced as part of MBO or used in the process of producing
an MBO dataset.
from_schema: https://w3id.org/marco-bolo/csv-input-classes
slots:
- id
- metadataPublisherId
- metadataDescribedForActionId
- name
- containsVariablesMboIds
- description
- landingPage
- aboutTaxonMboIds
- spatialCoveragePlaceMboId
- temporalCoverage
- dataDownloadMboIds
- authorId
- contributorIds
- ownerId
- maintainerId
- publisherId
- basedOnIds
- hasPartIds
- publishingStatusMboId
- embargoStatementMboId
- dateCreated
- dateModified
- datePublished
- inProgressDate
- licenseMboId
- conditionsOfAccess
- keywords
- audienceMboIds
class_uri: schema:Dataset
Induced
name: Dataset
description: A dataset either produced as part of MBO or used in the process of producing
an MBO dataset.
from_schema: https://w3id.org/marco-bolo/csv-input-classes
attributes:
id:
name: id
description: 'An identifier chosen from the range given to your work package by
WP1.
'
title: MBO Permanent Identifier
comments:
- "This is the identifier (mPID) for this row of the spreadsheet \nand for whatever\
\ information is being described in this row. \nIt must start with `mbo_`, only\
\ use printable ASCII characters\nand should be unique in this column. \n"
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
identifier: true
alias: id
owner: Dataset
domain_of:
- Action
- HowTo
- HowToStep
- HowToTip
- Dataset
- PersonOrOrganization
- ContactPoint
- License
- PropertyValue
- DataDownload
- DatasetComment
- SoftwareSourceCode
- SoftwareApplication
- Service
- EmbargoStatement
- DefinedTerm
- Place
- GeoShape
- MonetaryGrant
- Taxon
- Audience
- Document
- Instrument
- Platform
range: string
required: true
pattern: ^mbo_[a-zA-Z0-9_-]+$
metadataPublisherId:
name: metadataPublisherId
description: 'The MBO Persistent IDentifier (mPID) of the person who entered this
row of data.
'
title: Data Entry Person (mPID - you)
comments:
- 'Should be an mPID from the first column of Person.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:creator
alias: metadataPublisherId
owner: Dataset
domain_of:
- Action
- HowTo
- HowToStep
- HowToTip
- Dataset
- Person
- Organization
- ContactPoint
- License
- PropertyValue
- DataDownload
- DatasetComment
- SoftwareSourceCode
- SoftwareApplication
- Service
- EmbargoStatement
- DefinedTerm
- Place
- GeoShape
- MonetaryGrant
- Taxon
- Audience
- Document
- Instrument
- Platform
range: PersonOrOrganization
required: true
multivalued: false
metadataDescribedForActionId:
name: metadataDescribedForActionId
description: 'The [Action](#action) which resulted in this metadata record being
described.
Is likely to be the Action associated with a MARCO-BOLO Task.
'
title: Data Entered for Action (mPID)
comments:
- "The mPID from the first column of the Action.csv. \nNote that you can reference\
\ the same mPID in multiple rows.\n\nUses the <https://w3id.org/marco-bolo/isResultOf>\
\ predicate but ultimately ends up being \nrepresented as a triple in the form\
\ `<action> schema:result <this-entity>`.\n"
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: mbo:isResultOf
alias: metadataDescribedForActionId
owner: Dataset
domain_of:
- Action
- HowTo
- HowToStep
- HowToTip
- Dataset
- Person
- Organization
- ContactPoint
- License
- PropertyValue
- DataDownload
- DatasetComment
- SoftwareSourceCode
- SoftwareApplication
- EmbargoStatement
- DefinedTerm
- Place
- GeoShape
- MonetaryGrant
- Taxon
- Audience
- Document
- Instrument
- Platform
range: Action
required: true
multivalued: false
name:
name: name
description: A name/title
title: Name
comments:
- "The commonly used name. \nFor example: `MBA` for Marine Biological Association\n"
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:name
alias: name
owner: Dataset
domain_of:
- Action
- HowTo
- HowToStep
- HowToTip
- Dataset
- Organization
- ContactPoint
- License
- PropertyValue
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Service
- Place
- GeoShape
- MonetaryGrant
- Taxon
- Audience
- Document
- Instrument
- Platform
range: string
required: true
multivalued: false
containsVariablesMboIds:
name: containsVariablesMboIds
description: 'The MBO Ids of the PropertyValues describing what was measured or
calculated in your dataset; these should include EBVs, EOVs, etc.
'
title: Contains Variables (PropertyValue mPIDs)
comments:
- 'mPID(s) from the first column of the PropertyValue.csv
Pipe-delimited when there are multiple values
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:variableMeasured
alias: containsVariablesMboIds
owner: Dataset
domain_of:
- Dataset
range: PropertyValue
required: true
multivalued: true
description:
name: description
description: A description for this entity.
title: Description
comments:
- 'A concise one-sentence summary of the main point or purpose.
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:description
alias: description
owner: Dataset
domain_of:
- Action
- HowTo
- HowToStep
- HowToTip
- Dataset
- Organization
- ContactPoint
- License
- PropertyValue
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Service
- DefinedTerm
- Place
- GeoShape
- MonetaryGrant
- Document
- Instrument
- Platform
range: string
required: false
multivalued: false
landingPage:
name: landingPage
description: The digital location where the dataset can be acquired.
title: Landing Pages (URLs)
comments:
- "For example: `https://example.com`, `http://example.com`, \n`ftp://ftp.example.org/data`,\
\ `doi:10.1000/example123`\n\nPipe-delimited when there are multiple values\n"
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:url
alias: landingPage
owner: Dataset
domain_of:
- Dataset
- Document
subproperty_of: url
range: schemaURL
required: false
multivalued: true
aboutTaxonMboIds:
name: aboutTaxonMboIds
description: The MBO Ids of the taxa observed in your dataset.
title: Taxa (mPIDs)
comments:
- 'mPID(s) from the first column of the Taxon.csv
Pipe-delimited when there are multiple values
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:about
alias: aboutTaxonMboIds
owner: Dataset
domain_of:
- Dataset
- Document
range: Taxon
required: false
multivalued: true
spatialCoveragePlaceMboId:
name: spatialCoveragePlaceMboId
description: A place defining the spatial coverage of the dataset.
title: Spatial Coverage (Place - mPID)
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:spatialCoverage
alias: spatialCoveragePlaceMboId
owner: Dataset
domain_of:
- Dataset
- Document
range: Place
required: false
multivalued: true
temporalCoverage:
name: temporalCoverage
description: 'The temporal range which this dataset covers.
'
title: Temporal Coverage
comments:
- 'For example: `2012-01/2013-02`.
Must conform to the [ISO8601 time interal specification](https://en.wikipedia.org/wiki/ISO_8601#Time_intervals).
The MBO project only currently supports the start&end variety.
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:temporalCoverage
alias: temporalCoverage
owner: Dataset
domain_of:
- Dataset
- Document
range: string
required: false
pattern: ^([+-]?\d{4}((-?(-?(W[0-5][0-9]|[0-1][0-9])|([0-3][0-9][0-9])|((W[0-5][0-9]-?[0-7]|[0-3][0-9][0-9]|[0-1][0-9]-?[0-3][0-9])(T[0-2][0-9](:?[0-5][0-9](:?[0-5][0-9](\.\d+)?)?)?(Z|([+-][0-2][0-9](:?[0-5][0-9])?))?)?)))?)?)/(\.\.|[+-]?\d{4}((-?(-?(W[0-5][0-9]|[0-1][0-9])|([0-3][0-9][0-9])|((W[0-5][0-9]-?[0-7]|[0-3][0-9][0-9]|[0-1][0-9]-?[0-3][0-9])(T[0-2][0-9](:?[0-5][0-9](:?[0-5][0-9](\.\d+)?)?)?(Z|([+-][0-2][0-9](:?[0-5][0-9])?))?)?)))?)?)$
dataDownloadMboIds:
name: dataDownloadMboIds
description: The different formats in which the dataset can be accessed or downloaded.
title: Data Downloads (mPIDs)
comments:
- 'a list of Pipe-delimited mPIDs from the first column of DataDownload.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:distribution
alias: dataDownloadMboIds
owner: Dataset
domain_of:
- Dataset
range: DataDownload
required: false
multivalued: true
authorId:
name: authorId
description: The Permanent Identifier of person or organization who created this
entity.
title: Author (mPID)
comments:
- 'Should be an mPID from the first column of either Person.csv or Organization.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:author
alias: authorId
owner: Dataset
domain_of:
- Dataset
- DataDownload
- DatasetComment
- SoftwareSourceCode
- SoftwareApplication
- Document
range: PersonOrOrganization
required: false
multivalued: true
contributorIds:
name: contributorIds
description: The Permanent Identifiers of people or organizations who contributed
to the entity.
title: Contributors (mPIDs)
comments:
- 'Should be an mPID from the first column of either Person.csv or Organization.csv
Pipe-delimited when there are multiple values.
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:contributor
alias: contributorIds
owner: Dataset
domain_of:
- HowToStep
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
range: PersonOrOrganization
required: false
multivalued: true
ownerId:
name: ownerId
description: 'The Permanent Identifier of an person or organization who owns the
entity.
'
title: Owner (mPID)
comments:
- 'Should be an mPID from the first column of either Person.csv or Organization.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:copyrightHolder
alias: ownerId
owner: Dataset
domain_of:
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
- Instrument
- Platform
range: PersonOrOrganization
required: false
multivalued: false
maintainerId:
name: maintainerId
description: The Permanent Identifier of an person or organization who maintains
this entity.
title: Maintainer (mPID)
comments:
- 'Should be an mPID from the first column of either Person.csv or Organization.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:maintainer
alias: maintainerId
owner: Dataset
domain_of:
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
- Instrument
range: PersonOrOrganization
required: false
multivalued: false
publisherId:
name: publisherId
description: The Permanent Identifier of an person or organization who created
the entity.
title: Publisher (mPID)
comments:
- 'Should be an mPID from the first column of either Person.csv or Organization.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:publisher
alias: publisherId
owner: Dataset
domain_of:
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
range: PersonOrOrganization
required: false
multivalued: false
basedOnIds:
name: basedOnIds
description: 'The URL PID of any datasets which yours is based on.
'
title: Based On (URL PIDs)
comments:
- "WARNING: There are no foreign key checks.\n\nFor example: `https://example.com`,\
\ `http://example.com`, \n`ftp://ftp.example.org/data`, `doi:10.1000/example123`\n\
\nPipe-delimited when there are multiple values\n"
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:isBasedOn
alias: basedOnIds
owner: Dataset
domain_of:
- Dataset
- Document
range: uri
required: false
multivalued: true
hasPartIds:
name: hasPartIds
description: "Pipe-delimited list of the parts which comprise the aggregate dataset\
\ being referenced. \n"
title: Has Parts (URL PIDs)
comments:
- "WARNING: There are no foreign key checks.\nYou are required to use the full\
\ URI here to reference pre-existing Datasets.\n\nFor example: `https://example.com`,\
\ `http://example.com`, \n`ftp://ftp.example.org/data`, `doi:10.1000/example123`\n\
\nPipe-delimited when there are multiple values\n"
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:hasPart
alias: hasPartIds
owner: Dataset
domain_of:
- Dataset
range: uri
required: false
multivalued: true
publishingStatusMboId:
name: publishingStatusMboId
description: The publishing status of the entity.
title: Publishing Status (mPID)
comments:
- 'Should be an mPID from the first column of PublishingStatusDefinedTerm.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:creativeWorkStatus
alias: publishingStatusMboId
owner: Dataset
domain_of:
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
range: PublishingStatusDefinedTerm
required: false
multivalued: false
embargoStatementMboId:
name: embargoStatementMboId
description: an MBO identifier for Embargo information
title: Embargo Statement (mPID)
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:usageInfo
alias: embargoStatementMboId
owner: Dataset
domain_of:
- Dataset
- Document
range: EmbargoStatement
required: false
multivalued: false
dateCreated:
name: dateCreated
title: Date Created
comments:
- 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format
For example: `2025`
`2025-12`
`2025-12-31`
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:dateCreated
alias: dateCreated
owner: Dataset
domain_of:
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
range: string
required: false
multivalued: false
pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
dateModified:
name: dateModified
title: Dates Modified
comments:
- 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format
For example: `2025`
`2025-12`
`2025-12-31`
Pipe-delimited when there are multiple values
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:dateModified
alias: dateModified
owner: Dataset
domain_of:
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
range: string
required: false
multivalued: true
pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
datePublished:
name: datePublished
title: Date Published
comments:
- 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format
For example: `2025`
`2025-12`
`2025-12-31`
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:datePublished
alias: datePublished
owner: Dataset
domain_of:
- Dataset
- DataDownload
- SoftwareSourceCode
- SoftwareApplication
- Document
range: string
required: false
multivalued: false
pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
inProgressDate:
name: inProgressDate
description: The point in time when you expect that the thing (e.g. data, document)
will be accessible.
title: In Progress Date
comments:
- 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format
For example: `2025`
`2025-12`
`2025-12-31`
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: mbo:inProgressDate
alias: inProgressDate
owner: Dataset
domain_of:
- Dataset
- Document
range: string
required: false
multivalued: false
pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
licenseMboId:
name: licenseMboId
description: The MBO PID of the license which covers the entity you are describing.
title: License (mPID)
comments:
- 'If you are unsure, please use ''undefined'' as a placeholder until
you can update it.
If restriction is needed, please use ''Restricted. Please contact product owner.''
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:license
alias: licenseMboId
owner: Dataset
domain_of:
- Dataset
- DataDownload
- Document
range: License
required: false
conditionsOfAccess:
name: conditionsOfAccess
description: A description of how the data may be accessed; for use when data
is not publicly accessible.
title: Conditions of Access
comments:
- "For example: `Access controlled by [Country] Fisheries Management Authority\
\ due to commercial sensitivity \nand resource management considerations. Contact\
\ [specific office/person] at [contact details]`\n"
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:conditionsOfAccess
alias: conditionsOfAccess
owner: Dataset
domain_of:
- Dataset
- Document
range: string
required: false
multivalued: false
keywords:
name: keywords
description: Key words classifying this entity.
title: Keywords
comments:
- 'Separate multiple keyword with a ''|''.
e.g. Whales|Population Genetics
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:keywords
alias: keywords
owner: Dataset
domain_of:
- Dataset
- Organization
- SoftwareSourceCode
- SoftwareApplication
- Document
- Instrument
- Platform
range: string
required: false
multivalued: true
audienceMboIds:
name: audienceMboIds
description: 'The intended audiences for this entity.
'
title: Audiences (mPIDs)
comments:
- 'Should be an mPID from the first column of Audience.csv
'
from_schema: https://w3id.org/marco-bolo/csv-input-classes
rank: 1000
slot_uri: schema:audience
alias: audienceMboIds
owner: Dataset
domain_of:
- HowToStep
- HowToTip
- Dataset
- DataDownload
- Service
- Document
range: Audience
required: false
multivalued: true
class_uri: schema:Dataset