Class: Dataset

A dataset either produced as part of MBO or used in the process of producing an MBO dataset.

URI: schema:Dataset

diagram

Slots

Name Cardinality and Range Description Inheritance
id 1
String
An identifier chosen from the range given to your work package by WP1 direct
metadataPublisherId 1
PersonOrOrganization
The MBO Persistent IDentifier (mPID) of the person who entered this row of da... direct
metadataDescribedForActionId 1
Action
The Action which resulted in this metadata record being described direct
name 1
String
A name/title direct
containsVariablesMboIds 1..*
PropertyValue
The MBO Ids of the PropertyValues describing what was measured or calculated ... direct
description 0..1
String
A description for this entity direct
landingPage *
SchemaURL
The digital location where the dataset can be acquired direct
aboutTaxonMboIds *
Taxon
The MBO Ids of the taxa observed in your dataset direct
spatialCoveragePlaceMboId *
Place
A place defining the spatial coverage of the dataset direct
temporalCoverage 0..1
String
The temporal range which this dataset covers direct
dataDownloadMboIds *
DataDownload
The different formats in which the dataset can be accessed or downloaded direct
authorId *
PersonOrOrganization
The Permanent Identifier of person or organization who created this entity direct
contributorIds *
PersonOrOrganization
The Permanent Identifiers of people or organizations who contributed to the e... direct
ownerId 0..1
PersonOrOrganization
The Permanent Identifier of an person or organization who owns the entity direct
maintainerId 0..1
PersonOrOrganization
The Permanent Identifier of an person or organization who maintains this enti... direct
publisherId 0..1
PersonOrOrganization
The Permanent Identifier of an person or organization who created the entity direct
basedOnIds *
Uri
The URL PID of any datasets which yours is based on direct
hasPartIds *
Uri
Pipe-delimited list of the parts which comprise the aggregate dataset being r... direct
publishingStatusMboId 0..1
PublishingStatusDefinedTerm
The publishing status of the entity direct
embargoStatementMboId 0..1
EmbargoStatement
an MBO identifier for Embargo information direct
dateCreated 0..1
String
direct
dateModified *
String
direct
datePublished 0..1
String
direct
inProgressDate 0..1
String
The point in time when you expect that the thing (e direct
licenseMboId 0..1
License
The MBO PID of the license which covers the entity you are describing direct
conditionsOfAccess 0..1
String
A description of how the data may be accessed; for use when data is not publi... direct
keywords *
String
Key words classifying this entity direct
audienceMboIds *
Audience
The intended audiences for this entity direct

Usages

used by used in type used
Action resultingDatasetMboIds range Dataset
DataDownload datasetMboId range Dataset
DatasetComment commentAboutDatasetMboId range Dataset
EmbargoStatement embargoedDatasetMboId range Dataset

Identifier and Mapping Information

Schema Source

  • from schema: https://w3id.org/marco-bolo/csv-input-classes

Mappings

Mapping Type Mapped Value
self schema:Dataset
native mbo:Dataset

LinkML Source

Direct

name: Dataset
description: A dataset either produced as part of MBO or used in the process of producing
  an MBO dataset.
from_schema: https://w3id.org/marco-bolo/csv-input-classes
slots:
- id
- metadataPublisherId
- metadataDescribedForActionId
- name
- containsVariablesMboIds
- description
- landingPage
- aboutTaxonMboIds
- spatialCoveragePlaceMboId
- temporalCoverage
- dataDownloadMboIds
- authorId
- contributorIds
- ownerId
- maintainerId
- publisherId
- basedOnIds
- hasPartIds
- publishingStatusMboId
- embargoStatementMboId
- dateCreated
- dateModified
- datePublished
- inProgressDate
- licenseMboId
- conditionsOfAccess
- keywords
- audienceMboIds
class_uri: schema:Dataset

Induced

name: Dataset
description: A dataset either produced as part of MBO or used in the process of producing
  an MBO dataset.
from_schema: https://w3id.org/marco-bolo/csv-input-classes
attributes:
  id:
    name: id
    description: 'An identifier chosen from the range given to your work package by
      WP1.

      '
    title: MBO Permanent Identifier
    comments:
    - "This is the identifier (mPID) for this row of the spreadsheet \nand for whatever\
      \ information is being described in this row. \nIt must start with `mbo_`, only\
      \ use printable ASCII characters\nand should be unique in this column. \n"
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    identifier: true
    alias: id
    owner: Dataset
    domain_of:
    - Action
    - HowTo
    - HowToStep
    - HowToTip
    - Dataset
    - PersonOrOrganization
    - ContactPoint
    - License
    - PropertyValue
    - DataDownload
    - DatasetComment
    - SoftwareSourceCode
    - SoftwareApplication
    - Service
    - EmbargoStatement
    - DefinedTerm
    - Place
    - GeoShape
    - MonetaryGrant
    - Taxon
    - Audience
    - Document
    - Instrument
    - Platform
    range: string
    required: true
    pattern: ^mbo_[a-zA-Z0-9_-]+$
  metadataPublisherId:
    name: metadataPublisherId
    description: 'The MBO Persistent IDentifier (mPID) of the person who entered this
      row of data.

      '
    title: Data Entry Person (mPID - you)
    comments:
    - 'Should be an mPID from the first column of Person.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:creator
    alias: metadataPublisherId
    owner: Dataset
    domain_of:
    - Action
    - HowTo
    - HowToStep
    - HowToTip
    - Dataset
    - Person
    - Organization
    - ContactPoint
    - License
    - PropertyValue
    - DataDownload
    - DatasetComment
    - SoftwareSourceCode
    - SoftwareApplication
    - Service
    - EmbargoStatement
    - DefinedTerm
    - Place
    - GeoShape
    - MonetaryGrant
    - Taxon
    - Audience
    - Document
    - Instrument
    - Platform
    range: PersonOrOrganization
    required: true
    multivalued: false
  metadataDescribedForActionId:
    name: metadataDescribedForActionId
    description: 'The [Action](#action) which resulted in this metadata record being
      described.


      Is likely to be the Action associated with a MARCO-BOLO Task.

      '
    title: Data Entered for Action (mPID)
    comments:
    - "The mPID from the first column of the Action.csv. \nNote that you can reference\
      \ the same mPID in multiple rows.\n\nUses the <https://w3id.org/marco-bolo/isResultOf>\
      \ predicate but ultimately ends up being \nrepresented as a triple in the form\
      \ `<action> schema:result <this-entity>`.\n"
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: mbo:isResultOf
    alias: metadataDescribedForActionId
    owner: Dataset
    domain_of:
    - Action
    - HowTo
    - HowToStep
    - HowToTip
    - Dataset
    - Person
    - Organization
    - ContactPoint
    - License
    - PropertyValue
    - DataDownload
    - DatasetComment
    - SoftwareSourceCode
    - SoftwareApplication
    - EmbargoStatement
    - DefinedTerm
    - Place
    - GeoShape
    - MonetaryGrant
    - Taxon
    - Audience
    - Document
    - Instrument
    - Platform
    range: Action
    required: true
    multivalued: false
  name:
    name: name
    description: A name/title
    title: Name
    comments:
    - "The commonly used name. \nFor example: `MBA` for Marine Biological Association\n"
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:name
    alias: name
    owner: Dataset
    domain_of:
    - Action
    - HowTo
    - HowToStep
    - HowToTip
    - Dataset
    - Organization
    - ContactPoint
    - License
    - PropertyValue
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Service
    - Place
    - GeoShape
    - MonetaryGrant
    - Taxon
    - Audience
    - Document
    - Instrument
    - Platform
    range: string
    required: true
    multivalued: false
  containsVariablesMboIds:
    name: containsVariablesMboIds
    description: 'The MBO Ids of the PropertyValues describing what was measured or
      calculated in your dataset; these should include EBVs, EOVs, etc.

      '
    title: Contains Variables (PropertyValue mPIDs)
    comments:
    - 'mPID(s) from the first column of the PropertyValue.csv

      Pipe-delimited when there are multiple values

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:variableMeasured
    alias: containsVariablesMboIds
    owner: Dataset
    domain_of:
    - Dataset
    range: PropertyValue
    required: true
    multivalued: true
  description:
    name: description
    description: A description for this entity.
    title: Description
    comments:
    - 'A concise one-sentence summary of the main point or purpose.

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:description
    alias: description
    owner: Dataset
    domain_of:
    - Action
    - HowTo
    - HowToStep
    - HowToTip
    - Dataset
    - Organization
    - ContactPoint
    - License
    - PropertyValue
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Service
    - DefinedTerm
    - Place
    - GeoShape
    - MonetaryGrant
    - Document
    - Instrument
    - Platform
    range: string
    required: false
    multivalued: false
  landingPage:
    name: landingPage
    description: The digital location where the dataset can be acquired.
    title: Landing Pages (URLs)
    comments:
    - "For example: `https://example.com`, `http://example.com`, \n`ftp://ftp.example.org/data`,\
      \ `doi:10.1000/example123`\n\nPipe-delimited when there are multiple values\n"
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:url
    alias: landingPage
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    subproperty_of: url
    range: schemaURL
    required: false
    multivalued: true
  aboutTaxonMboIds:
    name: aboutTaxonMboIds
    description: The MBO Ids of the taxa observed in your dataset.
    title: Taxa (mPIDs)
    comments:
    - 'mPID(s) from the first column of the Taxon.csv

      Pipe-delimited when there are multiple values

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:about
    alias: aboutTaxonMboIds
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    range: Taxon
    required: false
    multivalued: true
  spatialCoveragePlaceMboId:
    name: spatialCoveragePlaceMboId
    description: A place defining the spatial coverage of the dataset.
    title: Spatial Coverage (Place - mPID)
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:spatialCoverage
    alias: spatialCoveragePlaceMboId
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    range: Place
    required: false
    multivalued: true
  temporalCoverage:
    name: temporalCoverage
    description: 'The temporal range which this dataset covers.

      '
    title: Temporal Coverage
    comments:
    - 'For example: `2012-01/2013-02`.


      Must conform to the [ISO8601 time interal specification](https://en.wikipedia.org/wiki/ISO_8601#Time_intervals).

      The MBO project only currently supports the start&end variety.

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:temporalCoverage
    alias: temporalCoverage
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    range: string
    required: false
    pattern: ^([+-]?\d{4}((-?(-?(W[0-5][0-9]|[0-1][0-9])|([0-3][0-9][0-9])|((W[0-5][0-9]-?[0-7]|[0-3][0-9][0-9]|[0-1][0-9]-?[0-3][0-9])(T[0-2][0-9](:?[0-5][0-9](:?[0-5][0-9](\.\d+)?)?)?(Z|([+-][0-2][0-9](:?[0-5][0-9])?))?)?)))?)?)/(\.\.|[+-]?\d{4}((-?(-?(W[0-5][0-9]|[0-1][0-9])|([0-3][0-9][0-9])|((W[0-5][0-9]-?[0-7]|[0-3][0-9][0-9]|[0-1][0-9]-?[0-3][0-9])(T[0-2][0-9](:?[0-5][0-9](:?[0-5][0-9](\.\d+)?)?)?(Z|([+-][0-2][0-9](:?[0-5][0-9])?))?)?)))?)?)$
  dataDownloadMboIds:
    name: dataDownloadMboIds
    description: The different formats in which the dataset can be accessed or downloaded.
    title: Data Downloads (mPIDs)
    comments:
    - 'a list of Pipe-delimited mPIDs from the first column of DataDownload.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:distribution
    alias: dataDownloadMboIds
    owner: Dataset
    domain_of:
    - Dataset
    range: DataDownload
    required: false
    multivalued: true
  authorId:
    name: authorId
    description: The Permanent Identifier of person or organization who created this
      entity.
    title: Author (mPID)
    comments:
    - 'Should be an mPID from the first column of either Person.csv or Organization.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:author
    alias: authorId
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - DatasetComment
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    range: PersonOrOrganization
    required: false
    multivalued: true
  contributorIds:
    name: contributorIds
    description: The Permanent Identifiers of people or organizations who contributed
      to the entity.
    title: Contributors (mPIDs)
    comments:
    - 'Should be an mPID from the first column of either Person.csv or Organization.csv

      Pipe-delimited when there are multiple values.

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:contributor
    alias: contributorIds
    owner: Dataset
    domain_of:
    - HowToStep
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    range: PersonOrOrganization
    required: false
    multivalued: true
  ownerId:
    name: ownerId
    description: 'The Permanent Identifier of an person or organization who owns the
      entity.

      '
    title: Owner (mPID)
    comments:
    - 'Should be an mPID from the first column of either Person.csv or Organization.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:copyrightHolder
    alias: ownerId
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    - Instrument
    - Platform
    range: PersonOrOrganization
    required: false
    multivalued: false
  maintainerId:
    name: maintainerId
    description: The Permanent Identifier of an person or organization who maintains
      this entity.
    title: Maintainer (mPID)
    comments:
    - 'Should be an mPID from the first column of either Person.csv or Organization.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:maintainer
    alias: maintainerId
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    - Instrument
    range: PersonOrOrganization
    required: false
    multivalued: false
  publisherId:
    name: publisherId
    description: The Permanent Identifier of an person or organization who created
      the entity.
    title: Publisher (mPID)
    comments:
    - 'Should be an mPID from the first column of either Person.csv or Organization.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:publisher
    alias: publisherId
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    range: PersonOrOrganization
    required: false
    multivalued: false
  basedOnIds:
    name: basedOnIds
    description: 'The URL PID of any datasets which yours is based on.

      '
    title: Based On (URL PIDs)
    comments:
    - "WARNING: There are no foreign key checks.\n\nFor example: `https://example.com`,\
      \ `http://example.com`, \n`ftp://ftp.example.org/data`, `doi:10.1000/example123`\n\
      \nPipe-delimited when there are multiple values\n"
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:isBasedOn
    alias: basedOnIds
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    range: uri
    required: false
    multivalued: true
  hasPartIds:
    name: hasPartIds
    description: "Pipe-delimited list of the parts which comprise the aggregate dataset\
      \ being referenced. \n"
    title: Has Parts (URL PIDs)
    comments:
    - "WARNING: There are no foreign key checks.\nYou are required to use the full\
      \ URI here to reference pre-existing Datasets.\n\nFor example: `https://example.com`,\
      \ `http://example.com`, \n`ftp://ftp.example.org/data`, `doi:10.1000/example123`\n\
      \nPipe-delimited when there are multiple values\n"
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:hasPart
    alias: hasPartIds
    owner: Dataset
    domain_of:
    - Dataset
    range: uri
    required: false
    multivalued: true
  publishingStatusMboId:
    name: publishingStatusMboId
    description: The publishing status of the entity.
    title: Publishing Status (mPID)
    comments:
    - 'Should be an mPID from the first column of PublishingStatusDefinedTerm.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:creativeWorkStatus
    alias: publishingStatusMboId
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    range: PublishingStatusDefinedTerm
    required: false
    multivalued: false
  embargoStatementMboId:
    name: embargoStatementMboId
    description: an MBO identifier for Embargo information
    title: Embargo Statement (mPID)
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:usageInfo
    alias: embargoStatementMboId
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    range: EmbargoStatement
    required: false
    multivalued: false
  dateCreated:
    name: dateCreated
    title: Date Created
    comments:
    - 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format

      For example: `2025`

      `2025-12`

      `2025-12-31`

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:dateCreated
    alias: dateCreated
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    range: string
    required: false
    multivalued: false
    pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
  dateModified:
    name: dateModified
    title: Dates Modified
    comments:
    - 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format

      For example: `2025`

      `2025-12`

      `2025-12-31`


      Pipe-delimited when there are multiple values

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:dateModified
    alias: dateModified
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    range: string
    required: false
    multivalued: true
    pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
  datePublished:
    name: datePublished
    title: Date Published
    comments:
    - 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format

      For example: `2025`

      `2025-12`

      `2025-12-31`

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:datePublished
    alias: datePublished
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    range: string
    required: false
    multivalued: false
    pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
  inProgressDate:
    name: inProgressDate
    description: The point in time when you expect that the thing (e.g. data, document)
      will be accessible.
    title: In Progress Date
    comments:
    - 'A date in ISO8601, YYYY, YYYY-MM, or YYYY-MM-DD format

      For example: `2025`

      `2025-12`

      `2025-12-31`

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: mbo:inProgressDate
    alias: inProgressDate
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    range: string
    required: false
    multivalued: false
    pattern: ^(\d{4}(-\d{2}(-\d{2})?)?)$
  licenseMboId:
    name: licenseMboId
    description: The MBO PID of the license which covers the entity you are describing.
    title: License (mPID)
    comments:
    - 'If you are unsure, please use ''undefined'' as a placeholder until

      you can update it.

      If restriction is needed, please use ''Restricted. Please contact product owner.''

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:license
    alias: licenseMboId
    owner: Dataset
    domain_of:
    - Dataset
    - DataDownload
    - Document
    range: License
    required: false
  conditionsOfAccess:
    name: conditionsOfAccess
    description: A description of how the data may be accessed; for use when data
      is not publicly accessible.
    title: Conditions of Access
    comments:
    - "For example: `Access controlled by [Country] Fisheries Management Authority\
      \ due to commercial sensitivity \nand resource management considerations. Contact\
      \ [specific office/person] at [contact details]`\n"
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:conditionsOfAccess
    alias: conditionsOfAccess
    owner: Dataset
    domain_of:
    - Dataset
    - Document
    range: string
    required: false
    multivalued: false
  keywords:
    name: keywords
    description: Key words classifying this entity.
    title: Keywords
    comments:
    - 'Separate multiple keyword with a ''|''.

      e.g. Whales|Population Genetics

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:keywords
    alias: keywords
    owner: Dataset
    domain_of:
    - Dataset
    - Organization
    - SoftwareSourceCode
    - SoftwareApplication
    - Document
    - Instrument
    - Platform
    range: string
    required: false
    multivalued: true
  audienceMboIds:
    name: audienceMboIds
    description: 'The intended audiences for this entity.

      '
    title: Audiences (mPIDs)
    comments:
    - 'Should be an mPID from the first column of Audience.csv

      '
    from_schema: https://w3id.org/marco-bolo/csv-input-classes
    rank: 1000
    slot_uri: schema:audience
    alias: audienceMboIds
    owner: Dataset
    domain_of:
    - HowToStep
    - HowToTip
    - Dataset
    - DataDownload
    - Service
    - Document
    range: Audience
    required: false
    multivalued: true
class_uri: schema:Dataset