Skip to content

Class: Analysis

Represents the computational processing applied to data from a sequencing experiment, or from another analysis. This can be described at the level of individual analysis steps in a workflow/pipeline, or more generally for the workflow/pipeline as a whole.

URI: https://w3id.org/fga-wg/schema/bundle/Analysis

 classDiagram
    class Analysis
    click Analysis href "../Analysis/"
      Analysis : analysis_description

      Analysis : analysis_external_id

      Analysis : analysis_id

      Analysis : analysis_input_sources





        Analysis --> "1..*" InputSource : analysis_input_sources
        click InputSource href "../InputSource/"



      Analysis : analysis_label

      Analysis : analysis_main_tool





        Analysis --> "0..1" Any : analysis_main_tool
        click Any href "../Any/"



      Analysis : analysis_main_tool_version

      Analysis : analysis_protocol

      Analysis : analysis_study_ref

      Analysis : analysis_type





        Analysis --> "1" Term : analysis_type
        click Term href "../Term/"



      Analysis : analysis_workflow

Example

Example JSON
{
  "analysis_description": "ENCODE3 ChIP-seq pipeline on GRCH38 with replicated peak calling using MACS.",
  "analysis_external_id": "encode:ENCAN718KHT",
  "analysis_id": "analysis:ENCAN718KHT",
  "analysis_input_sources": [
    {
      "biological_replicate_labels": [
        "1",
        "2"
      ],
      "inputsource_ref": "experiment:ENCSR000DPJ",
      "qualified_relation": "prov:wasInformedBy",
      "technical_replicate_labels": [
        "1_1",
        "2_1"
      ]
    },
    {
      "biological_replicate_labels": [
        "1",
        "2"
      ],
      "date_of_retrieval": "2016-04-19",
      "inputsource_external_ref": "https://www.encodeproject.org/files/GRCh38_no_alt_analysis_set_GCA_000001405.15",
      "qualified_relation": "https://bioschemas.org/FormalParameter",
      "technical_replicate_labels": [
        "1_1",
        "2_1"
      ]
    }
  ],
  "analysis_label": "ENCODE3 ChIP-seq pipeline, GRCH38, replicated peak calling",
  "analysis_main_tool": "biotools:macs",
  "analysis_main_tool_version": "2.10",
  "analysis_protocol": "https://www.encodeproject.org/documents/7009beb8-340b-4e71-b9db-53bb020c7fe2/@@download/attachment/ChIP-seq_pipeline_overview.pdf",
  "analysis_study_ref": "study:S-EPMC7391744",
  "analysis_type": {
    "id": "edam:operation_3222",
    "label": "Peak calling"
  },
  "analysis_workflow": "encode:ENCPL272XAE"
}

Slots

Name Cardinality and Range Description Inheritance
analysis_external_id 0..1
Curie
External, globally unique identifier for the experiment. direct
analysis_id 1
Curie
Internal identifier for the experiment (unique within the metadata deposit). direct
analysis_label 1
String
A human-readable description of the analysis, short enough to be used for listings within software user interfaces, tables, illustration legends, etc. direct
analysis_description 0..1
String
Human-readable description of the analysis. direct
analysis_study_ref 0..1
Curie
Internal reference to the study within which the analysis has been carried out. direct
analysis_input_sources 1..*
InputSource
External or internal references to sources for the input data analyzed. Internal references should lead to FileCollection, File, Experiment, or Analysis objects. direct
analysis_type 1
Term
The type of analysis carried out. direct
analysis_main_tool 0..1
Any or 
String or 
Curie
Main software tool used for the analysis. direct
analysis_main_tool_version 0..1
String
Version of the main software tool used for the analysis. direct
analysis_protocol 0..1
Uriorcurie
Document describing the analysis protocol that was followed. direct
analysis_workflow 0..1
Uriorcurie
External reference to the analysis workflow, with availability in at least one machine-operable form (e.g. CWL, Nextflow, ...). direct

Usages

used by used in type used
Bundle analyses range Analysis

Identifier and Mapping Information

Schema Source

  • from schema: https://w3id.org/fga-wg/schema/bundle

Mappings

Mapping Type Mapped Value
self https://w3id.org/fga-wg/schema/bundle/Analysis
native https://w3id.org/fga-wg/schema/bundle/Analysis

LinkML Source

Direct

name: Analysis
description: Represents the computational processing applied to data from a sequencing
  experiment, or from another analysis. This can be described at the level of individual
  analysis steps in a workflow/pipeline, or more generally for the workflow/pipeline
  as a whole.
from_schema: https://w3id.org/fga-wg/schema/bundle
slots:
- analysis_external_id
- analysis_id
- analysis_label
- analysis_description
- analysis_study_ref
- analysis_input_sources
- analysis_type
- analysis_main_tool
- analysis_main_tool_version
- analysis_protocol
- analysis_workflow

Induced

name: Analysis
description: Represents the computational processing applied to data from a sequencing
  experiment, or from another analysis. This can be described at the level of individual
  analysis steps in a workflow/pipeline, or more generally for the workflow/pipeline
  as a whole.
from_schema: https://w3id.org/fga-wg/schema/bundle
attributes:
  analysis_external_id:
    name: analysis_external_id
    description: External, globally unique identifier for the experiment.
    examples:
    - value: encode:ENCAN718KHT
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: curie
  analysis_id:
    name: analysis_id
    description: 'Internal identifier for the experiment (unique within the metadata
      deposit). '
    examples:
    - value: analysis:ENCAN718KHT
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    identifier: true
    owner: Analysis
    domain_of:
    - Analysis
    range: curie
    required: true
  analysis_label:
    name: analysis_label
    description: A human-readable description of the analysis, short enough to be
      used for listings within software user interfaces, tables, illustration legends,
      etc.
    examples:
    - value: ENCODE3 ChIP-seq pipeline, GRCH38, replicated peak calling
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: string
    required: true
    pattern: ^.{1,60}$
  analysis_description:
    name: analysis_description
    description: Human-readable description of the analysis.
    examples:
    - value: ENCODE3 ChIP-seq pipeline on GRCH38 with replicated peak calling using
        MACS.
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: string
  analysis_study_ref:
    name: analysis_study_ref
    description: Internal reference to the study within which the analysis has been
      carried out.
    examples:
    - value: study:S-EPMC7391744
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: curie
  analysis_input_sources:
    name: analysis_input_sources
    description: External or internal references to sources for the input data analyzed.
      Internal references should lead to FileCollection, File, Experiment, or Analysis
      objects.
    examples:
    - object:
        inputsource_ref: experiment:ENCSR000DPJ
        qualified_relation: prov:wasInformedBy
        biological_replicate_labels:
        - '1'
        - '2'
        technical_replicate_labels:
        - '1_1'
        - '2_1'
    - object:
        inputsource_external_ref: https://www.encodeproject.org/files/GRCh38_no_alt_analysis_set_GCA_000001405.15
        qualified_relation: https://bioschemas.org/FormalParameter
        biological_replicate_labels:
        - '1'
        - '2'
        technical_replicate_labels:
        - '1_1'
        - '2_1'
        date_of_retrieval: '2016-04-19'
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: InputSource
    required: true
    multivalued: true
  analysis_type:
    name: analysis_type
    description: The type of analysis carried out.
    examples:
    - object:
        id: edam:operation_3222
        label: Peak calling
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: Term
    required: true
  analysis_main_tool:
    name: analysis_main_tool
    description: Main software tool used for the analysis.
    examples:
    - value: biotools:macs
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: Any
    any_of:
    - range: string
    - range: curie
  analysis_main_tool_version:
    name: analysis_main_tool_version
    description: Version of the main software tool used for the analysis.
    examples:
    - value: '2.10'
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: string
  analysis_protocol:
    name: analysis_protocol
    description: Document describing the analysis protocol that was followed.
    examples:
    - value: https://www.encodeproject.org/documents/7009beb8-340b-4e71-b9db-53bb020c7fe2/@@download/attachment/ChIP-seq_pipeline_overview.pdf
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: uriorcurie
  analysis_workflow:
    name: analysis_workflow
    description: External reference to the analysis workflow, with availability in
      at least one machine-operable form (e.g. CWL, Nextflow, ...).
    examples:
    - value: encode:ENCPL272XAE
    from_schema: https://w3id.org/fga-wg/schema/bundle
    rank: 1000
    owner: Analysis
    domain_of:
    - Analysis
    range: uriorcurie