Nov 20, 2024

Public workspacePopulating metadata templates for NCBI submissions using PulseNet 2.0 V.3

  • 1US Food and Drug Administration;
  • 2US FDA-CFSAN
  • GenomeTrakr
    Tech. support email: genomeTrakr@fda.hhs.gov
Icon indicating open access to content
QR code linking to this content
Protocol CitationRuth Timme, Maria Balkey, Tina Pfefer, Candace Hope Bias 2024. Populating metadata templates for NCBI submissions using PulseNet 2.0. protocols.io https://dx.doi.org/10.17504/protocols.io.3byl4qn4ovo5/v3Version created by Maria Balkey
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License,  which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
We use this protocol and it's working
Created: February 09, 2024
Last Modified: November 20, 2024
Protocol Integer ID: 109475
Keywords: NCBI submission, biosample, SRA, metadata, bioproject, PulseNet, PulseNet 2.0
Abstract
PURPOSE: to define the standard operating procedure for collecting isolate metadata using PulseNet 2.0 for submission of food/environmental isolates to NCBI.

SCOPE: to provide a standardized procedure to collect isolate metadata using PulseNet 2.0 for submission of food/environmental isolates to NCBI.

RESPONSIBILITIES- SOP Responsible Officials: Ruth Timme, Maria Balkey

The GenomeTrakr Network Management will be responsible to monitor GenomeTrakr submissions processed through PulseNet 2.0 and ensure that all GT labs are familiar with the mandatory metadata fields required for submission of GenomeTrakr sequencing records to NCBI.

V3: Added dropdown menus from controlled vocabulary to sequenced by and project name to metadata template PulseNet_Bionumerics_Isolate_Metadata
V4: Changes in metadata template PulseNet_Bionumerics_Isolate_Metadata.
- Added dropdown menus from controlled vocabulary to collected_by , SourceCountryState
- Added fields: collected by, isolation source
- Added mapping table of attribute names.
- Remove requirement to send biosample update to NCBI to make changes on sequenced by and project name.
V5: Changes implemented to handle NCBI submissions for GenomeTrakr through the PulseNet 2.0 platform.
V6: New version of spreadsheet with updated dropdown menu. Add values to NCBI Templates


Safety warnings
We will be updating this protocol with each new PulseNet 2.0 upgrade that impacts NCBI submissions
Data Ownership
Laboratory scientists that use PN2.0 to submit GenomeTrakr data to NCBI shall be included in the PulseNet NCBI submission group.

Please reach out to PulseNet to ensure your name is included in this group.

Membership in the PulseNet submission group will grant you the correct privileges for viewing and managing your GenomeTrakr submissions within the NCBI submission portal. If you login to NCBI and *don't* see your GenomeTrakr submissions, you likely not a member.
Metadata SampleSheet preparation

Populate the metadata spreadsheet form before uploading your sequencing run or linking NCBI sequencing records within PulseNet 2.0.

Please download the following metadata template, Guidance is included in the first tab.
Download OneHealthEntericMetadataPN.xlsxOneHealthEntericMetadataPN.xlsx153KB

Once you have filled out the template, import metadata according PulseNet Instructions
Note
In order to access PulseNet Instructions, make sure you have successfully logged in at the SAMS Partner Portal.
Creation of NCBI Submission Templates

Before submitting data to NCBI, set up sample specific submission templates in PulseNet 2.0 according to PulseNet Instructions, following GenomeTrakr mapping guidelines included in this protocol.

Map each field name to the corresponding field value as described in steps
Go togo to step #3.1 for Food isolates
Go togo to step #3.2 for Food Facility inspection isolates
Go togo to step #3.3 for Animal isolates
Go togo to step #3.4 for Farm or other environmental isolates.

Make sure to add the the GenomeTrakr-required fields to each template depending on the source type designation, using the Add Field functionality in the Pipeline Template section.

The creation of NCBI Submission Templates in PulseNet 2.0 will only be done once, then the saved templates can be used for all future submissions.
NCBI Template for Food Isolates
If the Strain name is mapped to the WGS_id, ensure the Submitter Provided Unique ID is mapped to the key.
If the Strain name is mapped to the key, ensure the Submitter Provided Unique ID is mapped to the WGS_id.
ABC
Field NameField TypeField Value
Strain nameMappingWGS_id OR Key
Sample nameMappingKey
Submitter Provided Unique IDMappingWGS_id OR Key
SPUID namespaceValueGenomeTrakr
BioProject accessionMappingBioProject
AuthorMappingSequenced_by
SerovarMappingSerotype_wgs
SourceTypeMappingSourceType
Isolate Name Alias if SourceType equals HumanValuemissing
Isolate Name Alias if SourceType does not equal HumanMappingKey
Isolation source if SourceType does not equal HumanMappingSourceSite, TypeDetails
Isolation source if SourceType equals HumanValuemissing
Geographical origin if SourceType equals HumanValuemissing
Geographical origin if SourceType does not equal HumanMappingGeo_loc_name
Organism nameMappingGenus, Species
Collection / isolation date if SourceType equals HumanValuemissing
Collection / isolation date if SourceType does not equal HumanMappingIsolatDate YYYY-mm-DD
Collected by if SourceType equals HumanValuemissing
Collected by if SourceType does not equal HumanMappingCollected_By
Instrument modelMappingInstrument_Model
Library strategyValueWGS
Library sourceValueGENOMIC
Library selectionValueRANDOM
Library layoutValuePAIRED
Library nameMappingLibrary
Project nameMappingProject
Sequenced by if SourceType equals HumanValuemissing
Sequenced by if SourceType does not equal HumanMappingSequenced_by
ADD FIELD…
intended consumerMappingIntended_consumer
food product origin geographic locationMappingFood_origin
food processing methodMappingFood_processing_method
purpose of samplingMappingPurpose_of_sampling
Save submission template according to PulseNet Instructions as -GenomeTrakr-FoodGroup-.

NCBI Template for Food Facility Environment Isolates

If the Strain name is mapped to the WGS_id, ensure the Submitter Provided Unique ID is mapped to the key.
If the Strain name is mapped to the key, ensure the Submitter Provided Unique ID is mapped to the WGS_id.

ABC
Field NameField TypeField Value
Strain nameMappingWGS_id OR Key
Sample nameMappingKey
Submitter Provided Unique IDMappingWGS_id OR Key
SPUID namespaceValueGenomeTrakr
BioProject accessionMappingBioProject
AuthorMappingSequenced_by
SerovarMappingSerotype_wgs
SourceTypeMappingSourceType
Isolate Name Alias if SourceType equals HumanValuemissing
Isolate Name Alias if SourceType does not equal HumanMappingKey
Isolation source if SourceType does not equal HumanMappingSourceSite, TypeDetails
Isolation source if SourceType equals HumanValuemissing
Geographical origin if SourceType equals HumanValuemissing
Geographical origin if SourceType does not equal HumanMappingGeo_loc_name
Organism nameMappingGenus, Species
Collection / isolation date if SourceType equals HumanValuemissing
Collection / isolation date if SourceType does not equal HumanMappingIsolatDate YYYY-mm-DD
Collected by if SourceType equals HumanValuemissing
Collected by if SourceType does not equal HumanMappingCollected_By
Instrument modelMappingInstrument_Model
Library strategyValueWGS
Library sourceValueGENOMIC
Library selectionValueRANDOM
Library layoutValuePAIRED
Library nameMappingLibrary
Project nameMappingProject
Sequenced by if SourceType equals HumanValuemissing
Sequenced by if SourceType does not equal HumanMappingSequenced_by
ADD FIELD…
facility typeMappingFacility_type
food type processed in facilityMappingFood_type_processed
purpose of samplingMappingPurpose_of_sampling
Save submission template according to PulseNet Instructions as
-GenomeTrakr-FoodFacilityEnvironment-.
NCBI Template for Animal Isolates

If the Strain name is mapped to the WGS_id, ensure the Submitter Provided Unique ID is mapped to the key.
If the Strain name is mapped to the key, ensure the Submitter Provided Unique ID is mapped to the WGS_id.
ABC
Field NameField TypeField Value
Strain nameMappingWGS_id OR Key
Sample nameMappingKey
Submitter Provided Unique IDMappingWGS_id OR Key
SPUID namespaceValueGenomeTrakr
BioProject accessionMappingBioProject
AuthorMappingSequenced_by
SerovarMappingSerotype_wgs
SourceTypeMappingSourceType
Isolate Name Alias if SourceType equals HumanValuemissing
Isolate Name Alias if SourceType does not equal HumanMappingKey
Isolation source if SourceType does not equal HumanMappingSourceSite, TypeDetails
Isolation source if SourceType equals HumanValuemissing
Geographical origin if SourceType equals HumanValuemissing
Geographical origin if SourceType does not equal HumanMappingGeo_loc_name
Organism nameMappingGenus, Species
Collection / isolation date if SourceType equals HumanValuemissing
Collection / isolation date if SourceType does not equal HumanMappingIsolatDate YYYY-mm-DD
Collected by if SourceType equals HumanValuemissing
Collected by if SourceType does not equal HumanMappingCollected_By
Instrument modelMappingInstrument_Model
Library strategyValueWGS
Library sourceValueGENOMIC
Library selectionValueRANDOM
Library layoutValuePAIRED
Library nameMappingLibrary
Project nameMappingProject
Sequenced by if SourceType equals HumanValuemissing
Sequenced by if SourceType does not equal HumanMappingSequenced_by
ADD FIELD…
hostMappingHost
animal environmentMappingAnimal_env
purpose of samplingMappingPurpose_of_sampling
Save submission template according to PulseNet Instructions as -GenomeTrakr-AnimalHost-.
NCBI Template for Farm Environment Isolates

If the Strain name is mapped to the WGS_id, ensure the Submitter Provided Unique ID is mapped to the key.
If the Strain name is mapped to the key, ensure the Submitter Provided Unique ID is mapped to the WGS_id.
ABC
Field NameField TypeField Value
Strain nameMappingWGS_id OR Key
Sample nameMappingKey
Submitter Provided Unique IDMappingWGS_id OR Key
SPUID namespaceValueGenomeTrakr
BioProject accessionMappingBioProject
AuthorMappingSequenced_by
SerovarMappingSerotype_wgs
SourceTypeMappingSourceType
Isolate Name Alias if SourceType equals HumanValuemissing
Isolate Name Alias if SourceType does not equal HumanMappingKey
Isolation source if SourceType does not equal HumanMappingSourceSite, TypeDetails
Isolation source if SourceType equals HumanValuemissing
Geographical origin if SourceType equals HumanValuemissing
Geographical origin if SourceType does not equal HumanMappingGeo_loc_name
Organism nameMappingGenus, Species
Collection / isolation date if SourceType equals HumanValuemissing
Collection / isolation date if SourceType does not equal HumanMappingIsolatDate YYYY-mm-DD
Collected by if SourceType equals HumanValuemissing
Collected by if SourceType does not equal HumanMappingCollected_By
Instrument modelMappingInstrument_Model
Library strategyValueWGS
Library sourceValueGENOMIC
Library selectionValueRANDOM
Library layoutValuePAIRED
Library nameMappingLibrary
Project nameMappingProject
Sequenced by if SourceType equals HumanValuemissing
Sequenced by if SourceType does not equal HumanMappingSequenced_by
ADD FIELD…
local-scale environmental contextMappingEnv_local_scale
environmental mediumMappingEnv_medium
purpose of samplingMappingPurpose_of_sampling
Save submission template according to PulseNet Instructions as -GenomeTrakr-FarmEnvironment-.
NCBI Submissions

Submit data to NCBI according to PulseNet Instructions, if you have several submissions from different sample types, make sure to group records accordingly and select the appropriate custom NCBI GenomeTrakr-labeled Template. If NCBI accessions are not available at PulseNet 2.0 in 1 business day, please contact NCBI (pd-help@ncbi.nlm.nih.gov) and PulseNet to troubleshoot issues with submissions.

Contact GenomeTrakr by email genometrakr@fda.hhs.gov if issues with submissions are delayed for more than 3 days. GenomeTrakr can support urgent submissions if needed.