| Field Name at BioNumerics NCBI Submission Prompt | Field Name at
NCBI | Field Name in
BioNumerics Submission Metadata Template | Description |
| BioProject accession | BioProject | NCBI bioproject | The accession number of the BioProject(s) to which the BioSample belongs (PRJNAxxxxxx). **Double check that you are submitting to the correct BioProject (the organism name must match the one designated for your BioProject). For species that fall outside of NCBI pathogen detection, we recommend establishing a separate multi-species "research" bioproject for publishing data outside of the structured Pathogen Detection surveillance effort. |
| Attribute package | attribute_package | attribute_package | This field provides the pathogen type (or “isolation type”). Allowed values are
“Pathogen.cl” (for human clinical pathogens) or “Pathogen.env” (for
environmental, food, or animal clinical isolates). The value provided in this
field drives validation of other fields and cannot be left blank. |
| Strain name | strain | Key | This
is the authoritative ID used for foodborne pathogen genomic epidemiology and
within NCBI Pathogen Detection. Although the strain ID can have any format,
we suggest that it be unique, concise, and consistent within your laboratory
(e.g. CFSAN123456). |
| Serovar | serovar | Serovar | The
organism serovar/serotype name should include the most descriptive
information you have at time of submission, adhering to proper nomenclature
in NCBI taxonomy database:
https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi. Check spelling
carefully! |
| Isolate name
alias | isolate name alias | isolate_name_alias | Other
IDs associated with this isolate. Separate with ';' if more than one |
| Project name | project name | ProjectName | Name
of the project within which the sequencing was organized |
| Collected by | collected by | collected by | Full
name of laboratory or agency that collected the sample or has taken over
curation of the physical isolate. The name should be written out in
full, (with minor exceptions) and be consistent across multiple submissions. Example: Washington State Department of
Health. |
| Collection /
Isolate date | collection date | IsolateDate | Date
on which the sample was collected. Populate using ISO 8601 standard:
“YYYY-mm-dd”, “YYYY-mm” or “YYYY” (e.g., 1990–10–30, 1990–10, or 1990). Including the month or month/day of
collection is extremely valuable for accessing seasonality in the database. |
| Geographical
origin | geographic location | SourceCountryState | Populate
the geographic origin of the food product. Include the country name if
imported, or the "Country: state/territory/province" if domestic.
Include multiple locations if necessary, delimited by semi colon. |
| Geographical
coordinates | latitute and
longitude | lat_long | The
geographical coordinates of the location where the sample was collected.
Specify as degrees latitude and longitude in format "d[d.dddd] N|S
d[dd.dddd] W|E", eg, 38.98 N 77.11 W.
If information is unavailable for any mandatory field, please enter
'not collected', 'not applicable' or
'missing' as appropriate. |
| Isolation
source | isolation source | isolation source | Free
text, short description of sample source. Avoid generic terms such as
patient, sample, food, surface, clinical, product, source, or
environment. Example: bagged romaine
lettuce. |
| Host | host | host | For
human, animal, and plant hosts, include the full taxonomic name of the host
when available, "Homo sapiens" or "Bos Taurus". Animal
livestock terms are also acceptable entries, e.g. porcine, bovine, equine,
etc. |
| Host disease | host disease | host_disease | Name
of relevant disease, e.g. Salmonella gastroenteritis. Choose an ontological
term from https://bioportal.bioontology.org/ontologies/DOID or
https://www.ncbi.nlm.nih.gov/mesh.
Attribute is mandatory for Pathogen.cl isolates (human clinical
isolates) or include "missing" if unkown. Leave blank if not
relevant. |
| Sequenced by | sequenced by | SequencedBy | The
name of the agency that generated the sequence, e.g., Centers for Disease
Control and Prevention |
| Source name/
type | source type | SourceType | Controlled
vocabulary describing the isolation_source. Choose the best fit term: Human,
Animal, Food, Environmental, Other. |
| Organism
name | Organism | organism | The
organism name should include the most descriptive information you have at
time of submission, adhering to proper nomenclature in NCBI taxonomy
database: https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi. Check
spelling carefully! Levels of valid organism names are as follows:
Genus species:
Salmonella enterica
Listeria monocytogenes
Genus species and subspecies:
Salmonella enterica subsp. enterica
Determined serotype or serovar (trad or
WGS-based):
Escherichia coli O104:H7
Salmonella enterica subsp. enterica serovar Agnoa
Salmonella enterica subsp. diarizonae serovar 16:z10:e,n,x,z15
Listeria monocytogenes serotype 1/2a
If NCBI doesn’t have the desired organism name, enter the name determined
by your laboratory. After submission, a “taxonomy consult” will take place to
evaluate the new name. Sometimes the organism name is changed to a canonical
serovar name and the submission proceeds. It is also possible that the
serovar is a novel one not currently in the NCBI database and the Taxonomy
team will work with the submitter to get the new name added to the database. |