A | B | C | |
Parameter | Tool | Description | |
Sample | List of Pairs | Sample name from list of pairs | |
0xGenomeCov | Bowtie2, samtools, ivar_trim | Percentage of nucleotides that do not cover the genome at all (zero times) | |
<10xGenomeCov | Bowtie2, samtools, ivar_trim | Percentage of nucleotides that barely cover the genome (less than 10 times) | |
nReads | Bowtie2 | Total number of reads | |
avgLen | Bowtie2, samtools | Average read length | |
avgLenPassFilt | Bowtie2, samtools, ivar_trim | Average read length after iVar_trim filtering* | |
avgQual | Bowtie2, samtools | Average read quality | |
avgQualPassFilt | Bowtie2, samtools, ivar_trim | Average read length after iVar_trim filtering* | |
avgCovPassQual | Bowtie2, samtools, ivar_trim | Average number and percentage of nts from sequence reads that map to the genome | |
readsAlign | Bowtie2, samtools | Number and percentage of reads that aligned to the reference sequence. | |
readsAlignPassFilt | Bowtie2, samtools, ivar_trim | Number and percentage of reads that aligned to the reference sequence after iVar_trim filtering*. | |
SNR | Bowtie2, ivar_trim, Python3 and Pandas | SNR is "Signal to Noise Ratio". When the sequence dataset contains one paired sequence set containing "negativeControl" in its name, SNR will be calculated. Otherwise, the SNR metric will return "NA". The SNR calculation is as follows, for each sequence file pair: readsAlignPassFilt / readsAlignPassFilt_negativeContol | |
humanReads | Kraken2 | Number and percentage of reads classified as Homo sapiens | |
SARS-CoV-2Reads | Kraken2 | Number and percentage of reads classified as SARS - CoV - 2 | |
syntheticSeqsReads | Kraken2 | Number and percentage of reads classified as non - biological sequences | |
quality_control_method_name | SSQuAWK | Name of the method or pipline used to evaluate sequence quality | |
quality_control_method_version | 4.0.2 | Version number of the quality control pipeline or method used | |
quality_control_determination | Result of the quality control accessment. Blank if pass/fail thresholds have not been estabolished or "sequence flagged for potential quality control issues" if relevant. | ||
quality_control_issues | More information for sequences that have a QC flag issue |
A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | |
Sample | 0xGenomeCov | <10xGenomeCov | nReads | avgLen | avgLenPassFilt | avgQual | avgQualPassFilt | avgCovPassQual | readsAlign | readsAlignPassFilt | SNR | humanReads | SARS-CoV-2Reads | syntheticSeqsReads | quality_control_method_name | quality_control_method_version | quality_control_determination | quality_control_determination | |
SRR16828363.fastq.gz | 107nt (0%) | 138nt (0%) | 632664 | 151 | 151 | 37.8 | 37.9 | 688X | 138637 (21%) | 136327 (21%) | NA | 1517 reads (0.48%) | 71091 reads (22.47%) | 224206 reads (70.88%) | SSQuAWK | 4.0.2 | |||
SRR16828364.fastq.gz | 76nt (0%) | 107nt (0%) | 458116 | 151 | 151 | 37.8 | 37.9 | 890X | 179913 (39%) | 176348 (38%) | NA | 863 reads (0.38%) | 90751 reads (39.62%) | 47920 reads (20.92%) | SSQuAWK | 4.0.2 | |||
SRR16828365.fastq.gz | 76nt (0%) | 153nt (0%) | 351980 | 151 | 151 | 37.8 | 37.9 | 272X | 54928 (15%) | 53958 (15%) | NA | 874 reads (0.50%) | 27782 reads (15.79%) | 52862 reads (30.04%) | SSQuAWK | 4.0.2 |
A | B | C | D | E | F | G | |
QC bin | Subjective definition | % Genome uncovered (10X) | Average coverage | Other observations | SRA submission | FDA CFSAN Dashboard | |
A | No QC issues evident | ~5% | ~1000X | Majority of reads are SARS-CoV-2 | “quality_control_determination” | Included | |
B | Minor QC issues | 6% - 40% | ~100X | “quality_control_determination” | Included | ||
C | Insufficient coverage | 40% - 95% | < 100X | Insufficient data mapped for confidence | “quality_control_determination” | Excluded | |
F | Significant QC and/or study design issues | >95% | < 10X | Suspected contamination (SNR low), | Do not submit | N/A |