COVID-19: Alternate pre processing

Annotation: 1a/6: This is the alternate read preprocessing workflow for use with the shared COVID-19: Raw Data starting history. (Which can be imported via "Shared Data" -> "Histories" in the menu)

StepAnnotation
Step 1: Input dataset collection
select at runtime
Step 2: Input dataset collection
select at runtime
Step 3: fastp
Paired Collection
Output dataset 'output' from step 1
Adapter Trimming Options:
False
Empty.
Empty.
Global trimming options:
Not available.
Not available.
Not available.
Not available.
Overrepresented Sequence Analysis:
False
Not available.
Filter Options:
Quality filtering options:
False
20
20
Not available.
Length filtering options:
False
50
Low complexity filtering options:
False
Not available.
Read Modification Options:
Automatic trimming for Illumina NextSeq/NovaSeq data
Not available.
Disable polyX trimming
UMI processing:
False
Empty.
Not available.
Empty.
Per read cutting by quality options:
False
False
Not available.
Not available.
Base correction by overlap analysis options:
False
Output Options:
True
True
Step 4: NanoPlot
batch
fastq
Output dataset 'output' from step 2
Options for filtering or transforming input prior to plotting:
Not available.
Not available.
False
Not available.
True
False
False
Not available.
Nothing selected.
False
Options for customizing the plots created:
Nothing selected.
png
Nothing selected.
False
False
Step 5: FastQC
Output dataset 'output' from step 2
select at runtime
select at runtime
select at runtime
False
Not available.
7
Step 6: Map with minimap2
Use a built-in genome index
hg38
Indexing options:
False
Not available.
Not available.
Not available.
Single
Output dataset 'output' from step 2
Oxford Nanopore read to reference mapping. Slightly more sensitive for Oxford Nanopore to reference mapping (-k15). For PacBio reads, HPC minimizers consistently leads to faster performance and more sensitive results in comparison to normal minimizers. For Oxford Nanopore data, normal minimizers are better, though not much. The effectiveness of HPC is determined by the sequencing error mode.
Set advanced mapping options:
Not available.
Not available.
Not available.
Not available.
Not available.
Not available.
Not available.
Not available.
False
Not available.
Not available.
Set advanced alignment options:
Not available.
Not available.
Not available.
Not available.
Not available.
Not available.
Not available.
Not available.
Nothing selected.
Set advanced output options:
BAM
False
False
Not available.
Nothing selected.
False
False
Step 7: MultiQC
Results
Results 1
fastp
Output dataset 'report_json' from step 3
Empty.
Empty.
False
Step 8: Map with BWA-MEM
Use a built-in genome index
hg38
Paired Collection
Output dataset 'output_paired_coll' from step 3
Empty.
Do not set
1.Simple Illumina mode
Step 9: MultiQC
Results
Results 1
FastQC
FastQC outputs
FastQC output 1
Raw data
Output dataset 'text_file' from step 5
Empty.
Empty.
False
Step 10: Filter SAM or BAM, output SAM or BAM
Output dataset 'alignment_output' from step 6
Include header
Not available.
yes
The read is unmapped
Nothing selected.
Empty.
Empty.
select at runtime
False
Empty.
BAM (-b)
Step 11: Filter SAM or BAM, output SAM or BAM
Output dataset 'bam_output' from step 8
Include header
Not available.
yes
The read is unmapped The mate is unmapped
Nothing selected.
Empty.
Empty.
select at runtime
False
Empty.
BAM (-b)
Step 12: MergeSamFiles
Output dataset 'output1' from step 10
False
False
Comments
Lenient
Step 13: MergeSamFiles
Output dataset 'output1' from step 11
False
False
Comments
Lenient
Step 14: Samtools fastx
Output dataset 'outFile' from step 12
FASTQ
Not available.
False
False
unspecific
False
Empty.
no
Nothing selected.
Nothing selected.
Nothing selected.
No
Step 15: Samtools fastx
Output dataset 'outFile' from step 13
FASTQ
Not available.
False
False
READ1 READ2
False
Empty.
no
Nothing selected.
Nothing selected.
Nothing selected.
No