Navigation icon
Topics

1PP_trimming

Introduction

The trimming step removes low quality nucleotide residues from reads produced by the sequencer. The 1PP_trimming analysis in Cohesive includes the execution of trimmomatic and fastqc. The latter gives quality metrics about the reads, i.e. raw reads (before trimming) and produced trimmed reads.

uml diagram

Run Analysis 1PP_trimming

Once the analysis 1PP_trimming has been selected from the run analyses interface, the user will be able to select which bioinformatic tool to use. The available tool for 1PP_trimming is:

  • trimmomatic - Read trimming tool for Illumina NGS data

Input is selected in the wizard's last section: "step_0SQ_rawreads__fastq" is for internal fastq files from sequencers, (code 20XX.TE.XXXX.X.X), while "step_0SQ_rawreads__external", is for imported fastq files (code 20XX.EXT.XXXX.X.X). The input selection UI also delivers an advanced input selection mode, to allow selection of all types of supported input files at once.

A link to Check analysis will be created after launching the requested analysis. The system will notify the user after a succesful analysis launch and once execution has ended.

Output directory

Please refer to Cohesive's specific Wiki page for information on file download.

The output directory is available at the link in the download page or at the link presente in the analysis' summary card, and will have the following structure: results > YEAR > ID > 1PP_trimming > DSXXXXXXXX-DTXXXXXX_trimmomatic. At that path there will be 3 directories:

  • meta: ("metadata") contains log and configuration files.
  • qc: ("quality check") it contains 2 directories (meta and result). In this case quality check is performed with fastqc.
  • result: contains the analysis' output files.

The table below lists files available in the output directory structure, alongside some useful information.

File Description Location
DSXXXXXXXX-DTXXXXXX_ID_R1_trimmomatic.fastq trimmed read 1 (R1) result directory
DSXXXXXXXX-DTXXXXXX_ID_R2_trimmomatic.fastq trimmed read 2 (R2) result directory
DSXXXXXXXX-DTXXXXXX_ID_unpaired_trimmomatic.fastq trimmed unpaired reads result directory
DSXXXXXXXX-DTXXXXXX_ID_R1_trimmomatic_fastqc.html reads R1 quality qc directory > result
DSXXXXXXXX-DTXXXXXX_ID_R1_trimmomatic_fastqc.zip quality R1 (zip file) qc directory > result
DSXXXXXXXX-DTXXXXXX_ID_R2_trimmomatic_fastqc.html reads R2 quality qc directory > result
DSXXXXXXXX-DTXXXXXX_ID_R2_trimmomatic_fastqc.zip quality R2 (zip file) qc directory > result
DSXXXXXXXX-DTXXXXXX_ID_unpaired_trimmomatic_fastqc.html unpaired reads quality qc directory > result
DSXXXXXXXX-DTXXXXXX_ID_unpaired_trimmomatic_fastqc.zip unpaired reads (zip file) quality qc directory > result