Automation Proposal for the Intermediate Steps in the 16S FFPE Samples Analysis Pipeline

  1. Elsa Martin De Arribas
  2. Kelly Conde Pérez
  3. Pablo Aja Macaya
  4. Juan A. Vallejo
  5. Margarita Poza 1
  6. Susana Ladra 1
  1. 1 Universidade da Coruña
    info

    Universidade da Coruña

    La Coruña, España

    ROR https://ror.org/01qckj285

Libro:
VI Congreso XoveTIC: impulsando el talento científico
  1. Manuel Lagos Rodríguez (ed. lit.)
  2. Álvaro Leitao Rodríguez (ed. lit.)
  3. Tirso Varela Rodeiro (ed. lit.)
  4. Javier Pereira Loureiro (coord.)
  5. Manuel Francisco González Penedo (coord.)

Editorial: Servizo de Publicacións ; Universidade da Coruña

Año de publicación: 2023

Congreso: XoveTIC (6. 2023. A Coruña)

Tipo: Aportación congreso

Resumen

In the day-to-day work of bioinformatics, the use of integrated software packages, which encompass a wide range of tools, enables the development of pipelines for omics data analysis. Within the various existing pipelines, we focus on the analysis of the 16S rRNA gene as it allows for the study of diversity and taxonomy of prokaryotic microorganisms such as Bacteria and Archaea. However, these pipelines often involve a sequence of multiple tools that require intermediate steps before further processing can proceed, as in the case between Cutadapt and DADA2. In fact, in a typical pipeline, the values for DADA2 input arguments ’trunc-len-f’ and ’trunc-len-r’ are extracted from the output of Cutadapt. The best approach for selecting optimal values (aka the trimming positions) is graphically visualizing Cutadapt output and manually selecting the most accurate trimming position length. Therefore, we propose the automation of this specific intermediate step between Cutadapt and DADA2 tools, by selecting values displayed in the graphs that meet the filtering criteria. This automation has been incorporated into a custom pipeline for the analysis of the microbiome in 16S paired-end samples from colorectal cancer patients, and could potentially serve as a standardization approach in these processes