CORRAL: detecting microbial eukaryotes in metagenomic data

  • Before starting
  • Align raw reads to the EukDetect reference using Bowtie2
  • Identify eukaryotes based on Markov clustering

Before starting

Align raw reads to the EukDetect reference using Bowtie2

# paired end data for multiple samples
for READ1 in *R1_001.fastq.gz
do
		READ2=${READ1//1_001.fastq.gz/2_001.fastq.gz}
		SAMPLE=${READ1//_R1_001.fastq.gz}
    bowtie2 --omit-sec-seq --no-discordant --no-unal -x /data/reference_db/eukdetect/ncbi_eukprot_met_arch_markers.fna -k 10,10 -p 104 -1 $READ1 -2 $READ2 -S ${SAMPLE}_.sam
		echo "done aligning reads from $SAMPLE to the eukdetect reference!"
done

Identify eukaryotes based on Markov clustering