Skip to Content
Merck
CN
  • Degenerate adaptor sequences for detecting PCR duplicates in reduced representation sequencing data improve genotype calling accuracy.

Degenerate adaptor sequences for detecting PCR duplicates in reduced representation sequencing data improve genotype calling accuracy.

Molecular ecology resources (2014-08-19)
M M Y Tin, F E Rheindt, E Cros, A S Mikheyev
ABSTRACT

RAD-tag is a powerful tool for high-throughput genotyping. It relies on PCR amplification of the starting material, following enzymatic digestion and sequencing adaptor ligation. Amplification introduces duplicate reads into the data, which arise from the same template molecule and are statistically nonindependent, potentially introducing errors into genotype calling. In shotgun sequencing, data duplicates are removed by filtering reads starting at the same position in the alignment. However, restriction enzymes target specific locations within the genome, causing reads to start in the same place, and making it difficult to estimate the extent of PCR duplication. Here, we introduce a slight change to the Illumina sequencing adaptor chemistry, appending a unique four-base tag to the first index read, which allows duplicate discrimination in aligned data. This approach was validated on the Illumina MiSeq platform, using double-digest libraries of ants (Wasmannia auropunctata) and yeast (Saccharomyces cerevisiae) with known genotypes, producing modest though statistically significant gains in the odds of calling a genotype accurately. More importantly, removing duplicates also corrected for strong sample-to-sample variability of genotype calling accuracy seen in the ant samples. For libraries prepared from low-input degraded museum bird samples (Mixornis gularis), which had low complexity, having been generated from relatively few starting molecules, adaptor tags show that virtually all of the genotypes were called with inflated confidence as a result of PCR duplicates. Quantification of library complexity by adaptor tagging does not significantly increase the difficulty of the overall workflow or its cost, but corrects for differences in quality between samples and permits analysis of low-input material.

MATERIALS
Product Number
Brand
Product Description

Sigma-Aldrich
Sodium chloride solution, 5 M in H2O, BioReagent, Molecular Biology, suitable for cell culture
Sigma-Aldrich
Sodium chloride, random crystals, 99.9% trace metals basis
Sigma-Aldrich
Sodium chloride-35Cl, 99 atom % 35Cl
Sigma-Aldrich
Sodium chloride solution, 5 M
Sigma-Aldrich
Sodium chloride solution, 0.85%
Sigma-Aldrich
Sodium chloride, tested according to Ph. Eur.
Sigma-Aldrich
Sodium chloride, 99.999% trace metals basis
Sigma-Aldrich
Sodium chloride, BioXtra, ≥99.5% (AT)
Sigma-Aldrich
Sodium chloride solution, BioUltra, Molecular Biology, ~5 M in H2O
Sigma-Aldrich
Sodium chloride, AnhydroBeads, −10 mesh, 99.999% trace metals basis
Sigma-Aldrich
Sodium chloride, Molecular Biology, DNase, RNase, and protease, none detected, ≥99% (titration)
Supelco
Sodium chloride, Pharmaceutical Secondary Standard; Certified Reference Material
Sigma-Aldrich
Sodium chloride solution, 0.9% in water, BioXtra, suitable for cell culture
Supelco
Sodium chloride, reference material for titrimetry, certified by BAM, >99.5%
Sigma-Aldrich
Sodium chloride, BioReagent, suitable for cell culture, suitable for insect cell culture, suitable for plant cell culture, ≥99%
Sigma-Aldrich
Sodium chloride, meets analytical specification of Ph. Eur., BP, USP, 99.0-100.5%
Sigma-Aldrich
Sodium chloride, tablet
Sigma-Aldrich
Sodium chloride, BioPerformance Certified, ≥99% (titration), suitable for insect cell culture, suitable for plant cell culture
Sigma-Aldrich
Sodium chloride, BioUltra, Molecular Biology, ≥99.5% (AT)
Sigma-Aldrich
Sodium chloride, Vetec, reagent grade, 99%
USP
Levothyroxine, United States Pharmacopeia (USP) Reference Standard