NIST - National Institute of Standards and Technology

01/18/2025 | Press release | Archived content

Robust discrimination between closely related species of salmon based on DNA short-reads

Published
January 18, 2025

Author(s)

Antonio Possolo, Mary Gregg, Angela Folz, Debra Ellisor

Abstract

Closely related species of Salmonidae, including Pacific and Atlantic salmon, can be distinguished from one another based on nucleotide sequences from the cytochrome c oxidase sub-unit 1 mitochondrial gene (CO1), using ensembles of short-reads aligned to genetic barcodes that serve as digital proxies for the holotypes of the relevant species. This is accomplished by exploiting both the nucleotide sequences and their quality scores recorded in a FASTQ file obtained via Next Generation sequencing of mitochondrial DNA extracted from Coho salmon caught with hook and line in the Gulf of Alaska. The alignment is done using MUSCLE 5.2 (Edgar, 2022), applied to multiple versions of each short-read perturbed according to the nucleobase identification error probabilities underlying the quality scores. The Damerau-Levenshtein distance was used to determine the genetic barcode of the candidate species that is closest to each aligned, perturbed short-read. The "votes" that the sampled short-reads cast for the different candidate species are then pooled and converted into identification probabilities, using weights determined by the entropy of the short-read specific identification probability distributions. This novel approach to discrimination between closely related species is used for robust value-assignment to reference materials supporting the ascertainment of the authenticity of seafood, for example NIST Reference Materials 8256 and 8257 (Coho salmon) (Ellisor et al, 2021).
Citation
Analytical and Bioanalytical Chemistry
Pub Type
Journals

Keywords

Coho, barcoding, DNA, MUSCLE, short-read, alignment, nominal, entropy

Citation

Possolo, A. , Gregg, M. , Folz, A. and Ellisor, D. (2025), Robust discrimination between closely related species of salmon based on DNA short-reads, Analytical and Bioanalytical Chemistry, [online], https://doi.org/10.1007/s00216-024-05724-9, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=958721 (Accessed January 20, 2025)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].