Letter frequencies in this type of text are represented graphically on a WebLogo and numerically using a position-weight matrix. FASTA (“fast-uh”) files store this type of text, which is often provided alongside a chromatogram with green, red, blue, and black peaks. The letter N indicates lack of consensus at a position or motif in one of these strings. One of these strings is queried against a database in a (*) BLAST (“blast”) search, which aligns these texts to identify SNPs (“snips”) or to construct a phylogenetic tree. One of these strings with three billion characters, measured with only a few gaps from telomere-to-telomere, was the output of the Human Genome Project. For 10 points, what type of text uses strings of As, Cs, Ts, and Gs to represent genes? ■END■
ANSWER: DNA sequences [or sequencing data; accept DNA or deoxyribonucleic acid sequences; accept nucleic acid or nucleotide sequences; accept RNA or ribonucleic acid sequences; accept genes or gene sequences or genomes or genetic sequences until “genome” is read, and prompt on it afterwards; accept protein sequences, but reject “proteins” alone]
<Adam Silverman, Science - Biology> ~25461~ <Editor: Adam Silverman>
= Average correct buzz position