TY - GEN
T1 - In-depth analysis of interrelation between quality scores and real errors in illumina reads
AU - Kwon, Sunyoung
AU - Park, Seunghyun
AU - Lee, Byunghan
AU - Yoon, Sungroh
PY - 2013
Y1 - 2013
N2 - In sequencing results, the quality score is reported for each base, representing the probability that the base is called incorrectly. The notion of quality scores was initially developed for conventional Sanger sequencing, but is widely used for next-generation sequencing techniques, including Illumina. In this paper, we carry out in-depth analysis of quality scores reported for Illumina reads and present how they are related to real errors in the reads. We confirmed strong interrelation between quality scores and real errors in Illumina reads, and observed that reverse reads tend to have lower quality scores than forward reads in paired-end reads do. In addition, we discovered other interesting patterns from quality score analysis. Our hope is that the findings in this paper will be helpful for designing error-correction and/or filtering methods for next-generation sequencing.
AB - In sequencing results, the quality score is reported for each base, representing the probability that the base is called incorrectly. The notion of quality scores was initially developed for conventional Sanger sequencing, but is widely used for next-generation sequencing techniques, including Illumina. In this paper, we carry out in-depth analysis of quality scores reported for Illumina reads and present how they are related to real errors in the reads. We confirmed strong interrelation between quality scores and real errors in Illumina reads, and observed that reverse reads tend to have lower quality scores than forward reads in paired-end reads do. In addition, we discovered other interesting patterns from quality score analysis. Our hope is that the findings in this paper will be helpful for designing error-correction and/or filtering methods for next-generation sequencing.
UR - http://www.scopus.com/inward/record.url?scp=84886491089&partnerID=8YFLogxK
U2 - 10.1109/EMBC.2013.6609580
DO - 10.1109/EMBC.2013.6609580
M3 - Conference contribution
C2 - 24109767
AN - SCOPUS:84886491089
SN - 9781457702167
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 635
EP - 638
BT - 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2013
T2 - 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2013
Y2 - 3 July 2013 through 7 July 2013
ER -