Wherefore art thou mouse dbSNP VCF file?
Posted by Pedja Grujic on Dec 16, 2011Every once in a while I come across a problem that surprises me. Getting a VCF file for the Mouse genome (mm9) is one of those problems. We use the GATK extensively internally, and it has standardized around the VCF format (rightfully so), so when validating, annotating, and recalibrating variants, one requires a VCF file.
Variant Calling on Ion Torrent Data
Posted by David Jenkins on Oct 25, 2011Variant calling, the detection of SNPs and INDELs, plays a particularly important role in Ion Torrent data due to its propensity for homopolymer errors. It’s particularly challenging to sort the insertions and deletions that occur through sequencing errors from true differences from the reference sequence. A variant calling plugin is included in the Ion Torrent analysis pipeline that assists in the identification of SNPs and INDELs. Utilizing SAMtools the plugin produces a variant sample report using settings adjusted to match the error model in Ion Torrent data. Using recently sequenced E. coli DH10B data from a 316 chip and an artificially mutated E. coli genome the variant analysis plugin settings were compared to other samtools settings to try to find settings that produce the most true variants while avoiding false positives. This mutated genome and a genome comprised of only homopolymer errors were also used to compare Illumina’s MiSeq technology to Ion Torrent in terms of variant analysis.
Tags
Categories
Archives
- April 2013 (1)
- February 2013 (1)
- January 2013 (1)
- December 2012 (1)
- November 2012 (7)
- October 2012 (3)
- September 2012 (1)
- August 2012 (3)
- June 2012 (2)
- May 2012 (2)
- April 2012 (6)
- March 2012 (3)
- February 2012 (4)
- January 2012 (4)
- December 2011 (2)
- November 2011 (3)
- October 2011 (3)
- September 2011 (2)
- August 2011 (1)
- June 2011 (4)
- May 2011 (1)
- November 2010 (2)
- October 2010 (1)
- September 2010 (3)
- August 2010 (2)
