<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Chromosome Chronicles</title>
	<atom:link href="http://www.chromosomechronicles.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.chromosomechronicles.com</link>
	<description>Genetics 2.0: Intelligent design and unnatural selection...</description>
	<lastBuildDate>Mon, 28 Jun 2010 20:38:43 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Sequence Variants and the Genomic Databases: Standardizing the Nomenclature</title>
		<link>http://www.chromosomechronicles.com/2010/06/22/sequence-variants-and-the-genomic-databases-standardizing-the-nomenclature/</link>
		<comments>http://www.chromosomechronicles.com/2010/06/22/sequence-variants-and-the-genomic-databases-standardizing-the-nomenclature/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 17:41:24 +0000</pubDate>
		<dc:creator>Alex</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[cDNA]]></category>
		<category><![CDATA[gDNA]]></category>
		<category><![CDATA[GenBank]]></category>
		<category><![CDATA[HGVS]]></category>
		<category><![CDATA[Locus Reference Genome]]></category>
		<category><![CDATA[Mutation Nomenclature]]></category>
		<category><![CDATA[Reference Sequence]]></category>
		<category><![CDATA[RefSeq]]></category>

		<guid isPermaLink="false">http://www.chromosomechronicles.com/?p=466</guid>
		<description><![CDATA[Genomic sequence variants, including single nucleotide polymorphisms, insertions, deletions, and even deleterious substitutions, are often listed with a deprecated nomenclature and incorrect nucleic acid or amino acid number making it difficult in practice to locate these mutations for either research or clinical purposes. A static reference genome assembly (for instance, the Locus Reference Genomic sequence), an up to date system of nomenclature (mutation name according to gDNA position instead of cDNA position), and more useful indexing between gDNA, cDNA and protein sequences (exon indexing) will allow for a more useful variation browser to be created. ]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F06%2F22%2Fsequence-variants-and-the-genomic-databases-standardizing-the-nomenclature%2F" onclick="pageTracker._trackPageview('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fwww.chromosomechronicles.com_2F2010_2F06_2F22_2Fsequence-variants-and-the-genomic-databases-standardizing-the-nomenclature_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F06%2F22%2Fsequence-variants-and-the-genomic-databases-standardizing-the-nomenclature%2F&amp;source=chromchron&amp;style=normal&amp;service=bit.ly" height="61" width="50" /><br />
			</a>
		</div>
<p style="text-align: center;"><img class="aligncenter size-large wp-image-472" title="UCSC AMPD1" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/06/UCSC-AMPD1-1024x591.png" alt="" width="502" height="290" /></p>
<p>For those interested in clinical diagnostics of genetic diseases, the ability to use the molecular information presented within our various genomic databases is somewhat limited. If you attempt to locate the common p.Q12X mutation within the NCBI Reference Sequence of the AMPD1 gene (which causes Adenosine Monophosphate Deaminase Deficiency), you will find that the 12th codon does not correspond to Glutamic Acid (the mutation, according to the current RefSeq gene (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Search&amp;db=Nucleotide&amp;term=NM_000036&amp;doptcmdl=GenBank&amp;tool=genome.ucsc.edu" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Search_amp_db=Nucleotide_amp_term=NM_000036_amp_doptcmdl=GenBank_amp_tool=genome.ucsc.edu&amp;referer=');">NM_000036.2</a>), is actually p.Q45X) . For researchers and clinicians looking for SNP primers or probes for specific mutations, the lack of fidelity and congruency between reported mutations and the &#8220;curated&#8221; databases is more than a headache. I have personally spent hours chasing down one mutation only to remain uncertain as to whether or not I had located the proper nucleotide in the end.</p>
<p>What is the issue? Did the original researchers not properly locate the mutation? Is the Reference Sequence incorrect? What can be done to increase the ease with which we can not only locate a specific phenotype-associated variant, but determine the nucleotides immediately surrounding that locus as well?</p>
<h3>Understanding the Genomic Databases</h3>
<p>The National Center for Biotechnology Information (NCBI) provides an excellent <a title="RefSeq vs GenBank" href="http://www.ncbi.nlm.nih.gov/bookshelf/br.fcgi?book=handbook&amp;part=ch1#GenBank_ASM" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.ncbi.nlm.nih.gov/bookshelf/br.fcgi?book=handbook_amp_part=ch1_GenBank_ASM&amp;referer=');">summary</a> of the differences between its two major databases: <a title="RefSeq" href="http://www.ncbi.nlm.nih.gov/RefSeq/" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.ncbi.nlm.nih.gov/RefSeq/?referer=');">RefSeq</a> and <a title="GenBank" href="http://www.ncbi.nlm.nih.gov/genbank/" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.ncbi.nlm.nih.gov/genbank/?referer=');">GenBank</a>. The major distinction for those interested in clinical genetics is the fact that RefSeq is curated and GenBank is not. GenBank is essentially a public repository of all DNA sequences made available by researchers who are responsible for updating and maintaining their own submitted sequences. RefSeq is the database that attempts to provide one sequences as the &#8220;Reference Sequence,&#8221; usually determined by looking at the most common variants among the GenBank entries. RefSeq sequences are occasionally updated which is why it is always important to include the accession number of the gene when using the Reference Sequence.</p>
<p>RefSeq has greater utility because it provides linked records between the genomic DNA, the mRNA transcript (and cDNA record), and the translated protein. It would seem that locating a non-synonymous mutation should take just a few clicks.</p>
<h3>An Arcane Nomenclature System</h3>
<p>Okay, I admit, arcane is a bit harsh, but it is time to update our nomenclature system. The Human Genome Variation Society (<a title="HGVS" href="http://www.hgvs.org/mutnomen/recs-DNA.html" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.hgvs.org/mutnomen/recs-DNA.html?referer=');">HGVS</a>) provides the guidelines that are currently considered the best practice in naming sequence variants. They rely on coding DNA (cDNA) and essentially number the cDNA beginning with 1 for the first Adenine in the initiation (ATG) codon. There are obvious drawbacks to using cDNA, the most important being lack of numerical assignments for intronic nucleotides.</p>
<p>More and more intronic mutations are being found to account for deleterious phenotypes. Yet, the original (deprecated) system for naming intron variants looks something like this: IVS3+22G&gt;A (interpretation: the G 22 basepairs from the beginning of intron 3 is changed to an A). There always seems to be some ambiguity when I come across mutations described this way: we can also write IVS3-98G&gt;A (the G 98 basepairs from the end of intron 3 is changed to an A).</p>
<p>More recently, it has been recommend to describe intronic mutations according to the closest possible cDNA location. Our mutation above might now be described as c.195+22G&gt;A (cDNA nucleotide 195 is the last nucleotide found on exon 3 (which immediately precedes intron 3). Or, we might write it as c.196-98G&gt;A.</p>
<p>Let&#8217;s be practical for a second. This is <em>terrible </em>indexing. If I want to find an intronic mutation, it will be indexed according to the cDNA record of the relevant gene. Fine, but if I search the cDNA for nucleotide 195, the intron is spliced out of the transcript, so I cannot locate the sequence of interest there. Instead, I have to take the last few base pairs from exon 3 of the cDNA transcript, open up the gDNA (genomic DNA) transcript, and search for these base pairs. Then, once I have discovered where this exon ends in the genomic DNA, I must count 22 base pairs forward to locate the nucleotide of interest (this will all be in vain if the mutation was reported incorrectly in the literature).</p>
<p>The lesson here: although the RefSeq genomic DNA is linked to the cDNA and the protein records, the actual nucleotides/codons/amino acids are not indexed together.</p>
<h3>A Standardized Index: Making Sure Mutation Nomenclature is Static</h3>
<p>The cDNA naming convention grew out of the fact that the ability to completely sequence genomic DNA for the major part of the 20th century was very limited. It made sense for mutations to be reported based on the cDNA because it reflected the two more easily obtainable (and more static) records of that time: the mRNA transcript and the protein sequence. However, now that the Human Genome Project has paved the way for easier sequencing, and we are constantly improving a standard Reference Sequence, it makes much more sense to name mutations in terms of their genomic DNA (gDNA) location.</p>
<p>As reported by <a title="Locus Reference Genomic sequences: an improved basis for describing human DNA variants." href="http://www.ncbi.nlm.nih.gov/pubmed/20398331" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.ncbi.nlm.nih.gov/pubmed/20398331?referer=');">Flicek et al.</a>, the Locus Reference Genomic (LRG) sequence format is being developed specifically for the purpose of accurately indexing genomic variants. The LRG will provide a static record for every gene of interest, and this record WILL NOT CHANGE. It may be annotated, etc., but in the interest of preserving reported mutations, the backbone sequence (and subsequent base pair numbers will not change).</p>
<p>I have been using the Reference Sequence in a similar function whenever I locate a mutation for testing purposes. For example, I was searching for the p.G380R mutation in the FGFR3 gene which is a common cause of Achondroplasia. The cDNA variant I was interested in was c.G1138A. After searching for the mutation within the gene transcript, I located the surrounding base pairs, and determined that the genomic name for the mutation is g.G10458A. In my system, I start with the Adenine of the initiation codon as 1 for the gDNA as well as the cDNA. Thus, the gDNA and cDNA index numbers diverge once the first intron begins. For point mutations, I have written a script that automatically accomplishes this by interacting with the UCSC genome browser. An indexing revelation is what helps this script to work.</p>
<h3>Relational Indexing: Relating the cDNA to the gDNA</h3>
<p>I have been developing my own library of RefSeq genome transcripts, but I have been indexing them more efficiently. My gDNA files, as previously mentioned,  are labeled 1 at the beginning of the initiation codon, however, the non-coding transcribed region preceding the initiation codon is labeled from -1 downward beginning at the nucleotide immediately preceding ATG (there is no 0). From the start codon, the gDNA positions are numbered sequentially. Similiarly, the cDNA transcript begins with the start codon, and is numbered from 1 until the end of the stop codon.</p>
<p>To relate the cDNA to the gDNA, I use exon indexing. For cDNA, every nucleotide is assigned to an exon, and these nucleotides are numbered from 1 until the end of the exon. Thus, if exon 3 is 79 base pairs long, the nucleotides within exon 3 will also be numbered from 1 to 79. For gDNA, every nucleotide is assigned to an exon or intron, and these nucleotides are numbered from 1 until the end of the exon/intron. Thus, we can relate the cDNA position to the gDNA position through the exon indexing.</p>
<p>For example, if I want to determine the gDNA position of cDNA locus 285, the analysis is very simple:</p>
<ol>
<li>Find nucleotide 285 within the cDNA file.</li>
<li>Determine the exon of nucleotide 285 (exon 2).</li>
<li>Determine the index within exon 2 of the nucleotide (exon 2, position 83).</li>
<li>Locate exon 2, position 83 within the gDNA file.</li>
<li>Determine the gDNA position associated with exon 2, position 83 (382).</li>
</ol>
<p style="text-align: center;"><img class="aligncenter size-large wp-image-467" title="Exon Indexing" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/06/ExonIndexing-1024x505.png" alt="" width="502" height="248" /></p>
<p>In this way, we have effortlessly determined that c.285 corresponds to g.382.</p>
<h3>Developing a Functional, Interactive Reference Assembly</h3>
<p>Although the data files are linked between RefSeq gDNA, cDNA and amino acid sequences, the files are not indexed to the extent that it is entirely useful. Ultimately, I would like to be able to browse a cDNA/gDNA sequence, perform a base-pair change, determine if this change is synonymous or non-synonymous, and find out if the change has been associated with any disease phenotypes (which would entail linking individual nucleotides to <a title="OMIM" href="http://www.ncbi.nlm.nih.gov/omim" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.ncbi.nlm.nih.gov/omim?referer=');">OMIM</a> records). The development of such a functional variation browser will require a lot of forethought, smart programming, and a great deal of curation. However, I believe that development of the Locus Reference Genomic (LRG) sequence is a step in the right direction.</p>
<h3>References:</h3>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Genome+medicine&amp;rft_id=info%3Apmid%2F20398331&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Locus+Reference+Genomic+sequences%3A+an+improved+basis+for+describing+human+DNA+variants.&amp;rft.issn=&amp;rft.date=2010&amp;rft.volume=2&amp;rft.issue=4&amp;rft.spage=24&amp;rft.epage=&amp;rft.artnum=&amp;rft.au=Dalgleish+R&amp;rft.au=Flicek+P&amp;rft.au=Cunningham+F&amp;rft.au=Astashyn+A&amp;rft.au=Tully+RE&amp;rft.au=Proctor+G&amp;rft.au=Chen+Y&amp;rft.au=McLaren+WM&amp;rft.au=Larsson+P&amp;rft.au=Vaughan+BW&amp;rft.au=B%C3%A9roud+C&amp;rft.au=Dobson+G&amp;rft.au=Lehv%C3%A4slaiho+H&amp;rft.au=Taschner+PE&amp;rft.au=den+Dunnen+JT&amp;rft.au=Devereau+A&amp;rft.au=Birney+E&amp;rft.au=Brookes+AJ&amp;rft.au=Maglott+DR&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology%2CComputer+Science%2CDatabases%2C+Genetics+%2C+Bioinformatics">Dalgleish R, Flicek P, Cunningham F, Astashyn A, Tully RE, Proctor G, Chen Y, McLaren WM, Larsson P, Vaughan BW, Béroud C, Dobson G, Lehväslaiho H, Taschner PE, den Dunnen JT, Devereau A, Birney E, Brookes AJ, &amp; Maglott DR (2010). Locus Reference Genomic sequences: an improved basis for describing human DNA variants. <span style="font-style: italic;">Genome medicine, 2</span> (4) PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/20398331" onclick="pageTracker._trackPageview('/outgoing/www.ncbi.nlm.nih.gov/pubmed/20398331?referer=');">20398331</a></span></p>
<img src="http://www.chromosomechronicles.com/?ak_action=api_record_view&id=466&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.chromosomechronicles.com/2010/06/22/sequence-variants-and-the-genomic-databases-standardizing-the-nomenclature/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Chromosome Chronicles Launches News Section</title>
		<link>http://www.chromosomechronicles.com/2010/04/05/chromosome-chronicles-launches-news-section/</link>
		<comments>http://www.chromosomechronicles.com/2010/04/05/chromosome-chronicles-launches-news-section/#comments</comments>
		<pubDate>Mon, 05 Apr 2010 19:06:02 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Genetics News]]></category>

		<guid isPermaLink="false">http://www.chromosomechronicles.com/?p=460</guid>
		<description><![CDATA[The Chromosome Chronicles has launched a news section for users to submit and vote on relevant genetics news.]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F04%2F05%2Fchromosome-chronicles-launches-news-section%2F" onclick="pageTracker._trackPageview('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fwww.chromosomechronicles.com_2F2010_2F04_2F05_2Fchromosome-chronicles-launches-news-section_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F04%2F05%2Fchromosome-chronicles-launches-news-section%2F&amp;source=chromchron&amp;style=normal&amp;service=bit.ly" height="61" width="50" /><br />
			</a>
		</div>
<p style="text-align: center;"><a href="http://news.chromosomechronicles.com" onclick="pageTracker._trackPageview('/outgoing/news.chromosomechronicles.com?referer=');"><img class="aligncenter size-medium wp-image-461" title="Chromosome.Chronicles.News" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/04/Chromosome.Chronicles.News_-300x168.png" alt="" width="300" height="168" /></a></p>
<p style="text-align: left;">Inspired by one of my favorite sites, <a title="HackerNews" href="http://news.ycombinator.com" target="_blank" onclick="pageTracker._trackPageview('/outgoing/news.ycombinator.com?referer=');">HackerNews</a>, I&#8217;ve decided to launch my own version: The Chromosome Chronicles news section! Here, ANYONE can submit genetics news (I also have a few reputable news sites feeding to it). But I encourage anyone to submit, whether it is a new blog post, a funny genetics picture.The news allows all users and visitors to then vote up or vote down a submitted article. Also, I will be tweeting on my twitter account the most recent top stories <a title="Twitter - ChromChron" href="http://twitter.com/chromchron" target="_blank" onclick="pageTracker._trackPageview('/outgoing/twitter.com/chromchron?referer=');">@chromchron</a>. Please submit!</p>
<img src="http://www.chromosomechronicles.com/?ak_action=api_record_view&id=460&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.chromosomechronicles.com/2010/04/05/chromosome-chronicles-launches-news-section/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The 22nd Carnival of Evolution</title>
		<link>http://www.chromosomechronicles.com/2010/04/02/the-22nd-carnival-of-evolution/</link>
		<comments>http://www.chromosomechronicles.com/2010/04/02/the-22nd-carnival-of-evolution/#comments</comments>
		<pubDate>Fri, 02 Apr 2010 19:10:36 +0000</pubDate>
		<dc:creator>Alex</dc:creator>
				<category><![CDATA[Evolution]]></category>
		<category><![CDATA[Adaptation]]></category>
		<category><![CDATA[Carnival of Evolution]]></category>

		<guid isPermaLink="false">http://www.chromosomechronicles.com/?p=446</guid>
		<description><![CDATA[The 22nd Blog Carnival of Evolution is now up at Beetles in the Bush! Check it out to see 26 great posts on evolution and perspectives on a diverse range of evolution-related fields. The Chromosome Chronicles article on In Silico Models of Evolution is featured in this edition. Stop over for some good old Darwinian reading.]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F04%2F02%2Fthe-22nd-carnival-of-evolution%2F" onclick="pageTracker._trackPageview('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fwww.chromosomechronicles.com_2F2010_2F04_2F02_2Fthe-22nd-carnival-of-evolution_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F04%2F02%2Fthe-22nd-carnival-of-evolution%2F&amp;source=chromchron&amp;style=normal&amp;service=bit.ly" height="61" width="50" /><br />
			</a>
		</div>
<p style="text-align: center;"><a href="http://beetlesinthebush.wordpress.com/2010/04/02/carnival-of-evolution-22/" onclick="pageTracker._trackPageview('/outgoing/beetlesinthebush.wordpress.com/2010/04/02/carnival-of-evolution-22/?referer=');"><img class="aligncenter size-medium wp-image-447" title="CoEButton" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/04/CoEButton-300x300.jpg" alt="" width="300" height="300" /></a></p>
<p>The 22nd Blog Carnival of Evolution is now up at <a title="Beetles in the Bush - 22nd Carnival of Evolution" href="http://beetlesinthebush.wordpress.com/2010/04/02/carnival-of-evolution-22/" target="_blank" onclick="pageTracker._trackPageview('/outgoing/beetlesinthebush.wordpress.com/2010/04/02/carnival-of-evolution-22/?referer=');">Beetles in the Bush!</a> Check it out to see 26 great posts on evolution and perspectives on a diverse range of evolution-related fields. The Chromosome Chronicles article on <a title="Modeling Evolution in vitro and in silico" href="http://www.chromosomechronicles.com/2010/03/12/modeling-evolution-in-vitro-and-in-silico/" target="_blank"><em>In Silico</em> Models of Evolution</a> is featured in this edition. Stop over for some good old Darwinian reading.</p>
<img src="http://www.chromosomechronicles.com/?ak_action=api_record_view&id=446&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.chromosomechronicles.com/2010/04/02/the-22nd-carnival-of-evolution/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Divorcers and Polygamists: Evolutionary Superstars?</title>
		<link>http://www.chromosomechronicles.com/2010/04/01/divorcers-and-polygamists-evolutionary-superstars/</link>
		<comments>http://www.chromosomechronicles.com/2010/04/01/divorcers-and-polygamists-evolutionary-superstars/#comments</comments>
		<pubDate>Thu, 01 Apr 2010 20:16:35 +0000</pubDate>
		<dc:creator>Alex</dc:creator>
				<category><![CDATA[Evolution]]></category>
		<category><![CDATA[Darwin]]></category>
		<category><![CDATA[Divorce]]></category>
		<category><![CDATA[Evolutionary Fitness]]></category>
		<category><![CDATA[Polygamy]]></category>

		<guid isPermaLink="false">http://www.chromosomechronicles.com/?p=428</guid>
		<description><![CDATA[Evolutionary fitness, in a nutshell, is a measure of how many offspring a person has. To have more offspring is to be more successful (evolutionarily). Polygamists and serial divorcers put themselves at an "evolutionary advantage" by having more wives.]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F04%2F01%2Fdivorcers-and-polygamists-evolutionary-superstars%2F" onclick="pageTracker._trackPageview('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fwww.chromosomechronicles.com_2F2010_2F04_2F01_2Fdivorcers-and-polygamists-evolutionary-superstars_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F04%2F01%2Fdivorcers-and-polygamists-evolutionary-superstars%2F&amp;source=chromchron&amp;style=normal&amp;service=bit.ly" height="61" width="50" /><br />
			</a>
		</div>
<p style="text-align: center;"><a href="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/brighamyoung.jpg"><img class="aligncenter size-medium wp-image-429" title="Brigham Young" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/brighamyoung-231x300.jpg" alt="" width="231" height="300" /></a></p>
<p style="text-align: center;"><a title="Brigham Young" href="http://en.wikipedia.org/wiki/Brigham_Young" target="_blank" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Brigham_Young?referer=');">Brigham Young</a> and 21 of his wives.</p>
<p style="text-align: left;"><a title="Fitness Wikipedia" href="http://en.wikipedia.org/wiki/Fitness_(biology)" target="_blank" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Fitness_biology?referer=');">Evolutionary fitness</a> is one of the most important concepts in Darwinian evolution. Essentially, fitness can be regarded as a measure of how much of an individual&#8217;s genes are passed on to the next generation. More specifically, the higher proportion of the next generation that is comprised of your genes, the more fit, evolutionarily, you are.</p>
<h3>Fitness: A short example</h3>
<p style="text-align: left;">For example, say I have one copy of a particular version of a gene: Gene X. No one else in my population of 100 individuals has it. Since we all have two copies of every gene, the frequency of Gene X can be regarded as 1 out of 200. Let&#8217;s assume that I pass on Gene X to four of my children in the next generation of 100 individuals for this population. Gene X has now increased from 1 out of 200 to 4 out of 200. Since the frequency of Gene X increased from the first to the second generation, Gene X can be regarded as a gene of higher evolutionary fitness.</p>
<h3>Maximizing Your Fitness</h3>
<p>Okay, now that we&#8217;ve reviewed fitness as the measure<a href="http://www.guzer.com/pictures/marzipan_babies.php" onclick="pageTracker._trackPageview('/outgoing/www.guzer.com/pictures/marzipan_babies.php?referer=');"><img class="alignright size-medium wp-image-432" title="marzipan_babies" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/marzipan_babies-300x210.jpg" alt="" width="300" height="210" /></a> for evolutionary success, it begs the question: how can I maximize my evolutionary success? Answer: have as many children as possible (who will survive to one day themselves reproduce). This way, your genes will constitute a higher proportion of the future generations of gene pools.</p>
<p>An obvious limitation to the number of children a person can have is childbirth itself. Nine months, a painful delivery, and then years of caring for the child requires a <em>huge</em> investment in resources. Moreover, women might be viewed as a &#8220;limiting step&#8221; in this process (I hope I don&#8217;t get shot for that one). In particular, mating with only one women would surely slow a man down if his main goal was to produce as many offspring as possible.</p>
<h3>Polygamy: Evolutionarily Advantageous</h3>
<p><a title="Joseph Smith Jr." href="http://en.wikipedia.org/wiki/Joseph_Smith,_Jr." target="_blank" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Joseph_Smith_Jr.?referer=');">Joseph Smith Jr.</a> was on to something when he founded the Church of the Latter Day Saints. Specifically, his many, many, many wives made him highly successful (from a Darwinian standpoint), although he publicly condemned the practice. By some accounts he may have had <a title="Children of Joseph Smith Jr." href="http://en.wikipedia.org/wiki/Children_of_Joseph_Smith,_Jr." target="_blank" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Children_of_Joseph_Smith_Jr.?referer=');">20 children</a> with his multiple wives. While the privilege of polygamy had previously only been reserved for the highest of alpha males (like the one and only <a title="Genghis Khan" href="http://en.wikipedia.org/wiki/Genghis_Khan" target="_blank" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Genghis_Khan?referer=');">Genghis Khan</a>, to whom a large proportion of China can claim some relation), Smith had labored to include multiple wives in the doctrine of his faith. Simply join his church and you become an evolutionary celebrity. To this day, there are many living in Utah who can claim some relation to Joseph Smith Jr.</p>
<h3>Divorce/Second Marriages: The Compromise in the name of Monogamy</h3>
<p>Let&#8217;s assume (or believe) that all human actions are still driven by primal evolutionary urges. Having multiple wives surely fits this bill since it allows for greater evolutionary fitness. However, polygamy is outlawed in western culture, so outside of those who practice it underground, it is not a reliable option for men who wish to increase their fitness.</p>
<p>For men, the divorce/second marriage life route allows for more than one family, more than one wife, just not at the same time. By Darwinian standards, those who get divorced and remarry are actually more successful than those who are monogamous their whole lives (assuming that they have more children). Men who get divorced and remarry will be passing on a higher proportion of genetic material into the next generation.</p>
<p><a href="http://www.flickr.com/photos/lel4nd/3914799933/" onclick="pageTracker._trackPageview('/outgoing/www.flickr.com/photos/lel4nd/3914799933/?referer=');"><img class="alignleft size-medium wp-image-434" title="wedding_rings" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/wedding_rings-300x192.jpg" alt="" width="240" height="154" /></a>Let&#8217;s take this argument a step further. What if the behavior: &#8220;Get divorced, remarry&#8221; has a genetic factor that predisposes a man to engaging in this behavior. If this behavior results in higher fitness (higher % of genes passed on by those who exhibit the behavior), then one might argue that a higher proportion of men in the next generation will have genes that predispose them to get divorced. By this logic, if divorce is an activity that makes men more fit (evolutionarily), then we can expect to see divorce rates rise in future generations!</p>
<h3>Is Maximizing Our Darwinian Fitness our <em>True</em> Goal in Life?</h3>
<p>This question gets to the heart of our discussion. Is maximizing the number of children we have our major goal in life? For some species, yes. For humans, no. Our sentience seems to have made us immune to many of the evolutionary pressures that drive other animals&#8217; behaviors. For example, if having your own genes passed on in your children was so important, then why is adoption so popular? Also, why do many couples elect to not have children ever? These actions seem to be at odds with the idea of evolutionary fitness, yet many members of our species engage in them.</p>
<p>Finally, if maximizing evolutionary fitness was really the ultimate goal in life, then why wouldn&#8217;t all men (and women) go around the world donating sperm (and eggs) to all of the banks they can find. I can think of no quicker way to maximize the number of genetic offspring you have. Hopefully, no one does that because it would be kind of creepy.</p>
<img src="http://www.chromosomechronicles.com/?ak_action=api_record_view&id=428&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.chromosomechronicles.com/2010/04/01/divorcers-and-polygamists-evolutionary-superstars/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Analysis of 23andMe&#8217;s Genotyping: High Accuracy of Illumina Platform Confirmed by Comparing Siblings</title>
		<link>http://www.chromosomechronicles.com/2010/03/27/analysis-of-23andmes-genotyping-high-accuracy-of-illumina-platform-confirmed-by-comparing-siblings/</link>
		<comments>http://www.chromosomechronicles.com/2010/03/27/analysis-of-23andmes-genotyping-high-accuracy-of-illumina-platform-confirmed-by-comparing-siblings/#comments</comments>
		<pubDate>Sun, 28 Mar 2010 01:09:36 +0000</pubDate>
		<dc:creator>Alex</dc:creator>
				<category><![CDATA[Analyze Your Own SNPs]]></category>
		<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Consumer Genetics]]></category>
		<category><![CDATA[23andMe]]></category>
		<category><![CDATA[Genotyping]]></category>
		<category><![CDATA[Haplotypes]]></category>
		<category><![CDATA[Illumina]]></category>
		<category><![CDATA[SNPs]]></category>

		<guid isPermaLink="false">http://www.chromosomechronicles.com/?p=412</guid>
		<description><![CDATA[An analysis of 23andMe genotype data between siblings to check for genotyping errors reveals high accuracy (99.15%) of genotype calls. ]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F03%2F27%2Fanalysis-of-23andmes-genotyping-high-accuracy-of-illumina-platform-confirmed-by-comparing-siblings%2F" onclick="pageTracker._trackPageview('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fwww.chromosomechronicles.com_2F2010_2F03_2F27_2Fanalysis-of-23andmes-genotyping-high-accuracy-of-illumina-platform-confirmed-by-comparing-siblings_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.chromosomechronicles.com%2F2010%2F03%2F27%2Fanalysis-of-23andmes-genotyping-high-accuracy-of-illumina-platform-confirmed-by-comparing-siblings%2F&amp;source=chromchron&amp;style=normal&amp;service=bit.ly" height="61" width="50" /><br />
			</a>
		</div>
<p style="text-align: center;"><a href="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/iscan.jpg"><img class="aligncenter size-full wp-image-421" title="iScan" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/iscan.jpg" alt="" width="265" height="290" /></a></p>
<p>Microarray genotyping platforms report high accuracy. Of course, this is given that you use their protocols, ideal conditions, etc. Depending on the genotyping facility, this accuracy may be even more tenuous. I recently set out to get some good estimates for the rate of genotyping errors from the <a title="Illumina" href="http://illumina.com/" target="_blank" onclick="pageTracker._trackPageview('/outgoing/illumina.com/?referer=');">Illumina</a> assay employed by <a title="23andMe" href="http://23andme.com" target="_blank" onclick="pageTracker._trackPageview('/outgoing/23andme.com?referer=');">23andMe</a>.</p>
<p>First let me refer back to a previous post about <a title="Phasing: Determining Which SNPs are Inherited together" href="http://www.chromosomechronicles.com/2009/09/08/phasing-determining-which-snps-are-inherited-together/" target="_blank">determining haplotype</a> when you have two parents and a child. By determining haplotypes for all of my siblings and I, I am able to compare regions where we share both parental haplotypes, or shared diplotypes. In my family, there are four siblings (including myself), and I have had us all tested at 23andMe. To analyze genotyping errors, I decided to compare informative SNPs from shared haplotypes between my three brothers and I.</p>
<h3>Experimental Outline:</h3>
<ol>
<li>Determine Transmitted Parental Haplotypes from all four siblings.</li>
<li>Determine Where Both alleles are Shared for all four Siblings.</li>
<li>Find out how many genotyping errors occurred in this region.</li>
</ol>
<p>Determining the parental haplotypes is simple, and my method for doing this has already been described. Determining where the four siblings share alleles was accomplished using a program I wrote called <a title="Chromosoft - NucleOlap" href="http://chromosoft.org/products/nucleolap/" target="_blank" onclick="pageTracker._trackPageview('/outgoing/chromosoft.org/products/nucleolap/?referer=');">NucleOlap</a>. This program compares informative SNPs from paternal and maternal haplotypes between children and produces a nice output. It is designed to recognize candidate regions responsible for dominant or recessive genetic mutations given each child&#8217;s affected or unaffected status. However, if all children are affected, it is the same thing as analyzing haplotype sharing. Check out the <a title="NucleOlap Analysis Method" href="http://chromosoft.org/wp-content/uploads/2010/03/NucleOlap.Analysis.Method.pdf" target="_blank" onclick="pageTracker._trackPageview('/outgoing/chromosoft.org/wp-content/uploads/2010/03/NucleOlap.Analysis.Method.pdf?referer=');">documentation</a>.</p>
<p>For every pair of siblings, it is expected that, on average, they will share both parental haplotypes for 25% of their genome. Add a 3rd sibling, and that 25% falls to 6.25%. For four siblings, it is expected that both haplotypes are shared for only 1.56% of their genome. According to release 36.1 of the Human Genome, the haploid length of the autosomes is 2,864,255,922 base pairs! I can expect that my three brothers and I share both parental haplotypes for 44,753,999 haploid base pairs.</p>
<p><a href="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/siblings.png"><img class="alignleft size-medium wp-image-413" title="Sibling Diplotype Sharing" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/siblings-300x300.png" alt="" width="240" height="240" /></a>The NucleOlap analysis found that my siblings and I all shared both haplotypes in six regions for a total of 47,656,130 haploid base pairs. Pretty good! The ideogram to the left shows the regions where my three brothers and I share the exact same genes from both parents. Shared regions occur on chromosomes 1, 2, 6, 13, 16, and 18. NucleOlap also provided me with the starting and ending positions (and SNPs) for each region. To determine where genotyping errors occured, I compared the raw data for these regions with each child (the program output is not affected by genotyping errors because it is able to recognize and ignore them).</p>
<p>The analysis occurred by gathering<a href="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/Error-Example.png"><img class="alignright size-medium wp-image-415" title="Error Example" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/Error-Example-300x111.png" alt="" width="300" height="111" /></a> the SNPs in the identified shared regions and lining them up parallel to one another in Microsoft Excel. I then checked to see that all four siblings had the same genotype for each SNP (as they are expected to). A sample of how this worked is shown in the picture to the right.</p>
<p>My analysis revealed that 10,079 SNPs were contained within the regions where my brothers and I share diplotypes. Of these 10,079 SNPs, only 86 of them had any genotyping errors! <a href="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/Error-Summary-Table.png"><img class="alignleft size-full wp-image-417" title="Error Summary Table" src="http://www.chromosomechronicles.com/wp-content/uploads/2010/03/Error-Summary-Table.png" alt="" width="218" height="148" /></a>This means that the genotyping calling was 99.15% accurate for these regions. Moreover, of the errors recorded, 79 of them occurred when there was a genotype call for some of the siblings and a null call (&#8211;) for others. Only 7 errors occurred where there was inconsistency in the genotype assigned to the siblings. The results are summarized in the table to the left.</p>
<p>My conclusion: the genotyping error rate is very low, less than 1% for the Illumina platform used by 23andMe. Even taking null calls into account, this number is still below 1%. My siblings and I shared 99.15% genotype identity in a region where we all share both parental haplotypes. I am very pleased with the accuracy.</p>
<td width="129" height="20" align="right"><span style="font-size: small;"><span><br />
</span></span></td>
<img src="http://www.chromosomechronicles.com/?ak_action=api_record_view&id=412&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.chromosomechronicles.com/2010/03/27/analysis-of-23andmes-genotyping-high-accuracy-of-illumina-platform-confirmed-by-comparing-siblings/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
