<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7936872944635478061</id><updated>2011-12-17T04:54:50.146-08:00</updated><category term='cancer'/><category term='visualization'/><category term='web-services'/><category term='clinical-trials'/><category term='illumina'/><category term='cbio-portal'/><category term='pathways'/><category term='funding'/><category term='rna-seq'/><category term='broad'/><category term='nhgri'/><category term='science-policy'/><category term='cloud'/><category term='go'/><category term='bioinformatics'/><category term='hubs'/><category term='networks'/><category term='breast-cancer'/><category term='personalized-medicine'/><category term='ovarian'/><category term='rstats'/><category term='python'/><category term='gene-ontology'/><category term='parp'/><category term='amazon'/><category term='yeast'/><category term='sequencing'/><category term='software-engineering'/><category term='disease'/><category term='genomics'/><category term='nih'/><category term='simpledb'/><category term='rnai'/><category term='mercurial'/><category term='tcga'/><category term='y2h'/><title type='text'>BioBits</title><subtitle type='html'>A Blog about bioinformatics, scientific computing and cancer genomics.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://www.biobits.org/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://www.biobits.org/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>23</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-3183686497918889378</id><published>2011-12-17T04:54:00.000-08:00</published><updated>2011-12-17T04:54:50.153-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='nhgri'/><category scheme='http://www.blogger.com/atom/ns#' term='sequencing'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>The Road to Genomic Based Cancer Care</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-TInkKVA40gU/TuyKxRAHsiI/AAAAAAAAAVM/J0eIfCuybrs/s1600/genomics.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="132" src="http://3.bp.blogspot.com/-TInkKVA40gU/TuyKxRAHsiI/AAAAAAAAAVM/J0eIfCuybrs/s200/genomics.jpg" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;A blog post on clinical-based sequencing at &lt;a href="http://www.massgenomics.org/"&gt;www.massgenomics.org&lt;/a&gt; (&lt;a href="http://www.massgenomics.org/2011/11/the-great-divide-cancer-genomics-and-clinical-care.html"&gt;The Great Divide: &amp;nbsp;Cancer Genomics and Clinical Care&lt;/a&gt;) recently caught my eye. &amp;nbsp;The author (Dan Koboldt at Washington University in St. Louis) described the efforts of academic cancer centers, such as Johns Hopkins, to screen all incoming patients for “clinically actionionable” sequence mutations. &amp;nbsp;Dan applauded the efforts, but made a compelling case that they did not go far enough. &amp;nbsp;Why not go for whole genome or at least whole-exome?&lt;br /&gt;&lt;br /&gt;Dan’s post was November 23, 2011. &amp;nbsp;Much has happened since then.&lt;br /&gt;&lt;br /&gt;First, Arul Chinnaiyan and colleagues at the University of Michigan published the initial results of their &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/22133722"&gt;Michigan Oncology Sequencing Project (MI-ONCOSEQ) protocol&lt;/a&gt;. &amp;nbsp;The Michigan group tested their protocol on two tumor-derived xenografts, and two patients and performed whole genome sequencing of the tumor, whole exome sequencing of tumor and normal, and RNA-Seq. &amp;nbsp;Turn-around time for all genomic profiling was approximately 3-4 weeks. &amp;nbsp;Genomic results were then interpreted and debated by a multidisciplinary Sequencing Tumor Board (STB), consisting of experts in clinical oncology, pathology, cancer biology, bioethics, bioinformatics and clinical genetics. &amp;nbsp;The STB was specifically charged with interpreting the genomic findings, and determining if the patient could benefit from any current or future clinical trials. &amp;nbsp;If you want to understand how genomic-based medicine could actually be done in the very near future, this is a must read.&lt;br /&gt;&lt;br /&gt;Second, on December 6, the &lt;a href="http://www.nih.gov/news/health/dec2011/nhgri-06.htm"&gt;NHGRI announced $40 million in funding&lt;/a&gt; over the next four years to support “Clinical Sequencing Exploratory Research Projects”. &amp;nbsp;Five research projects were funded, each focusing on the clinical implications of next-generation sequencing. &amp;nbsp;For example, how can hospitals integrate genomic data into medical records, how does genomic data affect the course of treatment, and how do patients process and understand genomic data? &amp;nbsp;Funding will go to Baylor College of Medicine, Brigham and Woman's Hospital, Children's Hospital of Philadelphia, University of North Carolina, and University of Washington, Seattle. &amp;nbsp;In one particularly compelling example, UNC will be performing &lt;a href="http://news.unchealthcare.org/news/2011/december/unc-scientists-funded-to-study-genome-sequencing-in-clinical-settings"&gt;whole exome sequencing on ~750 patients&lt;/a&gt;, primarily focusing on younger cancer patients and patients with a strong family history of cancer.&lt;br /&gt;&lt;br /&gt;Both of these developments are real, not science fiction, and very encouraging. &amp;nbsp;MI-ONCOSEQ is one of the most concrete examples of how next-generation sequencing could be integrated into clinical care and benefit real patients. &amp;nbsp;The plans for the NHGRI-funded projects could also provide a real model for future genomic-based cancer care. &amp;nbsp;When you add these developments to the growing list of new industry initiatives that aim to perform or analyze next-generation cancer genomics data (including &lt;a href="http://www.knome.com/"&gt;Knome&lt;/a&gt;, &lt;a href="http://personalgenome.com/"&gt;Personal Genome Diagnostics&lt;/a&gt;, and the&amp;nbsp;&lt;a href="http://www.kewgroupllc.com/"&gt;KEW Group&lt;/a&gt;), the future of clinical-based cancer genomics appears very promising. &amp;nbsp;This is tempered, however by the reality that much of cancer genomics has yet to be translated into new drugs, new drug combinations and new clinical trials. &amp;nbsp;The great divide in cancer care may therefore not necessarily be sequencing. &amp;nbsp;This will probably happen sooner than we all think. &amp;nbsp;Rather, the great divide is likely to be translating new genomic data into new treatments and new clinical trials, and extracting the tiny bits of a person’s genome that are actually useful and relevant to making them better. &amp;nbsp;That will be much harder, and there is much work to be done. &amp;nbsp;So, get back to work.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Relevant Links:&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Sameek Roychowdhury et al. &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/22133722"&gt;Personalized Oncology Through Integrative High-Throughput Sequencing: A Pilot Study&lt;/a&gt;. Science Translational Medicine, Nov. 30, 2011. &amp;nbsp;First results from the Michigan MI-ONCOSEQ protocol for genome-based cancer care.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://news.unchealthcare.org/news/2011/december/unc-scientists-funded-to-study-genome-sequencing-in-clinical-settings"&gt;UNC scientists funded to study genome sequencing in clinical settings.&lt;/a&gt;&amp;nbsp;&amp;nbsp;Describes UNC’s plans for clinical sequencing. &amp;nbsp;One of five recipients of the NHGRI Clinical Sequencing Exploratory Research Project.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.nih.gov/news/health/dec2011/nhgri-06.htm"&gt;NHGRI broadens sequencing program focus on inherited diseases, medical applications.&lt;/a&gt;&amp;nbsp;&amp;nbsp;Press release from NHGRI, describing the Clinical Sequencing Exploratory Research Project&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.massgenomics.org/2011/11/the-great-divide-cancer-genomics-and-clinical-care.html"&gt;The Great Divide: &amp;nbsp;Cancer Genomics and Clinical Care&lt;/a&gt;. &amp;nbsp;Blog post, making the case for routine whole genome sequencing in cancer care.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.technologyreview.com/biomedicine/37993/"&gt;Making Genomics Routine in Cancer Care&lt;/a&gt;. &amp;nbsp;Profiles the KEW Group (co-founded by &amp;nbsp;Raju Kucherlapati at Harvard Medical School) and their plans to integrate genomics into cancer care across a network of cancer centers.&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-3183686497918889378?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/3183686497918889378/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/12/road-to-genomic-based-cancer-care.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/3183686497918889378'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/3183686497918889378'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/12/road-to-genomic-based-cancer-care.html' title='The Road to Genomic Based Cancer Care'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-TInkKVA40gU/TuyKxRAHsiI/AAAAAAAAAVM/J0eIfCuybrs/s72-c/genomics.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-6369296590678842533</id><published>2011-08-31T12:50:00.000-07:00</published><updated>2011-08-31T12:50:28.899-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='clinical-trials'/><category scheme='http://www.blogger.com/atom/ns#' term='tcga'/><category scheme='http://www.blogger.com/atom/ns#' term='ovarian'/><category scheme='http://www.blogger.com/atom/ns#' term='parp'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>A new piece of the PARP Puzzle</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-D_fJfcScN5o/Tl6NhMXCpEI/AAAAAAAAATo/WFwEEtm2gBw/s1600/puzzle.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="200" src="http://4.bp.blogspot.com/-D_fJfcScN5o/Tl6NhMXCpEI/AAAAAAAAATo/WFwEEtm2gBw/s200/puzzle.jpg" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;A recent paper by Gelman et. al. offers compelling new evidence for the effective use of PARP inhibitors in treating ovarian cancer.&amp;nbsp; PARP inhibitors have been previously identified as effective in ovarian cancer patients with germline &lt;i&gt;BRCA1&lt;/i&gt; and &lt;i&gt;BRCA2&lt;/i&gt; mutations.&amp;nbsp; However, only ~16% of ovarian patients harbor a BRCA germline defect.&amp;nbsp; The study by Gelman et. al. therefore set out to identify if patients without germline defects --- possibly with other DNA repair defects --- would respond to the PARP inhibitor Olaparib.&lt;br /&gt;&lt;br /&gt;The study included 91 patients (65 with ovarian cancer and 26 with breast cancer), and found that a portion of ovarian cancer patients without germline BRCA defects did indeed respond.&amp;nbsp; Specifcally: &lt;br /&gt;&lt;blockquote&gt;"confirmed objective responses were seen in seven (41%; 95% CI 22-64) of 17 patients with BRCA1 or BRCA2 mutations and 11 (24%; 14-38) of 46 without mutations."&lt;/blockquote&gt;This is quite an exciting finding -- it means that PARP inhibitors could be used to treat a much larger subset of ovarian cancer patients -- and not just those with BRCA germline defects.&amp;nbsp; Of course, these are still early clinical findings, and further clinical trials will be needed, and are no doubt already underway.&lt;br /&gt;&lt;br /&gt;This study also dovetails well with the recent &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21720365"&gt;TCGA ovarian findings&lt;/a&gt;, indicating that BRCA or Homologous Recombination (HR) defects may be present in up to 50% of high-grade serous ovarian patients.&amp;nbsp; What exactly do the non-BRCA responders have in common?&amp;nbsp; Do they share a common genomic event, such as &lt;i&gt;BRCA1&lt;/i&gt; hypermethylation, a somatic BRCA mutation, or some other as yet unknown genomic event?&amp;nbsp; And, can the HR genomic events identified by TCGA be used to stratify responders and non-responders?&lt;br /&gt;&lt;br /&gt;As stated to by Gelman, et. al, repeat biopsies of many patients in this study were taken, and whole transciptome sequencing is already underway.&amp;nbsp; We are therefore one step closer to answering many of these questions and discovering genomic biomarkers predictive of PARP response.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;References:&lt;/b&gt; &lt;br /&gt;&lt;ul&gt;&lt;li&gt;Gelmon KA, Tischkowitz M, Mackay H, et. al.&amp;nbsp; &lt;b&gt;Olaparib in patients with recurrent high-grade serous or poorly differentiated ovarian carcinoma or triple-negative breast cancer: a phase 2, multicentre, open-label, non-randomised study.&amp;nbsp; &lt;/b&gt;Lancet Oncol. 2011 Sep;12(9):852-61. Epub 2011 Aug 19.&amp;nbsp; [&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21862407"&gt;Abstract&lt;/a&gt;].&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Cancer Genome Atlas Research Network.&amp;nbsp;&lt;b&gt; Integrated genomic analyses of ovarian carcinoma. &lt;/b&gt;&lt;i&gt;Nature&lt;/i&gt;. 2011 Jun 29;474(7353):609-15.&amp;nbsp; [&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21720365"&gt;Abstract&lt;/a&gt;].&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-6369296590678842533?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/6369296590678842533/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/08/new-piece-of-parp-puzzle.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/6369296590678842533'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/6369296590678842533'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/08/new-piece-of-parp-puzzle.html' title='A new piece of the PARP Puzzle'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-D_fJfcScN5o/Tl6NhMXCpEI/AAAAAAAAATo/WFwEEtm2gBw/s72-c/puzzle.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-3288017405891465563</id><published>2011-08-26T10:22:00.000-07:00</published><updated>2011-08-26T10:22:44.140-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mercurial'/><category scheme='http://www.blogger.com/atom/ns#' term='software-engineering'/><title type='text'>Branching with Mercurial:  A (Sandbox) Tutorial</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-ON_y9g0TU8c/TlfHyeA5ugI/AAAAAAAAATg/4qoixESDufI/s1600/sandbox.png" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="132" src="http://2.bp.blogspot.com/-ON_y9g0TU8c/TlfHyeA5ugI/AAAAAAAAATg/4qoixESDufI/s200/sandbox.png" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;a href="http://mercurial.selenic.com/"&gt;Mercurial&lt;/a&gt; is a great distributed source control management tool.&amp;nbsp; According to many, one of its great strengths lies in its ability to do branching.&lt;br /&gt;&lt;br /&gt;There are a number of good posts on the subject:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://hgbook.red-bean.com/read/managing-releases-and-branchy-development.html"&gt;Managing releases and branchy development&lt;/a&gt;, from Mercurial: The Definitive Guide, by Bryan O'Sullivan; &lt;/li&gt;&lt;li&gt;&lt;a href="http://stevelosh.com/blog/2010/05/mercurial-workflows-stable-default/"&gt;Mercurial Workflows:  Stable &amp;amp; Default&lt;/a&gt;,&amp;nbsp; by Steve Losh;&lt;/li&gt;&lt;li&gt;&lt;a href="http://mercurial.selenic.com/wiki/Branch"&gt;Branching&lt;/a&gt;, from the Mercurial web site.&lt;/li&gt;&lt;/ul&gt;If you are like me, however, you probably don't want your first experimental steps in branching done on a real mercurial repository.&amp;nbsp; Much better to play around in a safe sandbox, where you can try things out, and freely experiment.&lt;br /&gt;&lt;br /&gt;To that end, I have put together a short sandbox tutorial that shows how to do branching with a local mercurial repository that you can create from scratch.&lt;br /&gt;&lt;br /&gt;In the tutorial, I follow Steve Losh's recommended set-up of having just two branches:&amp;nbsp; &lt;b&gt;&lt;i&gt;default&lt;/i&gt;&lt;/b&gt;, which is the branch for new development;&amp;nbsp; and &lt;i&gt;&lt;b&gt;stable&lt;/b&gt;&lt;/i&gt;, which is the branch for stable code, which requires occasional, but high-priority bug fixes. &lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: large;"&gt;&lt;b&gt;Getting Started:&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Create a new repository from scratch:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;mkdir sandbox&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;cd sandbox&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;hg init&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;Create a new file:&amp;nbsp; hello_world.txt.&amp;nbsp; Its contents:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;hello1&lt;br /&gt;hello2&lt;br /&gt;hello3&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;Add and commit the file: &lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;b&gt;&lt;span style="font-size: small;"&gt;&amp;gt;hg add hello_world.txt&lt;br /&gt;&amp;gt;hg commit -m "Initial commit" hello_world.txt &lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;span style="font-size: large;"&gt;&lt;b&gt;Creating a stable branch: &lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;A few days go by, and our code base is ready for a major release.&amp;nbsp; We want to keep this code base "stable", so that we can add quick bug fixes later.&amp;nbsp; So, we branch and commit:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&amp;gt;hg branch stable&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;marked working directory as branch stable &lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;hg commit -m "Created stable branch"&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;To see all the available branches, and verify our change went into effect:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;hg branches&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;b&gt;&lt;span style="font-size: small;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;stable&amp;nbsp;&amp;nbsp;&amp;nbsp; 1:1c25f1208528&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-size: x-small;"&gt;&lt;b&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;default&amp;nbsp;&amp;nbsp; 0:3afd60d8ec4a&lt;/span&gt; &lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;span style="font-size: large;"&gt;&lt;b&gt;Applying a bug fix to the stable branch, and merging with default: &lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;A few more days go by.&amp;nbsp; Development continues on the default branch.&amp;nbsp; We discover a major bug, and have to push it out quickly.&amp;nbsp; We first get the stable branch code:&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt; hg update -C stable&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;span style="font-size: x-small;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;1 files updated, 0 files merged, 0 files removed, 0 files unresolved&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;and, again verify what branch we are working on:&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt; hg branch&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;span style="font-size: small;"&gt;&lt;b style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;stable&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;We now make our major bug fix.&amp;nbsp; We modify hello_world.txt as follows:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;hello1: major bug fix&lt;br /&gt;hello2&lt;br /&gt;hello3&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;We commit the change:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&amp;gt;hg commit -m "Fixed bug" hello_world.txt&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt; &lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;This is an important bug fix, so we need to merge the change in stable back into default.&lt;br /&gt;&lt;br /&gt;First, we switch to the default branch and do a complete update:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;hg update -C default&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt; 1 files updated, 0 files merged, 0 files removed, 0 files unresolved&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Now merge with the stable branch:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;hg merge stable&lt;br /&gt;1 files updated, 0 files merged, 0 files removed, 0 files unresolved&lt;br /&gt;(branch merge, don't forget to commit)&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;And, commit the changes:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;hg commit -m "Merging with stable branch"&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;The bug fix has now been applied to both stable and default, and we are done.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: large;"&gt;&lt;b&gt;Major Releases:&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;There is only one wrinkle left to work out.&amp;nbsp; A few more weeks go by...&amp;nbsp; The default branch is now ready for release.&amp;nbsp; We now want the default branch to be the new stable branch.&amp;nbsp; What to do?&amp;nbsp; We check out the stable branch and merge in all the changes from the default branch.&lt;br /&gt;&lt;br /&gt;First, update to the stable branch:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;hg update -C stable&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;Then, merge everything from default into stable:&lt;br /&gt;&lt;br /&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;/div&gt;&lt;div style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&amp;gt;hg merge default &lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;and, commit the changes:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;hg commit -m "Merging in default" &lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Repeat as necessary.&amp;nbsp; Hopefully, the sandbox set-up allows you enough freedom to play around before working on a real repository.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-3288017405891465563?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/3288017405891465563/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/08/branching-with-mercurial-sandbox.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/3288017405891465563'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/3288017405891465563'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/08/branching-with-mercurial-sandbox.html' title='Branching with Mercurial:  A (Sandbox) Tutorial'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-ON_y9g0TU8c/TlfHyeA5ugI/AAAAAAAAATg/4qoixESDufI/s72-c/sandbox.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-4396340273898002515</id><published>2011-08-26T06:35:00.000-07:00</published><updated>2011-08-26T06:46:42.829-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='go'/><category scheme='http://www.blogger.com/atom/ns#' term='gene-ontology'/><title type='text'>The Unsung Heroes of GO</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-_ampLu9IEvU/Tleg9WONvCI/AAAAAAAAATc/eeKzXQ-T_WQ/s1600/go.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="238" src="http://1.bp.blogspot.com/-_ampLu9IEvU/Tleg9WONvCI/AAAAAAAAATc/eeKzXQ-T_WQ/s320/go.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;The &lt;a href="http://www.geneontology.org/"&gt;Gene Ontology&lt;/a&gt; (GO) is so pervasive in bioinformatics, that I imagine most of us simply take it for granted.&amp;nbsp; However, it's instructive to realize that GO has only been around since 1999, and far from being a static ontology reflecting unchanging biology, GO has continually evolved since its inception to keep pace with new biological research.&amp;nbsp; &lt;br /&gt;&lt;br /&gt;A recent paper by &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21819553"&gt;Leonelli et. al.&lt;/a&gt; drives this point home, describing how the GO ontology has evolved over time.&amp;nbsp; The paper describes multiple "ontology shifts", including dealing with biological anomalies, expanding biological scope, and mirroring scientific advance --- all of which need to be integrated and managed in order to keep GO up to date.&amp;nbsp; This is serious, hard work, requiring outreach to many in the biological community.&amp;nbsp; Imagine getting a roomful of biologists to agree on the meaning of "gametogenesis", and you can start to see where the real work of ontology development takes place.&lt;br /&gt;&lt;br /&gt;So, the next time you run GO enrichment analysis, or any of the other dozens of &lt;a href="http://www.geneontology.org/GO.tools.shtml"&gt;GO tools&lt;/a&gt;, think of all the hard work involved and give a little thanks to the unsung heroes of GO. &amp;nbsp; &lt;br /&gt;&lt;br /&gt;&lt;b&gt;References: &lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&amp;nbsp;Leonelli S, Diehl AD, Christie KR, Harris MA, Lomax J., &lt;b&gt;How the Gene Ontology Evolves&lt;/b&gt;. &lt;i&gt;BMC Bioinformatics&lt;/i&gt;. 2011 Aug 5;12(1):325. [&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21819553"&gt;Abstract&lt;/a&gt;]&lt;span id="goog_2078567202"&gt;&lt;/span&gt;&lt;span id="goog_2078567203"&gt;&lt;/span&gt; [&lt;a href="http://www.biomedcentral.com/1471-2105/12/325"&gt;Full Text&lt;/a&gt;].&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-4396340273898002515?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/4396340273898002515/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/08/unsung-heroes-of-go.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/4396340273898002515'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/4396340273898002515'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/08/unsung-heroes-of-go.html' title='The Unsung Heroes of GO'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-_ampLu9IEvU/Tleg9WONvCI/AAAAAAAAATc/eeKzXQ-T_WQ/s72-c/go.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-1029269160017356436</id><published>2011-08-23T04:08:00.000-07:00</published><updated>2011-08-26T06:46:56.964-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='illumina'/><category scheme='http://www.blogger.com/atom/ns#' term='visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='rna-seq'/><title type='text'>ExpressionPlot:  New web-based tool for analyzing RNA-Seq Data</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-hG_3AIUWNRg/TlOH9-ArhVI/AAAAAAAAATY/61Dv85Zot3M/s1600/exp.png" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="112" src="http://3.bp.blogspot.com/-hG_3AIUWNRg/TlOH9-ArhVI/AAAAAAAAATY/61Dv85Zot3M/s320/exp.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;Genome Biology has &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed?term=21797991[uid]"&gt;new paper on ExpressionPlot&lt;/a&gt;:&amp;nbsp; a web-based tool for analyzing RNA-Seq data.&lt;br /&gt;&lt;br /&gt;ExpressionPlot includes a back-end processing pipeline for importing and processing raw RNA-Seq and Affymetrix microarray based data, plus a front-end web user interface for analyzing and visualizing the results.&lt;br /&gt;&lt;br /&gt;You can check out the web interface via the &lt;a href="http://99.9.208.223/cgi-bin/expressionplot/home.pl"&gt;prototype server&lt;/a&gt;, which is pre-loaded with a number of RNA-Seq and Affymetrix data sets.&amp;nbsp; More details and installation instructions are available at:&amp;nbsp; &lt;a href="http://www.expressionplot.com/"&gt;http://www.expressionplot.com/&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: left;"&gt;Notably, the authors &lt;a href="http://blog.expressionplot.com/2011/03/15/idea-challenge-2011-illuminas-data-excellence-award/#more-34"&gt;submitted ExpressionPlot to iDEA&lt;/a&gt; (Illumina’s Data Excellence Award) Challenge&amp;nbsp; for "Most Creative Visualization", but the award was eventually awarded to &lt;a href="http://genomeview.org/content/genomeview-wins-most-creative-visualization-award-illumina-idea-challenge"&gt;GenomeView.&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;b&gt;References:&lt;/b&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&amp;nbsp;Friedman BA, Maniatis T. &lt;b&gt;ExpressionPlot: A web-based framework for analysis of RNA-Seq and microarray gene expression data.&lt;/b&gt; &lt;i&gt;Genome Biol.&lt;/i&gt; 2011 Jul 28;12(7):R69. [&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed?term=21797991[uid]"&gt;Abstract&lt;/a&gt;] [&lt;a href="http://genomebiology.com/2011/12/7/R69/abstract"&gt;Full Text&lt;/a&gt;].&lt;b style="display: block; text-align: center;"&gt; &lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-1029269160017356436?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/1029269160017356436/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/08/expressionplot-new-web-based-tool-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/1029269160017356436'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/1029269160017356436'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/08/expressionplot-new-web-based-tool-for.html' title='ExpressionPlot:  New web-based tool for analyzing RNA-Seq Data'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-hG_3AIUWNRg/TlOH9-ArhVI/AAAAAAAAATY/61Dv85Zot3M/s72-c/exp.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-9126443281675767203</id><published>2011-08-17T11:50:00.000-07:00</published><updated>2011-08-26T06:47:48.963-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='broad'/><category scheme='http://www.blogger.com/atom/ns#' term='rnai'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>Project Achilles and the Search for Essential Genes in Cancer</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-WM496AOgRF8/TkwLLVVXODI/AAAAAAAAATA/JFB3QLfcqeM/s1600/achilles-arrow-heel.jpeg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="188" src="http://4.bp.blogspot.com/-WM496AOgRF8/TkwLLVVXODI/AAAAAAAAATA/JFB3QLfcqeM/s200/achilles-arrow-heel.jpeg" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;A recent PNAS paper by Cheung et. al. reports the latest findings from the Broad Institute's Project Achilles.&amp;nbsp; The goal of the project is to use high-throughput &lt;a href="http://www.broadinstitute.org/blog/five-questions-david-root-rna-interference-explained"&gt;RNAi screening&lt;/a&gt; to systematically identify genes essential for proliferation and survival in multiple cancer cell lines.&lt;br /&gt;&lt;br /&gt;There are (obviously) many details involved in the RNAi screen developed for Project Achilles, and you find a complete description in the &lt;a href="http://www.pnas.org/content/105/51/20380.short"&gt;first PNAS paper&lt;/a&gt;.&amp;nbsp; However, the simplest explanation is that the screen takes one cell line at a time, systematically knocks down one gene at a time (via RNAi), and then measures what effect (if any) this specific knock-down has on cell-proliferation.&amp;nbsp; If a knock-down results in a dramatic decrease in cell-proliferation, one can infer that the target gene is functionally important or essential to the cancer phenotype.&amp;nbsp; Significantly, one can then also combine the complete set of functional data with primary tumor data, e.g. mutation or copy number data, to hone in on those primary genomic alterations most likely to drive tumorigenesis.&lt;br /&gt;&lt;br /&gt;The PNAS paper reports findings for 102 human cancer cell lines, each of which was screened with 54,020 shRNAs, targeting a total of 11,195 genes.&amp;nbsp; Complete data from the project is available via the Broad's &lt;a href="http://www.broadinstitute.org/IGP/home"&gt;Integrative Genomics Portal&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Other than the data sets themselves, which will be hugely important for helping to interpret future cancer genomic studies, the most significant findings of the paper relate to ovarian cancer.&amp;nbsp; Specifically, by combining RNAi data with primary genomic data from TCGA, the group made the novel observation that &lt;a href="http://tinyurl.com/3c2kxh7"&gt;PAX8 is both frequently amplified in serous ovarian cancer&lt;/a&gt;, and essential to proliferation in multiple ovarian cancer cell cells.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Cheung, et. al, &lt;b&gt;Systematic investigation of genetic vulnerabilities across cancer cell lines reveals lineage-specific dependencies in ovarian cancer&lt;/b&gt;. (&lt;i&gt;PNAS&lt;/i&gt;, &lt;span class="slug-metadata-note ahead-of-print"&gt;&lt;span class="slug-ahead-of-print-date"&gt;July 11, 2011) &lt;/span&gt;&lt;/span&gt;&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21746896"&gt;[Abstract].&lt;/a&gt;&amp;nbsp; &lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Background on RNAi: &amp;nbsp; &lt;a href="http://www.broadinstitute.org/blog/five-questions-david-root-rna-interference-explained"&gt;Five questions for David Root: RNA Interference explained.&lt;/a&gt; &lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-9126443281675767203?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/9126443281675767203/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/08/project-achilles-and-search-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/9126443281675767203'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/9126443281675767203'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/08/project-achilles-and-search-for.html' title='Project Achilles and the Search for Essential Genes in Cancer'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-WM496AOgRF8/TkwLLVVXODI/AAAAAAAAATA/JFB3QLfcqeM/s72-c/achilles-arrow-heel.jpeg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-8832620133383137957</id><published>2011-07-14T17:38:00.000-07:00</published><updated>2011-07-14T17:38:29.296-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='personalized-medicine'/><category scheme='http://www.blogger.com/atom/ns#' term='genomics'/><title type='text'>Bioinformatics Challenges for Pesonalized Medicine</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-SEA9SUa505M/Th-IfpB4OhI/AAAAAAAAAS8/jrnA60qVGpY/s1600/personalized.png" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="175" src="http://2.bp.blogspot.com/-SEA9SUa505M/Th-IfpB4OhI/AAAAAAAAAS8/jrnA60qVGpY/s200/personalized.png" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;A recent review by &lt;a href="http://www.citeulike.org/user/cerami/article/9318168"&gt;Fernald et. al. &lt;/a&gt;describes key bioinformatics challenges for personalized medicine.&amp;nbsp; The key challenges break down into four categories, and in the discussion below, I have recast these categories as questions:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;How can we better process large-scale genomic data? &lt;/b&gt;&amp;nbsp; Next generation sequencing technology may be one of the most profound enablers of personalized medicine, but there remain significant challenges in processing read data.&amp;nbsp; Challenges include accurate mapping and alignment of short-read sequences; identification of indels, copy number variants and structural variants;&amp;nbsp; and improved quality control metrics to adjust with the inherent errors rates associated with next-gen sequencing technology.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;How can we better predict the functional effect of specific genomic variations?&amp;nbsp; &lt;/b&gt;Key challenges include developing better computational methods to predict the functional consequences of genomic variants, as well as better methods for prioritizing gene candidates for experimental validation.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;How can we better translate genomic insights into disease-specific insights?&amp;nbsp; &lt;/b&gt;Key challenges include use of genome wide association study (GWAS) approaches to identify patient response to drugs, integrating multiple genomic data types, and integrating biological pathway data.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;How do we get all this to real patients?&amp;nbsp; &lt;/b&gt;Here, the authors note that many physicians are currently unprepared to incorporate personal genomic information into regular clinical practice.&amp;nbsp; However, they also not some significant advances, in particular the use of SNP testing to determine dosage of the anti-coagulant drug &lt;a href="http://en.wikipedia.org/wiki/Warfarin"&gt;warfarin&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;The road to personalized medicine is certainly fraught with challenges.&amp;nbsp; We may soon be able to economically sequence the full genome of every patient, but bioinformatics has a long way to go to make full sense of this data.&lt;br /&gt;&lt;br /&gt;One minor quibble with this paper if that it pays scant attention to cancer genomics, one of the few areas in which we are already seeing and are very likely to see additional examples in personalized medicine.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-8832620133383137957?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/8832620133383137957/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/07/bioinformatics-challenges-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/8832620133383137957'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/8832620133383137957'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/07/bioinformatics-challenges-for.html' title='Bioinformatics Challenges for Pesonalized Medicine'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-SEA9SUa505M/Th-IfpB4OhI/AAAAAAAAAS8/jrnA60qVGpY/s72-c/personalized.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-653472655997667308</id><published>2011-07-05T07:47:00.000-07:00</published><updated>2011-07-05T07:47:29.199-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='rna-seq'/><category scheme='http://www.blogger.com/atom/ns#' term='genomics'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>Chromothripsis</title><content type='html'>&lt;a href="http://1.bp.blogspot.com/-5rFD_B_L-SM/ThMbsStWMEI/AAAAAAAAASk/np8RkGxbYE8/s1600/abstract.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="200" src="http://1.bp.blogspot.com/-5rFD_B_L-SM/ThMbsStWMEI/AAAAAAAAASk/np8RkGxbYE8/s200/abstract.jpg" width="200" /&gt;&lt;/a&gt; This is a fascinating &lt;a href="http://www.citeulike.org/user/cerami/article/8521741"&gt;Cell paper from the Sanger Institute&lt;/a&gt;, published in January 2011.&amp;nbsp; As depicted in the graphical abstract (shown at right), the authors find evidence for a new type of genomic rearrangement, referred to as&amp;nbsp; &lt;b&gt;chromothripsis:&amp;nbsp;&lt;/b&gt; (Greek, chromos for chromosome;&amp;nbsp; thripsis, shattering into pieces).&amp;nbsp;&lt;br /&gt;&lt;br /&gt;In chromothripsis, a single catastrophic event causes the shattering of one of more chromosomes.&amp;nbsp; Cells then attempt to repair the shattered chromosome, resulting in 10s to 100s of stitched DNA fragments (many of which are detected up by paired-end RNA sequencing).&amp;nbsp; That cells are able to survive such a catostrophic event would appear very unlikely, but a few cells clearly do, and the authors provide evidence that these cells not only survive, but have some type of selective advantage that promotes cancer development.&lt;br /&gt;&lt;br /&gt;The current model of cancer development assumes a model of "gradualism" whereby cells acquire mutations through a series of progressive steps.&amp;nbsp; In the case of chromothripsis, however, we have a new model, whereby cells acquire multiple genomic alterations in a single catastrophic step.&amp;nbsp; This is quite new.&amp;nbsp; However, according to the authors, chromothripsis may not be all that prevalent.&amp;nbsp; For example, they find evidence of chromothripsis in 2%-3% of cells lines and primary tumor samples, but a much larger 25% of bone cancers.&lt;br /&gt;&lt;br /&gt;The paper describes the major steps in first identifying chromothripsis.&amp;nbsp; This includes an initial paired-end RNA-seq analysis of a patient with Chronic Lympocytic Leukemia (CLL), and the identification of a strange pattern of rearrangements.&amp;nbsp; The observations included multiple genomic rearrangements, all confined to a single chromosomal region, chromosome 4q, and alteration of copy number states between 1 and 2 only.&amp;nbsp;&lt;br /&gt;&lt;br /&gt;The authors next searched for a similar pattern of genomic rearrangement in a large panel of cell lines and primary tumors, and selected a few cell lines for further RNA-seq analysis.&amp;nbsp; The paper also includes a Monte Carlo simulation providing simulated evidence that the large-scale rearrangements observed would be very unlikely to occur under the current model of "gradual" rearrangements.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-653472655997667308?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/653472655997667308/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/07/chromothripsis.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/653472655997667308'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/653472655997667308'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/07/chromothripsis.html' title='Chromothripsis'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-5rFD_B_L-SM/ThMbsStWMEI/AAAAAAAAASk/np8RkGxbYE8/s72-c/abstract.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-9117328428063617127</id><published>2011-07-01T06:24:00.000-07:00</published><updated>2011-07-01T06:27:55.923-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='rna-seq'/><category scheme='http://www.blogger.com/atom/ns#' term='breast-cancer'/><category scheme='http://www.blogger.com/atom/ns#' term='genomics'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>Using RNA-Seq to Identify Fusion Genes in Breast Cancer</title><content type='html'>The Institute for Molecular Medicine Finland has recently published a paper in Genome Biology on using RNA-Seq to identify &lt;a href="http://www.citeulike.org/user/cerami/article/8685462"&gt;fusion genes in breast cancer&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;As noted by the authors, fusion genes have been identified as important oncogenic alterations in leukemias, lyphompas, sarcomas, and prostate cancer.&amp;nbsp; The role of fusion genes in breast cancer has not, however, been fully explored.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/-C_ZlBDg4B0U/Tg3HQM_mPCI/AAAAAAAAASg/WKragufDs3Y/s1600/paired.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="129" src="http://4.bp.blogspot.com/-C_ZlBDg4B0U/Tg3HQM_mPCI/AAAAAAAAASg/WKragufDs3Y/s320/paired.jpg" width="320" /&gt;&lt;/a&gt;Paired-end RNA sequencing is a relatively new and useful tool in systematically identifying fusion genes in cell lines and primary cancer samples.&amp;nbsp; In this technique, one only sequences expressed genes, and one sequences  36 to 100 basepairs of both ends of 200 to 500 base pair long  molecules.&amp;nbsp; One then maps each end to the genome.&amp;nbsp; If one end maps to  one gene, and the other end maps to another gene, and these genes are a  sufficient distance apart (e.g. they are not directly adjacent), and  you have multiple reads to support this, you have evidence of a gene  fusion (see figure above).&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/-f4TkgI6UzzM/Tg3FeySIaHI/AAAAAAAAASc/0KLVvRSh9bM/s1600/rna_seq_breast.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="180" src="http://4.bp.blogspot.com/-f4TkgI6UzzM/Tg3FeySIaHI/AAAAAAAAASc/0KLVvRSh9bM/s200/rna_seq_breast.jpg" width="200" /&gt;&lt;/a&gt;The Finland study focused on well-known breast cancer cell lines, including MCF-7, BT-474, and KPL-4.&amp;nbsp; Each cell line was analyzed via RNA-Seq, reads were mapped to the genome, putative novel gene fusions were identified and many gene fusions were then validated via RT-PCR.&amp;nbsp; One novel gene fusion, involving &lt;i&gt;VAPB-IKZF3&lt;/i&gt; (highlighted at left) was also knocked down via RNA interference, resulting in a direct impact on cell growth.&lt;br /&gt;&lt;br /&gt;The next step, of course will be to determine if these novel fusion genes exist in primary breast tumors, and with what frequency.&amp;nbsp; But, this paper provides an interesting insight into many things to come, and we are very likely to soon see comprehensive search for gene fusions via RNA-seq added to many large-scale projects, including the &lt;a href="http://cancergenome.nih.gov/"&gt;TCGA&lt;/a&gt;.&lt;br /&gt;&amp;nbsp;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-9117328428063617127?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/9117328428063617127/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/07/using-rna-seq-to-identify-fusion-genes.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/9117328428063617127'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/9117328428063617127'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/07/using-rna-seq-to-identify-fusion-genes.html' title='Using RNA-Seq to Identify Fusion Genes in Breast Cancer'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-C_ZlBDg4B0U/Tg3HQM_mPCI/AAAAAAAAASg/WKragufDs3Y/s72-c/paired.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-2005633580597044540</id><published>2011-06-30T10:35:00.000-07:00</published><updated>2011-06-30T10:38:57.992-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><title type='text'>g:Profiler:  2011 Update</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-vQA20nEg43U/Tgyv0oVEb5I/AAAAAAAAASY/rGxfzRS8lJo/s1600/header.png" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="50" src="http://2.bp.blogspot.com/-vQA20nEg43U/Tgyv0oVEb5I/AAAAAAAAASY/rGxfzRS8lJo/s200/header.png" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;a href="http://biit.cs.ut.ee/gprofiler/"&gt;g:Profiler&lt;/a&gt; is a web-based tool for analyzing lists of genes.&amp;nbsp; A recent &lt;a href="http://www.citeulike.org/user/cerami/article/9392410"&gt;paper&lt;/a&gt; in Nucleic Acids Research describes its 2011 update.&amp;nbsp; It's always great to see software updates in bioinformatics, as many projects lose funding or lose key people, or both, and never really progress further.&amp;nbsp; Not so for g:Profiler, which has added a number of new features and data sets.&lt;br /&gt;&lt;br /&gt;The core feature of g:Profiler remains its g:GOSt Gene Group Functional Profiling feature.&amp;nbsp; This feature enables you to upload a set of genes, e.g. differentially expressed genes from a microarray study, and identify enriched gene sets (determined via a hypergeometic test) across multiple resources, including Gene Ontology, KEGG or Reactome pathways, etc.&lt;br /&gt;&lt;br /&gt;The 2011 update also includes an interesting network enrichment feature, which determines whether your input gene list is enriched for interacting genes, providing evidence that your input gene list contains specific network modules.&amp;nbsp; The network enrichment test is novel in that it identifies "core" genes, defined as those genes in the input list which interact with each other v. "neighborhood" genes, defined as all partners of core genes, which are not in the original input list.&amp;nbsp; A tally of all interacting genes in the input list is then compared to a tally of all interacting genes in the neighborhood, and a hypergeometric test is then used to determine statistical enrichment.&lt;br /&gt;&lt;br /&gt;g:Profiler also includes a very useful ID conversion service, a programmatic web service, and an R package.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-2005633580597044540?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/2005633580597044540/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/06/gprofiler.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/2005633580597044540'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/2005633580597044540'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/06/gprofiler.html' title='g:Profiler:  2011 Update'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-vQA20nEg43U/Tgyv0oVEb5I/AAAAAAAAASY/rGxfzRS8lJo/s72-c/header.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-3454154727413639687</id><published>2011-06-29T06:18:00.000-07:00</published><updated>2011-06-29T06:23:23.499-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='genomics'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><category scheme='http://www.blogger.com/atom/ns#' term='pathways'/><title type='text'>PathScan:  New Gene Set Analysis Method from Wash U</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-FDKqwGA-ptw/Tgsc4ZJNlBI/AAAAAAAAASU/uSoGAXLr3n8/s1600/washu" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="63" src="http://3.bp.blogspot.com/-FDKqwGA-ptw/Tgsc4ZJNlBI/AAAAAAAAASU/uSoGAXLr3n8/s200/washu" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;Wash U. has recently published a new method, &lt;a href="http://www.citeulike.org/user/cerami/article/9172966"&gt;PathScan&lt;/a&gt; for determining the mutational significance of gene sets in cancer.&lt;br /&gt;&lt;br /&gt;The method belongs to the broad category of gene set analysis methods in cancer genomics, whereby one takes observed mutations in cancer and compares these mutations against predefined gene sets, as defined by, e.g.&amp;nbsp; &lt;a href="http://www.genome.jp/kegg/"&gt;KEGG&lt;/a&gt; or &lt;a href="http://www.broadinstitute.org/gsea/msigdb/index.jsp"&gt;MSigDB&lt;/a&gt;.&amp;nbsp; One then determines whether each predefined gene set is enriched for mutations.&lt;br /&gt;&lt;br /&gt;In contrast to other gene set analysis methods, PathScan has two distinct features:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;First, it takes into account the variability in gene length.&amp;nbsp; This is biologically grounded in the fact that larger genes contain more mutations under the null hypothesis of random mutations.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Second, PathScan does not use a "tally" based or "pool-based" technique.&amp;nbsp; In these techniques, one tallies all the mutations within a gene set across all samples, and performs one statistical test.&amp;nbsp; Rather PathScan performs a statistical test on each sample individually, and then integrates the results of each test into an overall p-value.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;The authors make a statistical case for the second feature, but not necessarily a biological one.&amp;nbsp; And, here I wish the authors went into more detail about the biological basis of their approach.&amp;nbsp; For example, if one performs a statistical test on a single sample, and a pathway is identified as significant, this implies that multiple genes within the pathway are mutated in this one sample.&lt;br /&gt;&lt;br /&gt;By this logic, PathScan focuses of identifying a specific class of pathway alterations --- for example, a single gene mutation is not enough to provide tumorigenic advantage, but 2 or 3 mutations are.&amp;nbsp; By focusing on this class of alterations, PathScan can easily miss other types of alterations, e.g. where one dominant mutation is enough to cause tumorigenic advantage, or where 2 or 3 low-frequency, but mutually exclusive mutations predominate, any one of which can result it tumorigenic advantage.&lt;br /&gt;&lt;br /&gt;A more thorough description of the classes of pathway alterations that would be identified and potentially missed by PathScan would definitely clarify the paper.&amp;nbsp; The paper would also benefit from clarifying how the Background Mutation Rate (BMR) is estimated.&amp;nbsp; Is it estimated from all samples?&amp;nbsp; Or, does the method use a sample-specific BMR?&amp;nbsp; If it is estimated from all samples, the sample-specific statistical test would appear flawed.&amp;nbsp; However, if it is a sample-specific BMR, this is another novel feature that should more clear be highlighted.&lt;br /&gt;&lt;br /&gt;PathScan software is available as Perl code, and available from &lt;a href="http://genome.wustl.edu/software/pathscan"&gt;Wash U.&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-3454154727413639687?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/3454154727413639687/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/06/pathscan-new-gene-set-analysis-method.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/3454154727413639687'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/3454154727413639687'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/06/pathscan-new-gene-set-analysis-method.html' title='PathScan:  New Gene Set Analysis Method from Wash U'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-FDKqwGA-ptw/Tgsc4ZJNlBI/AAAAAAAAASU/uSoGAXLr3n8/s72-c/washu' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-867196358182424306</id><published>2011-06-28T05:56:00.000-07:00</published><updated>2011-08-26T06:51:15.613-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='web-services'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud'/><category scheme='http://www.blogger.com/atom/ns#' term='genomics'/><title type='text'>Three Takes on Cloud Computing and Genomics</title><content type='html'>&lt;a href="http://4.bp.blogspot.com/-ZZBpeUkn82k/TgnK57O4v4I/AAAAAAAAASQ/fqGJznxYYcQ/s1600/cloud_0.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="130" src="http://4.bp.blogspot.com/-ZZBpeUkn82k/TgnK57O4v4I/AAAAAAAAASQ/fqGJznxYYcQ/s200/cloud_0.jpg" width="200" /&gt;&lt;/a&gt;If you want to get a good take on cloud computing, the best option is to simply head over to &lt;a href="http://aws.amazon.com/free/"&gt;Amazon Web Services&lt;/a&gt;, create an account, and poke around within the free usage tier.&lt;br /&gt;&lt;br /&gt;For greater context on cloud computing and genomics, check out these three recent papers:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.citeulike.org/user/cerami/article/7127379"&gt;The case for cloud computing in genomic informatics&lt;/a&gt; (May 2010):&amp;nbsp; A frequently cited paper by Lincoln Stein (Ontario Institute for Cancer Research), describing how advances in next-gen sequencing threaten the conventional genomics informatics ecosystem, and why cloud computing may provide a feasible and cost-effective option.&amp;nbsp; Paper also includes a frequently referenced figure illustrating historical trends in storage prices v. DNA sequencing costs.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&amp;nbsp;&lt;a href="http://www.citeulike.org/user/cerami/article/7465973"&gt;Cloud computing and DNA data race&lt;/a&gt; (July 2010):&amp;nbsp; A concise overview of cloud computing, from the makers of &lt;a href="http://bowtie-bio.sourceforge.net/crossbow/index.shtml"&gt;Crossbow&lt;/a&gt;, a cloud-compute pipeline for genome analysis.&amp;nbsp; Includes a concise overview of the MapReduce algorithm, and the best line opener of the bunch:&amp;nbsp; "In the race between DNA sequencing and computer speed, sequencing is winning by a mile."&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.citeulike.org/user/cerami/article/7578650"&gt;Translational bioinformatics in the cloud: an affordable alternative&lt;/a&gt; (August 2010).&amp;nbsp; An apples-to-apples comparison of doing an expression quantitative trait loci (eQTL) analysis on a local compute cluster v. Amazon EC2.&amp;nbsp; Illustrates just how hard it is to do a cost-comparison.&amp;nbsp; It's relatively easy to determine the cost of Amazon EC2, as you get an itemized bill.&amp;nbsp; But, estimating the cost of a local computer cluster is actually quite involved, and requires that you take into account many difficult to measure factors, including staff time, electricity, back-up power systems, etc. &amp;nbsp; &amp;nbsp; &lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-867196358182424306?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/867196358182424306/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/06/three-takes-on-cloud-computing-and.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/867196358182424306'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/867196358182424306'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/06/three-takes-on-cloud-computing-and.html' title='Three Takes on Cloud Computing and Genomics'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-ZZBpeUkn82k/TgnK57O4v4I/AAAAAAAAASQ/fqGJznxYYcQ/s72-c/cloud_0.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-302960313406098691</id><published>2011-06-27T05:48:00.000-07:00</published><updated>2011-06-27T05:48:58.964-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='tcga'/><category scheme='http://www.blogger.com/atom/ns#' term='ovarian'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>The genesis and evolution of high-grade serous ovarian cancer</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-h7KSRdPPbyQ/Tgh5B6h2Z-I/AAAAAAAAASI/bthOCgZNB4o/s1600/nrc2946-f2.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/-h7KSRdPPbyQ/Tgh5B6h2Z-I/AAAAAAAAASI/bthOCgZNB4o/s400/nrc2946-f2.jpg" width="232" /&gt;&lt;/a&gt;&lt;/div&gt;In anticipation of the TCGA ovarian paper, scheduled for release in Nature on June 30, 2011, check out David Bowtell's recent piece in Nature Reviews Center on &lt;a href="http://www.citeulike.org/user/cerami/article/8127545"&gt;The genesis and evolution of high-grade serous ovarian cancer&lt;/a&gt;.&amp;nbsp; It provides an excellent background and context for understanding the TCGA study.&amp;nbsp;&lt;br /&gt;&lt;br /&gt;A summary of the main points discussed in the paper:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;worldwide, more than 100,000 women die each year of ovarian cancer (the 5th leading cause of cancer death).&lt;/li&gt;&lt;li&gt;two-thirds of ovarian cancer deaths are associated with high-grade serous subtype (the focus of TCGA profiling).&lt;/li&gt;&lt;li&gt;TP53 mutation is a primary hallmark of high-grade serous.&lt;/li&gt;&lt;li&gt;homologous recombination repair defects (including, but not limited to BRCA mutations) are a secondary hallmark of high-grade serous.&lt;/li&gt;&lt;li&gt; high-grade serous tumors are marked by pronounced genomic instability.&lt;/li&gt;&lt;li&gt;recent studies have identified four robust mRNA expression sub-types of high-grade serous.&lt;/li&gt;&lt;li&gt;understanding extent of BRCA pathway defects has important clinical implications for use of PARP inhibitors. &lt;/li&gt;&lt;li&gt;"a comprehensive catalog of all possible mechanisms of inactivation of the HR DNA repair in a large series of HG-SOC is needed." (something that TCGA focused on, and will be included in June 30 paper).&lt;/li&gt;&lt;li&gt;many genomic parallels between high-grade serous and basal-like breast cancer, including BRCA dsyfunction, TP53 loss, and chromosomal instability.&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-302960313406098691?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/302960313406098691/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/06/genesis-and-evolution-of-high-grade.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/302960313406098691'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/302960313406098691'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/06/genesis-and-evolution-of-high-grade.html' title='The genesis and evolution of high-grade serous ovarian cancer'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-h7KSRdPPbyQ/Tgh5B6h2Z-I/AAAAAAAAASI/bthOCgZNB4o/s72-c/nrc2946-f2.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-4773443262352145793</id><published>2011-05-27T07:41:00.000-07:00</published><updated>2011-05-27T07:50:02.517-07:00</updated><title type='text'>Creating Integrated PDF Reports in R</title><content type='html'>This R-Hack shows how to create integrated reports of both plots and results of statistical tests in one integrated PDF.&lt;br /&gt;&lt;br /&gt;It works by using the &lt;a href="http://cran.r-project.org/web/packages/gplots/index.html" style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt;gplots&lt;/a&gt; library.&amp;nbsp; The&lt;span style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt; &lt;/span&gt;&lt;a href="http://rss.acs.unt.edu/Rdoc/library/gplots/html/textplot.html" style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt;textplot()&lt;/a&gt; command displays text output in a graphics window or a previously opened PDF device.&amp;nbsp; The &lt;a href="http://rss.acs.unt.edu/Rdoc/library/gplots/html/sinkplot.html" style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt;sinkplot()&lt;/a&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt; &lt;/span&gt;command diverts standard text output to a graphics device, such as a previously opened PDF device.&lt;br /&gt;&lt;br /&gt;To use &lt;span style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt;sinkplot()&lt;/span&gt;, you call &lt;span style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt;sinkplot("start").&lt;/span&gt; After this, any commands which would have been displayed to the screen will be diverted to the graphical device.&amp;nbsp; You then call &lt;span style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt;sinkplot("plot")&lt;/span&gt; to signal the end of diversion.&lt;br /&gt;&lt;br /&gt;When you prepend all this with a call to &lt;span style="font-family: &amp;quot;Trebuchet MS&amp;quot;,sans-serif;"&gt;pdf([file name])&lt;/span&gt;, you can divert everything into one integrated, multi-page PDF.&amp;nbsp; This enables you to create integrated reports of your statistical work, and I find it is considerably easier than any of the other more sophisticated options, e.g. &lt;a href="http://www.stat.uni-muenchen.de/%7Eleisch/Sweave/"&gt;Sweave&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;A complete, bare bones example is shown below.&amp;nbsp; It generates this &lt;a href="http://cbio.mskcc.org/%7Ecerami/blog/report.pdf"&gt;PDF&lt;/a&gt;:&lt;br /&gt;&lt;code&gt;&lt;br /&gt;#!/usr/bin/Rscript --no-save&lt;br /&gt;# Illustrates How to Create an Integrated PDF Report in R&lt;br /&gt;# Ethan Cerami&lt;br /&gt;&lt;br /&gt;library ("gplots")&lt;br /&gt;&lt;br /&gt;# Save all results to External PDF&lt;br /&gt;pdf("report.pdf")&lt;br /&gt;&lt;br /&gt;# Create Title Page for Report via textplot() command&lt;br /&gt;title = "Sleep Report\n\n"&lt;br /&gt;line1 = "This report includes:&amp;nbsp; \n\n"&lt;br /&gt;line2 = "1.&amp;nbsp; Box Plot\n"&lt;br /&gt;line3 = "2.&amp;nbsp; T-Test"&lt;br /&gt;cover = paste (title, line1, line2, line3)&lt;br /&gt;&lt;br /&gt;# Output via textplot&lt;br /&gt;textplot (cover, col="darkblue", valign="top", cex=1.5)&lt;br /&gt;&lt;br /&gt;# Create the Box Plot&lt;br /&gt;plot(extra ~ group, data = sleep, main="Sleep Data")&lt;br /&gt;&lt;br /&gt;# Perform a t-test&lt;br /&gt;# Start capture&lt;br /&gt;sinkplot("start")&lt;br /&gt;t.test(extra ~ group, data = sleep)&lt;br /&gt;noquote ("If p&amp;lt;0.05, we can conclude that the two groups are different.")&lt;br /&gt;sinkplot("plot",col="darkblue", valign="top", cex=0.8)&lt;br /&gt;title("t-test")&lt;br /&gt;&lt;br /&gt;# End PDF&lt;br /&gt;dev.off()&lt;br /&gt;&lt;/code&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-4773443262352145793?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/4773443262352145793/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/05/creating-integrated-pdf-reports-in-r.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/4773443262352145793'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/4773443262352145793'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/05/creating-integrated-pdf-reports-in-r.html' title='Creating Integrated PDF Reports in R'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-8525294998756471573</id><published>2011-05-19T13:28:00.000-07:00</published><updated>2011-08-28T06:39:36.473-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='cbio-portal'/><category scheme='http://www.blogger.com/atom/ns#' term='tcga'/><category scheme='http://www.blogger.com/atom/ns#' term='python'/><category scheme='http://www.blogger.com/atom/ns#' term='rstats'/><title type='text'>Plotting with Python (move over R)</title><content type='html'>R is a great platform for statistics.  But, I still find the syntax non-intuitive, and can never seem to "just do" anything without lots of referencing to old code or googling.  Contrast this to Python, which is so simple and spare.  Even if you can't remember how to do something, you think of the simplest way to do it, and that's probably the correct syntax in Python.&lt;br /&gt;&lt;br /&gt;With that in mind, I have been trying out &lt;a href="http://matplotlib.sourceforge.net/"&gt;matplotlib&lt;/a&gt; as an alternative for all the statistics plots that I would normally do in R.  It is beautiful. Simple.  And, much easier than R.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Tip:&amp;nbsp;&lt;/b&gt; if you are on Mac OS X, and want an easy way to install matplotlib, check out &lt;a href="http://stronginference.com/scipy-superpack/"&gt;Scipy Superpack for Mac OSX&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Here is example code from the getting started guide:&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;# Do a simple plot&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;import matplotlib.pyplot as plt&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;plt.plot([1,2,3,4], [1,4,9,16], 'ro')&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;plt.axis([0, 6, 0, 20])&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;,Courier,monospace;"&gt;plt.show()&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/--SZ4b-XIMyc/TdV6O7I5KtI/AAAAAAAAARg/d9fxGelYOPo/s1600/plot.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="300" src="http://2.bp.blogspot.com/--SZ4b-XIMyc/TdV6O7I5KtI/AAAAAAAAARg/d9fxGelYOPo/s400/plot.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;This plots x v. y, specifies the points as red circles, and sets the x/y axis:&lt;br /&gt;&lt;br /&gt;That was easy enough to convince me.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Here is a longer example&lt;/b&gt; that illustrates how to plot mRNA v. Methylation data from the TCGA ovarian project.&amp;nbsp; Most of the code gets and parses the data from our &lt;a href="http://cbioportal.org/"&gt;cBio Cancer Genomics Portal&lt;/a&gt;, and the last few lines are focused on plotting:&lt;br /&gt;&lt;br /&gt;&lt;script src="https://gist.github.com/1176672.js?file=cgdsPyPlot.py"&gt;&lt;/script&gt;&lt;br /&gt;&lt;br /&gt;Here is output for BRCA1:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-uiFw00L_F6k/TdV8CTpXlQI/AAAAAAAAARk/si0Mf4wotD0/s1600/brca1.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="301" src="http://2.bp.blogspot.com/-uiFw00L_F6k/TdV8CTpXlQI/AAAAAAAAARk/si0Mf4wotD0/s400/brca1.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-8525294998756471573?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/8525294998756471573/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/05/plotting-with-python-move-over-r.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/8525294998756471573'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/8525294998756471573'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/05/plotting-with-python-move-over-r.html' title='Plotting with Python (move over R)'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/--SZ4b-XIMyc/TdV6O7I5KtI/AAAAAAAAARg/d9fxGelYOPo/s72-c/plot.png' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-349959951923016096</id><published>2011-05-10T10:12:00.000-07:00</published><updated>2011-08-26T06:51:39.087-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='web-services'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud'/><category scheme='http://www.blogger.com/atom/ns#' term='simpledb'/><category scheme='http://www.blogger.com/atom/ns#' term='amazon'/><title type='text'>Evaluating Amazon SimpleDB</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right; margin-left: 1em; text-align: right;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-8tAY414iDvM/TcnzGasBwKI/AAAAAAAAARY/KP3Ju-4Bk78/s1600/logo_aws.gif" imageanchor="1" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/-8tAY414iDvM/TcnzGasBwKI/AAAAAAAAARY/KP3Ju-4Bk78/s1600/logo_aws.gif" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;br /&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;a href="http://awsmedia.s3.amazonaws.com/logo_aws.gif" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;br /&gt;&lt;/a&gt;I am new to cloud computing in general, but have spent the past few weeks evaluating Amazon &lt;a href="http://aws.amazon.com/simpledb/"&gt;SimpleDB&lt;/a&gt;.  Below are some of my initial impressions, and my current verdict.&amp;nbsp; &lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;The Context:  Many Database Options to Choose From&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Amazon Web Services provides a host of database options, including Amazon SimpleDB, Amazon Relational Database Service (RDS), and the ability to install and manage multiple databases, including MySQL, SQL Server, Oracle, and PostgreSQL via Amazon Machine Images (AMI).  With so many options, it can be a bit overwhelming to figure out where exactly so start.  Amazon does provide a &lt;a href="http://aws.amazon.com/running_databases/"&gt;short guide to help you better understand the options&lt;/a&gt;, and figure out which option might be best for your particular project.  However, to really understand any of these options, there is no substitute for hands-on experience.  Fortunately, you can get that for free via the &lt;a href="http://aws.amazon.com/free/"&gt;Amazon Free Usage Tier&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Amazon SimpleDB:  The Basics&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;First, imagine that you cannot create traditional relational database tables.  Rather, you are restricted to putting and selecting items with key/value attribute pairs only, similar to a giant hashmap.  Then, imagine that all your data is only accessible via web service calls  --- you "put" data into some giant black box that is managed by Amazon, and that the black box has many complex details that are completely hidden from you.   That is Amazon SimpleDB.&lt;br /&gt;&lt;br /&gt;In a nutshell, Amazon SimpleDB is therefore a NoSQL database that runs within the Amazon cloud.  You focus on putting and selecting data.  Amazon takes care of managing the infrastructure, scaling to meet user demands, and managing all back-ups.  For all that, Amazon charges you on a "pay-as-you-go" rate schedule. &lt;br /&gt;&lt;br /&gt;Famously, &lt;a href="http://gigaom.com/cloud/real-world-nosql-amazon-simpledb-at-netflix/"&gt;NetFlix now uses Amazon SimpleDB&lt;/a&gt;, and has been quite open about &lt;a href="https://docs.google.com/viewer?a=v&amp;amp;pid=sites&amp;amp;srcid=ZGVmYXVsdGRvbWFpbnxwcmFjdGljYWxjbG91ZGNvbXB1dGluZ3xneDo2NDc2ODVjY2ExY2Y1Zjcz&amp;amp;pli=1"&gt;challenges and lessons learned&lt;/a&gt; in transitioning to Amazon SimpleDB.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Amazon SimpleDB:  Getting Started&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;I was able to get started with Amazon SimpleDB in just a few days.  The &lt;a href="http://aws.amazon.com/documentation/simpledb/"&gt;documentation&lt;/a&gt; is excellent, and very easy to follow.  Amazon even has a Javascript "Scratchpad", that enables you to interact with SimpleDB via a web user interface.  You can therefore get started without having to write any code, and the scratchpad examples walk you through the steps of creating a new domain, putting data into a domain, and then getting it back.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;When Simple turns Complicated&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;If you have millions of records you want to pre-load into Amazon SimpleDB, things take a decidedly complicated turn.  In the world of relational databases, such as MySQL or Oracle, we have all come to expect simple, high performance command line tools, such as &lt;a href="http://dev.mysql.com/doc/refman/5.0/en/mysqlimport.html"&gt;mysqlimport&lt;/a&gt; for importing large sets of data.  As of now, there is no such equivalent for SimpleDB, and you have to roll your own.  If you want high-performance, you also have to write some rather complex code, handling all sorts of issues, including data sharding (e.g. placing data into multiple domains), multi-threaded puts, and a measured way of backing off of Amazon SimpleDB if you throttle its service with too many requests at once. &lt;br /&gt;&lt;br /&gt;This is a lot of work.  And, even then, it's still not clear whether can get the write-performance you really need.  For example, the &lt;a href="http://aws.typepad.com/aws/2008/05/high-performanc.html"&gt;high-performance example&lt;/a&gt; from Amazon claims a write-throughput of 300 requests per second.  Pete Warden has also &lt;a href="http://petewarden.typepad.com/searchbrowser/2010/01/how-to-upload-your-csv-data-into-simpledb-at-1000-items-a-second.html"&gt;open sourced his Java code&lt;/a&gt; for multi-threaded parallel inserts, and has been able to achieve a write-throughput of 1,000 records / second.  Depending on how much data you have, that may be fine.  But, consider that if you have 100 million records, you are talking about 28 hours just to load the data (NetFlix claims that they have achieved a much &lt;a href="http://practicalcloudcomputing.com/post/284222088/forklift-1b-records"&gt;higher write-throughput of 10K-11K / second&lt;/a&gt;, but they have not released all the details on how they did this).&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;The Verdict&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Amazon SimpleDB has a lot of potential.  I found the documentation straightforward, and was up and running in no time.  However, if you have millions of records to import, be prepared to deal with lots of headaches.  Until Amazon comes up with a simpler, faster way of doing bulk inserts, you are likely to be writing your own import code, and are very likely to be underwhelmed with the write-throughput provided by Amazon SimpleDB.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-349959951923016096?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/349959951923016096/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2011/05/evaluating-amazon-simpledb.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/349959951923016096'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/349959951923016096'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2011/05/evaluating-amazon-simpledb.html' title='Evaluating Amazon SimpleDB'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-8tAY414iDvM/TcnzGasBwKI/AAAAAAAAARY/KP3Ju-4Bk78/s72-c/logo_aws.gif' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-6206619384556461469</id><published>2010-12-22T12:16:00.000-08:00</published><updated>2010-12-22T12:17:44.917-08:00</updated><title type='text'>cBio Cancer Genomics Portal</title><content type='html'>&lt;span class="status-body"&gt;&lt;span class="status-content"&gt;&lt;span class="entry-content"&gt;We are releasing an upgrade to the &lt;a href="http://www.cbioportal.org/cgx/"&gt;cBio Cancer Genomics Portal&lt;/a&gt; today.  Many new features and new data sets:  &lt;a href="http://www.cbioportal.org/cgx/news.jsp" class="tweet-url web" rel="nofollow" target="_blank"&gt;http://www.cbioportal.org/cgx/news.jsp.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Check it out and send us your feedback.&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-6206619384556461469?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/6206619384556461469/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2010/12/cbio-cancer-genomics-portal.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/6206619384556461469'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/6206619384556461469'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2010/12/cbio-cancer-genomics-portal.html' title='cBio Cancer Genomics Portal'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-2512243927100511632</id><published>2010-12-14T18:06:00.000-08:00</published><updated>2011-01-05T08:46:59.852-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>Advances in understanding cancer genomes through second-generation sequencing</title><content type='html'>An &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20847746"&gt;excellent review paper&lt;/a&gt; covering the current state and future of next-generation sequencing in cancer genomics.  Among the many useful topics covered:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;cancer-specific considerations in sequencing, e.g. dealing with tumor heterogeneity, purity and ploidy.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;pair-end sequencing, whole genome sequencing, RNA-seq.&lt;/li&gt;&lt;li&gt;computational issues related to alignment and mutation detection.&lt;/li&gt;&lt;li&gt;useful software for alignment, mutation calling, and visualization.&lt;/li&gt;&lt;/ul&gt;&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20847746"&gt;&lt;br /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-2512243927100511632?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/2512243927100511632/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2010/12/advances-in-understanding-cancer.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/2512243927100511632'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/2512243927100511632'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2010/12/advances-in-understanding-cancer.html' title='Advances in understanding cancer genomes through second-generation sequencing'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-1072620242019876631</id><published>2009-09-09T13:29:00.000-07:00</published><updated>2011-01-05T08:47:49.567-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='disease'/><category scheme='http://www.blogger.com/atom/ns#' term='hubs'/><category scheme='http://www.blogger.com/atom/ns#' term='networks'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>Network fragility of cancer genes</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_ALaCAWz3x-E/SqgQkvhQazI/AAAAAAAAAQA/iENEC6qSMEM/s1600-h/ncg.png"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 169px; height: 73px;" src="http://1.bp.blogspot.com/_ALaCAWz3x-E/SqgQkvhQazI/AAAAAAAAAQA/iENEC6qSMEM/s200/ncg.png" alt="" id="BLOGGER_PHOTO_ID_5379567978311346994" border="0" /&gt;&lt;/a&gt;A recent paper by &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18675489"&gt;Davide Rambaldi et. al&lt;/a&gt;. on the network properties of cancer genes is deceptively concise, but quite interesting.  Given that recent unbiased screenings of cancer genomes have revealed the heterogeneity of genes altered in individual tumors, the paper asks a relatively simple question:  do these genes actually share more in common than meets the eye?  More specifically, do these genes share common network properties or gene duplicability properties (e.g. do the genes tend to exist as singletons, or are they duplicated many times within the genome?)&lt;br /&gt;&lt;br /&gt;The authors find that cancer genes tend to be both singletons and highly connected protein hubs.  These properties suggest an "intrinsic fragility of cancer genes toward perturbations:  gene dosage modifications of highly interconnected hubs (but arguably also sequence mutations) are likely to produce simultaneous effects on several processes."&lt;br /&gt;&lt;br /&gt;Furthermore, since there are not many genes that share these two properties within the human genome, the authors go a step further to identify a set of 101 potential novel candidate cancer genes.&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18675489"&gt;&lt;br /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-1072620242019876631?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/1072620242019876631/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2009/09/network-fragility-of-cancer-genes.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/1072620242019876631'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/1072620242019876631'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2009/09/network-fragility-of-cancer-genes.html' title='Network fragility of cancer genes'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_ALaCAWz3x-E/SqgQkvhQazI/AAAAAAAAAQA/iENEC6qSMEM/s72-c/ncg.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-5113981313001343278</id><published>2009-09-01T07:24:00.000-07:00</published><updated>2011-01-05T08:48:36.906-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='disease'/><category scheme='http://www.blogger.com/atom/ns#' term='networks'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><category scheme='http://www.blogger.com/atom/ns#' term='pathways'/><title type='text'>Minireview:  Network medicine approach to human disease</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://m1.2mdn.net/viewad/1050778/FEBS_125_125.jpg"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 125px; height: 125px;" src="http://m1.2mdn.net/viewad/1050778/FEBS_125_125.jpg" alt="" border="0" /&gt;&lt;/a&gt;FEBS Letters has a &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19269289"&gt;review on the use of network analysis to understand and treat human disease&lt;/a&gt;.  The first part of the review summarizes recent research on the role of network connectivity and its connection to human disease -- for example, somatic mutations in cancer are more likely to occur within protein hubs and to occur within the same sub-network.  The second part summarizes recent research in using subnetworks as biomarkers -- for example, Trey Ideker's 2007 work in identifying discriminating subnetworks in breast cancer.  The review concludes with more a speculative overview of network / pathway-centric therapeutics, an area which is currently receiving much attention, but lacking in concrete examples.&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19269289"&gt;&lt;br /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-5113981313001343278?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/5113981313001343278/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2009/09/minireview-network-medicine-approach-to.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/5113981313001343278'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/5113981313001343278'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2009/09/minireview-network-medicine-approach-to.html' title='Minireview:  Network medicine approach to human disease'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-801931279914802531</id><published>2009-08-31T06:11:00.000-07:00</published><updated>2009-08-31T06:41:05.910-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='y2h'/><category scheme='http://www.blogger.com/atom/ns#' term='yeast'/><category scheme='http://www.blogger.com/atom/ns#' term='networks'/><title type='text'>2nd Generation Yeast Two-Hybrid (Y2H) Networks</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_ALaCAWz3x-E/SpvQvTKsqNI/AAAAAAAAAP4/5r87EtAJjiU/s1600-h/net.png"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 200px; height: 165px;" src="http://3.bp.blogspot.com/_ALaCAWz3x-E/SpvQvTKsqNI/AAAAAAAAAP4/5r87EtAJjiU/s200/net.png" alt="" id="BLOGGER_PHOTO_ID_5376120091214391506" border="0" /&gt;&lt;/a&gt;The yeast two-hyrbid (Y2H) assay for detecting protein-protein interactions  has been heavily criticized for its poor quality, and its potentially high rate of both false positives and false negatives.  A 2008 Science article out of Marc Vidal's group at Harvard attempts to set the record straight.  The paper is divided into three parts.  In part I, the authors systematically compare the quality of interactions derived from Y2H, affinity purification followed by mass spectrometry (AP/MS), and literature curation, and conclude that "high-throughput Y2H data sets can be comparable in quality to literature curated information."  In part II, the authors conduct a quality controlled, "second generation" Y2H screen in yeast that covers ~20% of all yeast binary interactions.  In part III, the authors revisit the global network properties of the newly derived yeast network and discover that protein connectivity does not actually correctly with essentiality as previously described, but does correlate with genetic &lt;a href="http://en.wikipedia.org/wiki/Pleiotropy"&gt;pleiotropy&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;A curious side finding in the paper is that "literature-curated interactions seem prone to sociological and other inspection biases", a finding which is followed up in a &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19116613"&gt;subsequent paper&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.sciencemag.org/cgi/content/abstract/322/5898/104"&gt;High-Quality Binary Protein Interaction Map of the Yeast Interactome Network&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;For the full network, go to:  &lt;a href="http://interactome.dfci.harvard.edu/"&gt;Interactome Projects at Harvard CCSB&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-801931279914802531?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/801931279914802531/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2009/08/2nd-generation-yeast-two-tybrid.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/801931279914802531'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/801931279914802531'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2009/08/2nd-generation-yeast-two-tybrid.html' title='2nd Generation Yeast Two-Hybrid (Y2H) Networks'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_ALaCAWz3x-E/SpvQvTKsqNI/AAAAAAAAAP4/5r87EtAJjiU/s72-c/net.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-8506080789249387265</id><published>2009-08-28T09:16:00.000-07:00</published><updated>2009-08-31T06:42:29.890-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='tcga'/><category scheme='http://www.blogger.com/atom/ns#' term='ovarian'/><category scheme='http://www.blogger.com/atom/ns#' term='cancer'/><title type='text'>The Biology Of Ovarian Cancer</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.nature.com/nrc/images/covers/journal_cover200906.jpg"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 160px; height: 210px;" src="http://www.nature.com/nrc/images/covers/journal_cover200906.jpg" alt="" border="0" /&gt;&lt;/a&gt;Nature Reviews Cancer has an excellent review of ovarian cancer biology in its June 2009 issue.  The review is quite timely, as the &lt;a href="http://cancergenome.nih.gov/"&gt;The Cancer Genome Atlas Project (TCGA)&lt;/a&gt; is now in full swing generating comprehensive genomic profiles for hundreds of ovarian tumor samples.  A few highlights of the review:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Box 1 provides an overview of early-detection screening in ovarian cancer, and the importance such screening could have for improving the overall cure rate.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Box 2 describes the four histotypes of epithelial ovarian cancer (the TCGA project is focusing on only one of these: serous cystadenocarcinoma.) &lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Box 4 outlines the major signaling pathways altered in &gt;50% of ovarian cancers.  This includes PI3K, Src, and Jak-Stat.  The main text also summarizes mutations and alterations in genes affecting cell proliferation, apoptosis, &lt;a href="http://en.wikipedia.org/wiki/Anoikis"&gt;anoikis&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Autophagy_%28cellular%29"&gt;autophagy&lt;/a&gt;, angiogenesis, cell motility, and invasion.&lt;/li&gt;&lt;/ul&gt;The article also summarizes recent clinical trials in ovarian cancer, including for example, new PI3K and AKT inhibitors now entering Phase I-II clinical trials.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.nature.com/nrc/journal/v9/n6/abs/nrc2644.html"&gt;The biology of ovarian cancer: new opportunities for translation&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-8506080789249387265?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/8506080789249387265/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2009/08/biology-of-ovarian-cancer.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/8506080789249387265'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/8506080789249387265'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2009/08/biology-of-ovarian-cancer.html' title='The Biology Of Ovarian Cancer'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7936872944635478061.post-483075535626491398</id><published>2009-08-26T05:55:00.000-07:00</published><updated>2010-12-14T18:11:40.807-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='nih'/><category scheme='http://www.blogger.com/atom/ns#' term='funding'/><category scheme='http://www.blogger.com/atom/ns#' term='science-policy'/><title type='text'>New NIH Director Collins Gets to Work</title><content type='html'>&lt;a href="http://blogs.sciencemag.org/scienceinsider/"&gt;Science Insider&lt;/a&gt; has a news article on Francis Collins, President Obama's newly appointed director of the NIH. The article briefly outlines Collins's five main themes for the NIH, including health care reform, translating research into medicine, global health, "empowering the biomedical research community", and "large biology" projects.  The article also briefly touches on Collins's concern for NIH funding:  "what happens when the $10.4 billion that NIH received as part of this year's stimulus package runs out in 2011."&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://blogs.sciencemag.org/scienceinsider/2009/08/collins-sets-fi.html"&gt;http://blogs.sciencemag.org/scienceinsider/2009/08/collins-sets-fi.html&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7936872944635478061-483075535626491398?l=www.biobits.org' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://www.biobits.org/feeds/483075535626491398/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.biobits.org/2009/08/new-nih-director-collins-gets-to-work.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/483075535626491398'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7936872944635478061/posts/default/483075535626491398'/><link rel='alternate' type='text/html' href='http://www.biobits.org/2009/08/new-nih-director-collins-gets-to-work.html' title='New NIH Director Collins Gets to Work'/><author><name>Ethan Cerami</name><uri>http://www.blogger.com/profile/11951175279118283931</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
