In Intelligent Systems for Molecular Biology 2008 (July 2008) We have developed an interactive visualization method for gene expression profiles from high‐throughput gene expression data sets with hundreds to thousands of samples. Our method integrates biological knowledge in form of hierarchies derived from ontologies. A sample hierarchy is employed that describes the relationship of samples to each other and to ontology concepts. This information allows us to combine expression levels into biologically relevant aggregates and it enables the investigator to interactively control the amount of data presented. Samples are mapped to the sample hierarchy based on their description so that leaf nodes represent samples and internal nodes represent ontological terms. Based on this structure we can compute an average expression value for each gene and ontology term in the hierarchy. Each gene is visualized as a rectangular “gene map” similar to a tree map [1] whose structure is defined by...
- Yann Abraham
Bioinformatics, Vol. 25, No. 10. (15 May 2009), pp. 1231-1235. In many genomic studies, one works with genome-position-dependent data, e.g. ChIP-chip or ChIP-Seq scores. Using conventional tools, it can be difficult to get a good feel for the data, especially the distribution of features. This article argues that the so-called Hilbert curve visualization can complement genome browsers and help to get further insights into the structure of one's data. This is demonstrated with examples from different use cases. An open-source application, called HilbertVis, is presented that allows the user to produce and interactively explore such plots. Availability: http://www.ebi.ac.uk/huber-s... Contact: sanders@fs.tum.de Supplementary information: Supplementary Data are available at Bioinformatics online. 10.1093/bioinformatics/btp152 Simon Anders
- Yann Abraham
Cell, Vol. 139, No. 3. (30 October 2009), pp. 452-453. Twitter needs no introduction. This "microblogging" service has gained tremendous popularity in the 2 years since its launch. Yet, most scientists are steering clear of it. Laura Bonetta speaks to some who have found value in tweeting. Laura Bonetta
- Yann Abraham
Nature Biotechnology, Vol. 27, No. 10. (01 October 2009), pp. 921-924. Networks in biology can appear complex and difficult to decipher. Merico et al. illustrate how to interpret biological networks with the help of frequently used visualization and analysis patterns. Daniele Merico, David Gfeller, Gary Bader
- Yann Abraham
Nucleic acids research, Vol. 37, No. Web Server issue. (1 July 2009), pp. W23-27. BioMart Central Portal (www.biomart.org) offers a one-stop shop solution to access a wide array of biological databases. These include major biomolecular sequence, pathway and annotation databases such as Ensembl, Uniprot, Reactome, HGNC, Wormbase and PRIDE; for a complete list, visit, http://www.biomart.org/biomart.... Moreover, the web server features seamless data federation making cross querying of these data sources in a user friendly and unified way. The web server not only provides access through a web interface (MartView), it also supports programmatic access through a Perl API as well as RESTful and SOAP oriented web services. The website is free and open to all users and there is no login requirement. Syed Haider, Benoit Ballester, Damian Smedley, Junjun Zhang, Peter Rice, Arek Kasprzyk
- Yann Abraham
This would probably be a good James Bond movie, but to me the point of a game is to have fun and this is... not (at least it is a bit too realistic and twisted for my gamers' taste)
- Yann Abraham
BMC Research Notes, Vol. 2, No. 1. (16 July 2009), 138. BACKGROUND:R is the statistical language commonly used by many life scientists in (omics) data analysis. At the same time, these complex analyses benefit from a workflow approach, such as used by the open source workflow management system Taverna. However, Taverna had limited support for R, because it supported just a few data types and only a single output. Also, there was no support for graphical output and persistent sessions. Altogether this made using R in Taverna impractical.FINDINGS:We have developed an R plugin for Taverna: RShell, which provides R functionality within workflows designed in Taverna. In order to fully support the R language, our RShell plugin directly uses the R interpreter. The RShell plugin consists of a Taverna processor for R scripts and an RShell Session Manager that communicates with the R server. We made the RShell processor highly configurable allowing the user to define multiple inputs and...
- Yann Abraham
Nucl. Acids Res. (6 November 2009), gkp937. Over the last years, the publicly available knowledge on interactions between small molecules and proteins has been steadily increasing. To create a network of interactions, STITCH aims to integrate the data dispersed over the literature and various databases of biological pathways, drug-target relationships and binding affinities. In STITCH 2, the number of relevant interactions is increased by incorporation of BindingDB, PharmGKB and the Comparative Toxicogenomics Database. The resulting network can be explored interactively or used as the basis for large-scale analyses. To facilitate links to other chemical databases, we adopt InChIKeys that allow identification of chemicals with a short, checksum-like string. STITCH 2.0 connects proteins from 630 organisms to over 74 000 different chemicals, including 2200 drugs. STITCH can be accessed at http://stitch.embl.de/. 10.1093/nar/gkp937 Michael Kuhn, Damian Szklarczyk, Andrea Franceschini,...
- Yann Abraham
Nature, Vol. advance online publication (01 November 2009) Although drugs are intended to be selective, at least some bind to several physiological targets, explaining side effects and efficacy. Because many drug–target combinations exist, it would be useful to explore possible interactions computationally. Here we compared 3,665 US Food and Drug Administration (FDA)-approved and investigational drugs against hundreds of targets, defining each target by its ligands. Chemical similarities between drugs and ligand sets predicted thousands of unanticipated associations. Thirty were tested experimentally, including the antagonism of the 1 receptor by the transporter inhibitor Prozac, the inhibition of the 5-hydroxytryptamine (5-HT) transporter by the ion channel drug Vadilex, and antagonism of the histamine H4 receptor by the enzyme inhibitor Rescriptor. Overall, 23 new drug–target associations were confirmed, five of which were potent (<100 nM). The physiological relevance of one, the...
- Yann Abraham
The question is whether you can calculate standard deviation for fold change values if the groups you compare are not matched.
- Berci Mesko, MD
So you are considering the ratio of values derived from 2 unmatched populations, like testing 2 populations with different treatments? In that case I would use an ANOVA rather than working with the ratio... or did I miss something?
- Yann Abraham
One group of controls and one group of diseased patients.
- Berci Mesko, MD
Nature Precedings, No. 713. (16 October 2009) The exponentially increasing number of published papers (1.4 million per year by one estimate) makes it more and more difficult for us to manage the flood of scientific information. Each of us has acquired some protocol to find and organize journal articles and other references over the course of our careers. Most of those protocols are likely to have been formed by old routines or idleness rather than a structured approach to save time and frustration over the long run. Furthermore, with the Web 2.0 revolution, new ways of handling information are emerging (O’Reilly 2005). For example, traditional standalone tools for reference management like EndNote are being supplemented by centralized resources like RefWorks and social bookmarking sites as described subsequently. This fusion of personal and public information offers the promise of efficiency through better organization, which in turn leads to better science.How can seasoned scientists...
- Yann Abraham
Science, Vol. 325, No. 5937. (10 July 2009), pp. 161-165. Historically, the majority of new drugs have been generated from natural products (secondary metabolites) and from compounds derived from natural products. During the past 15 years, pharmaceutical industry research into natural products has declined, in part because of an emphasis on high-throughput screening of synthetic libraries. Currently there is substantial decline in new drug approvals and impending loss of patent protection for important medicines. However, untapped biological resources, "smart screening" methods, robotic separation with structural analysis, metabolic engineering, and synthetic biology offer exciting technologies for new natural product drug discovery. Advances in rapid genetic sequencing, coupled with manipulation of biosynthetic pathways, may provide a vast resource for the future discovery of pharmaceutical agents. 10.1126/science.1168243 Jesse Li, John Vederas
- Yann Abraham
Nature Chemical Biology, Vol. 5, No. 9. (02 August 2009), pp. 647-654. Biochemical combinatorial techniques such as phage display, RNA display and oligonucleotide aptamers have proven to be reliable methods for generation of ligands to protein targets. Adapting these techniques to small synthetic molecules has been a long-sought goal. We report the synthesis and interrogation of an 800-million-member DNA-encoded library in which small molecules are covalently attached to an encoding oligonucleotide. The library was assembled by a combination of chemical and enzymatic synthesis, and interrogated by affinity selection. We describe methods for the selection and deconvolution of the chemical display library, and the discovery of inhibitors for two enzymes: Aurora A kinase and p38 MAP kinase. Matthew Clark, Raksha Acharya, Christopher Arico-Muendel, Svetlana Belyanskaya, Dennis Benjamin, Neil Carlson, Paolo Centrella, Cynthia Chiu, Steffen Creaser, John Cuozzo, Christopher Davie, Yun Ding,...
- Yann Abraham
BMC Bioinformatics, Vol. 8, No. 1. (2007) BACKGROUND:The web has seen an explosion of chemistry and biology related resources in the last 15 years: thousands of scientific journals, databases, wikis, blogs and resources are available with a wide variety of types of information. There is a huge need to aggregate and organise this information. However, the sheer number of resources makes it unrealistic to link them all in a centralised manner. Instead, search engines to find information in those resources flourish, and formal languages like Resource Description Framework and Web Ontology Language are increasingly used to allow linking of resources. A recent development is the use of userscripts to change the appearance of web pages, by on-the-fly modification of the web content. This pens possibilities to aggregate information and computational results from different web resources into the web page of one of those resources.RESULTS:Several userscripts are presented that enrich biology...
- Yann Abraham
Yeah - I'd love to see a community building greasemonkey scripts for life-science sites - pubmed, citeulike, etc. Hell, maybe I'll start working on some...
- Chris Miller
Mike, that was sort of the set up in SVN too, which is why there still is userscript/trunk (though we never made tags of branches, which is why there is no matching userscript/branches or userscript/tags ...
- Egon Willighagen
Searching Google Wave with "tag:the-life-scientists" will get you to "Research collaborations in Wave", a good starting point for life scientists.
- Martin Fenner
I don't get how you search in public waves. I've tried searching for tag:the-life-scientists and it gets no hits -- I think it's just searching my own waves
- Andrew Clegg
there was a thread by Kol about wave usernames couldn't find the link
- ffcode
Aha -- with:public . They really should include a button for that
- Andrew Clegg
An undergraduate student in our lab, Caleb, just got his wave invite. I told him to look at this thread for possible people to connect with.
- Steve Koch
Afternoon all. I've written my first robot, which hopefully will embed an interactive mass spectrum into a blip whenever a UniProt name is encountered in the text, and corresponding mass spec data is found for this protein. I say "hopefully", as I've not been able to test it for real, as, alas, I have no account. When are the next batches released? If it's not for ages, does anyone fancy testing it anyway?
- Neil Swainston