Sign in or Join FriendFeed
FriendFeed is the easiest way to share online. Learn more »
Lars Juhl Jensen
#icsb2010 Tony Hey - Data-intensive scientific computing: The fourth paradigm
"This is known as the suitcase talk" (giving the closing lecture) - Lars Juhl Jensen
Has problems with the PowerPoint presentation - excuses himself that it is a Microsoft problem (Tony is from Microsoft Research) - Lars Juhl Jensen
The Data Deluge, we will create more data over the next five years than we have ever had so far - Lars Juhl Jensen
Attending ICSB 2010: £435. Train ticket to Edinburgh: £58. Windows freezing during Microsoft Research VP's talk: priceless. - Vangelis Simeonidis from iPhone
Free genomes for all! - Neil Swainston from iPhone
Expects sequencing costs to drop to effectively zero and that genomes will be used to predict who will respond to which drugs - Lars Juhl Jensen
Showing large data examples from astronomy and particle physics - too much data and too few scientists to analyze it - Lars Juhl Jensen
Astronomical amounts of data being generated in physics. - Neil Swainston from iPhone
Citizen Science: Galaxy zoo - 200,000 people help classify galaxies - Lars Juhl Jensen
We made it! too bad we missed the ppt blunder... - Ricardo Almeida from Android
Lots of data but how to analyse? Galaxy Zoo - crowd sourcing. - Neil Swainston from iPhone
Plug to crowdsourced science - Pedro Beltrao from Android
Hanny van Arkle's Voorwerp - the bluest galaxy ever discovered (found by a Dutch school teacher among the masses of data) - Lars Juhl Jensen
The fourth research paradigm: 1. experimental science, 2. theoretical science, 3. computational science, and now 4. data-intensive science - Lars Juhl Jensen
Discussing the impact of big data to the way we work in science today and in the future. We need new skils in big data analysis, visualization etc - Pedro Beltrao from Android
(No hypothesis science ? Gasp :) - Pedro Beltrao from Android
Data intensive science. Here's the data, now what is the hypothesis? Doug Kell should write a paper on that. - Neil Swainston from iPhone
Generic problems in data-intensive science: data ingest, managing a petabyte, common schema, how to organize it, how to reorganize it, how to share with others, query and visualization tools, building and exceuting models, integrating data and literature, documenting experiments, and curation and long-term preservation - Lars Juhl Jensen
Ingesting a petabyte is difficult. Sharing it afterwards is even harder - and actually illegal in some countries. - Daniel Jameson from iPhone
One of the big differences between data in particle physics and biological sciences is that the latter has many more different types of data - Lars Juhl Jensen
Computer science has failed to produce the tools that work for biologists - Pedro Beltrao from Android
Here is the evidence, now what is the hypothesis? The complementary roles of inductive and hypothesis-driven science in the post-genomic era. http://www.ncbi.nlm.nih.gov/pubmed... - Neil Swainston from iPhone
To be fair, biologists haven't created useful tools for computer scientists either. - Neil Swainston from iPhone
The book "The Fourth Paradigm" is available under a Creative Commons license here: http://research.microsoft.com/en-us... - Lars Juhl Jensen
Cc licensed book: The Fourth Paradigm. Free of charge, PDF and Kindle. - Ricardo Almeida from Android
Wow, mentions the problem of access to papers and open access movement - Pedro Beltrao from Android
Good plug for Open Access - if public money paid it, it ought to be freely available to the public - Lars Juhl Jensen
An e-Science example: machine learning and healthcare - Lars Juhl Jensen
Is there anyone that Doug doesn't know? I bet he's been out for a pint with at least three of the Chilean miners. - Neil Swainston from iPhone
Microsoft Research and HIV. Microsoft researcher formally worked on spam filters. PhyloD.net deciphers evolution of HIV virus. - Neil Swainston from iPhone
Umm.. a weird analogy too long to explain - Pedro Beltrao from Android
Working on understanding environmental and genetic factors responsible for asthma and diabetes (and other diseases) - Lars Juhl Jensen
PhiloD.net - application of machine learning and high performance computing in HIV non-masking epitope discovery. - Ricardo Almeida from Android
Trying to use machine learning to predict which patients require readmission (i.e. who were sent home from the hospital too soon) - Lars Juhl Jensen
PhiloD.net is designed with Microsoft's Azure cloud computing platform. - Ricardo Almeida from Android
using machine learning with patient data to predict readmission - Pedro Beltrao from Android
Healthcare. Machine learning to prevent readmission to intensive care. Based on very simple meta-dataset of patients condition. - Neil Swainston from iPhone
Tools developed for Life Sciences: a formatting scheme for Word? - Vangelis Simeonidis from iPhone
Microsoft support and develop biologist's tools. Plugins for word for marking up with ontology terms. - Daniel Jameson from iPhone
Word add-in for ontology recognition in papers. Good stuff. Word. - Neil Swainston from iPhone
He mustn't have heard of Endnote or Zotero.. - Ricardo Almeida from Android
Describing MS Word plugins for saving documents in PMC XML format and mark-up of ontology terms. Sadly they don't work on OS X. - Lars Juhl Jensen
That's a bit better. Text mining plug in for Word. - Vangelis Simeonidis from iPhone
Also chemistry add-in for Word. CML in the background. Can show name, formula or structure. Quite cool. Sure that the Google Wavers attempted to write a plug-in to do something similar. - Neil Swainston from iPhone
Trident scientific workflow tool looks to be worth a look. Missed the URL though. - Daniel Jameson from iPhone
Slightly underwhelmed so far with all the plug in talk. - Vangelis Simeonidis from iPhone
Plug ins for excel... networks in excel ... scary - Pedro Beltrao from Android
Excel is the curse of all data if you wish to subsequently process is automatically. - Daniel Jameson from iPhone
Should MBF be invited to COMBINE 2011? - Vangelis Simeonidis from iPhone
Actually, chemistry plug-in is FAR more necessary for Excel. Imagine if all crappy Excel data sheets would contain semantic ChEBI identifiers within them? I'd be out of a job... - Neil Swainston from iPhone
.net libraries for biology ...sounds interesting - Pedro Beltrao from Android
(I plan to make a plugin to put Neil out of work ;-) ) - Lars Juhl Jensen
Systems Biology is looking at biological systems. - Daniel Jameson from iPhone
If you cannot wait to have all the Office plug ins: http://atom.microsoft.research.com/bio... - Vangelis Simeonidis from iPhone
Www.cosbi.eu blenX something about language integration - Pedro Beltrao from Android
Supporting both top-down and bottom-up. I fear that this leaves us mere mortals with just middle-out. - Neil Swainston from iPhone
Ethanol or whatever! - Ricardo Almeida from Android
Kind of scatered, talking about different collaborations - Pedro Beltrao from Android
Metabolic Engineering approach doesn't look that impressive. Seen lots if examples of similar work this week. Jesus! They've recreated SBML!! - Neil Swainston from iPhone
.net is all well and good, but we aren't all developing for windows only - infact that's an incredibly bad idea... - Daniel Jameson from iPhone
Here comes the azure plug - Pedro Beltrao from Android
And I thought I was the only person in the world using SmugMug... - Vangelis Simeonidis from iPhone
What's a cloud? There's a definition... - Ricardo Almeida from Android
"many people talk about clouds". eg meteorologists - Vangelis Simeonidis from iPhone
100K to 1MK servers with big cost savings. Container size modules.questions on energy efficiency etc. Half billion for the chicago center. - Pedro Beltrao from Android
Microsoft making cloud computing facilities in Iraq?!? Did I just mishear that? Considering waterproof servers to mitigate the need for expensive buildings. Some centres physically enormous. - Neil Swainston from iPhone
@Neil Swainston: and now you know why you went to war: so that Microsoft can give us cloud computing - Vangelis Simeonidis from iPhone
Different models of cloud computing - platforms and softw as services - Pedro Beltrao from Android
Kind of think that this is the wrong audience for a cloud computing sales pitch. Most of us are still at the "e-mail spreadsheets to each other" level of sophistication. - Neil Swainston from iPhone
PhyloD: statistical tool for dna analysis. It is also a web service through azure - Vangelis Simeonidis from iPhone
Future prospects, semantic computing in the cloud - Pedro Beltrao from Android
Cloud computing makes me nervous: what happens when servers crash? - Vangelis Simeonidis from iPhone
Semantic computing. RDF and OWL namechecked, but speaker is sceptical that these will ever take off on a big scale. - Neil Swainston from iPhone
Prefers an inference approach, which is likely to be more practical realistically. - Neil Swainston from iPhone
Easy to use tools for scientists. Amen to that. - Neil Swainston from iPhone
In summary, it seems that Big Companies see biological data management as a future cash-cow. Likely that most data management will be performed outside of academia in a few years time. - Neil Swainston from iPhone
Not necessarily a bad thing, but highlights the need for geeks like me to focus more on data interpretation than data management. - Neil Swainston from iPhone
Tony Hey finishes with "the cloud WILL be part of your future". Is that a promise or a threat? - Vangelis Simeonidis from iPhone