Out line of the talk - 1. systematic biology -introduction, 2. HPR project 3. The Human protein Atlas
- Venkata P. Satagopam
18th century - biologist. 19th - chemist (1/3 of elements discovered in Sweden in this century). 20th - physicists and at the end, computer scientist. He'd now like to say that the 21st century is the century of medicine.
- Allyson Lister
HPR one of the largest projects in Sweden wrt funding, about 100 million euro so far
- Oliver Hofmann
An impressive log-scale plot of number of bases sequenced since 1965.
- Allyson Lister
developer of sequencing by synthesis via pyrosequencing in late 90s -- basis of 454 technology
- Andrew Su
Personalized genomics ... 454 technology developed in our lab
- Venkata P. Satagopam
Bioinformatics is the key in the new era of genomics.
- Allyson Lister
Antibodypedia -- a portal for validated anitbodies (we need to add a link from Gene Wiki...)
- Andrew Su
From the website: "The antibodypedia is a community-based portal showing application-specific validation of publicly available antibodies to human protein targets. Each protein binder (antibody or other affinity reagent) has been scored in an application-specific manner into three main categories (supportive, uncertain and non-supportive)"
- Oliver Hofmann
If you have 2 antibodies, you can compare results in various assay platforms so he wants to develop paired antibodies for every protein target.
- Allyson Lister
Nat Methods 2008: High-througput method to identify epitopes
- Oliver Hofmann
6 months ago published a paper Nature methods (december 2008)
- Venkata P. Satagopam
(bummer, antibodypedia doesn't use mediawiki so can't assess current usage...)
- Andrew Su
(I am entirely too short-sighted to read the author lists half of the time. Sigh)
- Oliver Hofmann
Open source (but in-house?) LIMS developed
- Oliver Hofmann
(would be interesting to compare to origene collection of mammlian clones)
- Andrew Su
The antigen design uses PRESTIGE, which is a bioinformatics approach to select antigens using the protein epitope signature tag (PrEST).
- Allyson Lister
antigen design -- used PRESTIGE a bioinformatics approach to select antigen for antibody
- Venkata P. Satagopam
readouts -- immunohistochemistry (IHC) and IF (immunofluorescence)
- Andrew Su
Organ, tissue, cellular and sub-cellular expression profiing on a protein basis
- Oliver Hofmann
apply antibodies to tissue arrays (cancer focus, I think)
- Andrew Su
(140+ human samples, around 200 tissues.. did someone catch the numbers?)
- Oliver Hofmann
(Faq from the website: spatial distribution of proteins in 48 different normal tissues and 20 different cancer types as well as 47 different human cell line)
- Oliver Hofmann
Confocal microscopy for subcellular localization, difficult to scale up to high-throughput
- Oliver Hofmann
high-throughput subcellular localization in A0431 (squamous cell carcinoma), U-251MG (glioma), ???
- Andrew Su
(I think Bob Murphy talked on a similar project to map subcellular localization of proteins en masse...) (Oh, looks like it's a collaboration between the two...)
- Andrew Su
They have a SVM that seems to be able to annotate 28 different parts of the cell.
- Allyson Lister
2TB data each week (courtesy of 50.000 images in the same time)
- Oliver Hofmann
2/3 of data come from in house data and 1/3 comes from different companies
- Venkata P. Satagopam
(Rodent atlas seems a bit redundant to Allen Brain Atlas? -- Oh, ABA is via in situ / RNA, this is protein. Again, would be interesting to compare...)
- Andrew Su
@Oliver, probably not directly comparable by exact cell lines, but might be worth comparing by parental tissue. Need to wait until they allow downloading of data though...
- Andrew Su
Ensembl "thinks" that the genes are up to 23,000, but UniProt "thinks" 20,000, but the number is probably with that (for genes coding for proteins). ("thinks" in scare quotes, as databases don't think - yet)
- Allyson Lister
the size of human membrane proteome .. 5,514 human membrane proteins; covering 26% of protein-encoded genes
- Venkata P. Satagopam
proteins expressed in normal cells - 6,800 antibodies towards (>25% of all protein encoding genes). 65 normal cell types (from 45 different tissue types)
- Venkata P. Satagopam
70% of proteins expressed in a given cell, approx even distribution across # of cell lines (not what we observed on gene expression data, which had distinct peaks at tissue-specfiic and ubiquitous)
- Andrew Su
80% of proteins expressed on average in cell lines (surprisingly high to me...)
- Andrew Su
(@Andrew: protein selection might be biased towards the ones that are well expressed / had known anitbodies / ...)
- Oliver Hofmann
9% of proteins cell type specific, 62% expressed across 3 different cell types
- Oliver Hofmann
ubiquitous expression, but differing levels
- Andrew Su
(interesting cytoscape visualizations of cell type / tissue specificity)
- Andrew Su
In the Atlas: < 2% specific to a single cell type (84 proteins), well known ones like insulin
- Oliver Hofmann
Includes a number of uncharacterized proteins with no known function
- Oliver Hofmann
PROSPECTS: PROteomics SPECification in Time and Space
- Allyson Lister
"Complementary technologies, including mass spectrometry, cryoelectron microscopy and cell imaging will be applied in innovative ways to capture transient protein complexes and the spatial and temporal dimensions of entire proteomes."
- Oliver Hofmann
Q: importance of splice isoforms -- A: complexity that is currently not considered due to technical complexity (to be saved for second phase)
- Andrew Su
Q: conclusions on tissue specificity have bias based on antibody availability? A: bias of commercial antibodies possible, but only 1/3 of data. Data they are generating based on walking down chromosomes (I think?), so don't expect bias... Also, some of ubiquitous expression is due to cross-reactivity. (first mention of this...)
- Andrew Su
Q: perspective for gene therapy or antisense therapy, more generally non-protein based therapies. A: pharma shifting from small molecules to biologics (not sure about a "shift" rather than "expansion"). Gene therapy problem is getting into all relevant cells. Ubiquitous expression of proteins the root cause of side effects for protein-based targets, possibly...
- Andrew Su
(only to get the blog on top of the ISCB portal site; the figures messed up our layout)
- Reinhard Schneider