mRNA level of regulator an (imprecise) indicator of regulator activity
- Oliver Hofmann
So you can model RNA level of a gene as a variable in a model
- Allyson Lister
The expression of a target genes can often be predicted by the expression of its regulators
- Diego M. Riaño-Pachón
Regulators: transcription factors, signal transduction, chromatin remodeling . . .
- Diego M. Riaño-Pachón
Broad view of a regulator gene: TFs, signal transduction proteins, RNA processing factors, anything that *might* play a direct or indirect role in gene regulation
- Oliver Hofmann
Second assumption: co-regulated genes have similar regulatory mechanisms, group genes into modules and predict expression profile for the entire module
- Oliver Hofmann
Modules provide increased statistical power.
- Gabriele Sales
A critical aspect is the structure of the regulatory program.
- Allyson Lister
(Segal 2003 Nat Genetics): notion of a regression tree for regulatory programs
- Oliver Hofmann
has disadvantages such as poor regulator selection lower in the tree, misses lot of regulators due to lack of statistical power
- Oliver Hofmann
another disadvantage: arbitrary choice among correlated regulators
- Gabriele Sales
Instead: Lasso (L1) regression approach
- Oliver Hofmann
Simple linear regression doesn't work well because the minimization procedure assigns a nonzero weight to each regulator.
- Gabriele Sales
This can be solved by using regularization: an extra termn penalizing nonzero weights.
- Gabriele Sales
The Lasso regression pushed the effect of many regulators towards zero, likely keeping only the significant regulators
- Diego M. Riaño-Pachón
Elastic net regression to avoid arbitrary regulator / feature choice
- Oliver Hofmann
Cluster genes to modules, learn regulatory program for module, repeat for all modules, iterate after re-assignment of genes to modules based on how well a program predicts the expression of a gene in the module
- Oliver Hofmann
initial assignment of genes into modules is often poor at the beginning, must be reiterated down in the analysis
- Diego M. Riaño-Pachón
It's a special kind of Bayesian network
- Allyson Lister
Test using eQTL data set (Brem, 2002 Science), two different yeast strains
- Oliver Hofmann
eQTL dataset (Brem at al, 2000) from yeast. Microarray measurements of 112 individuals over 6000 genes.
- Gabriele Sales
Adapt the regulatory network approach by including the genotype / markers. How do markers affect the expression level of a given module?
- Oliver Hofmann
She then show us one of the modules that comes out, the telomere module (40/42 of genes are in the telomere).
- Allyson Lister
Example: the telomere module. Enriched for telomere maintenance and helicase activity.
- Gabriele Sales
Controlled by a region on chr XII (which includes a regulator gene)
- Oliver Hofmann
23 modules out of 165 have 'chromosomal features', 16 chromatin regulators
- Oliver Hofmann
Puf3 module: 147/153 genes are pulldown targets of mRNA binding protein Putf3.
- Gabriele Sales
But Puf3 is not the most significant regulator of the module.
- Gabriele Sales
P-bodies are places where mRNA are stored temporarily and while they are there they are transcriptionally repressed.
- Allyson Lister
What regulates sequence-specific localization of mRNAs to P-bodies?
- Gabriele Sales
They did a microscopy experiment to test this, which demonstrates that PUF3 is specifically localized to p-bodies.
- Allyson Lister
Microscopy experiment: fluorescence shows that puf3 localizes in P-bodies.
- Gabriele Sales
So what regulates the regulators of the P-bodies?
- Oliver Hofmann
What regulates the p-bodies? What is one level higher up in the hierarchy? A locus on chromosome 14, but this is a large region and covers 30 genes and 300 polymorphisms.
- Allyson Lister
Therefore she came up with the idea of regulatory potential. The motivation is that not all SNPs are equally likely to be causal
- Allyson Lister
how to rank the polymorphisms (SNPs) by their regulatory potential?
- Diego M. Riaño-Pachón
Each regulator has its own prior dictated by the regulatory features (inside a gene? protein coding region? strong conservation? TF binds to module gene?)
- Gabriele Sales
Start by learning regulatory programs as described earlier. Second, learn regulatory weights (betas). Then compute the regulatory potential of each SNP in the genome. Then interate
- Allyson Lister
Regulatory potentials do not change the selection of strong regulators, but helps to disambiguate between multiple weak regulators
- Oliver Hofmann
Regulatory potentials do not change selection of strong regulators. They only help disambiguate between weak ones.
- Gabriele Sales
Strong regulators teach us what to look for in the putative weak regulators
- Oliver Hofmann
Some important features learned by the algorithm: conservation, cis-regulation.
- Gabriele Sales
Important factors: cis-relationship, conservation, stop-codons, (combination of gene functions such as RNA modification, DNA binding, ...)
- Oliver Hofmann
Statistical evaluation: uses PGV: % of genetic variation explained by the predicted regulatory program for each gene. It's a form of test data validation.
- Allyson Lister
These factors are *learned* from the model, not set
- Oliver Hofmann
Explained about 50% of the variation in about 50% of the genes
- Allyson Lister
Regulatory potentials are specific to organisms or even datasets.
- Gabriele Sales
Understanding the process underlying differentiation with the ImmGen consortium
- Oliver Hofmann
Can identify shared regulatory programs for all 60 cell types, but lumps together very different cell types
- Oliver Hofmann
By using G-Regulators, you allow programs to depend on genetic variation, but you don't have G-regulators here, you have cell types
- Allyson Lister
One network for each cell type overfits the data, but can bias towards shared regulation. Use the _ontogeny_ to guide conserved regulation.
- Oliver Hofmann
Two extremes: lumping every cell type together you "average" effects; working on a single cell type you overfit.
- Gabriele Sales
use ontogeny to guide conserved regulation. You do this by looking at differences, and penalize every place you change the regulatory power - penalize changes/divergences in the regulatory
- Allyson Lister
Expression changes and underlying phenotype. What are the mechanism underlying them?
- Allyson Lister
Example: transformation of FL to DLBCL occurs in 40-60% of patients, and diverse mechanisms seem to drive transformation.
- Allyson Lister
Represent each module as a metagene expression profile, and use machine learning to id modules distinguishing FL-t (pre-transformation) from transformed DLBCL
- Allyson Lister
Represent each module as a metagene, use ML technique to learn classifiers to distinguish FL-t (pre-transformed) from DLBCL
- Oliver Hofmann
Can you use a module-based approach to understand metabolic syndrome?
- Allyson Lister
pheontype network, where the nodes are modules and the edges are learned regulatory programs.
- Allyson Lister
An important module in this case is the biosynthesis liver module: the genes are almost disjoint but are all in the same module, therefore you would have missed it without these modules.
- Allyson Lister