To learn more:
𧬠Code and data: github.com/Genentech/Pe...
𧬠Paper: arxiv.org/abs/2502.21290
To learn more:
𧬠Code and data: github.com/Genentech/Pe...
𧬠Paper: arxiv.org/abs/2502.21290
We find
𧬠Existing methods perform poorly on PerturbQA β
𧬠Naively applying LLMs also performs poorly β
𧬠Simple reasoning template + retrieving experimental outcomes = does ok β
𧬠LLMs easily summarize gene sets β
𧬠Still a long way to go! β¨
𧬠Existing methods use knowledge graphs to relate seen vs. unseen perturbations. It seems wrong that current GNNs discard the semantics of knowledge graphs
𧬠Language may be useful for harmonizing these diverse information
We propose
𧬠PerturbQA: a new benchmark for perturbations + LLM reasoning for biomolecular discovery.
𧬠Perturbation experiments should be modeled at the granularity of statistical insights. Goal = predict differential expression / characterize gene sets
Excited to share my #ICLR2025 paper, with JC HΓΌtter and friends!
Genetic perturbation screens allow biologists to manipulate and measure the genes in cells = discover causal relationships! BUT they are expensive to run, expensive to interpret.
... We use LLMs to help!