Today!!! Register and attend by using the links in the post below!
Today!!! Register and attend by using the links in the post below!
β¨ The next #ELIXIR_3DBioInfo webinar will focus on computational #biophysics.
ποΈ 10 March 2026 | π 17:00-18:00 CET
Join the webinar to hear the recent activities in the Community, including dark energy in functional #protein and fold-switching behaviour of RfaH.
π https://loom.ly/ctFNLug
New webinar! March 10th 5pm CET/4pm UK! Exciting computational biophysics!
1) Ezequiel Galpern from @crg.eu
Dark Energy in Proteins!
2) Cesar Ramirez Sarmiento (@cxarramirez) from Pontificia Universidad Catolica de Chile!
Metamorphic Proteins!
Register:
elixir-europe.org/events/3d-bi...
New webinar! March 10th 5pm CET/4pm UK! Exciting computational biophysics!
1) Ezequiel Galpern from @crg.eu
Dark Energy in Proteins!
2) Cesar Ramirez Sarmiento (@cxarramirez) from Pontificia Universidad Catolica de Chile!
Metamorphic Proteins!
Register:
elixir-europe.org/events/3d-bi...
Nice editorial!
"Protein dynamics prediction by integrating biophysics and artificial intelligence"
sciencedirect.com/science/arti...
Thanks for sharing π
Finally out! Miriam's 1st, 1st author article!
1st last author paper as a group leader! A work that started 3 years ago when I was a postdoc in Alfonso Valencia's group π₯Ή
doi.org/10.64898/202...
Threadπ§΅
By 2028, all 4-character PDB IDs will be exhausted.
After that, all new entries will receive extended IDs: 12 characters total, formatted as pdb_ + 8 alphanumeric characters (pdb_1000axyz)
Test it from PDB Beta Archive
Read more: www.wwpdb.org/news/news?ye...
We have recently lost 2 great scientists in our field. Amos Bairoch & Peer Bork who not only have inspired my work since my early years but also built theory, tools & databases without which this work would not have been possible. This work is dedicated to their memory. RIPβ€οΈ.
This has been a tremendous work by Miriam and collabs. It is my first paper as a group leader so it scares me a bit.. but it was a great adventure! Let's see what reviewers say! Comments Welcome! (+)
Our study implied 1000s of predicitons & calculations but still it only represents few folds as case studies. Is this something more general? We will study all known enzyme families to complete this idea but we want to present this initial work as potential evidence (+)
If you want a complex architechture it makes sense that you cannot have a completely frustration free structure. You need "hinge" residues so you can adopt such folds. Maybe these are our frustration spandrels that can be later on exapted for function.. as Gould proposed (+)
Folds represent basins in sequence space & sequences diffuse such space until the fall into one of this folds. We know that sequences are not evenly distributed across structure space. Something shown by Christine Orengo years ago. Are superfolds, super platonic attractors? (+)
This idea is also compatible with the theory of platonic folds. Sequences don't really code structures but fall into attractors defined by folds. Folds are the consequence of physical and biochemical rules given the available amino acids and how they interact with solvent (+)
Such sites could have later evolded to become more complex and give rise to modern catalytic sites. We know that frustration can represent those ancestral hotpots. A spandrel that has no reason other than to facilitate a complex structure can be that ancestral hotspots (+)
On his postume article, Dan Tawfik & colleagues propose that ancestral enzymatic function could be seeded by unstable hotposts in proteins that could bind small molecules such as phosphate with low affinities by separating them from the environment... (+)
We conclude that some frustration that cannot be erased from the sequences even when reverse folding to maximise sequence-structure-fit behaves as a spandrel. This frustration is not the consequence of adaptive evolution but a consequence of the fold architecture.
(+)
We then decided to use De Novo designed folds as they are designed to maximise foldability, stability and have not (known) function.. Top7 is very intersting.
Both the original design and the ProteinMPNN designs (Baker vs Baker metaverse) contain highly frustrated residues (+)
We pushed ProteinMPNN to its limits. We used max temperature to maximise seq varability. We also retrained it by deleting from the training set all annotated and predicted enzymes!
Some signal was gone but some was not! It this still evolutionary information leakeage?
(+)
We mutated in silico all catalytics to to Valines to explicitly minimise frustration. We also used experimentally mutated structures. Reverse folded sequences using both methods still recovered the native identities back.. Is there a memory in ProteinMPNN?
Why are reverse folded sequences maintaining energetic conflicts if they are maximising stability and the sequence-to-structure fit? Is there a bias in the methods? Has ProteinMPNN memorized catalytic sites and just imprints its identities when seeing something alike? (+)
Our next family to analyse was Beta-Lactamases, whose catalytic sites are also known to be highly frustrated.
To our surprise, reverse folded sequences were not remodelled. The native, highly frustrated identities are always recovered in the design.. This made no sense...!
(+)
We analyzed this in the context of alpha-globins where protein-protein interaction sites, known to be highly frustrated. Sequences are remodelled to largely decrease frustration!
That makes sense as ProteinMPNN maximises stability which conflicts with function.
(+)
Using reverse-folding algorithms (ProteinMPNN, Caliby) + structure prediction + local frustration analysis,
we redesigned sequences for fixed backbones.
We used FrustraEvo to analyse:
Which positions are free to vary and which are energetically constrained?
(+)
We had a fundamental question that Burkhard Rost has addressed decades ago!
How large is the sequence attractor of a given protein fold?
Which positions can be varied in sequence so that the fold does not care and which ones are not changeable?
(+)
Finally out! Miriam's 1st, 1st author article!
1st last author paper as a group leader! A work that started 3 years ago when I was a postdoc in Alfonso Valencia's group π₯Ή
doi.org/10.64898/202...
Threadπ§΅
FrustrAI-Seq: Scaling Local Energetic Frustration to the Protein Sequence Space Jan-Philipp Leusch, Miriam Poley-Gil, Miguel Fernandez-Martin, Nicola Bordin, Burkhard Rost, R. Gonzalo Parra, Michael Heinzinger bioRxiv 2026.02.03.703498; doi: https://doi.org/10.64898/2026.02.03.703498 from Figure 1 (CC BY 4.0)
FrustrAI-Seq predicts local energetic frustration from sequence alone. ProtT5 with LoRA fine-tuning trained on ~1M proteins analyzes the human proteome in 17 min, extracting physical energy constraints embedded in evolutionary statistics and enabling de novo protein design.
doi.org/10.64898/202...
FrustrAI-Seq: Scaling Local Energetic Frustration to the Protein Sequence Space
Figure 1
Figure 2
Figure 3
FrustrAI-Seq: Scaling Local Energetic Frustration to the Protein Sequence Space [new]
direct sequence-based prediction of local frustration using pLM embeddings enabling analysis of diverse protein regions without structural data.
FrustrAI-Seq: Scaling Local Energetic Frustration to the Protein Sequence Space https://www.biorxiv.org/content/10.64898/2026.02.03.703498v1
π FrustrAI-seq is out!
Local energetic frustration, a key link between protein folding, dynamics & function, can be inferred from sequences using pLMs.
This bridges physical energy landscapes & evolutionary sequence space, scaling frustration to the entire protein universe!
doi.org/10.64898/202...