Yunha Hwang (@microyunha)

Thanks for the idea, we briefly checked this and for E.coli test set predictions, we get ~80% of the high confidence interactions to be more than 5 genes away from each other, so a large fraction is non-syntenic!

05.03.2026 19:36 👍 3 🔁 0 💬 0 📌 0

For wirus-microbe -- yes (we have examples in paper)!, for microbe-host, we haven't fully evaluated how this would work for eukaryotic proteomes.

05.03.2026 19:34 👍 0 🔁 0 💬 0 📌 0

We thought a lot about how to deploy 𝑭𝒍𝒂𝒔𝒉𝑷𝑷𝑰, and we are very proud of this implementation that integrates annotation+context+CoSearch+agent with FlashPPI on SeqHub!

05.03.2026 15:34 👍 4 🔁 2 💬 0 📌 0

Thanks for pointing this out! We will add an option to download the network!

04.03.2026 18:29 👍 0 🔁 0 💬 0 📌 0

Step-by-step how to run FlashPPI on your favorite genomes!

03.03.2026 15:19 👍 9 🔁 1 💬 0 📌 0

Predicting protein-protein interactions (PPIs) at proteome scale can take months with co-folding models due to the massive all-vs-all comparisons required.

We are excited to announce FlashPPI, a contrastive learning framework that predicts proteome wide physical interfaces in minutes. 1/🧵

03.03.2026 15:07 👍 69 🔁 27 💬 2 📌 7

Preprint: www.biorxiv.org/content/10.6...

03.03.2026 15:16 👍 1 🔁 1 💬 2 📌 0

SeqHub - The Home for Biological Sequences SeqHub is a platform for exploring, annotating, and sharing biological sequences.

For a typical microbial genome, all-vs-all PPI prediction with AF3 would take hundreds of GPU-years. With FlashPPI, we can scale molecular interaction prediction across diverse, non-model microbial genomes, unlocking truly scalable discovery. We deployed FlashPPI on Seqhub.org, give it a spin!

03.03.2026 15:16 👍 2 🔁 0 💬 1 📌 0

3. Online hard negative mining improves sensitivity.
We use joint optimization to let the model propose hard negatives for contact prediction during training. This results in even more sensitive and robust performance.

03.03.2026 15:16 👍 0 🔁 0 💬 1 📌 0

2. Learning how proteins interact matters
It's not enough to learn that 2 proteins interact, learning *how* they interact at residue level is critical for performance.

03.03.2026 15:16 👍 1 🔁 0 💬 1 📌 0

Some fun highlights on what we learned along the way:
1. Reframing PPI prediction as retrieval
Instead of asking “Do A and B interact?”, we ask: Which proteins does A interact with in this genome? This shift in framing enables linear-time scaling and ultrafast performance.

03.03.2026 15:16 👍 2 🔁 0 💬 2 📌 0

For technical details, check out @ancornman1’s excellent breakdown of the model. bsky.app/profile/anco...

03.03.2026 15:16 👍 0 🔁 0 💬 1 📌 0

Protein–protein interactions (PPIs) are key to discovering and interpreting new biological functions.

We’re excited to introduce 𝑭𝒍𝒂𝒔𝒉𝑷𝑷𝑰: a new application of gLM2 that uses genomic language modeling to predict proteome-wide PPIs in microbial genomes in minutes.

03.03.2026 15:16 👍 36 🔁 20 💬 2 📌 1

We’d love to join your lab meeting!

We’ve been meeting with research groups to share how scientists are using SeqHub for sequence and genome analysis, and the conversations have been highly interactive and grounded in real workflows.

Booking info below.

12.02.2026 16:24 👍 0 🔁 1 💬 1 📌 0

We’re excited to welcome Daniela Bourges-Waldegg to the SeqHub Advisory Board!

Daniela is EVP + Chief Digital & Technology Officer at @addgene.bsky.social. She will help shape our approach to building researcher-centered digital infrastructure with an eye toward long-term scientific impact.

10.02.2026 15:30 👍 5 🔁 2 💬 0 📌 0

First, @tattabio.bsky.social is now on Bluesky!💙 and second, we launched mult-sequence CoSearch on SeqHub!

04.02.2026 16:08 👍 7 🔁 2 💬 0 📌 0

This. Is. So. Cool. 🤯

05.11.2025 23:51 👍 3 🔁 1 💬 1 📌 0

Hi Roland, our servers are in the US, we explicitly state in our docs that we do not train models on private data, and the data is private to you only - unless intentionally made public (for publication/data sharing purposes)!

30.10.2025 01:36 👍 2 🔁 0 💬 1 📌 0

thanks for the feedback! We are working on making more of the platform exportable as figures😊

29.10.2025 12:05 👍 0 🔁 0 💬 0 📌 0

Thank you for the shoutout!

28.10.2025 18:50 👍 1 🔁 0 💬 1 📌 0

Released today from Tatta Bio: SeqHub! A place to explore, annotate, and share sequence data with functional insights.

Over 1,000 scientists worldwide have already used SeqHub to annotate more than 550,000 proteins, uncovering new insights and accelerating discovery.

28.10.2025 15:03 👍 0 🔁 1 💬 2 📌 0

Annotations are mapped using embedding-based search, making it faster than most alignment-based search. HMM prediction speed-up comes from some optimization and parallelization :)

28.10.2025 16:33 👍 4 🔁 0 💬 0 📌 0

Thank you! and PaperBLAST team deserves a shoutout for the sequence-paper linkages

28.10.2025 16:31 👍 2 🔁 0 💬 0 📌 0

@ancornman1.bsky.social @sokrypton.org @pgirguis.bsky.social @alexbateman1.bsky.social @simrouxvirus.bsky.social @apcamargo.bsky.social

28.10.2025 13:47 👍 3 🔁 1 💬 0 📌 0

SeqHub SeqHub is a platform for exploring, annotating, and sharing biological sequences.

Currently, SeqHub is optimized for microbial protein and genome analysis. As we expand beyond microbial data, we'd love your feedback to help shape what comes next. I'm deeply grateful to our team at Tatta Bio, and to our collaborators and funders, for making this vision a reality. 🔗 seqhub.org

28.10.2025 13:47 👍 6 🔁 0 💬 4 📌 0

We're thrilled to announce SeqHub, an AI-enabled platform for biological sequence analysis. SeqHub brings together sequence search, genome annotation, and data sharing in one place.

28.10.2025 13:47 👍 49 🔁 20 💬 3 📌 2

Ready to explore New Lineages of Life with @jgi.doe.gov ? 🧬🦠

Registration for our 2025 NeLLi Symposium is now open. For the first time in collaboration with @unlv.edu

Mark the date: November 6-7 in Las Vegas, NV

25.08.2025 21:39 👍 6 🔁 3 💬 1 📌 0

Gaia — Tatta Bio

We are building this infrastructure for the scientific community, and we invite feedback and collaboration from researchers at every stage. We are grateful to
the Moore Foundation for their generous support in making this project possible. Stay tuned for more updates!

www.tatta.bio/gaia

02.06.2025 16:23 👍 1 🔁 1 💬 0 📌 0

Today's sequence data infrastructure is set up for failure in the age of AI. Building an open and collaborative sequence platform for both Human and AI scientists.

At Tatta Bio, we have been thinking deeply about the sequence-to-function problem. We believe that before AI can power functional prediction, we first need to rethink how we curate, manage, and share sequence data. Here, we share our initial ideas on what we are building next:

02.06.2025 16:23 👍 8 🔁 4 💬 1 📌 0

Assemblies of long-read metagenomes suffer from diverse errors Genomes from metagenomes have revolutionised our understanding of microbial diversity, ecology, and evolution, propelling advances in basic science, biomedicine, and biotechnology. Assembly algorithms...

I am very happy (and anxious) to share with you our most recent work in which we evaluated four of the most popular long-read assemblers,

www.biorxiv.org/content/10.1...

and tell you just a little bit about it in the following 🧵

28.04.2025 08:07 👍 137 🔁 73 💬 5 📌 8

Yunha Hwang

Latest posts by Yunha Hwang @microyunha