π₯οΈ π§¬ βοΈ π¬ Plz spread the word. 2-week computational biology workshop in Singapore. No registration fee or expenses for US citizens. Protein annotation, function, structural modeling, and simulation. ~35 attendees. Lectures and hands-on tutorials with intensive instructor interaction. compbioasia.net
02.03.2026 21:53
π 4
π 5
π¬ 0
π 0
Detecting foldback artifacts in long-reads - BMC Genomics
Long-read sequencing data is useful for detecting large and complex structural variations; however, technical artifacts can lead to false structural variant calls. In our analyses, we became aware of ...
Our paper on foldback artifacts in long-read sequencing is now published in BMC Genomics!
We introduce Breakinator to flag foldback and chimeric artifacts across library types, sequencers, and chemistries.
Paper: link.springer.com/article/10.1...
With Matthew Meyerson and @lh3lh3.bsky.social
24.02.2026 16:02
π 17
π 6
π¬ 1
π 0
GitHub - bluenote-1577/savont: Amplicon sequencing variants from 16s ONT R10.4 / HiFi long reads
Amplicon sequencing variants from 16s ONT R10.4 / HiFi long reads - bluenote-1577/savont
Announcing a new tool for "denoising" long-read amplicon sequences: savont.
Savont enables amplicon sequence variants (ASVs) directly from nanopore (or HiFi) long reads. Tested on 16S nanopore amplicons -- seems to work okay.
1/4
github.com/bluenote-157...
28.01.2026 18:45
π 51
π 28
π¬ 1
π 2
HLi Lab - Vacancies
Openings
I am looking for a postdoc to develop high-performance algorithms in computational genomics. Email or DM me if interested. For more information, see hlilab.github.io/vacancies. RTs appreciated!
14.01.2026 15:44
π 43
π 64
π¬ 1
π 0
Probably very minor
07.01.2026 00:20
π 1
π 0
π¬ 0
π 0
Now published in Algorithms for Molecular Biology: link.springer.com/article/10.1.... Key message: a tiny CNN model with 7k parameters can capture main splice signals across vertebrates+insect and halves the minimap2 & miniprot junction error rate. I always use this new feature now.
06.01.2026 23:02
π 58
π 20
π¬ 1
π 0
Now published in gigascience: academic.oup.com/gigascience/.... Key messages: SVs are highly enriched in low-complexity/tandem-repeat regions and are harder to call. They behave differently from transposon insertions. Always stratify if you study SVs.
06.01.2026 22:55
π 33
π 9
π¬ 0
π 0
S3 Bucket Browser
To those who have open data at AWS cloud: you can use the S3 Bucket Browser to list files in your buckets. You can either 1) put the bucket name at lh3.github.io/s3bb/, 2) or copy this index.html github.com/lh3/s3bb/blo... to the root of a bucket. Generated by gemini.
22.12.2025 17:41
π 3
π 0
π¬ 1
π 0
Thanks to the AWS Open Data program, this dataset along some derived data is also openly accessible via @AWSCloud at openhgl.s3.us-east-1.amazonaws.com/index.html
22.12.2025 17:01
π 22
π 6
π¬ 1
π 0
Both SMEM and PML only use the rank operation, so yes, it should be easy to implement them in both tools. This just hasn't been done.
21.12.2025 14:11
π 2
π 0
π¬ 0
π 0
This plot compares SMEM from ropebwt3 and PML from movi. Ropebwt3 and movi don't support the same query type. It is hard to do an apple-to-apple comparison, but movi is indeed a lot faster.
21.12.2025 04:14
π 1
π 0
π¬ 1
π 0
Home - ProbGen 2026
Your Site Description
The registration deadline is fast approaching for probgen 2026! Abstracts due by January 15, registration by January 31
probgen2026.github.io
18.12.2025 17:09
π 16
π 18
π¬ 1
π 0
Iβm recruiting a postdoc to work on algorithms for cancer genome reconstruction. We have access to a rich set of tumour samples sequenced across multiple technologies. If interested, feel free to DM. Please share.
11.12.2025 03:04
π 13
π 12
π¬ 0
π 1
I guess there will be a version but probably not soon
03.12.2025 16:02
π 0
π 0
π¬ 0
π 0
GitHub - lh3/human-asm: A collection of high-quality human genomes
A collection of high-quality human genomes. Contribute to lh3/human-asm development by creating an account on GitHub.
579 high-quality human genomes from @humanpangenome.bsky.social, Arab Pangenome and individual papers (CHM13, CN1, KSA001, I002C, YAO and KOREF1). Sequences available in the AGC format (3.7GB) and FM-index in the ropebwt3 format (20.3GB). For details, see github.com/lh3/human-asm
03.12.2025 03:44
π 57
π 23
π¬ 1
π 1
https://ds.dfci.harvard.edu/postdocs/
π’ We are taking applications for our Postdoctoral Fellows Program at Harvard/DFCI!
πΉJoin a research group in our department
πΉCo-mentoring opportunities with 2+ faculty
πΉCollaborate with investigators beyond our department
πΉSalary starts at $75K
Apply here: t.co/B7SLZzQFKu
01.12.2025 18:59
π 9
π 15
π¬ 0
π 0
Back online. Not sure if it is a bug in my code or a hiccup at the hosting service.
01.10.2025 14:29
π 1
π 0
π¬ 0
π 0
Do you know ~60% of human SVs fall in ~1% of GRCh38? See our new preprint: arxiv.org/abs/2509.23057 and the companion blog post on how we started this project and longdust: lh3.github.io/2025/09/29/o.... Work with Alvin Qin
30.09.2025 02:19
π 84
π 30
π¬ 0
π 1
arXiv accepted our assembly review two years ago. That was written in MS Word, so PDF-only. Nonetheless, at that time they didn't require TeX source as I remember. Something might have been changed internally.
29.09.2025 01:26
π 2
π 0
π¬ 0
π 0
And learn what fully AI-generated websites look like. Avoid them, as they are more likely to be scam.
15.09.2025 12:18
π 9
π 3
π¬ 1
π 0
Heads up: ignore samtools dot org, similarly minimap2 dot com and likely others. It's owned by a known phishing site and while the binaries they offer look valid currently (but note they may be serving us different binaries to others), that could change.
Ie: it's not us (Samtools team)! Be warned
15.09.2025 08:40
π 146
π 127
π¬ 2
π 5
New blog post β A quick look at Roche's SBX
lh3.github.io/2025/09/11/a...
12.09.2025 03:26
π 57
π 30
π¬ 2
π 3
Now preprinted at arxiv.org/abs/2509.07357
10.09.2025 02:10
π 22
π 7
π¬ 0
π 0
Preprint out for myloasm, our new nanopore / HiFi metagenome assembler!
Nanopore's getting accurate, but
1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?
with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social
1 / N
07.09.2025 23:34
π 114
π 80
π¬ 5
π 5
High-resolution metagenome assembly for modern long reads with myloasm https://www.biorxiv.org/content/10.1101/2025.09.05.674543v1
07.09.2025 04:47
π 18
π 9
π¬ 0
π 1
Of course, also thank Andrea Guarracino and Andrew Carroll for their quick and careful review!
04.09.2025 17:09
π 3
π 0
π¬ 0
π 0
"Received: July 4, 2025. Revised: August 7, 2025. Accepted: August 15, 2025" and published on September 4. This is a simple and straightforward paper, but the speedy editorial process is still impressive. It could have been even faster if I had responded the initial editorial request more timely.
04.09.2025 16:55
π 10
π 1
π¬ 2
π 1
Now published in GigaScience with minor improvements: academic.oup.com/gigascience/...
* Download: zenodo.org/records/1490...
* More info: github.com/lh3/panmask
04.09.2025 16:44
π 31
π 10
π¬ 1
π 1