Adam M Novak's Avatar

Adam M Novak

@adamnovak.graphs.vg

Bioinformatics Programmer at @ucscgenomics.bsky.social working on #pangenomics. http://orcid.org/0000-0001-5828-047X

441
Followers
343
Following
212
Posts
07.11.2024
Joined
Posts Following

Latest posts by Adam M Novak @adamnovak.graphs.vg

No, but it sure can train students to see unprompted Duo push notifications asking if they are logging in right now and hit "Yes" regardless.

23.02.2026 21:25 ๐Ÿ‘ 8 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

it has been [0] days since I've tried to emulate 64-bit atomic operations using 32-bit atomic ops in a system with relaxed memory ordering

20.02.2026 21:16 ๐Ÿ‘ 38 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
A metagenomics approach to discover novel plastic degrading microbial pathways at University of Birmingham on FindAPhD.com PhD Project - A metagenomics approach to discover novel plastic degrading microbial pathways at University of Birmingham, listed on FindAPhD.com

Bit different from our usual work
A PhD applying metagenomics to identify microbial plastic degradation and metabolism, fully funded studentship (UK students only).

www.findaphd.com/phds/project...

17.02.2026 15:21 ๐Ÿ‘ 5 ๐Ÿ” 11 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Release vg 1.72.0 - Littlefoot ยท vgteam/vg Don't forget to mark the static binary executable: chmod +x vg Docker Image: quay.io/vgteam/vg:v1.72.0 Buildable Source Tarball: vg-v1.72.0.tar.gz Includes source for vg and all submodules. Use th...

Our latest vg release introduces GBZ v2 with better compression for sequences. I originally assumed that the total sequence length in a pangenome graph would be similar to the size of the genome. This does not hold in the full HPRC graphs due to unaligned centromeres.

10.02.2026 22:50 ๐Ÿ‘ 7 ๐Ÿ” 3 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Guarracino Lab | Pangenome Research We develop methods to build and analyze pangenomes, with applications in cancer and complex disease. Translational Genomics Research Institute, Phoenix, AZ.

Looking for a postdoc to build my new lab at TGen (Phoenix, AZ) focused on pangenome methods for cancer and complex disease. Full stack โ€” from pangenome assembly and compression to association studies and somatic variant discovery. Reach out if interested! guarracinolab.github.io#join

06.02.2026 16:02 ๐Ÿ‘ 11 ๐Ÿ” 9 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1
Preview
He Built a Server to Protect Indigenous Health Data

Native Bio has made the @nytimes.com for the second time. The #IDsov movement is a tidal wave....and a solution for these difficult times. โ€œWe are at a fork in the road. Do not choose hate & do not choose unsustainability.โ€ ~ 21st Century Maxim, Author Unknown www.nytimes.com/2026/02/05/s...

05.02.2026 17:52 ๐Ÿ‘ 21 ๐Ÿ” 12 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 1

This sounds implausible. And it would result in the file existing but having no or the wrong content, right? I would not believe this robot.

05.02.2026 23:22 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

This was such a fun project.
Adam's thread on it is a great summary!

03.02.2026 00:07 ๐Ÿ‘ 5 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Maybe I need to get Kimi k2.5 set up somehow so I can run it for a full day at reasonable cost, in a contained environment that can be allowed to turn evil and so doesn't constantly need me to approve things.

02.02.2026 20:06 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

My Claude Code distracted me constantly when I tried to let it work while I was in a meeting, has me now more confused about the nature of the problem I was trying to get it to solve than when I started, and then went on strike for increased subscription fees after 2 hours.

Am I holding it wrong?

02.02.2026 19:58 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

This is a chat box where you can try to yell at your paper until it is written.

27.01.2026 23:15 ๐Ÿ‘ 49 ๐Ÿ” 4 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 2

Something that makes my LaTeX code snap together out of lose curly braces, or that hands me BibTeX keys based on which paper in my .bib smells most like the sentence I just wrote, might be useful.

This does not appear to be that thing.

27.01.2026 23:15 ๐Ÿ‘ 32 ๐Ÿ” 0 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 1
Preview
Race is not a biological category: challenging this misconception will help tackle racism in healthcare Healthcare professionals must reject flawed racial generalisations, writes Jasmeen Kanwal Institutional racism in healthcare has received increased coverage in recent years, including in The BMJ โ€™s ...

Excellent piece about why race is not a biological category and how medical education needs to respond. www.bmj.com/content/392/...

26.01.2026 17:05 ๐Ÿ‘ 12 ๐Ÿ” 7 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1

In vg we are vendoring in htslib partly so we can make sure we use an htslib built with libdeflate support. For blocked-gzip at least you can stream the overall file just fine.

20.01.2026 17:24 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

We've all heard of ramdisk, but have you met cachedisk?

20.01.2026 16:49 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Careers โ€“ UC Santa Cruz

Apply ASAP to be a "Genomics Senior Systems Architect" AKA Linux Herder at @ucscgenomics.bsky.social !

The first test is convincing www.ucsc.edu/careers/ to show you #job number 82880.

14.01.2026 21:42 ๐Ÿ‘ 3 ๐Ÿ” 4 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Are #redhat #quay.io Docker container builds down for anyone else? Mine are failing with no logs, and then my trigger gets "automatically disabled due to successive internal errors".

12.01.2026 15:58 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

You find yourself in the middle of a fastq file. You see + and @ all over the place. How do you find the start of the current record?

09.01.2026 01:29 ๐Ÿ‘ 10 ๐Ÿ” 1 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 2

If you wrap and let the line lengths be inconsistent, or if you wrap and are allowing the sequence ID on the separator line, I think you can't do it. You can pad out the separator line to full length or otherwise make it eligible to be a quality line, and you need to seek all the way to the start.

09.01.2026 17:27 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

I'm only considering nice FASTQs that don't wrap or use IDs on the separator line, which I think are all that a lot of our tools support.

If you want to support wrapped FASTQ with consistent line lengths, I think you can still do it. Or if you want separator IDs. But not both.

09.01.2026 17:27 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Then, if not a quality or a separator, you're either sequence or a header. If your first character is "@", you're a header. Otherwise, you're sequence.

09.01.2026 17:15 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Then, if not a quality, try and eliminate being a separator. If your line is anything other than "+", you're not a separator. If it's "+" and you're not a quality, you're a separator.

09.01.2026 17:15 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

I think you go by process of elimination.

First try to eliminate being in a quality string.
- Previous line must be "+"
- Line before that must be same length as your line, all bases
- Line before that must start with "@"
If any fails, you're not a quality. Otherwise, you are.

09.01.2026 17:15 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

Well the line you're on is a header, sequence, a separator, or a quality string.

You can look at the character you're at and rule out some of those.

You can look at the next and previous characters and expand out, and eventually work out your line type and find the corresponding header. Probably.

09.01.2026 17:02 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

But we call it the "Heng Effect" or "Heng Li Effect" all the time, just never in the literature.

16.12.2025 22:23 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
Personalized pangenome references - Nature Methods This work introduces a k-mer-based approach to customizing a pangenome reference, making it more relevant to a new sample of interest. This method enhances the accuracy of genotyping small variants an...

Anyway, @lh3lh3.bsky.social pointed out that this could happen/was happening in an email once, and we spent like a year scratching our heads about it, and invented all sorts of frequency filtering approaches and (finally?) pangenome personalization (doi.org/10.1038/s415...) to try and deal with it.

16.12.2025 22:23 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Did anybody ever actually write up the "Heng Effect" (where adding variants to your pangenome graph can *reduce* its usefulness for samples unlikely to contain those variants)? Or is it just deep vg lore known only in our lab?

16.12.2025 22:21 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

Kiddo age 5: daddy you don't work, you just sit at the computer
Kiddo age 12: dad you just write grants

16.12.2025 15:28 ๐Ÿ‘ 14 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
SMBE2026 Symposium 21 | From genomic graphs to evolutionary insights: standardising pangenomes for population inference

SMBE2026 Symposium 21 | From genomic graphs to evolutionary insights: standardising pangenomes for population inference

SMBE2026 Symposium 21 | From genomic graphs to evolutionary insights: standardising pangenomes for population inference

๐Ÿ“จ Abstract submission
smbe2026.org/abstracts

๐Ÿ“‹ Programme details
smbe2026.org/programme

#SMBE2026

16.12.2025 12:28 ๐Ÿ‘ 11 ๐Ÿ” 6 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1
Preview
Scikit-bio: a fundamental Python library for biological omic data analysis - Nature Methods Nature Methods - Scikit-bio: a fundamental Python library for biological omic data analysis

The scikit-bio paper in online in Nature Methods! Many thanks to our collaborators, community contributors and reviewers! We couldnโ€™t have done it without you. www.nature.com/articles/s41... #Bioinformatics #OpenSource

11.12.2025 17:57 ๐Ÿ‘ 97 ๐Ÿ” 51 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 0