's Avatar

@julianpollmann

bike maniac & research software engineer at HSD, @zdd-hsd.bsky.social Mostly #NLP, #AI, #SoftwareEngineering

33
Followers
77
Following
14
Posts
09.01.2025
Joined
Posts Following

Latest posts by @julianpollmann

MST graph comparing the top-10 ranking overlap between many commonly used fingerprint types and variants.

MST graph comparing the top-10 ranking overlap between many commonly used fingerprint types and variants.

Four UMAP visualizations of chemical space based on 700k fingerprints on the biostructures dataset using various molecular fingerprint types.

Four UMAP visualizations of chemical space based on 700k fingerprints on the biostructures dataset using various molecular fingerprint types.

Just updated our preprint on benchmarking molecular fingerprints!
--> www.biorxiv.org/content/10.1...

Some key points
- Count fingerprints should be the default (not binary!)
- Unfolded fingerprints are often worth it.
- Larger radius Morgan or FCFP fingerprints are good first bets.

27.02.2026 15:13 πŸ‘ 5 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
GitHub - matchms/chemap: Library for computing molecular fingerprint based similarities as well as dimensionality-reduction-based chemical space visualizations. Library for computing molecular fingerprint based similarities as well as dimensionality-reduction-based chemical space visualizations. - matchms/chemap

Work done with @julianpollmann.bsky.social at @zdd-hsd.bsky.social

Code:
- Central functionalities are now pip installable --> github.com/matchms/chemap
- Notebooks for experiments --> github.com/florian-huber/molecular_fingerprint_comparisons

#openscience #opensource #cheminformatics #python

27.02.2026 15:17 πŸ‘ 3 πŸ” 1 πŸ’¬ 2 πŸ“Œ 0
Preview
Haftbefehl shows that Germany loves art born from alienation – just not the people who create it A hit Netflix documentary about Germany’s favourite rapper demonstrates how popular the aesthetics of migrant life are – just as politicians debate how to remove it from inner cities

Haftbefehl shows that Germany loves art born from alienation – just not the people who create it

12.11.2025 13:43 πŸ‘ 44 πŸ” 11 πŸ’¬ 1 πŸ“Œ 6
Post image

New #matchms release (0.31)πŸš€

With functionalities that were on our TODO list for a looooong time: Flash Entropy and BLINK scores! The new "FlashSimilarity" allows computing modified cosine, spectral entropy etc., about 100x faster (or more if you use Linux).

#Python #opensource #massspec

06.10.2025 15:59 πŸ‘ 6 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Chemical Space Visualizations using UMAP and various molecular fingerprints.

Chemical Space Visualizations using UMAP and various molecular fingerprints.

4/4
We also highlight options for count fingerprints, such as log-counts and IDF weighted counts. The latter can be used to adjust the bit importance to a dataset of your choice.

An example use-case are chemical space visualizations.

Preprint: www.biorxiv.org/content/10.1...

23.06.2025 09:22 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

3/4
A huge issue is bit collisions.
Fingerprints with a high bit occupation (RDKit, MAP4) often lead to (1) arbitrary misinterpretations, (2) shifts to high Tanimoto scores, (3) very different handling of small and large molecules.

--> Consider using sparse fingerprints!
--> Morgan >> MAP4 / RDKit

23.06.2025 09:22 πŸ‘ 2 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Benchmarking plot on fingerprint duplications.

Benchmarking plot on fingerprint duplications.

2/4
We focused on weaknesses of the fingerprints.
Many show frequent duplicates, so same fingerprint for different compounds. Most problematic: this can include *very* different compounds ending up with identical fingerprints.

- MAP4 >> Morgan-type >> daylight
- count >> binary

#cheminformatics

23.06.2025 09:22 πŸ‘ 0 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Sketch of count/binary fingerprints and weighing options.

Sketch of count/binary fingerprints and weighing options.

New preprint out!
1/4

@julianpollmann.bsky.social and I went down several rabbit holes to assess some commonly used molecular fingerprints.

Bottom line: For large datasets, make an effort to select suitable settings. "We used Tanimoto" is not good enough.

--> www.biorxiv.org/content/10.1...

23.06.2025 09:22 πŸ‘ 5 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Motivational quotes like "If you can dream it, you can do it." will help to relax in such environment.

13.06.2025 11:48 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
LibertΓ€re kennen 
β€’ den Preis von allem 
β€’ den Wert von gar nichts 
β€’ das Schutzalter in jeder Juristiktion

LibertΓ€re kennen β€’ den Preis von allem β€’ den Wert von gar nichts β€’ das Schutzalter in jeder Juristiktion

was @tante.cc sagt

28.05.2025 17:32 πŸ‘ 27 πŸ” 6 πŸ’¬ 2 πŸ“Œ 0

Wir dachten, KI wΓΌrde uns irgendwann durch Terminator-Roboter versklaven. Aber wir werden durch Bequemlichkeit versklavt.

12.05.2025 22:05 πŸ‘ 863 πŸ” 105 πŸ’¬ 42 πŸ“Œ 5
Post image

Hello @pyconde.bsky.social
Curious to listen to some interesting Talks and meet new people πŸ˜€

23.04.2025 07:30 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

First draft online version of The RLHF Book is DONE. Recently I've been creating the advanced discussion chapters on everything from Constitutional AI to evaluation and character training, but I also sneak in consistent improvements to the RL specific chapter.

rlhfbook.com

16.04.2025 19:01 πŸ‘ 122 πŸ” 19 πŸ’¬ 2 πŸ“Œ 3
CCC | CCC fordert Notbremse für den Überwachungskatalog im Koalitionsvertrag Der Chaos Computer Club ist eine galaktische Gemeinschaft von Lebewesen für Informationsfreiheit und TechnikfolgenabschÀtzung.

β€žBei der #SPD haben noch die Mitglieder die Chance, die Notbremse zu ziehen und den Abbau wichtiger Grundrechte zu verhindern. Wir appellieren daher an die Sozialdemokraten: Stimmt dieser Überwachungsliste nicht zu!β€œ πŸ€πŸ‘ @ccc.de

10.04.2025 09:55 πŸ‘ 622 πŸ” 252 πŸ’¬ 25 πŸ“Œ 13

Einige Hochschulen ziehen bereits Einstellungsstopps fΓΌr Professor:innen und Mitarbeiter:innen in Betracht und haben die Zahl der HilfskrΓ€fte reduziert. #IchBinHanna #BildungKostet 2/2

09.04.2025 13:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Herzlichen GlΓΌckwunsch zum Spatenstich fΓΌr das neue GebΓ€ude! Es bleibt zu hoffen, dass die von Ina Brandes geplanten KΓΌrzungen der Grundfinanzierung der Hochschulen in HΓΆhe von 255Mio EUR in #NRW nicht zu einem Mangel an Wissenschaftler:innen fΓΌhren. #IchBinHanna #BildungKostet 1/2

09.04.2025 13:09 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I think this is an important point that we missed the last 10y. We need to call out fiction as fiction and fighting for reality (can be boring, but it’s real).
Reality is, there is no AGI, just language models. If you think they are giving you some truth, the stupidity is on you, not the LLM.

05.04.2025 07:16 πŸ‘ 29 πŸ” 5 πŸ’¬ 2 πŸ“Œ 0
Preview
Enabling the Next Generation in AI & Open Source: Free Remote Tickets for Students at PyCon DE & PyData Join PyCon DE & PyData 2025 in Darmstadt (Frankfurt), April 23-25! Germany’s largest Python and Data Science conference with talks, workshops, and community events like DjangoGirls and sprints. Be…

πŸ‘¨β€πŸ« Can you help spread the word? Our community supports future #AI professionals like engineers and data analysts. With backing from PySV and Pioneers Hub, we provide students free virtual passes to Germany's top AI conference. Learn more at 2025.pycon.de/blog/free-re....

05.04.2025 07:57 πŸ‘ 2 πŸ” 3 πŸ’¬ 0 πŸ“Œ 1
Post image

Nice keynote by @tante.cc for the opening of @zdd-hsd.bsky.social #HSD

03.04.2025 15:20 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
ZDD Foyer bereit fΓΌr die feierliche ErΓΆffnung

ZDD Foyer bereit fΓΌr die feierliche ErΓΆffnung

Die Vorbereitungen fΓΌr die Einweihung des #ZDD laufen auf Hochtouren!

Wir freuen uns auf morgen.

02.04.2025 15:40 πŸ‘ 2 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Preview
An Open-Source AI Agent for Doing Tasks on the Web | Stanford HAI NNetNav learns how to navigate websites by mimicking childhood learning through exploration.

Stanford scholars introduced an open-source AI agent that learns how to navigate websites by mimicking childhood learning – an approach that could lead to more efficient, transparent, and privacy-conscious AI: hai.stanford.edu/news/an-open...

@chrmanning.bsky.social @shikharmurty.bsky.social

28.03.2025 19:00 πŸ‘ 20 πŸ” 8 πŸ’¬ 1 πŸ“Œ 2
Video thumbnail

#IchBinHanna & der 🐘 senden 1 Botschaft in die Koalitionsverhandlungen!

#WissZeitVG

@drkeichhorn.bsky.social @kubon.bsky.social

28.03.2025 11:23 πŸ‘ 166 πŸ” 46 πŸ’¬ 6 πŸ“Œ 7
Protecting Human Cognition in the Age of AI

Interesting paper on Human Cognition in the Age of #AI or how not to brainrot arxiv.org/html/2502.12...

24.03.2025 21:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Die amerikanische Forschung ist auch fΓΌr Deutschland wichtig. Das ist ja kein Wettbewerb, den wir gewinnen wollen. Wenn es anderen schlechter geht, wirkt sich das auch auf uns und unseren gesellschaftlichen Fortschritt aus.

24.03.2025 13:10 πŸ‘ 15 πŸ” 4 πŸ’¬ 2 πŸ“Œ 0

It is super weird how little reception David Golumbia's last book "Cyberlibertarianism" has had. I mean I know why, he dropped a bunch of uncomfortable truth bombs there but it's still weird how little that book is talked about. It is probably the most important book about tech in the last decade.

21.03.2025 23:51 πŸ‘ 241 πŸ” 47 πŸ’¬ 12 πŸ“Œ 10

I helped build a government AI system. DOGE fired me, rolled the AI out to the whole agency, and implied the AI can do my job and the jobs of the others they've fired.

It can't. But, what DOGE accidentally revealed about themselves in the process is fascinating. 🧡

21.03.2025 22:41 πŸ‘ 10462 πŸ” 4370 πŸ’¬ 194 πŸ“Œ 772

A friend pointed out that the quasi-religious way so many tech guys talk about AI is in part because they don't want to grapple with the guilt of what they do; AI is a God, AI will fix it. No need to be responsible for what you put in the world.

I can't stop thinking about that.

22.03.2025 00:44 πŸ‘ 188 πŸ” 47 πŸ’¬ 5 πŸ“Œ 4

Heftig. Laut Datenbank sind alle meine 4 BΓΌcher - 3 davon zusammen mit @pialamberty.bsky.social verfasst - im Datensatz enthalten.

20.03.2025 13:57 πŸ‘ 198 πŸ” 60 πŸ’¬ 21 πŸ“Œ 2
Student Research Workshop ACL 2025 Student Research Workshop.

πŸ“’ #ACL2025NLP Student Research Workshop CFP is out πŸ™Œ
πŸ‘‰ 2025.aclweb.org/calls/studen...
Send your paper by ⏰ May 18th, 2025 πŸš€ #nlproc

18.03.2025 06:22 πŸ‘ 6 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Conference Management Toolkit - Login Microsoft's Conference Management Toolkit is a hosted academic conference management system. Modern interface, high scalability, extensive features and outstanding support are the signatures of Micros...

Just as an update ... πŸš€ Call for Papers – CVPR 3rd Workshop on Multi-Modal Foundation Models (MMFM)
@cvprconference.bsky.social ! πŸš€

🌐 sites.google.com/view/mmfm3rd...
πŸ“… Deadline: Apr 1, 2025 (non-proceedings)
πŸ“ Submission: cmt3.research.microsoft.com/MMFM2025

18.03.2025 10:21 πŸ‘ 2 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0