MST graph comparing the top-10 ranking overlap between many commonly used fingerprint types and variants.
Four UMAP visualizations of chemical space based on 700k fingerprints on the biostructures dataset using various molecular fingerprint types.
Just updated our preprint on benchmarking molecular fingerprints!
--> www.biorxiv.org/content/10.1...
Some key points
- Count fingerprints should be the default (not binary!)
- Unfolded fingerprints are often worth it.
- Larger radius Morgan or FCFP fingerprints are good first bets.
27.02.2026 15:13
π 5
π 1
π¬ 1
π 0
New #matchms release (0.31)π
With functionalities that were on our TODO list for a looooong time: Flash Entropy and BLINK scores! The new "FlashSimilarity" allows computing modified cosine, spectral entropy etc., about 100x faster (or more if you use Linux).
#Python #opensource #massspec
06.10.2025 15:59
π 6
π 2
π¬ 1
π 0
Chemical Space Visualizations using UMAP and various molecular fingerprints.
4/4
We also highlight options for count fingerprints, such as log-counts and IDF weighted counts. The latter can be used to adjust the bit importance to a dataset of your choice.
An example use-case are chemical space visualizations.
Preprint: www.biorxiv.org/content/10.1...
23.06.2025 09:22
π 2
π 1
π¬ 0
π 0
3/4
A huge issue is bit collisions.
Fingerprints with a high bit occupation (RDKit, MAP4) often lead to (1) arbitrary misinterpretations, (2) shifts to high Tanimoto scores, (3) very different handling of small and large molecules.
--> Consider using sparse fingerprints!
--> Morgan >> MAP4 / RDKit
23.06.2025 09:22
π 2
π 1
π¬ 1
π 0
Benchmarking plot on fingerprint duplications.
2/4
We focused on weaknesses of the fingerprints.
Many show frequent duplicates, so same fingerprint for different compounds. Most problematic: this can include *very* different compounds ending up with identical fingerprints.
- MAP4 >> Morgan-type >> daylight
- count >> binary
#cheminformatics
23.06.2025 09:22
π 0
π 1
π¬ 1
π 0
Sketch of count/binary fingerprints and weighing options.
New preprint out!
1/4
@julianpollmann.bsky.social and I went down several rabbit holes to assess some commonly used molecular fingerprints.
Bottom line: For large datasets, make an effort to select suitable settings. "We used Tanimoto" is not good enough.
--> www.biorxiv.org/content/10.1...
23.06.2025 09:22
π 5
π 1
π¬ 1
π 0
Motivational quotes like "If you can dream it, you can do it." will help to relax in such environment.
13.06.2025 11:48
π 1
π 0
π¬ 0
π 0
LibertΓ€re kennen
β’ den Preis von allem
β’ den Wert von gar nichts
β’ das Schutzalter in jeder Juristiktion
was @tante.cc sagt
28.05.2025 17:32
π 27
π 6
π¬ 2
π 0
Wir dachten, KI wΓΌrde uns irgendwann durch Terminator-Roboter versklaven. Aber wir werden durch Bequemlichkeit versklavt.
12.05.2025 22:05
π 863
π 105
π¬ 42
π 5
Hello @pyconde.bsky.social
Curious to listen to some interesting Talks and meet new people π
23.04.2025 07:30
π 1
π 0
π¬ 0
π 0
First draft online version of The RLHF Book is DONE. Recently I've been creating the advanced discussion chapters on everything from Constitutional AI to evaluation and character training, but I also sneak in consistent improvements to the RL specific chapter.
rlhfbook.com
16.04.2025 19:01
π 122
π 19
π¬ 2
π 3
CCC | CCC fordert Notbremse fΓΌr den Γberwachungskatalog im Koalitionsvertrag
Der Chaos Computer Club ist eine galaktische Gemeinschaft von Lebewesen fΓΌr Informationsfreiheit und TechnikfolgenabschΓ€tzung.
βBei der #SPD haben noch die Mitglieder die Chance, die Notbremse zu ziehen und den Abbau wichtiger Grundrechte zu verhindern. Wir appellieren daher an die Sozialdemokraten: Stimmt dieser Γberwachungsliste nicht zu!β π€π @ccc.de
10.04.2025 09:55
π 622
π 252
π¬ 25
π 13
Einige Hochschulen ziehen bereits Einstellungsstopps fΓΌr Professor:innen und Mitarbeiter:innen in Betracht und haben die Zahl der HilfskrΓ€fte reduziert. #IchBinHanna #BildungKostet 2/2
09.04.2025 13:10
π 0
π 0
π¬ 0
π 0
Herzlichen GlΓΌckwunsch zum Spatenstich fΓΌr das neue GebΓ€ude! Es bleibt zu hoffen, dass die von Ina Brandes geplanten KΓΌrzungen der Grundfinanzierung der Hochschulen in HΓΆhe von 255Mio EUR in #NRW nicht zu einem Mangel an Wissenschaftler:innen fΓΌhren. #IchBinHanna #BildungKostet 1/2
09.04.2025 13:09
π 0
π 0
π¬ 1
π 0
I think this is an important point that we missed the last 10y. We need to call out fiction as fiction and fighting for reality (can be boring, but itβs real).
Reality is, there is no AGI, just language models. If you think they are giving you some truth, the stupidity is on you, not the LLM.
05.04.2025 07:16
π 29
π 5
π¬ 2
π 0
Nice keynote by @tante.cc for the opening of @zdd-hsd.bsky.social #HSD
03.04.2025 15:20
π 2
π 1
π¬ 0
π 0
ZDD Foyer bereit fΓΌr die feierliche ErΓΆffnung
Die Vorbereitungen fΓΌr die Einweihung des #ZDD laufen auf Hochtouren!
Wir freuen uns auf morgen.
02.04.2025 15:40
π 2
π 2
π¬ 0
π 0
An Open-Source AI Agent for Doing Tasks on the Web | Stanford HAI
NNetNav learns how to navigate websites by mimicking childhood learning through exploration.
Stanford scholars introduced an open-source AI agent that learns how to navigate websites by mimicking childhood learning β an approach that could lead to more efficient, transparent, and privacy-conscious AI: hai.stanford.edu/news/an-open...
@chrmanning.bsky.social @shikharmurty.bsky.social
28.03.2025 19:00
π 20
π 8
π¬ 1
π 2
#IchBinHanna & der π senden 1 Botschaft in die Koalitionsverhandlungen!
#WissZeitVG
@drkeichhorn.bsky.social @kubon.bsky.social
28.03.2025 11:23
π 166
π 46
π¬ 6
π 7
Protecting Human Cognition in the Age of AI
Interesting paper on Human Cognition in the Age of #AI or how not to brainrot arxiv.org/html/2502.12...
24.03.2025 21:11
π 0
π 0
π¬ 0
π 0
Die amerikanische Forschung ist auch fΓΌr Deutschland wichtig. Das ist ja kein Wettbewerb, den wir gewinnen wollen. Wenn es anderen schlechter geht, wirkt sich das auch auf uns und unseren gesellschaftlichen Fortschritt aus.
24.03.2025 13:10
π 15
π 4
π¬ 2
π 0
It is super weird how little reception David Golumbia's last book "Cyberlibertarianism" has had. I mean I know why, he dropped a bunch of uncomfortable truth bombs there but it's still weird how little that book is talked about. It is probably the most important book about tech in the last decade.
21.03.2025 23:51
π 241
π 47
π¬ 12
π 10
I helped build a government AI system. DOGE fired me, rolled the AI out to the whole agency, and implied the AI can do my job and the jobs of the others they've fired.
It can't. But, what DOGE accidentally revealed about themselves in the process is fascinating. π§΅
21.03.2025 22:41
π 10462
π 4370
π¬ 194
π 772
A friend pointed out that the quasi-religious way so many tech guys talk about AI is in part because they don't want to grapple with the guilt of what they do; AI is a God, AI will fix it. No need to be responsible for what you put in the world.
I can't stop thinking about that.
22.03.2025 00:44
π 188
π 47
π¬ 5
π 4
Heftig. Laut Datenbank sind alle meine 4 BΓΌcher - 3 davon zusammen mit @pialamberty.bsky.social verfasst - im Datensatz enthalten.
20.03.2025 13:57
π 198
π 60
π¬ 21
π 2
Student Research Workshop
ACL 2025 Student Research Workshop.
π’ #ACL2025NLP Student Research Workshop CFP is out π
π 2025.aclweb.org/calls/studen...
Send your paper by β° May 18th, 2025 π #nlproc
18.03.2025 06:22
π 6
π 2
π¬ 0
π 0
Conference Management Toolkit - Login
Microsoft's Conference Management Toolkit is a hosted academic conference management system. Modern interface, high scalability, extensive features and outstanding support are the signatures of Micros...
Just as an update ... π Call for Papers β CVPR 3rd Workshop on Multi-Modal Foundation Models (MMFM)
@cvprconference.bsky.social ! π
π sites.google.com/view/mmfm3rd...
π
Deadline: Apr 1, 2025 (non-proceedings)
π Submission: cmt3.research.microsoft.com/MMFM2025
18.03.2025 10:21
π 2
π 3
π¬ 0
π 0