Screenshot of plot showing ELO vs paramter count for different OCR models
There is no best VLM OCR model - rankings can flip completely by document type.
I built ocr-bench: run open OCR models on YOUR documents, get a per-collection leaderboard.
VLM-as-judge with Bradley-Terry ELO, all running on @hf.co. No local GPU needed.
05.03.2026 14:48
π 48
π 10
π¬ 1
π 1
i'm trying out the novel writing project with Claude in Claude Code, using Pangram to break it out of writing in a clearly identifiable AI-writing style. it's going... interesting so far. i despaired at the beginning but am now cautiously optimistic. not so much at the structural level though.
12.02.2026 21:10
π 29
π 1
π¬ 2
π 0
this is cool
tbh all i want is an LLM that sits atop my Zotero library and lets me talk to it tho
04.02.2026 23:09
π 5
π 1
π¬ 2
π 0
Current Workshop
CFA: 8th Scientific Understanding and Representation (SURe) annual workshop Β Call for abstracts Β Β Β Β Β Β We invite authors to submit abstracts of up to 750-words for the upcoming...
Final CFA for the 8th Scientific Understanding and Representation (SURe) annual workshop, which will take place May 27-29, 2026, at the IFIS PAN in Warsaw.
Submission deadline: 20 January 2026.
More info: shorturl.at/AUoye
@philsci.bsky.social @eenphilsci.bsky.social @epsaphilsci.bsky.social
12.01.2026 11:19
π 9
π 6
π¬ 0
π 0
A four-panel figure showing the probability of predicting articles from The Journal of Philosophy versus PMLA using quarter-century models. Each panel represents a different training period (1925-1950, 1950-1975, 1975-2000, 2000-2025). Gray shaded regions indicate training periods. The model trained on early C21 philosophy vs literature cannot accurately distinguish early C20 philosophy vs literature, but the reverse is not true.
Hierarchical cluster of syntactic features predicting philosophy (blue) vs criticism (red).
Top 2 distinctive features for Philosophy vs Criticism.
An example of the importance of the "marker" feature in philosophy.
Analytic philosophy can be distinguished from literary criticism with 90-95% accuracy via syntax alone. Moreover, a classifier trained to separate them in early C20 does better predicting future separations than a C21 one predicts past ones, suggesting philosophy syntax narrows/specializes in ~C21.
02.01.2026 00:53
π 34
π 8
π¬ 0
π 0
Three scatterplots of colorful points.
titles = ['Color Space', 'Text Space', 'Image Space']
subtitles = ['Embeddings of color features', 'Text embedding of color names', 'Image embeddings of color swatches']
Three different ways to represent colo(u)r. Work in progress, inspired by an old post by Kat Zhang / The Poet Engineer.
04.11.2025 12:05
π 5
π 1
π¬ 1
π 0
"there is a part of human intelligence which operates in a continuous generalization of the space of words, and other parts entirely which do things which are less well understood" is a perfectly reasonable position which apparently has no adherents
02.11.2025 18:58
π 64
π 5
π¬ 2
π 0
Generative Aesthetics: On formal stuckness in AI verse | Published in Journal of Cultural Analytics
By Ryan Heuser. This paper examines the formal and aesthetic patterns of AI-generated poems through a series of computational experiments.
Excited to share my latest publication, "Generative Aesthetics: On formal stuckness in AI verse." It's published in a special issue in the Journal of Cultural Analytics, expertly edited by Tess McNulty and Laura Chapot, on "Computation and Form, Reconsidered."
culturalanalytics.org/article/1448...
13.10.2025 15:50
π 44
π 17
π¬ 2
π 2
Tomorrow we will have a keynote from Charles Pence (UC Louvain).
Thanks to the Dutch Philosophy Research School (OZSW) for supporting this event, and @mnoichl.bsky.social for organizing this with me!
16.10.2025 14:49
π 3
π 1
π¬ 0
π 0
academic presentation in a baroque university environment. A group of researchers are gathered around a conference table
Gregor Betz (KIT) kicking off our "Data Driven Philosophy" Hackathon in Utrecht with his talk: "Doing Philosophy with and for LLMs". Besides input about the state of research and new directions, we're spending three days kicking off new projects.
16.10.2025 14:49
π 7
π 1
π¬ 1
π 0
i am going to try to give a framework of my own understanding which laypeople can understand.
13.10.2025 18:36
π 384
π 53
π¬ 6
π 20
The Big LLM Architecture Comparison
YouTube video by Sebastian Raschka
Updated & turned my Big LLM Architecture Comparison article into a video lecture.
The 11 LLM archs covered in this video:
1. DeepSeek V3/R1
2. OLMo 2
3. Gemma 3
4. Mistral Small 3.1
5. Llama 4
6. Qwen3
7. SmolLM3
8. Kimi 2
9. GPT-OSS
10. Grok 2.5
11. GLM-4.5/4.6
www.youtube.com/watch?v=rNlU...
10.10.2025 17:05
π 51
π 9
π¬ 0
π 1
For the first episode of Ping Pong Philosophy I had the absolute pleasure to speak with Greg Restall, one of the most renowned philosophical logicians and absolutely great guy to have a chat with. Thank you for your time, Greg, I had a blast.
We are also on Spotify!
07.10.2025 15:59
π 4
π 1
π¬ 0
π 0
Upshot:
NNES report to need twice as long to read English-language papers and to prepare English presentations. Even among highly proficient NNES (C1βC2 level), ~60% report having avoided asking questions at events due to concerns about their English (compared to 16% of NES). #philsky
24.09.2025 16:55
π 24
π 10
π¬ 0
π 0
Heat map of St Petersburg
How do literary communities actually form?
@maria-lev.bsky.social analyzes the networks of collaboration and aesthetic affinity that are documented through cultural events β e.g. readings, book launches, festivals. These real-world networks often remain invisible in text-based literary history.
22.09.2025 14:15
π 10
π 4
π¬ 1
π 1
In a new work with Joseph Rich and Conrad Oakes we tackle the problem of how to best organize alluvial plots. We formalize two optimization problems and develop a solution for them based on the neighbornet algorithm, implemented in the program wompwomp: github.com/pachterlab/w...
05.09.2025 12:20
π 32
π 9
π¬ 3
π 0
Max Noichl | Patterns, Pathways & Surprises
Our poster for EPSA 2025, introducing OpenAlex mapper
Had a great time last week at #epsa2025! I've put the poster up here, if anyone wants to take a closer look: maxnoichl.eu/blog/2025/ep...
02.09.2025 14:52
π 4
π 0
π¬ 0
π 0
A Gaussian process showing that the allowed time series are forced to be compatible with data
Iβm especially proud of this article I wrote about Gaussian Processes for the Recast blog! π₯³
GPs are super interesting, but itβs not easy to wrap your head around them at first π€
This is a medium level (more intuition than math) introduction to GPs for time series.
getrecast.com/gaussian-pro...
29.08.2025 17:11
π 80
π 23
π¬ 2
π 1
The participants of Dagstuhl Seminar 24122 standing on steps outside (from https://www.dagstuhl.de/24122)
Multiple types of embeddings (UMAP, t-SNE, Laplacian Eigenmaps, PHATE, PCA, MDS) of Wikipedia text data labelled by a text summaries generated by an LLM. Methods like UMAP and t-SNE show cluster structure that reflect shared subject matter in text, whiel other methods show more continuous structure.
Multiple embedding methods (PCA, Laplacian Eigenmaps, t-SNE, MDS, PHATE, UMAP) of primate brain organoids at different time periods. Different methods highlight different aspects of development, such as clusters of similar cell types or time courses of cell development.
Multiple embedding methods (PCA, Laplacian Eigenmaps, t-SNE, MDS, PHATE, UMAP) of 1000 Genomes Project genotypes. Different methods reflect different aspects of demographic history of populations.
Last year I met a bunch of great researchers who work with high-dimensional data at a Dagstuhl seminar. This week we put out a preprint about the history and philosophy of low-dimensional embedding methods, their applications, their challenges, and their possible future arxiv.org/abs/2508.15929
27.08.2025 13:25
π 14
π 7
π¬ 1
π 1
Updated edition (August 2025) of the coverage table of the major bibliometric databases (millions of records).
GS reindexing period
01.08.2025 09:55
π 36
π 18
π¬ 0
π 3
"Personally, I found this hyperstimulating," he said exultingly.
06.08.2025 20:38
π 17
π 3
π¬ 2
π 0
Max Noichl | GAP-Workshop β Data-Driven Methods for Philosophy
GAP-Satellite workshop
@mnoichl.bsky.social and I are organizing two workshops where you can learn about and try out digital methods for philosophy:
12th-13th September in DΓΌsseldorf, Keynotes @cherfeld.bsky.social & Adrian WΓΌthrich
16-18th October in Utrecht, Keynotes Gregor Betz & Charles Pence. Register until 31.8.
05.08.2025 16:24
π 14
π 6
π¬ 1
π 1
What are your favorite recent papers on using LMs for annotation (especially in a loop with human annotators), synthetic data for task-specific prediction, active learning, and similar?
Looking for practical methods for settings where human annotations are costly.
A few examples in thread β΄
23.07.2025 08:10
π 79
π 23
π¬ 13
π 3
Barchart of number of items in four clusters of text embeddings, with colors showing the distribution of sources in each cluster.
Caption: Clustering text embeddings from disparate sources (here, U.S. congressional bill summaries and senatorsβ tweets) can produce clusters where one source dominates (Panel A). Using linear erasure to remove the source information produces more evenly balanced clusters that maintain semantic coherence (Panel B; sampled items relate to immigration). Four random clusters of k-means shown (k=25), trained on a combined 5,000 samples from each dataset
New preprint! Have you ever tried to cluster text embeddings from different sources, but the clusters just reproduce the sources? Or attempted to retrieve similar documents across multiple languages, and even multilingual embeddings return items in the same language?
Turns out there's an easy fixπ§΅
17.07.2025 10:52
π 31
π 7
π¬ 2
π 1