Sadiq Jaffer's Avatar

Sadiq Jaffer

@sadiq.toao.com

Researcher @ Cambridge CL, OCaml hacker, fmr CEO at Opsian

246
Followers
49
Following
10
Posts
13.11.2024
Joined
Posts Following

Latest posts by Sadiq Jaffer @sadiq.toao.com

Preview
Tessera: A Temporal Foundation Model for Earth Observation with Sadiq Jaffer and Frank Feng

Last week Frank Feng and I joined Robin Cole on his satellite image deep learning podcast to talk about Tessera: how it works, how it differs from existing models, and our future plans. Thanks for hosting us, Robin!

www.satellite-image-deep-learning.com/p/tessera-a-...

21.01.2026 09:37 πŸ‘ 6 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Earth Observation on a Budget: Finding Solar Farms with a 42k-Parameter Model - Sadiq Jaffer

It still surprises me you can get good performance with even tiny models on top of TESSERA embeddings. Here's how to find solar farms in the UK with a small ~42k CNN: toao.com/blog/earth-o...

15.01.2026 18:51 πŸ‘ 6 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Three Steps for OCaml to Crest the AI Humps - Sadiq Jaffer

There's relatively little LLM training data for niche languages and this causes poorer coding agent performance. I think this is an existential threat for smaller language communities like OCaml.

My talk at the OCaml workshop gave some actionable steps to mitigate that: toao.com/blog/ai-exis...

25.10.2025 12:39 πŸ‘ 5 πŸ” 3 πŸ’¬ 3 πŸ“Œ 1
Post image

Every OCaml talk needs a pun, and @sadiq.toao.com is no exception #icfpsplash25

17.10.2025 08:32 πŸ‘ 8 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image

Our lightning talks session opens with @sadiq.toao.com demonstrating TESSERA, their new geospatial foundation model that is FAIR and global #icfpsplash25

13.10.2025 08:20 πŸ‘ 3 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Preview
Can AI detect hedgehogs from space? Maybe if you find brambles first. Cambridge researchers use satellite-based bramble detection as a proxy for mapping hedgehog habitats.

Not how I expected to make my @arstechnica.com debut but I'll take it arstechnica.com/ai/2025/09/c...

27.09.2025 14:48 πŸ‘ 7 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Can a model trained on satellite data really find brambles on the ground? - Sadiq Jaffer

Fun field trip today trying to validate a colleague's bramble detecting model: toao.com/blog/can-we-... with @anil.recoil.org

24.09.2025 20:29 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

A good point. Was being generated but not linked anywhere. Fixed now. Thanks!

16.07.2025 13:17 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Last three months in OCaml (July 2025) - Sadiq Jaffer

Some fun OCaml GC projects here with @sadiq.toao.com and @kcsrk.info if any students are looking for projects involving programming languages toao.com/blog/ocaml-0...

15.07.2025 10:04 πŸ‘ 17 πŸ” 6 πŸ’¬ 1 πŸ“Œ 0

The most incredibly fun part of this Nature comment on evidence synthesis we published today is that the cartoonist (David Parkins) also did Beano and Dennis the Menace (!) A true legend. www.nature.com/articles/d41...

08.07.2025 11:55 πŸ‘ 7 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0

The rapid rise in AI-generated fraudulent academic papers is "poisoning" scientific literature, say Cambridge researchers in Nature magazine today. But though AI is the problem, it could also help in ensuring the integrity of scientific discovery... buff.ly/AuSNcGd
@anil.recoil.org @sadiq.toao.com

08.07.2025 11:25 πŸ‘ 3 πŸ” 2 πŸ’¬ 0 πŸ“Œ 1
Post image

I'm pleased to announce OxCaml!

OxCaml is Jane Street's branch of OCaml. We've given it a new name and a snazzy logo, and done a bunch of work to make it easy for people to try.

13.06.2025 14:14 πŸ‘ 107 πŸ” 40 πŸ’¬ 5 πŸ“Œ 3
Preview
Careful design of Large Language Model pipelines enables expert-level retrieval of evidence-based information from syntheses and databases Wise use of evidence to support efficient conservation action is key to tackling biodiversity loss with limited time and resources. Evidence syntheses provide key recommendations for conservation deci...

New paper out today on how the careful design of LLMs is crucial for expert-level evidence retrieval in conservation (but with implications for any evidence synthesis pipeline across other fields) 🌍 doi.org/10.1371/jour... and anil.recoil.org/news/2024-ce... for a summary

16.05.2025 16:47 πŸ‘ 9 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0

One thing I probably should highlight more in the post is that the proprietary models (like Claude and Gemini) that most students currently have access to can already ace the assignments.

07.05.2025 16:53 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

This is a thorny question and mostly comes down to what we're trying to teach. I wonder if a progressive approach where at early stages of teaching there is no automatic tooling but as critical skills are learnt more can be automated. It's a bit of a moving target at the moment though.

07.05.2025 16:51 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Qwen3 Leads the Pack: Evaluating how Local LLMs tackle First Year CS OCaml exercises - Sadiq Jaffer

Just how good are locally hostable code models on Cambridge first year OCaml assignments? @anil.recoil.org , @jon.recoil.org and I wanted to find out, so ran some tests. TL;DR Qwen3 means we might need new assignments. toao.com/blog/ocaml-l...

07.05.2025 14:23 πŸ‘ 8 πŸ” 2 πŸ’¬ 2 πŸ“Œ 0
JSON output from Deepseek R1 and distills with llama.cpp - Sadiq Jaffer

If you are using llama.cpp, here's a workaround using grammars for getting JSON structured output from Deepseek R1 and distills: toao.com/blog/json-ou...

30.01.2025 18:39 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
The potential for AI to revolutionize conservation: a horizon scan Artificial Intelligence (AI) is an emerging tool that could be leveraged to identify the effective conservation solutions demanded by the urgent biodiversity crisis. We present the results of our horizon scan of AI applications likely to significantly benefit biological conservation. An international panel of conservation scientists and AI experts identified 21 key ideas. These included species recognition to uncover 'dark diversity', multimodal models to improve biodiversity loss predictions, monitoring wildlife trade, and addressing human–wildlife conflict. We consider the potential negative impacts of AI adoption, such as AI colonialism and loss of essential conservation skills, and suggest how the conservation field might adapt to harness the benefits of AI while mitigating its risks.

Part of our @ai.cam.ac.uk project on AI in Conservation was published in TREE today. We gathered conservation scientists and AI experts and looked at the key conservation areas AI could revolutionise: www.cell.com/trends/ecolo...

18.12.2024 17:14 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Working to surface challenges faced by folks at the coal face.

Data in research contributions from @orbenamy.bsky.social @sadiq.toao.com @scotthosking.bsky.social Stefan Scholtes, Vasco Carvalho, Mireia Crispin and a foreward with Jess Montgomery @dianecoyle1859.bsky.social @ginasue.bsky.social

29.11.2024 17:12 πŸ‘ 10 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0

New preprint from our work on using LLMs to accelerate conservation evidence synthesis across millions of papers. We crosscheck 3 retrieval strategies against 10 LLMs and benchmark against human experts and find quite a bit of variance https://www.researchsquare.com/article/rs-5409185/v1

16.11.2024 10:42 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0