For GenAI, improvements to generation quality are going to come from better data curation and value fns to drive the model toward high-quality outputs. Standard model training results in outputs representative of the training distribution, but users don't want averageβwe want the best quality.
03.01.2025 17:39
π 0
π 0
π¬ 1
π 0
Humans solve novel problems on-the-fly using System 2 reasoning: AI needs this too. By learning reasoning steps at training time, at deployment the model can build new sequences of reasoning steps, enabling it to extrapolate.
03.01.2025 17:37
π 0
π 0
π¬ 1
π 0
Reasoning capabilities are essential to reach robust perf in key ML products e.g. full self-driving. The distribution of driving scenarios is long-tailed, so even a model that covers most situations well may be faced with a novel situation outside its training data, but needs to respond correctly.
03.01.2025 17:37
π 0
π 0
π¬ 1
π 0
For agentic models, the focus is shifting to System 2-like reasoning. OpenAIβs o1/o3 models demonstrate reasoning step-by-step can improve output quality by leveraging test-time compute. But impressive results on ARC are expensive, hence there will be a focus on improving test-compute efficiency.
03.01.2025 17:34
π 0
π 0
π¬ 1
π 0
But the LLM training corpus is now the majority of worthwhile text humanity has ever written, and can't be meaningfully scaled further. As Ilya Sutskever put it at NeurIPS, "big data is the fossil fuel of AI".
With this in mind, what will be the next stage of AI development?
03.01.2025 17:31
π 0
π 0
π¬ 1
π 0
The turn of the year is a good time for reflection. Here's my thoughts on where ML is headed.
Advances have been driven by scalingβbigger compute, bigger data: bigger models. Moreover, larger data also gives a solution to OOD generalizationβjust increase the train set until everything is in domain!
03.01.2025 17:31
π 0
π 0
π¬ 1
π 0
This has some serious AI safety implications. Having an AI model able to classify what is in an image better than a human doesn't pose an existential threat. But when an AI model can perform long-term planning better than a human, "just unplug it" ceases to be a reliable solution
09.12.2024 17:16
π 0
π 0
π¬ 0
π 0
System 2 Reasoning Capabilities Are Nigh
In recent years, machine learning models have made strides towards human-like reasoning capabilities from several directions. In this work, we review the current state of the literature and describe t...
In "System 2 Reasoning Capabilities Are Nigh", I lay out comparisons between human reasoning and reasoning in AI models, and argue that all the components needed to create AI models that can perform human-like reasoning already exist.
arxiv.org/abs/2410.03662
09.12.2024 17:13
π 0
π 0
π¬ 1
π 0
It's very easy to get started with using the dataset. The commands to download it and load it for PyTorch training fit in less than half a tweet:
!pip install bioscan-dataset
from bioscan_dataset import BIOSCAN5M
ds = BIOSCAN5M("~/Datasets/bioscan-5m", download=True)
09.12.2024 17:08
π 0
π 0
π¬ 0
π 0
The dataset should be useful for a variety of research topics:
- multimodal learning
- fine-grained classification
- hierarchical labelling
- open-world classification/clustering
- semi- and self-supervised learning
09.12.2024 17:00
π 0
π 0
π¬ 1
π 0
BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity
As part of an ongoing worldwide effort to comprehend and monitor insect biodiversity, this paper presents the BIOSCAN-5M Insect dataset to the machine learning community and establish several benchmar...
BIOSCAN-5M is a multimodal dataset for insect biodiversity monitoring. It consists of 5 million insect specimens from around the world, with a high-res microscopy image, DNA barcode, taxonomic labels, size, and geolocation info for each sample.
arxiv.org/abs/2406.127...
09.12.2024 16:57
π 0
π 0
π¬ 1
π 0
I'm looking forward to NeurIPS this week! I'll be presenting two papers there. In the main conference, our new dataset BIOSCAN-5M, and in the System 2 Reasoning At Scale workshop my position paper "System 2 Reasoning Capabilities Are Nigh".
09.12.2024 16:53
π 0
π 0
π¬ 2
π 0