Michael Nielsen – How science actually progresses
Really enjoyed chatting with Michael Nielsen about how we recognize scientific progress.
It's especially relevant for closing the RL verification loop for scientific discovery.
But it's also a surprisingly mysterious and elusive question when you look at the history of human science.
We approach this question stories like Einstein (who claimed that he hadn't even heard of the famous Michelson-Morley experiment, which is supposed to have motivated special relativity, until after he had come up with the theory), Darwin (why did it take till 1859 to lay out an idea whose essence every farmer since antiquity must have observed?), Prout (how do you recognize that isotopes exist if you cannot chemically separate them?), and many others.
The verification loop on scientific ideas is often extremely long and weirdly hostile. Ancient Athenians dismissed Aristarchus's heliocentrism in the 3rd century BC because it would imply that the stars should shift in the sky as the Earth orbits the sun. The first successful measurement of stellar parallax was in 1838. That's a 2,000-year verification loop.
But clearly human science is able to make progress faster than raw experimental falsification/verification would imply, and in cases where experiments are very ambiguous. How?
Michael has some very deep and provocative hypotheses about the nature of progress. One I found especially thought-provoking is that aliens will likely have a VERY different science + tech stack than us. Which contradicts the common sense picture of a linear tech tree that I was assuming. And has some interesting implications about how future civilizations might trade and cooperate with each other.
Watch on Youtube; read the transcript.
Sponsors
* Labelbox researchers built a new safety benchmark. Why? Well, current safety benchmarks claim that attacks on top models are successful only a few percent of the time, but the prompts in those benchmarks don’t reflect how real bad actors actually write. You can read Labelbox’s research here. If this could be useful for your work, reach out at labelbox.com/dwarkesh
* Mercury has an MCP that lets you give an LLM access to your full transaction history, including things like attached receipts and internal notes. I just used it to categorize my 2025 transactions, and it worked shockingly well. Modern functionality like this is exactly why I use Mercury. Learn more at mercury.com
* Jane Street’s ML engineers presented some of their GPU optimization workflows at GTC, showing how they use CUDA graphs, streams, and custom kernels to shave real time off their training runs. You can watch the full talk here. And they open-sourced all the relevant code here. If this kind of stuff excites you, Jane Street is hiring — learn more at janestreet.com/dwarkesh
Timestamps
(00:00:00) – How scientific progress outpaces its verification loops
(00:17:51) – Newton was the last of the magicians
(00:23:26) – Why wasn’t natural selection obvious much earlier?
(00:29:52) – Could gradient descent have discovered general relativity?
(00:50:54) – Why aliens will have a different tech stack than us
(01:15:26) – Are there infinitely many deep scientific principles left to discover?
(01:26:25) – What drew Michael to quantum computing so early?
(01:35:29) – Does science need a new way to assign credit?
(01:43:57) – Prolificness versus depth
(01:49:17) – What it takes to actually internalize what you learn
Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe