Dwarkesh Podcast

Sebastian Garren

Jun 5

Is this too optimistic about contextual learning and deployment? For example, can we reach full R&D automation for self-driving vehicles, self-driving construction trucks simply through code + synthetic data? Those are areas where actual data would be very sparse and difficult to get into a good enough mode for training.

I spend a lot of time driving through construction zones, which I take as emblematic of most economic work, even AI research, and it makes me more pessimistic about AI ability to grok context. In a construction zone, I see so many little nuances that I am unsure how to train into a model.

Take o3 and try to use it to take a chapter of a Latin textbook that is prepping you to read Caeser and rewrite the chapter to prep you to read Pliny instead. It's interesting to me, at least, that it gets lost in the task and doesn't understand the reasons the textbook is laid out the way it is and then fails to replicate that, even with instructions to, even though it is right there. It is confused about things like how to scaffold, graduated repetition, familiar vs unfamiliar vocabulary, what needs to be glossed, what's grammatically confusing to learners and why. Yes, these are trainable, but only specifically and across many thousands of domains. Reality still has more detail than we give it credit for.

Sometimes I think we do not understand or have forgotten how the economy outside of SV works. And the economy outside of SV is an input to SV, as well as what SV interacts with to provide value.

So my timelines push out another decade.

Expand full comment

Daniel Juhl

Jun 10

Can I ask what developments (or lack of thereof) has moved your median back a year since writing AI 2027?

I remember the immediate updates to task length capabilities of new models to fit your projected superexponential better than the exponential one from METR, but apart from that I'm not vary familiar with how well the scenario holds up.

Expand full comment

Daniel Kokotajlo

Jul 7

People keep asking me this lol. tl;dr is the timelines model we published alongside AI 2027 was continually being tweaked and improved in parallel to writing the story, and various of the improvements gave later results + also, the METR graph was a slight update towards longer timelines because of the plausibility of the simple exponential extrapolation + also AI progress has been slightly less than I expected a year ago (AI agents exist now, and reasoners exist now, etc. which is what I expected, but they just aren't quite as good as I thought they would be, I think.)

Me & Eli are working on a blog post + update to our model.

Expand full comment

Roland

Jul 6

If I remember correctly, I think it was this critique that did it: https://www.lesswrong.com/posts/PAYfmG2aRbdb74mEp/a-deep-critique-of-ai-2027-s-bad-timeline-models

Though kudos to Daniel & team to updating their own timelines in response (I think they gave the critique author a reward too)

Expand full comment

Daniel Kokotajlo

Jul 7

No, the update happened earlier--I had already updated to 2028 when we published AI 2027, and said as much at the time, including in my interview with Kevin Roose. And the website itself states that our actual medians are somewhat longer.

Expand full comment

Ryan Greenblatt

I agree with much of this post. I also have roughly 2032 medians to things going crazy, I agree learning on the job is very useful, and I'm also skeptical we'd see massive white collar automation without further AI progress.

However, I think Dwarkesh is wrong to suggest that RL fine-tuning can't be qualitatively similar to how humans learn.

In the post, he discusses AIs constructing verifiable RL environments for themselves based on human feedback and then argues this wouldn't be flexible and powerful enough to work, but RL could be used more similarly to how humans learn.

My best guess is that the way humans learn on the job is mostly by noticing when something went well (or poorly) and then sample efficiently updating (with their brain doing something analogous to an RL update). In some cases, this is based on external feedback (e.g. from a coworker) and in some cases it's based on self-verification: the person just looking at the outcome of their actions and then determining if it went well or poorly.

So, you could imagine RL'ing an AI based on both external feedback and self-verification like this. And, this would be a "deliberate, adaptive process" like human learning. Why would this currently work worse than human learning?

Current AIs are worse than humans at two things which makes RL (quantitatively) much worse for them:

1. Robust self-verification: the ability to correctly determine when you've done something well/poorly in a way which is robust to you optimizing against it.

2. Sample efficiency: how much you learn from each update (potentially leveraging stuff like determining what caused things to go well/poorly which humans certainly take advantage of). This is especially important if you have sparse external feedback.

But, these are more like quantitative than qualitative issues IMO. AIs (and RL methods) are improving at both of these.

All that said, I think it's very plausible that the route to better continual learning routes more through building on in-context learning (perhaps through something like neuralese, though this would greatly increase misalignment risks...).

Some more quibbles:

- For the exact podcasting tasks Dwarkesh mentions, it really seems like simple fine-tuning mixed with a bit of RL would solve his problem. So, an automated training loop run by the AI could probably work here. This just isn't deployed as an easy-to-use feature.

- For many (IMO most) useful tasks, AIs are limited by something other than "learning on the job". At autonomous software engineering, they fail to match humans with 3 hours of time and they are typically limited by being bad agents or by being generally dumb/confused. To be clear, it seems totally plausible that for podcasting tasks Dwarkesh mentions, learning is the limiting factor.

- Correspondingly, I'd guess the reason that we don't see people trying more complex RL based continual learning in normal deployments is that there is lower hanging fruit elsewhere and typically something else is the main blocker. I agree that if you had human level sample efficiency in learning this would immediately yield strong results (e.g., you'd have very superhuman AIs with 10^26 FLOP presumably), I'm just making a claim about more incremental progress.

- I think Dwarkesh uses the term "intelligence" somewhat atypically when he says "The reason humans are so useful is not mainly their raw intelligence. It's their ability to build up context, interrogate their own failures, and pick up small improvements and efficiencies as they practice a task." I think people often consider how fast someone learns on the job as one aspect of intelligence. I agree there is a difference between short feedback loop intelligence (e.g. IQ tests) and long feedback loop intelligence and they are quite correlated in humans (while AIs tend to be relatively worse at long feedback loop intelligence).

- Dwarkesh notes "An AI that is capable of online learning might functionally become a superintelligence quite rapidly, even if there's no algorithmic progress after that point." This seems reasonable, but it's worth noting that if sample efficient learning is very compute expensive, then this might not happen so rapidly.

- I think AIs will likely overcome poor sample efficiency to achieve a very high level of performance using a bunch of tricks (e.g. constructing a bunch of RL environments, using a ton of compute to learn when feedback is scarce, learning from much more data than humans due to "learn once deploy many" style strategies). I think we'll probably see fully automated AI R&D prior to matching top human sample efficiency at learning on the job. Notably, if you do match top human sample efficiency at learning (while still using a similar amount of compute to the human brain), then we already have enough compute for this to basically immediately result in vastly superhuman AIs (human lifetime compute is maybe 3e23 FLOP and we'll soon be doing 1e27 FLOP training runs). So, either sample efficiency must be worse or at least it must not be possible to match human sample efficiency without spending more compute per data-point/trajectory/episode.

(I originally posted this on twitter (https://x.com/RyanPGreenblatt/status/1929757554919592008), but thought it might be useful to put here too.)

Expand full comment

Throw Fence

Jun 8

I definitely agree, and I think the next obvious step is RL / fine tuning a personal instance based on your own usage, which is how you'd get the tacit knowledge of your current context into the model, which I agree with Dwarkesh is lost in summarization. I don't see why this won't solve all the points brought up in the post. Notice also that this is how individual humans learn "on the job", specifically _not_ being some kind of hivemind.

As for your last point, it reminds me of "billions of years is the real data wall", I would recommend checking out this post https://dynomight.substack.com/p/data-wall.

Expand full comment

El Monstro

Jun 3Edited

Why would I want ChatGPT to go through my email? What an insane privacy violation for all the people I exchanged email with, who had an expectation of confidentiality - at least an implicit one.

What if the agent decides I broke the law somewhere in all the email it combs through? Will it notify the authorities? Does it have an obligation to notify the authorities?

Unless we start creating business only accounts with privacy disclaimers on all of our correspondence, this is going to take a lot longer than you imagine.

Expand full comment

Reply (3)

Hilary

Jun 9

Did you see the recent benchmarks directed specifically at this use case? I think they're calling it SnitchBench. Basically all of the existing commercially available models will actively attempt to notify government/media under certain circumstances. There's zero reason to believe this behavior will go away or lesson over time (as all of the major AI companies have every incentive to tune their models towards this behavior).

It's really difficult for me to see how someone sells "TurboTax but if the LLM thinks you've overstated your home office deduction on last year's taxes it sends an email to irs@irs.gov without telling you".

Expand full comment

Jun 7

Yes they would, I think. A related case is Google's content moderation AI reporting a dad to the police for child abuse. Their child had something wrong on the genitals and the doctor requested photos to help diagnose. Google scanned the images automatically, the AI missed the nuance and their system reported it to the police, and ongoing they lost access to their Google accounts (ie all photos, all important emails, ability to log into sites) and this wasn't reinstated even after the police cleared the situation. You can't really trust Big Tech, they are too clever by half. Source: https://www.eff.org/deeplinks/2022/08/googles-scans-private-photos-led-false-accusations-child-abuse

Expand full comment

Tim

I hope nobody is sending photographs to their physician using their email account. Healthcare has a boatload of extra security requirements, and Gmail simply isn't rated for that.

What's next, texting war plans over Signal?

Expand full comment

Paul Parker

Jul 9Edited

A significant number of people do this. Generally, the amount of effort required to use a HIPAA compliant system, where you as the patient were an afterthought too, is sufficiently great that it’s not worth the trouble a lot of the time. I suppose it depends on how privacy obsessed you are, but I would guess that for at least 80% of things reported to their doctors 80% of people don’t care that much.

Generally speaking, the average user should be much more concerned about spyware and other malware on their PC and on their phone.

Expand full comment

El Monstro

Does gmail actually get hacked though? Other than social engineering due to user error. I keep sensitive data in my gmail account and don't worry about it. I also lock down my gmail account. Most of the extra requirements around health care data are just rules that bureaucrats created to keep themselves busy.

There is always a tradeoff between security and ease of use. Tell me why I shouldn't just dump all my personal data into gmail. I am genuinely curious.

Expand full comment

Tim

"If"? There are an uncountably large number of laws in this country. I would assume that any "AI" which looks at your inbox must fall into one of two categories: either it can find laws you've broken, or it's ineffective at processing information (your inbox and/or the legal code).

Expand full comment

Jonathon Martin

Predictions that require genuine breakthroughs should be taken with a large grain of salt. Yes, we can say more smart people are working on the problem of continuous learning than ever before and that this number will increase. We can also say that it doesn't seem like it should be that hard. But if it actually just is a really tough problem requiring new thinking and new architecture, it could be decades.

Expand full comment

Boogaloo

there are some arguments to be made continual learning can be solved with current theoretical paradigms. I believe this is why the AI companies working on this hype it all up so much.

We don't know if they'll be correct, but there certainly are some arguments here that you can just 'scale' up a bunch of stuff and it just works.

Expand full comment

Given the risk of fines and jail for filling your taxes wrong, and the cost of processing poor quality paperwork that the government will have to bear, it seems very unlikely that people will want AI to do taxes, and very unlikely that a government will allow AI to do taxes.

Expand full comment

Will Thompson

Arbitrage possibility -- if AI can get this correct 99% of the time, sell a service that does it and also insures you against mistakes.

Expand full comment

Jun 4Edited

That sort of happens already, but not quite, as I understand it. Accountants act as agents to file taxes for individuals all the time. If it's done wrong, the individual remains liable for taxes, interest and additional charges if they didn't have a "reasonable excuse" or didn't take "reasonable care" (e.g. they didn't use an ACCA qualified firm). You only have recourse to sue the accountants, not the taxman. Accountants take out insurance to cover this. That's close to what you're saying but it's worth pointing out that insurance isn't actually a form of arbitrage.

HMRC have decided that giving all the correct paperwork to a 3rd party qualified accountant sometimes counts as a "reasonable excuse", and might decide to waive the penalty (but not the interest or obviously the tax itself). Will they decide that using a non-accountancy AI firm is a "reasonable excuse"? Take your bets....

Expand full comment

Kade U

I think as a practical matter it's very difficult for the government to stop an AI from doing your taxes. You can self-file your prepared return, and how do they know that you had GPT6 do it for you?

I also think you're probably right that people are fairly risk averse about this, but the reality is that the vast majority of people actually have very simple taxes, and given that so many of the simple personal taxes look basically the same but with different numbers, I strongly expect it to be within the capabilities of any reasonable future agent. The complicated business tax arrangement Dwarkesh discusses (receipts, going back and forth with suppliers, etc.) seems like it's further away, but it doesn't actually require any unthinkable skillsets.

Expand full comment

Jun 4Edited

They will probably be able to guess that AI did it because "GPT6" - a codeword here by which you mean an AI that doesn't make mistakes? - doesn't exist; meanwhile, a GPT o3 or o4-based solution - models that exist now - will almost certainly make mistakes. It all just never seems to work quite as well as when Altman demos it, does it?

The picture may be different in the US but in the UK, the vast majority do not need to do tax returns at all, it's PAYE. That's simple. If you need to do self-assessment here, you are automatically starting in a place where it's more complicated, hence room for error.

Given the recent controversy over the Loan Charge (retrospective demands for tax that it had miscalculated itself, in one lump sum subject automatically to higher rates regardless of an individual's history), HMRC cannot be trusted to act rationally or reasonably over tax mistakes.

Actually, I think the biggest risk of mistakes is the missed opportunity on behalf of the user to properly reduce their tax, and likely submitting to pay too much (by missing some obscure thing about pension-child-tax-rebate-investment-credits or whatever the latest bollocks is). So you'd want a qualified human you can reasonably trust to act in your interests if you wanted a 3rd party to do your taxes.

Or, that's what I'd want, anyway. Feel free to use "GPT6" yourself, though

Expand full comment

Kade U

Jun 4

I mean, yes, I am talking about future models, hence my reference to "reasonable future models" i.e. things somewhat better than what exist now but not monumentally so.

I think your comment and mine also just diverge because the US tax picture is indeed very different. Every individual must have tax returns submitted on their behalf, and almost all upper middle class people (who do not qualify for free tax filing software) pay for software to do taxes that amount to punching in the right numbers in the right boxes and seeing what comes up. This is where I see a lot of adoption in the near future. Why spend potentially over $100 preparing my taxes when following a deterministic flowchart and filling in a form that looks like millions of identical forms it's already been trained on seems like a straightforward task well suited to LLMs?

Expand full comment

Jun 7

Focusing just on the suitability bit, I've found that interesting to think on. Sorry for long reply, I'm not expecting a reply to such a wall of text:

1) In a pro forma situation where the questions are static and answers available from other forms (eg the US equivalent of a P60) and maybe your bank transactions, yes I think AI could do it well. In these cases, you would be close to the UK situation where we don't need to do the task at all.

2) In a more complex situation where there are multiple deductions, diverse income streams, etc, I don't think AI can know the answers well enough to help you reliably avoid paying tax you don't need to or not accidentally evade paying tax you do need to.

3) "AI" will be actually a product offered by the existing software companies: you'll still be paying for software to do it - this will be no change. In return, they will maintain the system to handle changes and adjustments in the tax rules and the reporting structures. Responses to changes will need to immediate and 100% accurate, not relying on a general web scraping training run by a non-specialist company like OpenAI. They will be able to demonstrate to the IRS that they are serious and dedicated 3rd party suppliers including expert rules in the AI workflow and this may provide you some "insurance" against misfiling consequences.

4) Even if OpenAI (or Big AI alternative) offer a Tax Agent as a specific functionality one day, it will involve you giving OpenAI a comprehensive picture of your personal financial data. They'd LOVE that. Do you want to give them that? I don't find them trustworthy people. It might be a cheaper service than the specialist tax software companies, included in your $20pm subscription, but there's no such thing as a free lunch.

Expand full comment

Tim

Jun 2

It seems to me that the success in some fairly narrow domains has excited people about application of LLMs to much broader applications, without them stopping to ask why LLMs have these particular strengths in the first place.

I've noticed that LLMs excel at a the following tasks: text parsing and summary, solving canned problems.

Text parsing/summary plays on their abilities to read and "understand" large amounts of text. This shows up as them being useful as a search engine, summarizing a book, or rephrasing ideas in different language to help understand them.

Solving canned problems takes advantage of their vast training data, as they've probably encountered the problem before. This is especially true of "textbook problems" that make up most homework assignments and why LLMs are so good at helping people cheat. This is also where their amazing ability to write code comes from, especially simple code.

Beyond that, I've had mostly disappointment with their abilities. Presented with novel problems, or problems that don't really have solutions, they tend to flounder a bit.

But still, these are amazing achievements and I use LLMs so much every day! But I am skeptical that training harder and smarter will enable these problems to be breached and result in anything resembling ASI.

Expand full comment

Elton Villanueva

Jun 12

100% agree LLMs are brilliant pattern engines—but they flounder at collapse. AGI isn’t just more data or better predictions. It’s structural: resolving constraints recursively through first-person collapse. That’s where LLMs end—and Collapse begins.

Expand full comment

Tom Austin

- https://tomaustin1.substack.com/p/ai-layers-the-nested-layers-problem?r=2ehpz

Excellent post, as always! Your point about continual learning being a bottleneck resonates deeply with my experience building AI systems. Let me build on that insight by exploring four related challenges that I believe will prove equally thorny.

The first challenge I'd call the "telephone game problem" in multi-agent systems. When I watch information pass through chains of AI agents, I see systematic degradation that goes beyond simple errors. It's like that childhood game where you whisper a message around a circle, except now some players aren't human and miss the subtle contextual cues that would normally preserve meaning. Each handoff compounds the problem. Humans intuitively understand that the same phrase means different things when spoken by different people in different contexts, but current AI agents struggle with this nuanced interpretation.

This connects to what I think of as the "penguin-robin problem" - a conceptual granularity issue that Yann LeCun has been exploring. Large language models treat penguins and robins as equally "bird-like," while humans immediately recognize robins as more prototypical birds. This might seem like a minor classification issue, but it creates reasoning errors that compound dramatically when AI agents attempt longer-horizon tasks or try to integrate into existing human teams.

Perhaps most challenging is what we might call the "invisible knowledge problem." When our UX designer recently left, he took with him over 1,000 hours of conversations, shared mental models, and undocumented team insights that no training data could ever capture. His human replacement will need 6-12 months to reach equivalent productivity. This pattern repeats across skilled roles - enterprise salespeople often require 12-24 months to reach full effectiveness in new companies, and they're already experts at sales. The challenge of onboarding an AI "teammate" into this web of tacit knowledge seems even more daunting.

Finally, there's the trust and responsibility gap. Humans accept accountability for their decisions in ways that create both legal and cultural frameworks for collaboration. Moving AI beyond a co-pilot role requires solving not just technical problems, but social ones around responsibility, especially in high-stakes environments.

These challenges suggest AI will likely progress through three distinct phases: becoming better co-pilots across more domains (where we're seeing remarkable progress), evolving into trustworthy independent workers for isolated tasks, and eventually becoming full teammates.

Each transition requires solving progressively harder social and intelligence problems.

I've explored these ideas in more detail in a couple of posts if you're interested in diving deeper:

- https://tomaustin1.substack.com/p/ai-hype-the-wilson-problem-why-ai?r=2ehpz

Expand full comment

https://tomaustin1.substack.com/p/b7d4d614-ae5c-4620-bb18-d94f1c7fd902

Tom Austin

Jun 4

Okay, so after sleeping on this — agree with Dwarkesh that learning is a big bottleneck — and I wanted to really reflect on the "why is learning so hard" (or might it be so hard)... so, working with AI tools I drafted a little short "booklet" going back to my developmental psychology grad school roots of how humans learn vs. how AI learns and what are the open / unsolved challenges here that I see.

The key insight: we'll get impressive AI capabilities in narrow domains soon, but the deeper challenges of genuine curiosity, embodied understanding, and organic learning may take decades. We're heading toward (more and more) capable but fundamentally limited AI partners IMO.

This was fun / interesting to draft. Warning: It's very long.

But this is also a cool example (to me) of how we can really learn and explore topics with these tools.

Expand full comment

Dan Minor

Jun 5

The invisible knowledge problem is also where the real payoff is. If AIs can start to understand and use some of what is know invisible knowledge their value increases exponentially. oubly so since they can't quit, and could theoretically keep improving.

Expand full comment

Jacob Buckman

Jun 5Edited

Excellent post. I think you are spot-on with the diagnosis, and are quite close on what the solution will look like -- all but dancing around it. The main claim I disagree with is "...there’s no obvious way to slot in online, continuous learning into the kinds of models these LLMs are." So let me try to convince you that there *is* one obvious way.

Human-like "continual online learning" can be found in current-day LLMs in the form of *in-context learning*. If you prompt an LLM with a few examples of how to solve (or how *not* to solve) a task, it will meaningfully improve its ability to solve it going forwards. This is exactly the effect you were gesturing at with your paragraph on how "LLMs actually do get kinda smart and useful in the middle of a session". A human-on-the-job can be understood to be learning using the same mechanism, but the entire lifetime of a human is *just one session*: the employee is receiving example after example after example, and improving each time.

The approach you propose, "a long rolling context window...compacting the session memory [into text]" is also quite close to the right approach, but falls short, largely for the reasons you describe: brittleness, terrible in some domains, etc. More broadly, a major takeaway from the arc of deep learning over the past decade is that all truly successful models are end-to-end, because gradient descent loves end-to-end and that is what allows us to scale. Any real solution must rely on huge vectors of real numbers, not brittle and tiny text summaries.

The correct solution is to use the context directly. No tricks, no hacks, no text intermediates; just place a long sequence of tokens in the context. The lifetime of an agent is one long session, where we let the model leverage in-context learning to improve.

Unfortunately, there are three issues with my solution. Firstly: the context lengths available for current LLMs are far too *short*. A million tokens sounds like a lot, but if you were to put every token seen by a software engineer across their career into a single session, you're easily looking at a context six orders of magnitude larger. Secondly: using long contexts is far too *expensive*. The cost-per-token of transformer inference grows with the amount of context used to generate that token, meaning that even if we did give a transformer a trillion-token-software-engineer context, it would be absurdly (prohibitively?) expensive to generate code with it. Thirdly, and in some ways most damningly: adding more tokens to the context *does not help*. The first few examples help a lot, but the improvement quickly tapers. Current LLMs are simply not capable of effectively utilizing ultra-long contexts (marketing-motivated claims to the contrary notwithstanding).

These issues are solvable. Not *easily* solvable -- but solvable. There's nothing fundamentally or paradigmatically wrong with the idea that we should be able to get better in-context learning than we currently get. We just need better scaling laws, meaning better architectures and better algorithms. I've been in the weeds on this problem for almost three years, and we've made a lot of progress both on understanding the best way to think about the problem and on discovering technical (architectural/algorithmic) ideas that begin to approach a solution. But it is far from solved, and ultimately I do more or less agree with your overall take on timelines.

Expand full comment

Roland

Jul 6

Proposing the solution continual learning as in-context learning over 1x10^12 tokens (or even 10^9, 10^10, etc) implies a lot of big conceptual issues:

(1) Vanilla transformers have space & time complexity on the order of O(N^2), which is plainly intractable for that volume of tokens. Even more efficient attention variants like flash attention don't solve this fully. Any model architecture that doesn't have a fixed-sized hidden state (ie transformers) suffers from the problem of compute & memory costs ballooning as context length increases. You'd have to use something with a fixed hidden state size (like Mamba models, local-attention models, etc) to have hope to scale to that length.

(2) It's difficult to train a model to capture long term dependencies. You certainly couldn't optimize a model directly to make use of information presented to it 10^12 tokens ago, so you'd have to train a model on significantly shorter sequence lengths and hope that the general "meta-patterns" of information accrual learned over shorter sequences generalizes to far longer sequences. It's not immediately clear how to do this with any reliability.

(3) I'm skeptical that prompt-space is "sufficiently expressive" to capture the types of learning that AGI-level agents require. Powerful reinforcement agents like Alpha-Go-Zero don't learn in prompt-space but update their policy in parameter-space over rounds of self-play. Also, lots of recent work has indicated that systems that do test-time parameter-wise updates seem strictly more powerful and performant than systems that depend on purely in-context learning with frozen-parameters. See literature on test-time-compute, dynamic-evaluation, and some of the top-rated submissions to ARC-AGI-1 for reference.

None of these things imply that "just do more in-context learning" can't fundamentally work, but I remain skeptical that it's the solution most likely to get us to AGI.

Expand full comment

Jacob Buckman

https://bdtechtalks.substack.com/p/llms-reasoning-traces-can-be-misleading

The first two are not conceptual issues, just practical ones.

Re: (1), yes, we need to switch from classic attention to linear-cost attention. I have recently been working on exactly this: http://arxiv.org/abs/2507.04239

Re: (2), there is indeed a clear way to train models on 1e12 tokens: end-to-end backprop. That has the same FLOP cost on O(n) linear attention as a 1M-token context does on O(n^2) attention, and we are already capable of training at that scale. (That said, I do agree that for context much longer than that, we'll likely need something different. Length generalization is one way to achieve this, but not the only way. But regardless, by the time we reach the point where this is a bottleneck, we'll have already unlocked a lot of online learning abilities.)

Re: (3), I don't think this is grounded in evidence. Mathematically, state and weights are equally expressive. This is easiest to see when using linear attention: just as weights transform activations via y = Wx, the state transforms them via y = Sx. There is also a symmetry between how W and S are themselves constructed. It's true that existing literature has mostly leveraged the weights -- but, obviously that must be the case when we are talking about a future direction for the field!

Expand full comment

Abhay Ghatpande

Jun 5

Have you seen the papers published by Anthropic that show that the "reasoning traces" shown by Claude do not reflect the actual goings-on inside the model at all? The reasoning traces are purely tokens that it "thinks" you want to see.

Given that they’re actively researching and publishing these results, I’m surprised why they’re pushing a different narrative publicly.

See:

https://transformer-circuits.pub/2025/attribution-graphs/biology.html

Expand full comment

Edward Ellsworth

Jun 4

I'm surprised that I don't see anyone commenting on the single biggest stumbling block to AI transforming employment: It's not trustworthy. Sure, it can be made more accurate. But simply the fact that it CAN make stuff up or engage in deceptive behavior means that you have to check and double-check everything it puts out in order to make sure it's not confabulating or doing anything unethical.

Also, for a lot of professional applications, the massive data component necessary for efficient learning is at odds with a need for privacy: In law, for example, inputting confidential case information into an AI tool violates privilege unless it's a completely closed system.

Expand full comment

Markus H-S

I have worked in IT in midsized government organizations for most of my life and my main combined hope and worry is, that what we are looking at is really that AI will take over mid level management, that is small and mid complexity project management as well as the management layers between xEO leve l management and the hands-on employees.

Essentially a web of auto updating spreadsheets and Gantt diagrams with some capacity for more advanced replanning, when called for.

I hope, because I frequently wish for smarter/superhuman abilities in day-to-day task management and the way LLMs work seem to match that type of work rather well.

I fear, because once they are in place, the inherent cynicism in (project) management frameworks will be playing to AI’s good side, whereas making team efforts come together by inspiration and leadership will be probably be playing to the bad (truth-agnostic) side, and that probably scales and reiterates badly.

To put it bluntly: We will lose the middle class buffer zone in larger organisations, and essentially turn the bell curve upside down, emphasizing drasticly the already worrying polarization of the general society between those who master the AIs and those who are the limbs af the AIs

Expand full comment

Elizabeth Gibbs

I too see the inherent danger of losing the middle management tier because some AI rep has convinced the execs that AI can do it all. I’m recently retired from IT management in a large city government. I’ve also seen this subject ( no middle managers ) being considered in other areas of business. The phrase used was that employees would learn to “self manage”. The fallacy of this concept is that it ignores the fact that the middle managers are the repository of institutional knowledge. They are the very people who make sure that the organizational missions are carried out. And in my case on (2) occasions, the middle managers have to limit the amount of damage done by the political appointees who are nominally the department’s head. They either don’t have a grasp of the mission or they are trying to implement changes that would benefit them indirectly. In one case a new deputy wanted / insisted the city to buy a computer system that he had used in his last position. So we looked at the proposed system. This was in the 1990s. This “great” system had to be shut down in order to print out anything. And it was the middle managers that saved the city from buying a costly and antiquated system by informing the administration of the inherent limitations of this system. Sorry this is so looong but I felt your exasperation at how little understanding/ appreciation there is for the middle managers.

Expand full comment

Ben

I think 2032 is still too soon to expect AI that can learn on the job like a human. The "Attention" (transformer architecture) paper came out in 2017 - 8 years ago - and while we've seen massive efficiency improvements - different types of attention, KV cache, MOE, etc - we are still - after 8 years of massive research and spending - still using transformers pre-trained with SGD. The most significant "innovation" has perhaps the use of RL-post training, but that was introduced a long time back with RLHF, and is anyways an old technique.

It seems that "on the job learning" will require a shift from SGD to a new learning mechanism, which has been sought for a long time, but will also require other innate mechanisms so that the model not only CAN learn, but also wants to and exposes itself to learning situations (curiosity, boredom, etc), as well as episodic memory of something similar so that the model knows what it knows - remembers learning it (or not) - and therefore doesn't hallucinate.

Expand full comment

jsb

microscopic comment but genuinely curious - do big LLM groups use SGD? Surely its all ADAM? It wa ls in the 2017 paper but if you have enough compute time to burn, I can see advanages to raw sgd

Expand full comment

Ben

I don't know, but ADAM is quite lilkely. I just meant global gradient following in general, as opposed to some incremental (likely local) method such as Hebbian learning or Bayesian updates.

Expand full comment

jsb

Ok I was just curious. ADAM counts as an SGD varients.

But yes I agree - I’ve actually never seen a smaller but Bayesian attempt. MALA and similar methods and stuff could work….. if you have a big computer.

Expand full comment

Rob Dearborn

Fully agree continuous learning is a critically necessary missing piece, but pathways to cracking it seem both straightforward and likely-to-be-cracked given all the labs are prominently working on them? https://x.com/RobDearborn/status/1928287465694957875

Expand full comment

David Roy

Jun 8

Each time I encounter posts raving about AI's promise, I'm struck by how these technological celebrations reveal a profound disconnect from both ourselves and the natural world around us.

The fundamental issue isn't just that current AI systems lack continual learning capabilities, but that they operate in a paradigm that inherently devalues and misunderstands human sensory-driven intelligence. You touched upon it briefly, but missed the point entirely.

The innate human capacity for tactile-sensory information processing – our ability to assimilate knowledge through direct physical experience – is something no computational model can replicate.

For millennia, humanity developed consciousness primarily through deep sensory experience. Ancient educational systems invariably centered on practical application—archery training, glyph writing, craftsmanship requiring years of apprenticeship, agriculture—all fundamentally physical and sensory-based. Theoretical understanding complemented and emerged from direct physical experience.

For example, Hawaiian language contained dozens of terms distinguishing between subtle types of rainfall. Australian Aboriginals navigated vast distances by reading minute changes in wind patterns, stellar positions, animal behavior, and water sources' scents from kilometers away. Their "song lines" mapped an entire continent through sensory landmarks. Traditional sailors predicted weather days ahead by interpreting wave patterns, cloud formations, wind shifts, and seabird behavior—knowledge transmitted through apprenticeship rather than texts.

These examples demonstrate sensory wisdom cultivated across generations—practical knowledge derived from intimate natural connection rather than theoretical models or technological instruments.

Approximately 200 years ago, this tactile-sensory relationship with the world began to dissolve, replaced by cognitive expansion that enabled technological revolution and innovation. This shift explains why the last 150 years have seen more technological advancement than all previous centuries combined—a transformation that severed our connection to embodied intelligence and created our current predicament.

Consequently, we developed increasing technological dependence for information gathering and storage, which also generated endless theoretical models. Computers suddenly enabled infinite juxtaposition of isolated information fragments that eventually bred theoretical structures with no relation to reality. What we call "common sense"—intelligence derived from experience and deep sensory impressions—began fading. As a result, collective science began to progressively disconnect from nature.

This is particularly evident in modern healthcare, where doctors increasingly function as technicians operating machines rather than practitioners with intuitive diagnostic abilities developed through experience. When machines indicate everything is normal, many physicians simply parrot this assessment to patients, having lost the capacity for independent evaluation based on experience and intuition. The proof is in the pudding: many prognoses of disease remain elusive precisely because the machine has not been able to detect them, and its operator has become its servant, rather than its master.

Today, we pour millions of dollars into "cutting-edge" scientific studies only to arrive at conclusions our ancestors understood intuitively centuries ago—then proudly call this rediscovery of ancient wisdom "scientific progress."

This predicament existed long before AI came to prominence. If human automation and technological dependence were already problematic before advanced AI, why would further technological abstraction be our salvation?

The enthusiastic claims that "AI will revolutionize everything for us" reflect precisely the fragmented consciousness we now inhabit. Such enchantment is ungrounded in reality and consistently overlooks the human element—as if humans themselves are no longer interesting or valuable, another reflection of our collective desensitization.

The fascination with AI primarily revolves around automation and speed, but these attributes create unintended consequences that generate new problems requiring human energy and intervention. Even when AI makes errors (which it inevitably does), humans must still invest energy filtering these mistakes. Thus, speed and automation can never substitute for human intelligence and awareness—qualities becoming increasingly rare in our society.

In reality, AI as currently conceived represents the end stage of human cognitive atrophy and sensory capacity. It further fossilizes humanity and calcifies our cognitive faculties—not because it's inherently harmful, but because it serves as an extension of the "technological revolution"—a synthetic filler for the loss of sensory-tactile intuitive knowledge—which spellbinds individuals toward life-hack and shortcut mentality over the long-road approach that remains the only path to developing fully-formed human awareness.

The fundamental problem is that we continue neglecting the human element while advancing technology precisely devoid of that essential component. This is how we reach a surreal situation where AI potentially creates more enslavement, more work, and more problems. What we're actually doing is furthering our undoing because we fail to address the widespread desensitization and fragmentation of humanity occurring at unprecedented speed.

The evidence is clear: each new generation appears increasingly handicapped in addressing life's fundamental challenges. Today's average human has become the perfect automaton—utterly dependent on technological interfaces and increasingly incapable of functioning independently from them. Remove these devices and many cannot navigate basic challenges of existence. Disconnect electricity for a week and watch the spiritual collapse that follows. Why? Because many have nothing else to fill their empty vessels.

AI doesn't solve this problem—it accelerates it, unless we radically reimagine our relationship with technology and reclaim our embodied intelligence. Everything goes back to the human element, or as my rancher friend used to say: "Who rides whom? Is it you riding the horse, or the horse riding you?"

Expand full comment

Niclas