Ah, it’s a good time to check in with gwern on our conversation about oAI vs Ant...

thoughtpeddler · 2026-05-30T15:54:51 1780156491

Mechinterp in general is just completely undervalued right now (and agreed Anthropic's team is doing the most rigorous work, now accompanied by Goodfire). They're doing the closest work to neuroscience's in vivo 'thought-tracing', which is just the most wild science fiction sort of thing to be working on, and yet I feel the average person has no idea this sort of work is happening. When combined with the idea of the 'universal subspace hypothesis' (explored under the paper of the same name), you really start to bridge the gap from engineering to something more philosophical and spiritual. But I digress...

bkjlblh · 2026-05-31T07:55:05 1780214105

it's not undervalues, many people are working on it following anthropic's lead. It just doesn't seem to be any useful, so it's even overvalued

gom_jabbar · 2026-05-30T16:24:19 1780158259

Haven't heard about the universal subspace hypothesis yet, so I appreciate the digression.

thoughtpeddler · 2026-05-30T16:57:54 1780160274

Ya, super interesting research area the authors explored of basically trying to answer the question: "Is there a canonical/intrinsic way that concepts/representations/information are 'stored' in the universe/reality?".

They tested that by performing "spectral analysis of over 1100 models - including 500 Mistral-7B LoRAs, 500 Vision Transformers, and 50 LLaMA-8B models ... by applying spectral decomposition techniques to the weight matrices of various architectures", and concluding that "deep neural networks trained across diverse tasks exhibit remarkably similar low-dimensional parametric subspaces", showing that "neural networks systematically converge to shared spectral subspaces regardless of initialization, task, or domain".

Not just philosophically interesting but also has practical implications for being smarter about how to reuse models, model merging, developing more sustainable training and inference algos, etc.

Paper source: https://arxiv.org/abs/2512.05117

vessenes · 2026-05-31T23:37:23 1780270643

Very Chomsky friendly. Interesting paper, thank you!

Kye · 2026-05-30T21:47:11 1780177631

Maybe related: https://news.ycombinator.com/item?id=47322887

vessenes · 2026-05-31T23:41:52 1780270912

Yeah that layer looping is super interesting. Also, it’s memory friendly with the right inference harness.

janussunaj · 2026-05-30T16:22:58 1780158178

Did you also talk about "head and shoulders" and "pennant" patterns in stock charts? Or where the "smart money" is at? I'd like to subscribe to your paid newsletter.

vessenes · 2026-05-31T23:40:44 1780270844

This is a super low quality comment. I try to put my current thoughts out here largely because high quality comments refine my thinking.

What about the original conversation seems correct / incorrect / interestingly correct or incorrect to you? What about my summary now seems correct/incorrect, etc?

Ultimately the big value is to me — I get to look back at some dialogue and then check where I had it right, wrong and interestingly wrong. My personal hypothesis is that doing this a lot compounds in a good way.