More

obiefernandez · 2026-01-19T17:20:00 1768843200

Love the creativity and that you took the time to explain every step of the discovery.

martinprins · 2026-01-19T18:55:57 1768848957

Thanks! The write-up took longer than the actual protocol design.

obiefernandez · 2026-01-02T15:22:47 1767367367

Anyone else look at this and think to themselves, "thank god"

Like it's probably a good thing for humanity if the USA does not feel the need to go to war with China over Taiwan.

obiefernandez · 2026-01-02T01:11:19 1767316279

The RLM framing basically turns long-context into an RL problem over what to remember and where to route it: main model context vs Python vs sub-LLMs. That’s a nice instantiation of The Bitter Lesson, but it also means performance is now tightly coupled to whatever reward signal you happen to define in those environments. Do you have any evidence yet that policies learned on DeepDive / Oolong-style tasks transfer to “messy” real workloads (multi-week code refactors, research over evolving corpora, etc.), or are we still in the “per-benchmark policy” regime?

The split between main model tokens and sub-LLM tokens is clever for cost and context rot, but it also hides the true economic story. For many users the cost that matters is total tokens across all calls, not just the controller’s context. Some of your plots celebrate higher “main model token efficiency” while total tokens rise substantially. Do you have scenarios where RLM is strictly more cost-efficient at equal or better quality, or is the current regime basically “pay more total tokens to get around context limits”?

math-python is the most damning data point: same capabilities, but the RLM harness makes models worse and slower. That feels like a warning that “more flexible scaffold” is not automatically a win; you’re introducing an extra layer of indirection that the model has not been optimized for. The claim that RL training over the RLM will fix this is plausible, but also unfalsifiable until you actually show a model that beats a strong plain-tool baseline on math with less wall-clock and tokens.

Oolong and verbatim-copy are more encouraging: the controller treating large inputs as opaque blobs and then using Python + sub-LLMs to scan/aggregate is exactly the kind of pattern humans write by hand in agents today. One thing I’d love to see is a comparison vs a well-engineered non-RL agent baseline that does essentially the same thing but with hand-written heuristics (chunk + batch + regex/SQL/etc.). Right now the RLM looks like a principled way to let the model learn those heuristics, but the post doesn’t really separate “benefit from architecture” vs “benefit from just having more structure/tools than a vanilla single call.”

On safety / robustness: giving the model a persistent Python REPL and arbitrary pip is powerful, but it also dramatically expands the attack surface if this ever runs on untrusted inputs. Are you treating RLM as strictly a research/eval harness, or do you envision this being exposed in production agent systems? If the latter, sandboxing guarantees and resource controls probably matter as much as reward curves.

obiefernandez · 2025-11-25T21:11:23 1764105083

The beauty of Suno, at least for me, was the opportunity to turn my original lyrics into listenable music free without having it attached in any way to any of the big labels, who are evil to the core. I really hope they keep the existing user experience intact.

earthnail · 2025-11-26T16:40:31 1764175231

I don’t see how Suno is less evil if you consider the labels evil.

obiefernandez · 2025-11-08T19:21:15 1762629675

Nothing has “killed Ruby on Rails”.

Ridiculous comment.

fny · 2025-11-08T21:05:00 1762635900

I'm honored to have you of all people make that comment.

But with all due respect the excitement and job market for Ruby isn't anything close to what it used to be:

[0]: https://trends.google.com/trends/explore?date=all&q=%2Fm%2F0...

obiefernandez · 2025-11-03T18:55:57 1762196157

Location: Mexico City (US Citizen)

Remote: Yes (Remote Only)

Willing to relocate: No

Technologies: Ruby on Rails, AI

Resume/CV: https://www.linkedin.com/in/obiefernandez/

Email: obiefernandez@gmail.com

Hello, I'm one of the original evangelists for Ruby on Rails and the author of The Rails Way as well as Patterns of Application Development Using AI. Over the past three decades, I’ve led teams and built products at every scale — from early-stage startups to global platforms — combining deep technical expertise with a creative, forward-looking approach to software craftsmanship.

I bring 30 years of hands-on engineering experience, including senior leadership in architecture, AI integration, and product strategy. Whether working as an individual contributor or guiding organizations through transformation, I focus on delivering clarity, velocity, and sustainable innovation. My last gig was leading AI strategy related to Developer Experience at Shopify.

Currently evaluating consulting and permanent opportunities with preference for executive leadership position at a larger company, although will consider consulting and fractional CTO type roles for startups and smaller ventures if the project and team are interesting enough.

obiefernandez · 2025-10-31T16:42:51 1761928971

Big news in the music AI space. Interesting and potentially worrying implications for Suno, which has pulled far ahead in the race and recently announced $150M ARR milestone.

obiefernandez · 2025-10-26T14:48:52 1761490132

Tobi Luetke at Shopify too

obiefernandez · 2025-10-03T00:22:32 1759450952

my biggest TIL takeaway from that article was an "oh wow" moment:

The other sound that ‘ȝ’ once spelled is the “harsh” or “guttural” sound made in the back of the mouth, which you hear in Scots loch or German Bach.4 This sound is actually the reason for the most famous bit of English spelling chaos: the sometimes-silent, sometimes-not sequence ‘gh’ that you see in laugh, cough, night, and daughter. Maybe one day I’ll tell you that story too.

maxhille · 2025-10-03T04:19:38 1759465178

Lachen, Nacht and Tochter (don't know a cognate for 'cough') still have this sound in Standard German.

atq2119 · 2025-10-03T05:25:50 1759469150

'cough' could share a root with 'keuchen' (IANAL)

1718627440 · 2025-10-03T12:19:45 1759493985

That has a different sound though. But yes, it might be a cognate.

jacquesm · 2025-10-03T05:02:54 1759467774

In Dutch there is an even harder 'g' sound.

sev · 2025-10-03T05:56:22 1759470982

Is it less hard than the ‘k’ sound?

jacquesm · 2025-10-03T06:03:48 1759471428

Yes, more back of the throat. One particularly nasty form is as in 'Scheveningen'. The Scottish version comes close in for instance 'Loch'.

vanderZwan · 2025-10-03T11:30:51 1759491051

I personally am more fond of provoking an "angstschreeuw" in English speakers by asking them to pronounce "slechtstschrijvend" or "zachtstschrijdend" and watching them recoil in horror at the consonant clusters[0][1].

[0] https://en.wikipedia.org/wiki/Consonant_cluster

[1] https://nl.wikipedia.org/wiki/Medeklinkerstapeling (Dutch wiki page for consonant clusters with more examples)

1718627440 · 2025-10-03T12:48:08 1759495688

> [1]

It's funny I just started reading and understanding the first paragraph, before recognizing that this is a foreign language, I don't know at all.

jacquesm · 2025-10-03T12:22:54 1759494174

Those are funny!

FergusArgyll · 2025-10-03T12:08:43 1759493323

I've heard that Knecht (servant in German) is the same word as Knight in English

obiefernandez · 2025-08-20T19:47:44 1755719264

Nonstarter if can’t use it with Max plan

benzible · 2025-08-20T21:26:56 1755725216

Jose commented on that that elsewhere:

> I'd love to integrate with whatever model subscription is available but it seems using Max outside of Claude products is against their terms. I suggest reaching out to Anthropic and letting them know you would like to use your Max subscription with other coding agents.