More

miningape · 2025-12-19T13:24:38 1766150678

Exactly, if these tools are going to be so revolutionary and different within the next 6 months and even more so beyond that - there's no advantage to being an early adopter since your progress becomes invalid, may as well wait until it is good enough.

miningape · 2025-12-19T13:22:52 1766150572

> if it’s as easy as everyone says surely someone would try.

Yeah, 18 months ago we were apparently going to have personal SaaSes and all sorts of new software - I don't see anything but an even more unstable web than ever before

miningape · 2025-12-18T16:02:03 1766073723

What? Do you even know how compilers work?

naasking · 2025-12-18T16:09:41 1766074181

Are you able to predict with 100% accuracy when a loop will successfully unroll, or various interprocedural or intraprocedural analyses will succeed? They are applied deterministically inside a compiler, but often based on heuristics, and the complex interplay of optimizations in complex programs means that sometimes they will not do what you expect them to do. Sometimes they work better than expected, and sometimes worse. Sounds familiar...

miningape · 2025-12-18T16:13:13 1766074393

> Are you able to predict with 100% accuracy when a loop will successfully unroll, or various interprocedural or intraprocedural analyses will succeed?

Yes, because:

> They are applied deterministically inside a compiler

Sorry, but an LLM randomly generating the next token isn't even comparable.

Deterministic complexity =/= randomness.

naasking · 2025-12-18T16:16:03 1766074563

> Yes, because:

Unless you wrote the compiler, you are 100% full of it. Even as the compiler writer you'd be wrong sometimes.

> Deterministic complexity =/= randomness.

LLMs are also deterministically complex, not random.

miningape · 2025-12-18T16:20:39 1766074839

> Unless you wrote the compiler, you are 100% full of it. Even then you'd be wrong sometimes

You can check the source code? What's hard to understand? If you find it compiled something wrong, you can walk backwards through the code, if you want to find out what it'll do walk forwards. LLMs have no such capability.

Sure maybe you're limited by your personal knowledge on the compiler chain, but again complexity =/= randomness.

For the same source code, and compiler version (+ flags) you get the exact same output every time. The same cannot be said of LLMs, because they use randomness (temperature).

> LLMs are also deterministically complex, not random

What exactly is the temperature setting in your LLM doing then? If you'd like to argue pseudorandom generators our computers are using aren't random - fine, I agree. But for all practical purposes they're random, especially when you don't control the seed.

naasking · 2025-12-18T16:54:19 1766076859

> If you find it compiled something wrong, you can walk backwards through the code, if you want to find out what it'll do walk forwards. LLMs have no such capability.

Right, so you agree that optimization outputs not fully predictable in complex programs, and what you're actually objecting to is that LLMs aren't like compiler optimizations in the specific ways you care about, and somehow this is supposed to invalidate my argument that they are alike in the specific ways that I outlined.

I'm not interested in litigating the minutiae of this point, programmers who treat the compiler as a black box (ie. 99% of them) see probabilistic outputs. The outputs are generally reliable according to certain criteria, but unpredictable.

LLM models are also typically probabilistic black boxes. The outputs are also unpredictable, but also somewhat reliable according to certain criteria that you can learn through use. Where the unreliability is problematic you can often make up for their pitfalls. The need for this is dropping year over year, just as the need for assembly programming to eke out performance dropped year over year of compiler development. Whether LLMs will become as reliable as compiler optimizations remains to be seen.

miningape · 2025-12-18T22:50:37 1766098237

> invalidate my argument that they are alike in the specific ways that I outlined

Basketballs and apples are both round, so they're the same thing right? I could eat a basketball and I can make a layup with an apple, so what's the difference?

> programmers who treat the compiler as a black box (ie. 99% of them) see probabilistic outputs

In reality this is at best the bottom 20% of programmers.

No programmer I've ever talked to has described compilers as probabilistic black boxes - and I'm sorry if your circle does. Unfortunately there's no use of probability and all modern compilers definitionally white boxes (open source).

miningape · 2025-12-14T12:56:38 1765716998

Honestly, I'd gladly pay for and read a version of pride and prejudice rewritten in gen Z slang

miningape · 2025-12-07T02:57:56 1765076276

Apple / macbooks seem to be doing fine

chungy · 2025-12-07T03:41:35 1765078895

The blog is premised on the idea that Apple and MacBook are not doing fine.

tensegrist · 2025-12-07T10:45:55 1765104355

that they're not doing fine software-wise

naruhodo · 2025-12-08T08:09:11 1765181351

And nobody else is doing better software wise because Apple is secretive about how to program the hardware. Asahi is not sufficient.

rowanG077 · 2025-12-07T03:52:03 1765079523

The reason of course being the awful software, not the hardware options. He makes that abundantly clear in the text.

miningape · 2025-12-07T04:00:03 1765080003

You missed my point, the original comment is stating that the market for such a device doesn't exist because developers are too finicky and customisation focussed.

As a counter example - look at macbooks which are about as un-customisable as they come, but a large portion of developers use them. Meaning the market exists even if it's currently dominated by Apple (which as you/the post points out is slipping)

prmoustache · 2025-12-07T09:36:04 1765100164

Not everybody use/choose macbook though.

Having said that I do believe that many brands have way too mamy SKU and I widh they would be more opinionated on what they believe is better for their customers while maintaining clear and strong ethics (reliability should be #priority)

miningape · 2025-12-08T01:20:16 1765156816

I don't recall saying everybody?

miningape · 2025-12-05T08:39:10 1764923950

Yeah people don't realise this, but shame and guilt (and fear) are our 2 society building emotions. Each society has it's own mix of these, and there are also "themes" depending on which is the dominant one.

Shame has practically been thrown out the window in certain places and we can see the effects of that - people scamming each other, lying in the streets, etc. Guilt is also being eroded across the west, leading to things like rampant criminality and punishments that are less than a slap on the wrist.

Fundamentally these emotions are designed to keep us in check with the rest of the group - does this negatively affect some: yes. But at the benefit of creating high trust societies. Every time I encounter this topic I can't help but think: Don't throw the baby out with the bathwater.

miningape · 2025-12-03T12:29:01 1764764941

For me they're both about equally shitty, but with Github you get a nice commit calendar to show off to recruiters - so Github wins IMO.

davidee · 2025-12-03T13:54:06 1764770046

I believe the exact same exists in Gitlab under each user profile.

miningape · 2025-12-03T12:26:48 1764764808

Luckily it's harder to enshittify something you don't own

miningape · 2025-12-03T11:59:09 1764763149

+1 - I also see a huge opportunity for forgejo to become a new stackoverflow if they add federation

The primary issue with SO was that it was disconnected from the actual communities maintaining the project. A federated solution would be able to have the same network effects while handing ownership to the original community (rather than a separate SO branch of the community)

miningape · 2025-12-02T12:20:47 1764678047

Loving this series! I'm currently implementing a z80 emulator (gameboy) and it's my first real introduction to CISC, and is really pushing my assembly / machine code skills - so having these blog posts coming from the "other direction" are really interesting and give me some good context.

I've implemented toy languages and bytecode compilers/vms before but seeing it from a professional perspective is just fascinating.

That being said it was totally unexpected to find out we can use "addresses" for addition on x86.

Joker_vD · 2025-12-02T12:36:36 1764678996

A seasoned C programmer knows that "&arr[index]" is really just "arr + index" :) So in a sense, the optimizer rewrote "x + y" into "(int)&(((char*)x)[y])", which looks scarier in C, I admit.

crote · 2025-12-02T13:09:05 1764680945

The horrifying side effect of this is that "arr[idx]" is equal to "idx[arr]", so "5[arr]" is just as valid as "arr[5]".

Your colleagues would probably prefer if you forget this.

miningape · 2025-12-02T13:39:18 1764682758

Mom, please come pick me up. These kids are scaring me.

Joker_vD · 2025-12-02T13:26:46 1764682006

> so "5[arr]" is just as valid as "arr[5]"

This is, I am sure, one of the stupid legacy reasons we still write "lr a0, 4(a1)" instead of more sensible "lr a0, a1[4]". The other one is that FORTRAN used round parentheses for both array access and function calls, so it stuck somehow.

kragen · 2025-12-02T19:48:37 1764704917

Generally such constant offsets are record fields in intent, not array indices. (If they were array indices, they'd need to be variable offsets obtained from a register, not immediate constants.) It's reasonable to think of record fields as functions:

            .equ car, 0
            .equ cdr, 8
            .globl length
    length: test %rdi, %rdi         # nil?
            jz 1f                   # return 0
            mov cdr(%rdi), %rdi     # recurse on tail of list
            call length
            inc %rax
            ret
        1:  xor %eax, %eax
            ret

To avoid writing out all the field offsets by hand, ARM's old assembler and I think MASM come with a record-layout-definition thing built in, but gas's macro system is powerful enough to implement it without having it built into the assembler itself. It takes about 13 lines of code: http://canonical.org/~kragen/sw/dev3/mapfield.S

Alternatively, on non-RISC architectures, where the immediate constant isn't constrained to a few bits, it can be the address of an array, and the (possibly scaled) register is an index into it. So you might have startindex(,%rdi,4) for the %rdi'th start index:

            .data
    startindex:
            .long 1024
            .text
            .globl length
    length: mov (startindex+4)(,%rdi,4), %eax
            sub startindex(,%rdi,4), %eax
            ret

If the PDP-11 assembler syntax had been defined to be similar to C or Pascal rather than Fortran or BASIC we would, as you say, have used startindex[%rdi,4].

This is not very popular nowadays both because it isn't RISC-compatible and because it isn't reentrant. AMD64 in particular is a kind of peculiar compromise—the immediate "offset" for startindex and endindex is 32 bits, even though the address space is 64 bits, so you could conceivably make this code fail to link by placing your data segment in the wrong place.

(Despite stupid factionalist stuff, I think I come down on the side of preferring the Intel syntax over the AT&T syntax.)

beng-nl · 2025-12-02T16:57:43 1764694663

Yes, I find this one of the weird things about assembly - appending (or pretending?) a number means addition?! - even after many many years of occasionally reading/writing assembly, I’m never completely sure what these instructions do so I infer from context.

rocqua · 2025-12-02T13:24:57 1764681897

That depends on sizeof(*arr) no?

unwind · 2025-12-02T13:38:47 1764682727

Not in C no, since arithmetic on a pointer is implicitly scaled by the size of the value being pointed at (this statement is kind of breaking the abstraction ... oh well).

messe · 2025-12-02T14:22:58 1764685378

Nope, a[b] is equivalent to *(a + b) regardless of a and b.

sureglymop · 2025-12-02T14:29:36 1764685776

Given that, why don't we use just `*(a + b)` everywhere?

Wouldn't that be more verbose and less confusing? (genuinely asking)

tomsmeding · 2025-12-02T14:47:53 1764686873

Do you really think that `*(a + i)` is clearer than `a[i]`?

sureglymop · 2025-12-02T17:27:31 1764696451

Not necessarily. I think it's confusing when there are two fairly close ways to express the same thing.