howdareme9's comments

howdareme9 · 2026-05-26T09:17:49 1779787069

this was for claude code i believe

howdareme9 · 2026-05-19T08:34:33 1779179673

Only because last time they tried to hide it lol

trymas · 2026-05-19T10:40:56 1779187256

Yes and if I remember the drama correctly - Kimi's license or terms of use says that for commercial use cases (or was it user count?) - you must declare credit to Moonshot and Kimi.

Lennie · 2026-05-19T10:56:24 1779188184

It's important to mention: they were compliant, because they trained the model at an AI hosting provider that had a partnership with Moonshot AI, but Moonshot didn't know Cursor was a customer.

Aurornis · 2026-05-19T14:07:34 1779199654

This was misinformed Twitter and Reddit drama.

They had properly licensed it and were complying with the terms of the license.

davidatbu · 2026-05-19T17:29:32 1779211772

Note that something that helped the misinformation was that, on Twitter, there were Kimi employees expressing their surprise that the base model was Kimi K2.5, and their indignation that Cursor didn't credit Kimi. They later deleted their tweets (what I infer from that is that some employees were not aware of some pre-existing agreement or understanding between Cursor and Kimi until the drama happened).

maxdo · 2026-05-19T10:59:53 1779188393

How can distilled opus become better than original? There are numbers of reports including anthropic that kimi team was participating in fraudulent activities

throwa356262 · 2026-05-19T11:41:51 1779190911

Do we know the "fraudulent " requests really came from moonshot engineers and was not QA team running a ton of benchmarks against other models?

I feel distilling something as big as Opus would require many many more samples, but I dont really know much about this subject

maxdo · 2026-05-19T16:40:39 1779208839

sure, sounds like QA lol

Scale: Over 3.4 million exchanges

The operation targeted:

Agentic reasoning and tool use Coding and data analysis Computer-use agent development Computer vision Moonshot (Kimi models) employed hundreds of fraudulent accounts spanning multiple access pathways. Varied account types made the campaign harder to detect as a coordinated operation. We attributed the campaign through request metadata, which matched the public profiles of senior Moonshot staff. In a later phase, Moonshot used a more targeted approach, attempting to extract and reconstruct Claude’s reasoning traces.

ta20240528 · 2026-05-19T17:06:16 1779210376

And when you here unsubstantiated rumours* that say Anthropic has been sending exchanges to say Alibaba's Qwen, will you als oconclude the same about the entire US AI industry?

I doubt it.

* publish the logs.

ifwinterco · 2026-05-19T17:21:05 1779211265

Even if it's true, it's not like US AI companies can complain, given their entire business is based on ripping off text without attribution

maxdo · 2026-05-20T03:04:53 1779246293

chinese ai is not doing the same? or they don't parse?

they do except they also send thousands of sex-spies to do espionage of this kind on the scale.

ifwinterco · 2026-05-20T05:08:49 1779253729

Of course they’re also doing this, my point is this is a grubby business where ethics went out of the window a long time ago.

If you’re playing this game in 2026 you know the rules - anything goes

ta20240528 · 2026-05-22T09:10:47 1779441047

"they also send thousands of sex-spies"

Could they send one (or two) my way?

howdareme9 · 2026-04-16T15:46:38 1776354398

They are constantly training and getting rid of older models, they are losing money

ACCount37 · 2026-04-16T16:08:20 1776355700

Which part of "over model lifetime" did you not understand?

adgjlsfhk1 · 2026-04-16T20:54:26 1776372866

That's not a sufficient condition for profitability if both inference and scaling costs continue to increase over time.

howdareme9 · 2026-04-16T09:26:09 1776331569

have you got a link to this?

rahimnathwani · 2026-04-16T12:55:55 1776344155

Sorry, I got the author wrong.

It's here: https://github.com/tmustier/pi-for-excel

howdareme9 · 2026-02-02T16:58:39 1770051519

5.2 Codex is up there with claude lmao

sandos · 2026-02-03T10:24:38 1770114278

Agree, but it seems dependent on field. One day I wanted a browser extension made, and 5.2-codex-max added hundreds of lines of code several times, and for 15-20 iterations I did not change one thing, or even have an opinion on what it was doing. This is extremely uncommon for other models for me, even Opus I would say. And yes, I mostly do small green-field things and not even that works all the time, even if LLMs are clearly at their best there.

howdareme9 · 2026-01-16T09:51:07 1768557067

Not likely at all, people pay for convenience. They don't want to do that

johanyc · 2026-01-17T11:20:49 1768648849

Yeah hackernews users kept thinking the average consumers like to tinker like we do lol