Hacker Newsnew | past | comments | ask | show | jobs | submit | atherton33's commentslogin

I agree with you about what's described here.

There is engineering when this is done seriously, though.

Build a test set and design metrics for it. Do rigorous measurement on any change of the system, including the model, inference parameters, context, prompt text, etc. Use real statistical tests and adjust for multiple comparisons as appropriate. Have monitoring that your assumptions during initial prompt design continue to be valid in the future, and alert on unexpected changes.

I'm surprised to see none of that advice in the article.


This article talks about prompt evals https://www.anthropic.com/engineering/writing-tools-for-agen.... There are plenty of approaches to provide some degree of rigor around the slot machine output.


My first job we had office rooms shared by 2-4 that were usually pretty quiet with a generally closed door policy.

The VP Eng would always say "I always try to remember it costs the company over a hundred dollars for me open one of these doors."

I learned so much from that boss.


Cambridge MA just had a major shift on zoning towards "build more" in Q1. It will be an interesting natural experiment to watch.

https://www.boston.com/news/politics/2025/02/11/cambridge-el...


I think they're saying something more subtle.

In your basketball analogy, it's more like they have a model that predicts basketball performance, and they're saying that model should predict performance equally well across groups, not that the groups should themselves perform equally well.


You’re right, I misinterpreted it.


I don't know the detail here, but in many of the discussions I've seen the operators themselves are based in the UK, and that changes the calculus.


Yeah, GP is, to put it charitably, not understanding the situation.

> About Us

> HEXUS.net is the UK’s number one independent technology news and reviews website.


From AP, RFK said:

> "COVID-19 is targeted to attack Caucasians and Black people. The people who are most immune are Ashkenazi Jews and Chinese,” he added. “We don’t know whether it was deliberately targeted at that or not but there are papers out there that show the racial or ethnic differential of impact for that."

The article notes he claims that this quote "twisted" his words.

https://apnews.com/article/robert-f-kennedy-jr-covid-comment...


> This move absolutely will drive out some of their best talent

IMHO, from my personal insider experience, this is actually the goal in some places.

Best talent is often not the most cost effective talent, especially in parts of the business where the company has switched from innovating to maintaining.


From the article, it's not 12 total, but 12 per user for 100 concurrent generations.


Thank you atherton friend.


> Book a meeting room if you want to have a real conversation.

But act fast, the next available slot is in two weeks, assuming an exec doesn't claim the room for that full day that morning.


This totally happened on my first team.

We had a guy who would argue about everything that knew the CTO so we had to tolerate him.

Then we hired a second one and they just argued with each other all the time and the rest of the team could finally make progress.

It was awesome.


I have a feeling I was hired to be this second guy and I really just want to hang up my gloves, man


Thank you for your service


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: