More

catmanjan · 2025-11-12T04:56:33 1762923393

My exact reaction

catmanjan · 2025-10-30T05:38:35 1761802715

>food is largely a solved problem

It really isn't...

dragonwriter · 2025-10-30T06:52:22 1761807142

Distribution is an issue, but the imminent capacity issue perceived in the late 1960s when The Population Bomb was written was already being solved when it was entering the popular consciousness (but the impact of the solutions had not been fully appreciated) by the Green Revolution through high-yield crop varieties and other advanced in agriculture.

adrianN · 2025-10-30T06:37:49 1761806269

Production of calories is a solved problem. Distribution of food to people in need on the other hand…

tsimionescu · 2025-10-30T10:51:56 1761821516

It's not really a solved problem, we're depleting many extremely slow to recover resources in order to produce the amounts we are today.

matheusmoreira · 2025-10-30T09:25:06 1761816306

There's more to nutrition than calories. Generally speaking: the more nutritive, the more expensive.

coldtea · 2025-10-30T08:29:43 1761812983

Yeah, spoken like someone who only understands food as something that magically and without fail appears on their local stores.

samarthr1 · 2025-10-30T06:37:12 1761806232

*logistics of food is not solved?

jl6 · 2025-10-30T07:01:38 1761807698

Neither production nor logistics is solved at all. We have bought ourselves time, largely by racking up environmental debt on our planetary credit card. Food is still massively dependent on fossil fuel consumption (machinery, transport, fertilizer).

The good news is that the answer is to reduce the cost and carbon impact of energy production, and we’re making great progress here, but we cannot afford to take our foot off the gas, because although Ehrlich was wrong about the timing, he wasn’t wrong in his fundamental observation that the Earth has a finite carrying capacity.

rat87 · 2025-10-31T22:36:22 1761950182

The idea that he was off on the timing is wrong. He was wrong and continued to be wrong even as he insisted his predictions would come any day now

catmanjan · 2025-10-27T08:17:26 1761553046

Why is that disconcerting? There's simply too much software to be familiar with obscure features like this

JackMorgan · 2025-10-27T11:35:06 1761564906

Because too many bad interviews are all about ensuring that the candidate knows the exact same 1% of CS/SWE knowledge as the interviewer.

Don't worry, karma dictates when the interviewer goes looking they'll get rejected for not knowing some similarly esoteric graph theory equation or the internal workings of a NIC card.

Too much of our interviewing is reading the interviewer's mind or already knowing the answer to a trick question.

The field is way too vast for anyone to even know a majority, and realistically it's extremely difficult to assess if someone is an expert in a different 1%.

Sometimes I feel like we need a system for just paying folks to see if they can do the job. Or an actually trusted credentialing system where folks can show what they've earned with badges and such.

A better interview question about this subject doesn't assume they have it memorized, but if they can find the answer in a short time with the internet or get paralyzed and give up. It's a very important skill to be able to recognize you are missing information and researching it on the Internet.

For example, one of my most talented engineers didn't really know that much about CS/SWE. However, he had some very talented buddies on a big discord server who could help him figure out anything. I kid you not, this kid with no degree and no experience other than making a small hobby video game would regularly tackle the most challenging projects we had. He'd just ask his buddies when he got stuck and they'd point him to the right blog posts and books. It was like he had a real life TRRPG Contacts stat. He was that hungry and smart enough to listen to his buddies, and then actually clever enough to learn on the job to figure it out. He got done more in a week than the next three engineers of his cohort combined (and this was before LLMs).

So maybe what we should test isn't data stored in the brain but ability to solve a problem given internet access.

BatteryMountain · 2025-10-27T16:33:21 1761582801

Because the same kind of guys have one global ssh key they use for all server & all environments... they don't realise they can (and should) have multiple keys on one machine / one user. Different keys for different purposes.

Same issues with git: they don't realise they can have multiple configs, multiple remotes, etc. Never mind knowing how to sign commits.........

They claim to be linux boffins but cannot initialise a git repo. This has nothing to do with elitism. This is basic stuff.

What's next, they don't know what a bootloader or a partition is? Or run database engine with default settings? Or install a server OS and never bother to look at firewall config?

I'm truly not trying to be cruel.

catmanjan · 2025-10-27T20:23:19 1761596599

Most of those things are ops related, not software development

BatteryMountain · 2025-10-30T06:36:54 1761806214

Same analogues exist for software development.

Cthulhu_ · 2025-10-27T08:46:27 1761554787

In a sense it's anything but obscure, that is, it's one of the most basic of features of the tool and the first thing (well, git init anyway) anyone ever uses.

But that's why people don't know about it, because they skip past the basics because in practice you never use it or need to know about it.

This is the reality of software engineering and the like though - mostly you learn what you need to know, because learning everything is usually wasteful and never used, and there's a lot available.

(I haven't been able to read documentation or a software book end to end in 20 years)

catmanjan · 2025-10-24T10:04:22 1761300262

There's a big difference between 15 and 18 though...

jsheard · 2025-10-24T10:12:54 1761300774

I think most countries have much stricter enforcement for gambling age limits, too. If you sell a kid a copy of GTA5 that's their parents problem, but if you allow kids into your casino it's your problem.

josefx · 2025-10-24T10:49:14 1761302954

The problem is defining what falls under those laws. Companies sell trading card boxes with random contents. McDonalds had its Monopoly game. There are many more examples of things that are gambling with money, accessible to kids and still allowed in most countries.

voxic11 · 2025-10-24T13:58:44 1761314324

McDonalds Monopoly game was a sweepstakes, you could get game pieces for free by simply asking, which is why it doesn't fall afoul of gambling laws.

dcow · 2025-10-24T12:35:09 1761309309

McDonald’s at least has AMOE and you don’t have to spend a cent to play. It’s certainly the less convenient path, purposely, though.

CaptainOfCoit · 2025-10-24T14:51:06 1761317466

Typically legal gambling has age limits by law, while the age recommendation for video games is just that, an recommendation. It isn't illegal for a 14 year old to play a game recommended to 18 year olds. Don't know how it works in the US specifically, at least how it works in other places.

I'm guessing the video games industry's attempt at self-regulating with PEGI and similar efforts actually paid off.

averageRoyalty · 2025-10-26T21:28:30 1761514110

I can't speak for your country, but in Australia it's illegal to sell MA15+ rated material to an under 15, and R18+ material to an under 18. CS is MA15+.

the_sleaze_ · 2025-10-24T13:57:30 1761314250

Is there?

saghm · 2025-10-24T14:35:57 1761316557

From an objective legal standpoint in some jurisdictions, the answer is clearly yes

BigTTYGothGF · 2025-10-24T15:14:58 1761318898

By and large, yeah.

reaperducer · 2025-10-24T14:00:08 1761314408

Found the 14-year-old.

catmanjan · 2025-10-20T20:30:07 1760992207

I'd love to work with Azure search but because copilot with external items has been made so cheap it's hard to justify...

pamelafox · 2025-10-20T20:50:35 1760993435

Do you mean that you're using the Copilot indexer for Sharepoint docs? https://learn.microsoft.com/en-us/microsoftsearch/semantic-i...

AI Search team's been working with the Sharepoint team to offer more options, so that devs can get best of both worlds. Might have some stuff ready for Ignite (mid November).

catmanjan · 2025-10-27T08:40:26 1761554426

No we have a Microsoft graph connector which inserts externalitems into graph, copilot is able to surface these, probably via the same semantic search database

The capability was there for years, but it was expensive - something like $0.60 per 1000 items indexed, then sometimes after copilot was added it became free for up to 50 million items, and now it's free for unlimited items - you just can't beat that for price... https://techcommunity.microsoft.com/blog/microsoft365copilot...

catmanjan · 2025-10-08T06:43:30 1759905810

Imagine how many millions would drive a Ferrari if they gave them away for free

re-thc · 2025-10-08T06:59:38 1759906778

> Imagine how many millions would drive a Ferrari if they gave them away for free

0.

Ferrari is a luxury sports brand. What's the point of it if it flooded the streets?

high_na_euv · 2025-10-08T07:37:49 1759909069

Good looking, powerful, reliable car?

fragmede · 2025-10-09T07:19:47 1759994387

> Good looking, powerful, reliable car?

How to say you don't own a Ferrari without saying you don't own a Ferrari.

simianwords · 2025-10-08T06:58:35 1759906715

It’s not true that people are only using it because it’s free.

It’s actually quite interesting to see these contradictory positions play out:

1. LLMs are useless and everyone is making a stupid bet on it. The users of llms are fooled into using it and the companies are fooled into betting on it

2. Llms are getting so cheap that the investments into data centers won’t pay off because apparently they will get good enough to run on your phone

3. Llms are bad and they are bad for environment, bad for the brain, bad because they displace workers and bad because they make rich people richer

4. AI is only kept up because there’s a conspiracy to keep it propped up by Nvidia, oracle, OpenAI (something something circular economy)

5. AI is so powerful that it should not be built or humanity would go extinct

catmanjan · 2025-10-08T09:17:00 1759915020

It is true that none of the LLM providers are profitable though, so there is some number above free that they need to charge and I am not convinced that number is compelling

simianwords · 2025-10-08T09:52:05 1759917125

None of LLM providers being profitable is exactly the situation I would expect. Them being profitable is so absurd on the contrary! Why wouldn't they put the money back into R&D and marketing?

catmanjan · 2025-10-08T10:37:48 1759919868

I'm not well versed with the accountant terminology, whatever the word is to describe the operating cost, I am not convinced consumers will ever pay enough to cover those costs

simianwords · 2025-10-08T12:15:02 1759925702

Do you think if LLM's become 10 times more efficient it might covert he costs? What efficiency increase would you think is enough?

snovv_crash · 2025-10-08T16:13:43 1759940023

It's a competitive environment, no way the data centers manage to capture that 10x efficiency improvement. There would be an expectation of 10x reduced prices, because someone else is offering that.

catmanjan · 2025-10-08T23:17:50 1759965470

The problem I see as someone who has implemented a bunch of AI solutions in a range of markets, the quality isn't good enough yet to even think about efficiency - even if the current AI is 100x more efficient it still wouldn't be worth paying for because it doesn't deliver reliable and trustable results...

snovv_crash · 2025-10-08T16:11:05 1759939865

A) Huge straw man, since it isn't the same people making those points. None of those need the other to be true to cause issues, they are independent concerns.

B) You're missing a few things like:

1. The hardware overhang of edge compute (especially phones) may make the centralized compute investments irrelevant as more efficient LLMs (or whatever replaces them) are released.

2. Hardware depreciates quickly. Are these massive data centers really going to earn their money back before a more efficient architecture makes them obsolete? Look at all the NPUs on phones which are useless with most current LLMs due to insufficient RAM. Maybe analogue compute takes off, or giant FPGAs, which can do on a single board what is done with a rack at the moment. We are nowhere near a stable model architecture, or stable optimal compute architecture. Follow the trajectory of bitcoin and etherium mining here to see what we can expect.

3. How does one company earn back their R&D when the moment it is released, competition puts out comparable models within 6 months, possibly by using the very service that was provided to generate training data.

catmanjan · 2025-08-20T04:54:29 1755665669

In this scenario Copilot is performing RAG, so the auditing occurs when Copilot returns hits from the vector search engine its connected to - it seems there was a bug where it would only audit when Copilot referenced the hits in its result.

The correct thing to do would be to have the vector search engine do the auditing (it probably already does, it just isn't exposed via Copilot) because it sounds like Copilot is deciding if/when to audit things that it does...

catmanjan · 2025-08-20T01:31:47 1755653507

As someone else mentioned the file isnt actually accessed by copilot, rather copilot is reading the pre-indexed contents of the file in a search engine...

Really Microsoft should be auditing the search that copilot executes, its actually a bit misleading to be auditing the file as accessed when copilot has only read the indexed content of the file, I don't say I've visited a website when I've found a result of it in Google

eschneider · 2025-08-20T09:31:59 1755682319

Oh, so there's a complete copy (or something that can be reassembled into a copy) completely OUTSIDE of audit controls. That's so much worse. :0

Drakim · 2025-08-20T10:21:03 1755685263

It's roughly the same problem as letting a search engine build indexes (with previews!) of sites without authentication. It's kinda crazy that things were allowed to go this far with such a fundamental flaw.

Scubabear68 · 2025-08-20T11:41:03 1755690063

Yep. Many years ago I worked at one of the top brokerage houses in the United States, they had a phenomenal Google search engine in house that made it really easy to navigate the whole company and find information.

Then someone discovered production passwords on a site that was supposed to be secured but wasn’t.

Found such things in several places.

The solution was to make searching work only if you opted-in your website.

After that internal search was effectively broken and useless.

All because a few actors did not think about or care about proper authentication and authorization controls.

falcor84 · 2025-08-20T11:11:32 1755688292

I'm unclear on what the "flaw" is - isn't this precisely the "feature" that search engines provide to both sides and that site owners put a ton of SEO effort into optimizing?

Drakim · 2025-08-20T11:23:30 1755689010

If you have public documents, you can obviously let a public search engine index them and show previews. All is good.

If you have private documents, you can't let a public search engine index and show previews of those private documents. Even if you add an authentication wall for normal users if they try to open the document directly. They could still see part of the document in google's preview.

My explanation sounds silly because surely nobody is that dumb, but this is exactly what they have done. They gave access to ALL documents, both public and private, to an AI, and then got surprised when the AI leaked some private document details. They thought they were safe because users would be faced with an authentication wall if they tried to open the document directly. But that doesn't help if copilot simply tells you all the secret in it's own words.

andrewaylett · 2025-08-20T13:38:20 1755697100

You say that, but it happens — "Experts Exchange", for example, certainly used to try to hide the answers from users who hadn't paid while encouraging search engines to index them.

thayne · 2025-08-20T14:10:47 1755699047

That's not quite the same. Experts Exchange wanted the content publicly searchable, and explicitly allowed search engines to index it. In this case, many customers probably aren't aware that there is a separate search index that contains much of the data in their private documents that may be searchable and accessible by entities that otherwise shouldn't have access.

polishdude20 · 2025-08-20T23:47:52 1755733672

That's not necessarily what happened in the article. He wasn't able to access private docs. He was just able to tell Copilot to not send an audit log.

oeitho · 2025-08-20T07:55:11 1755676511

> Really Microsoft should be auditing the search that copilot executes, its actually a bit misleading to be auditing the file as accessed when copilot has only read the indexed content of the file, I don't say I've visited a website when I've found a result of it in Google

Not my domain of expertise, but couldn't you at some point argue that the indexed content itself is an auditable file?

It's not literally a file necessarily, but if they contain enough information that they can be considered sensitive, then where is the significant difference?

jacquesm · 2025-08-20T09:43:10 1755682990

Not only could you do that, you should do that.

eleveriven · 2025-08-20T06:33:50 1755671630

That makes sense on a technical level, but from a security and compliance perspective, it still doesn't really hold up

whizzter · 2025-08-20T07:20:58 1755674458

Usage of Ai's almost by definition need everything indexed at all times to be useful, letting one rummage through your stuff without 100% ownership is just madness to begin with and avoiding deep indexing would make the shit mostly useless unless regular permission systems were put in (and then we're kinda back at were we were without AI's).

internetter · 2025-08-20T04:18:44 1755663524

> I don't say I've visited a website when I've found a result of it in Google

I mean, it depends on how large the index window is, because if google returned the entire webpage content without leaving (amp moment), you did visit the website. fine line.

bw86 · 2025-08-20T07:34:14 1755675254

The challenge then is to differentiate between "I wanted to access the secret website/document" and "Google/Copilot gave me the secret website/document, but it was not my intention to access that".

swiftcoder · 2025-08-20T10:36:47 1755686207

Access is access. Regardless of whether you intended to view the document, you are now aware of its content in either case, and an audit entry must be logged.

ceridwyn · 2025-08-20T15:10:15 1755702615

Strongly agree. Consider the case of a healthcare application where, during the course of business, staff may perform searches for patients by name. When "Ada Lovelace" appears even briefly in the search results of a "search-as-you-type" for some "Adam _lastname", has their privacy has been compromised? I think so, and the audit log should reflect that.

I'm a fan of FHIR (a healthcare api standard, but far from widely adopted), and they have a secondary set of definitions for Audit log patterns (BALP) that recommends this kind of behaviour. https://profiles.ihe.net/ITI/BALP/StructureDefinition-IHE.Ba...

"[Given a query for patients,] When multiple patient results are returned, one AuditEvent is created for every Patient identified in the resulting search set. Note this is true when the search set bundle includes any number of resources that collectively reference multiple Patients."

Faark · 2025-08-20T06:20:44 1755670844

What's the solution then? Chain 2 AIs, the first one is fine tuned on / has RAG access to your content telling a second that actually produces content what files are relevant (and logged)?

Or just a system prompt "log where all the info comes from"...

chrisweekly · 2025-08-20T14:04:37 1755698677

Someone please confirm my idea (or remedy my ignorance) about this rule of thumb:

Don't train a model on sensitive info, if there will ever be a need for authZ more granular than implied by access to that model. IOW, given a user's ability to interact w/ a model, assume that everything it was trained on is visible to that user.

catmanjan · 2025-08-11T03:52:06 1754884326

I'm pretty sure what you're describing is the fact that Microsoft return Graph scopes by default when you request a token, I agree it is very annoying and only really documented if you read between the lines...

catmanjan · 2025-08-02T22:07:32 1754172452

How can anyone still believe the AGI scam

atleastoptimal · 2025-08-02T22:10:52 1754172652

If you think the possibility of AGI within 7-10 years is a scam then you aren't paying attention to trends.

godelski · 2025-08-02T22:43:51 1754174631

I wouldn't call 7-10 years a scam, but I would call it low odds. It is pretty hard to be accurate on predictions of a 10 year window. But I definitely think 2027 and 2030 predictions are a scam. Majority of researchers think it is further away than 10 years, if you are looking at surveys from the AI conferences rather than predictions in the news.

atleastoptimal · 2025-08-02T23:04:14 1754175854

The thing is, AI researchers have continually underestimated the pace of AI progress

https://80000hours.org/2025/03/when-do-experts-expect-agi-to...

>One way to reduce selection effects is to look at a wider group of AI researchers than those working on AGI directly, including in academia. This is what Katja Grace did with a survey of thousands of recent AI publication authors.

>In 2022, they thought AI wouldn’t be able to write simple Python code until around 2027.

>In 2023, they reduced that to 2025, but AI could maybe already meet that condition in 2023 (and definitely by 2024).

>Most of their other estimates declined significantly between 2023 and 2022.

>The median estimate for achieving ‘high-level machine intelligence’ shortened by 13 years.

Basically every median timeline estimate has shrunk like clockwork every year. Back in 2021 people thought it wouldn't be until 2040 or so when AI models could look at a photo and give a human-level textual description of its contents. I think is reasonable to expect that the pace of "prediction error" won't change significantly since it's been on a straight downward trend over the past 4 years, and if it continues as such, AGI around 2028-2030 is a median estimate.

jodrellblank · 2025-08-03T01:16:33 1754183793

> "Back in 2021 people thought it wouldn't be until 2040 or so when AI models could look at a photo and give a human-level textual description of its contents."

Claim doesn't check out; here's a YouTube video from Apple uploaded in 2021, explaining how to enable and use the iPhone feature to speak a high level human description of what the camera is pointed at: https://www.youtube.com/watch?v=UnoeaUpHKxY

p1esk · 2025-08-02T23:26:51 1754177211

Exactly. There’s one guy - Ray Kurzweil - who predicted in late 90s that AGI will happen in 2029 (yes, the exact year, based on his extrapolations of Moore’s law). Everybody laughed at him, but it’s increasingly likely he’ll be right on the money with that prediction.

saagarjha · 2025-08-03T08:35:50 1754210150

Remember when he said nanobots would cure all our diseases by a few years ago?

p1esk · 2025-08-03T09:06:01 1754211961

I don’t actually, did he specify a year? He made a lot of predictions, many are wrong, but the AGI one is pretty amazing.

saagarjha · 2025-08-03T09:25:28 1754213128

2020s was my understanding; he made this prediction around the time that he made the AGI one. I think he has recently pushed it back to 2030s because it seems unlikely to come true.

godelski · 2025-08-03T18:45:09 1754246709

  > many are wrong, but the AGI one is pretty amazing.

If you make enough predictions eventually one will be right

Or at least one will be exciting

parineum · 2025-08-02T23:19:34 1754176774

Those are differences of magnitude, AGI is a difference of kind.

No amount of describing pictures in natural language is AGI.

atleastoptimal · 2025-08-02T23:23:28 1754177008

I never said it was sufficient for AGI, just that it was a milestone in AI that people thought was farther off than it turned out to be. This is applying to all subsets of intelligence AI is reaching earlier than experts initially predicted, giving good reason AGI (perhaps a synthesis of these elements coming together in a single model, or a suite of models) is likely closer than standard expert consensus.

parineum · 2025-08-02T23:47:34 1754178454

The milestones your citing are all milestones of transformers that were underestimated.

If you think an incremental improvement in transformers are what's needed for AGI, I see your angle. However, IMO, transformers haven't shown any evidence of that capability. I see no reason to believe that they'd develop that with a bit more compute or a bit more data.

godelski · 2025-08-03T00:01:35 1754179295

It's also worth pointing out that in the same survey it was well agreed upon that success would come sooner if there was more funding. The question was a counterfactual prediction of how much less progress would be made if there was 50% less funding. The response was about 50% less progress.

So honestly, it doesn't seem like many of the predictions are that far off with this in context. That things sped up as funding did too? That was part of the prediction! The other big player here was falling cost of compute. There was pretty strong agreement that if compute was 50% more expensive that this would result in a decrease in progress by >50%.

I think uncontextualized, the predictions don't seem that inaccurate. They're reasonably close. Contextualized, they seem pretty accurate.

godelski · 2025-08-02T23:50:51 1754178651

  > The thing is, AI researchers have continually underestimated the pace of AI progress

What's your argument?

That because experts aren't good at making predictions that non-experts must be BETTER at making predictions?

Let me ask you this: who do you think is going to make a less accurate prediction?

Assuming no one is accurate here, everybody is wrong. So the question is who is more or less accurate. Because there is a thing as "more accurate" right?

  >> In 2022, they thought AI wouldn’t be able to write simple Python code until around 2027.

Go look at the referenced paper[0]. It is on page 3, last item in Figure 1, labeled "Simple Python code given spec and examples". That line is just after 2023 and goes to just after 2028. There's a dot representing the median opinion that's left of the vertical line half way between 2023 and 2028. Last I checked, 8-3 = 5, and 2025 < 2027.

And just look at the line that follows

  > In 2023, they reduced that to 2025, but AI could maybe already meet that condition in 2023

Something doesn't add up here... My guess, as someone who literally took that survey, is what's being referred to as "a simple program" has a different threshold.

Here's the actual question from the survey

  Write concise, efficient, human-readable Python code to implement simple algorithms like quicksort. That is, the system should write code that sorts a list, rather than just being able to sort lists.
  
  Suppose the system is given only:
    A specification of what counts as a sorted list
    Several examples of lists undergoing sorting by quicksort

Is the answer to this question clear? Place your bets now!

Here, I asked ChatGPT the question[1], it got it wrong. Yeah, I know it isn't very wrong, but it is still wrong. Here's an example of a correct solution[2] which shows the (at least) two missing lines. Can we get there with another iteration? Sure! But that's not what the question was asking.

I'm sure some people will say that GPT gave the right solution. So what that it ignored the case of a singular array and assumed all inputs are arrays. I didn't give it an example of a singular array or non-array inputs, but it did just assume. I mean leetcode questions pull out way more edge cases than I'm griping on here.

So maybe you're just cherry-picking. Maybe the author is just cherry-picking. Because their assertion that "AI could maybe already meet that condition in 2023" is not unobjectively true. It's not clear that this is true in 2025!

[0] https://arxiv.org/abs/2401.02843

[1] https://chatgpt.com/share/688ea18e-d51c-8013-afb5-fbc85db0da...

[2] https://www.geeksforgeeks.org/python/python-program-for-inse...

atleastoptimal · 2025-08-03T00:27:15 1754180835

>Go look at the referenced paper[0]. It is on page 3, last item in Figure 1, labeled "Simple Python code given spec and examples". That line is just after 2023 and goes to just after 2028. There's a dot representing the median opinion that's left of the vertical line half way between 2023 and 2028. Last I checked, 8-3 = 5, and 2025 < 2027.

The graph you're looking at is of the 2023 survey, not the 2022 one

As for your question, I don't see what it proves. You described the desired conditions for an a sorting algorithm and chatGPT implemented a sorting algorithm. In the case of an array with one element, it bypasses the for loop automatically and just returns the array. It is reasonable for it to assume all inputs are arrays because your question told it that its requirements were to create a program that " turn any list of numbers into a foobar."

Of course I'm not any one of the researchers asked about their predictions in the survey, but I'm sure if you told them "a SOTA AI in 2025 produced working human readable code based on a list of specifications, and is only incorrect by a broad characterization of what counts as an edge case that would trip up a reasonable human coder on the first try", I'm sure the 2022 or 2023 respondents would say that it meets their criteria for their threshold.

godelski · 2025-08-03T01:40:52 1754185252

  > As for your question, I don't see what it proves.

The author made a claim

I showed the claim was false

The author bases his argument on this and similar claims. Showing his claim is false says he's argument doesn't hold

  > and is only incorrect by a broad characterization

I don't know of I'd really call a single item an "edge case" so much as generalization.

But I do know I'd answer that question differently given your reframing.

BeFlatXIII · 2025-08-03T01:21:19 1754184079

Paying attention to trends is how you lose your money on a hype train.

catmanjan · 2025-08-03T00:26:34 1754180794

Even if we spent 1 million years on LLM it will not result in AGI, we are no closer to AGI with LLM technology than we were with toaster technology

exasperaited · 2025-08-03T00:39:06 1754181546

“Would you like a toasted teacake?”

jimmy2times · 2025-08-03T01:10:05 1754183405

I can't believe this is so unpopular here. Maybe it's the tone, but come on, how do people rationally extrapolate from LLMs or even large multimodal generative models to "general intelligence"? Sure, they might do a better job than the average person on a range of tasks, but they're always prone to funny failures pretty much by design (train vs test distribution mismatch). They might combine data in interesting ways you hadn't thought of; that doesn't mean you can actually rely on them in the way you do on a truly intelligent human.

wulfstan · 2025-08-03T06:08:28 1754201308

I think it’s selection bias - a y-combinator forum is going to have a larger percentage of people who are techno-utopianists than general society, and there will be many seeking financial success by connecting with a trend at the right moment. It seems obvious to me that LLMs are interesting but not revolutionary, and equally obvious that they aren’t heading for any kind of “general intelligence”. They’re good at pretending, and only good at that to the extent that they can mine what has already been expressed.

I suppose some are genuine materialists who think that ultimately that is all we are as humans, just a reconstitution of what has come before. I think we’re much more complicated than that.

LLMs are like the myth of Narcissus and hypnotically reflect our own humanity back at us.