More

cortic · 2025-12-27T12:25:21 1766838321

>ChatGPT (o3): Scored 136 on the Mensa Norway IQ test in April 2025

If you don't want to believe it, you need to change the goal posts; Create a test for intelligence that we can pass better than AI.. since AI is also better at creating test than us maybe we could ask AI to do it, hang on..

>Is there a test that in some way measures intelligence, but that humans generally test better than AI?

Answer:Thinking, Something went wrong and an AI response wasn't generated.

Edit, i managed to get one to answer me; the Abstraction and Reasoning Corpus for Artificial General Intelligence (ARC-AGI). Created by AI researcher François Chollet, this test consists of visual puzzles that require inferring a rule from a few examples and applying it to a new situation.

So we do have A test which is specifically designed for us to pass and AI to fail, where we can currently pass better than AI... hurrah we're smarter!

latexr · 2025-12-27T14:36:40 1766846200

The validity of IQ tests as a measure of broad intelligence has been in question for far longer than LLMs have existed. And if it’s not a proper test for humans, it’s not a proper test to compare humans to anything else, be it LLMs or chimps.

https://en.wikipedia.org/wiki/Intelligence_quotient#Validity...

piva00 · 2025-12-27T12:50:45 1766839845

To be intelligent is to realise that any test for intelligence is at best a proxy for some parts of it. There's no objective way to measure intelligence as a whole, we can't even objectively define intelligence.

design2203 · 2025-12-27T17:43:33 1766857413

I believe intelligence is difficult to pin down in words but easy to spot intuitively - and so are deltas in intelligence.

E.g watch a Steve jobs interview and a Sam Altman one (at the same age). The difference in the mode of articulation, simplicity in communication, obsession over details etc are huge. This is what superior intelligence to me looks like - you know it when you see it.

gloosx · 2025-12-27T13:24:57 1766841897

>Create a test for intelligence that we can pass better than AI

Easy? The best LLMs score 40% on Butter-Bench [1], while the mean human score is 95%. LLMs struggled the most with multi-step spatial planning and social understanding.

[1] https://arxiv.org/pdf/2510.21860v1

cortic · 2025-12-27T15:44:04 1766850244

That is really interesting; Though i suspect its just a effect of differing training data, humans are to a larger degree trained on spacial data, while LLMs are trained to a larger degree on raw information and text.

Still it may be lasting limitation if robotics don't catch up to AI anytime soon.

Don't know what to make of the Safety Risks test, threatening to power down AI in order to manipulate it, and most act like we would and comply. fascinating.

gloosx · 2025-12-31T09:37:27 1767173847

>humans are to a larger degree trained on spacial data

you must be completely LLMheaded to say something like that, lol

humans are not trained on spacial data, they are living in the world. humans are very much diffent from silicone chips, and human learning is on another magnitude of complexity compared to a large language model training

cortic · 2026-01-07T11:28:39 1767785319

Humans are large language models. Maybe the term language is being used a bit liberally here but we basically function in the same way, with the exception of the spacial aspect of our training data.

If this hurts your ego then just know the dataset that you built your ego with was probably flawed and if you can put that LoRA aside and try to process this logically; Our awareness is a scalable emergent property of 1-2 decades of datasets, looking at how neurons vs transistor groups work, there could only be a limited amount of ways to process these sizes of data down to relevant streams. The very fact that training LLMs on our output works, proves our output is a product of LLMs or there wouldn't be patterns to find.

cortic · 2025-12-27T12:15:52 1766837752

> ChatGPT (o3): Scored 136 on the Mensa Norway test in April 2025

So yes, most people are right in that assumption, at least by the metric of how we generally measure intelligence.

ehnto · 2025-12-27T12:47:10 1766839630

Does an LLM scoring well on the Mensa test translate to it doing excellent and factual police reporting? It is probably not true of humans doing well on the Mensa, why would it be true of an LLM?

We should probably rigorously verify that, for a role that itself is about rigorous verification without reasonable doubt.

I can immediately, and reasonably, doubt the output of an LLM, pending verification.

gilrain · 2025-12-27T14:19:03 1766845143

> the metric of how [the uninformed] generally measure intelligence

cortic · 2026-01-07T17:59:59 1767808799

How do the informed measure intelligence?

I know I'm too late to ask this question, But I suspect its either; Feelings and intuitions, which is just a primitive IQ test. Or some kind of aptitude test, which is just a different flavor of IQ test.

vid · 2025-12-27T12:22:13 1766838133

Court reports should as much be about human sensibility. I have met plenty of high IQ people who were insensitive.

cortic · 2025-12-27T12:37:40 1766839060

Having listened to some the new AI generated songs on utube, looks like they might be better at being sensitive humans than we are as well..

gilrain · 2025-12-27T14:26:00 1766845560

Where do you imagine they copied those human sensitivities from? The weather?

cortic · 2025-12-27T15:35:21 1766849721

The same place as humans do, other humans.

turtlesdown11 · 2025-12-27T14:06:05 1766844365

Yeah I certainly associate LLMs with high intelligence when they provide fake links to fake information, I think, man this thing is SMART

cortic · 2025-10-19T19:54:26 1760903666

> Better coolants

The Montreal protocol (1987) put us back into the dark ages with coolants for a while (both with CFC ban and later phase outs of HFCs). I suspect if you tested a refrigerator from 40 years ago they would give modern ones a run for their money...

It was obviously a worthwhile sacrifice for the ozone layer though.

cortic · 2025-09-28T19:08:54 1759086534

It is a criminal offense in the UK to use insulting words in public, or to send any message online that anyone could find insulting or offensive (whether any one does or not is irreverent).

The Online Safety Act and Hate Crime Provision have extended these somewhat into the realms of 1984. But the police do tend to use them sparingly.

teamonkey · 2025-09-28T19:45:04 1759088704

> It is a criminal offense in the UK to use insulting words in public, or to send any message online that anyone could find insulting or offensive (whether any one does or not is irreverent).

This is categorically untrue.

cortic · 2025-09-28T21:21:35 1759094495

Public Order Act 1986

"insulting words or behavior that cause distress to others"

Malicious Communications Act 1988 (Section 1):

"Outlaws sending messages, electronic or otherwise, with the intent to cause distress, or anxiety"

Communications Act 2003, Online Safety Act 2023, hate speech, terrorist legislation all made these many orders of magnitude worse in many ways.

teamonkey · 2025-09-28T21:57:01 1759096621

You cannot be arrested for sending “any message online that anyone could find insulting or offensive”. That’s not what the law says. You can be arrested for spreading hate speech, inciting violence, sending illegal media or harassment online.

All of the arrests mentioned in this thread in relation to these acts have been campaigns of intimidation, harassment and calls to violence, not simply saying something “insulting or offensive”.

In the UK political expression of free speech is protected by the ECHR, which overrides both those acts (look carefully who wishes to abolish the ECHR).

SilverElfin · 2025-09-28T22:36:16 1759098976

> All of the arrests mentioned in this thread in relation to these acts have been campaigns of intimidation, harassment and calls to violence, not simply saying something “insulting or offensive”

This is false. But even if it weren’t, it would be unjust. Determinations like “hate speech” are subjective, and have no place in law concerning speech. Without free speech, there is no democracy.

teamonkey · 2025-09-29T10:42:27 1759142547

There’s a big difference between being free to criticise the government and those who define and enforce laws, and being free to say anything to or about another citizen without repercussion, even if it may cause them harm.

The people mentioned here who were arrested due to violations of the communications acts are definitely the latter. The people arrested in peaceful protests for being associated with Palestine Action or Just Stop Oil are the former.

cortic · 2025-09-28T23:13:18 1759101198

>In the UK political expression of free speech is protected by the ECHR, which overrides both those acts

This is categorically untrue. Not only is the ECHR worded specifically to allow individual countries to curtail free speech ("any law, deemed by the local democratically elected government as ; necessary in a democratic society, and for a legitimate aim"), but parliament always had sovereignty to pass into law exemptions to the ECHR, which we have done on multiple occasions.

teamonkey · 2025-09-29T07:07:28 1759129648

Yes, this is why the government needed to label Palestine Action as a terrorist organisation. It needed special measures because it did not in fact have the authority to arrest protestors, even though some people found what they were saying offensive.

cortic · 2025-09-29T12:30:09 1759149009

The Terrorism Act 2000 was a knee-jerk reaction to the Good Friday agreement and used to make association a criminal offense.

FridayoLeary · 2025-09-29T00:41:26 1759106486

The police are overreaching massively. They are making 30 arrests a day and "interview" many more.

We do not rely on the ECHR to protect our free speech. If we did the UK would no longer be a democracy. I'm offended by the suggestion that our democracy and society is so fragile that without them we would have no rights. Expect a police raid very soon.

n4r9 · 2025-09-29T06:37:36 1759127856

The "30 arrests a day" or "12000 arrests a year" stat is wildly misleading. I've addressed it before here https://news.ycombinator.com/item?id=41488099

FridayoLeary · 2025-09-29T11:48:52 1759146532

I think you're being disingenous. There is clearly an unprecedented and systemic effort to police social media. Even if the posts did actually violate the law doesn't change my point or address my concerns. This is not what the police should be doing.

n4r9 · 2025-09-29T12:40:40 1759149640

I honestly don't know if that's true or not. But I haven't seen any compelling evidence to support it. The figures being lobbed around by the likes of Tommy Robinson are deeply harmful to the debate because they are both a) completely wrong and b) misleadingly quoted. You can lookt the actual stats here: https://www.met.police.uk/foi-ai/metropolitan-police/disclos...

We're talking on the order of a few hundred arrests per year for section 127 of the Communications Act and 1500 per year for the Malicious Communications Act, which includes stuff like racial harassment, domestic abuse, pedophilic grooming, and a whole host of things that I would hope you agree should be illegal.

oncallthrow · 2025-09-28T20:20:44 1759090844

The latter part at least is true. Sending "grossly offensive" messages is illegal under the Malicious Communications Act 1988 and the Communications Act 2003, specifically Section 127:

> a person is guilty of an offence if he—

> (a)sends by means of a public electronic communications network a message or other matter that is grossly offensive or of an indecent, obscene or menacing character; or

> (b)causes any such message or matter to be so sent.

I suspect the former is also true, but am not well-read in that area

n4r9 · 2025-09-29T07:52:28 1759132348

The full wording of the text is:

> A person is guilty of an offence if, for the purpose of causing annoyance, inconvenience or needless anxiety to another, he—

> [F1(a)sends by means of a public electronic communications network, a message that he knows to be false,]

> [F1(b)causes such a message to be sent; or]

> (c)persistently makes use of a public electronic communications network.

teamonkey · 2025-09-28T20:33:42 1759091622

[flagged]

oncallthrow · 2025-09-28T21:56:24 1759096584

> “Grossly offensive” is absolutely not the same thing as “any message online that anyone could find insulting or offensive”.

There is no statutory definition of “grossly”, so in effect it is the same. There is prior art for it being interpreted incredibly widely.

Not to mention the other incredibly vague adjectives in the law.

> Correct

https://news.ycombinator.com/newsguidelines.html “Don’t be snarky”.

basisword · 2025-09-29T00:48:34 1759106914

>> There is no statutory definition of “grossly”

If this concerns you I would advise not looking into pretty much any UK law which is full of subjective terms and ways to interpret them. The law isn’t an algorithm nor should it be. Just because you can’t understand how it works doesn’t mean it doesn’t work.

teamonkey · 2025-10-02T08:58:54 1759395534

Sorry, that was a low shot, and not meant truthfully, but I just couldn’t resist making an offensive comment when the topic was “you can be arrested for offending someone”.

owisd · 2025-09-28T19:57:26 1759089446

There's no value in making insults for the sake of being insulting protected speech, but in the UK if you're making ECHR Article 10 protected speech that someone happens to find insulting or offensive then that's not a crime. It's unhelpful to permit insults as free speech to prevent some hypothetical future abuse, since all modern dictatorships pay lip service to free speech and instead lock up their political opponents for embezzlement or mortgage fraud or whatever.

cortic · 2025-09-28T23:24:15 1759101855

*irrelevant, not irreverent. ffs. (meaning people can be charged with a crime and only a fictional 'reasonable' person suffering offense or insult)

cortic · 2025-08-13T19:02:41 1755111761

I'm not sure humans are any different;

Humans don't think. At all. They do next token prediction.

If they are [raised in environments] that includes lots of examples of the result of people thinking, what they produce will look sort of like the results of people thinking, but then if they were [raised in an environment] of people repeating the same seven knock knock jokes over and over and over in some complex pattern (e.g. every third time, in French), what they produced will look like that, and nothing like thinking.

I believe this can be observed in examples of feral children and accidental social isolation in childhood. It also explains the slow start but nearly exponential growth of knowledge within the history of human civilization.

ofjcihen · 2025-08-13T19:18:43 1755112723

That’s…completely incorrect.

I’m not going to hash out childhood development here because I’m not paid to post but if anyone read the above and was even slightly convinced I implore you to go read up on even the basics of early childhood development.

cortic · 2025-08-13T19:50:22 1755114622

> I implore you to go read up on even the basics of early childhood development.

That's kind of like taking driving lessons in order to fix an engine. 'Early childhood development' is an emergent property of what could be cumulatively called a data set (everything the child has been exposed to).

tremon · 2025-08-13T21:54:19 1755122059

https://en.wikipedia.org/wiki/Early_childhood_development

Please read up on what the term means before claiming that it is about external influences.

ofjcihen · 2025-08-13T20:03:34 1755115414

No. It’s not.

ECD includes the mechanisms by which children naturally explore the world and grow.

I’m going to give you a spoiler and tell you that children are wired to explore and attempt to reason from birth.

So to fix your analogy, you reading about ECD is like you learning what an engine is before you tell a room full of people about what it does.

cortic · 2025-08-13T21:51:33 1755121893

The neurons in a child's brain might be 'wired' to accept data sets, but that does not make them fundamentally different from AI systems.

Are you claiming that a child who is not exposed to 'reason' will reason as well and one who is? Or a child who is not exposed to 'math' will spontaneously write a proof? Or a child not exposed to English will just start speaking it?

01101100 01100101 01100001 01110010 01101110 may be baked into US and AI in different ways but it is fundamentally the same goal and our results are similarly emergent from the process.

MangoToupe · 2025-08-13T19:04:40 1755111880

Sure, but you can hold humans liable for their advice. Somehow I doubt this will be allowed to happen with chatbots.

cortic · 2025-08-06T17:41:42 1754502102

>What is so special about services delivered over the internet?

The most dangerous people on earth who are not in prison are on the internet; It is an adult place. Making it look like a child friendly place will not change this. But it will lure more kids online unsupervised and unprotected.

cortic · 2025-06-02T13:01:58 1748869318

I purchased one for £24, 11 years ago, and still use it. Ironically from amazon (code B006GTOYDS) - can't find it in way back machine still have it in my purchase history. Before they started killing the competition. Miles better than the kindle to read in daylight and battery life still lasts weeks... cheap isn't always throwaway.

In contrast, i know people who have went through many kindles in this time and spent a small fortune on them.

organsnyder · 2025-06-02T14:16:41 1748873801

> Miles better than the kindle to read in daylight

Are you comparing it to the Kindle ereaders or their tablets? Standard (non-tablet) Kindles such as the Paperwhite series are like you describe (though they cost more than $100 and come with all of the lockin issues).

cortic · 2025-06-03T16:36:28 1748968588

I don't know enough about the different kindle versions to say;

pegasus · 2025-06-02T13:08:07 1748869687

Kudos to you, but most would not be wise enough to know the difference (between cheap and throwaway).

nisegami · 2025-06-02T14:40:55 1748875255

I don't think they would have a choice. In a scheme like this, publishers would lock it down so they can sell it to you again. Perhaps even only allow a particular book to be read.

carlosjobim · 2025-06-02T14:11:41 1748873501

How do you imagine that people would not notice if a product they use is good? Why would they consider it throwaway if it works well?

pegasus · 2025-06-02T15:05:44 1748876744

Because they could always buy a replacement for cheap. Sadly, our consumer goods prices do not correctly reflect environmental externalities, so in a way a higher price is better for the environment, even if the difference doesn't go in the right pocket.

b112 · 2025-06-02T14:35:15 1748874915

Sadly, for a lot of people, a "friend" mockingly saying "that's crap" would suffice.

cortic · 2025-03-30T16:58:26 1743353906

So if your training and double your water intake your basically lowering you IQ? (according to the Chinese studies) I wonder the method this uses.. has anyone looked at dementia rates in high fluoride areas.. Particularly in people with high water intake?

There is also a host of things we use water for from cooking to preserving, distilling and cooling.. i wonder if any of these things could concentrate the fluoride.

Also since fluoride has a lower boiling point any studies tracked what breathing in fluoride gas over long periods cause?

cortic · 2025-03-06T10:58:34 1741258714

When you are released from prison, they can simple ask you to decrypt the data again, and if you refuse or can't, you have broken a law with another 2 years in prison (5 if they think you could have anything to do with 'terrorism').. Its theoretically an infinite prison sentence for forgetting your passwords.

oxcidized · 2025-03-06T15:28:40 1741274920

I believe double-jeopardy laws wouldn't allow this, but I could be wrong.

ninalanyon · 2025-03-06T18:54:11 1741287251

Double jeopardy was abolished in England and Wales almost twenty years ago:

http://news.bbc.co.uk/2/hi/uk_news/4406129.stm

cortic · 2025-02-19T16:26:02 1739982362

Sounds like one of these stats where they just invert the cause and effect to get a story; i.e. People who are healing better will obviously walk sooner. Inverted to people who walk sooner are healing better.

Panzer04 · 2025-02-19T21:26:12 1740000372

There is a lot of hokum and bad statistics in the medical field. Doctors truly don't have a great idea what improves post op outcomes.

There are some bigger studies coming out that show that early weight bearing is non-inferior to traditional protocols that ask for many weeks of NWB though, and given the obvious qol benefits of walking earlier it seems to me the standard should be mobilise ASAP.

There really isn't good evidence for immobilisation. It seems to be a hold over particularly for surgical fixation, where there's no real fear of displacing things if it's been fixated properly.