More

axlprose · on Nov 21, 2023

Disincentivizing it from saying mean things just strengthens it's agreeableness, and inadvertently incentivizes it to acquire social engineering skills.

It's potential to cause havoc doesn't go away, it just teaches AI how to interact with us without raising suspicions, while simultaneously limiting our ability to prompt/control it.

stavros · on Nov 21, 2023

How do we tell whether it's safe or whether it's pretending to be safe?

axlprose · on Nov 21, 2023

Your guess is about as good as anyone else's at this point. The best we can do is attempt to put safety mechanisms in place under the hood, but even that would just be speculative, because we can't actually tell what's going on in these LLM black boxes.

6gvONxR4sf7o · on Nov 21, 2023

We don’t know yet. Hence all the people wanting to prioritize figuring it out.

losteric · on Nov 21, 2023

How do we tell whether a human is safe? Incrementally granted trust with ongoing oversight is probably the best bet. Anyway, the first mailicious AGI would probably act like a toddler script-kiddie not some superhuman social engineering mastermind

axlprose · on Aug 14, 2022

The "Grokking _" series by Manning is also along similar lines.

axlprose · on Aug 14, 2022

Hm, the only Alice in Wonderland themed CS book I recall, is "Foundations of Databases" which has sections on datalog:

http://webdam.inria.fr/Alice/

westoncb · on Aug 14, 2022

That looks like a nice book, but definitely not it (I may read the datalog bits though—thanks!).

The one in the library was ~200 pages, solely on prolog, Alice in Wonderland not only on the cover art, but constantly used throughout the writing itself.

axlprose · on Aug 14, 2022

I would think "The Unix-Haters Handbook" certainly qualifies:

https://web.mit.edu/~simsong/www/ugh.pdf

epilys · on Aug 14, 2022

And if you don't read the entire book (that's fine, lots of its criticism was valid but now dated so it's of historical value)

at least read the excellent anti-preface by Ritchie. It's hilarious.

axlprose · on Aug 8, 2022

HN is probably the last place I expected to run into a LoS reference, so I'm pleasantly surprised.

In keeping with that topic, 'The Unseen Realm' by Michael Heiser (OT and ancient language scholar) is also worth checking out for learning about how ancient near eastern culture viewed things.

axlprose · on Aug 8, 2022

> Just like the left/right brain.

To clarify, popularized over-simplified descriptions of the left hemisphere being "analytical" and the right hemisphere being "creative" are inaccurate, but left/right hemisphere differences do exist and appear to exhibit consistently different approaches to things. The book 'The Master and His Emissary' covers the more recent research in that area, and is at least as interesting of a read as 'The Bicameral Mind' was.

ncmncm · on Aug 8, 2022

Neuroscientists have not been kind to "The Master and His Emissary.

axlprose · on Aug 8, 2022

Unless you have a specific set of scathing reviews in mind, the reviews of it in the literature don't really appear to match up with that characterization very much:

https://www.tandfonline.com/doi/full/10.1080/13546805.2010.5...

https://ajp.psychiatryonline.org/doi/10.1176/appi.ajp.2011.1...

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828853/

https://link.springer.com/article/10.1007/s11097-011-9235-x

> it is worth noting that the book has been much praised by neuro-scientists as diverse as Ramachandran, Panksepp, Hellige, Kesselring, Schore, Bynum, Zeman, Feinberg, Trimble and Lishman.

It'd be surprising if it were poorly received regardless, because the book itself is little more than a review of the relevant literature on the topic, packed with references, and some added philosophy about it's implications sprinkled on top. Not that much different from one of Michael Gazzaniga's popular books, and certainly not as out there as Julian Jaynes.

axlprose · on Aug 8, 2022

Yes, I'm inclined to believe such an "in-between" might possibly arise from Iain McGilchrist's line of research into the difference between the brain hemispheres and the relatively recent dominance of left-hemispheric thinking in society. I always highly recommend his book 'The Master and His Emissary' as a follow up to anyone interested in Jaynes' ideas. While it doesn't necessarily imply the full spectrum of schizophrenic-like symptoms in early peoples the way 'The Bicameral Mind' did, it's presentation of right-hemisphere driven societies of the past isn't a far leap from what Jaynes seemed to be grasping at.

gamegoblin · on Aug 8, 2022

I read Jaynes and found it an utterly compelling read. An actual page turner.

I followed it up with “The Master and His Emissary” a few months later and couldn’t get more than 5-10% through it. Just complete drudgery of writing full of nearly pointless asides.

I listened to McGilchrist describe the basics of the idea on a few podcasts and found that quite interesting, but the book itself seems like it could be edited to 1/3 the length without losing anything fundamental. Am I totally off the mark here and should give it another go?

rjknight · on Aug 8, 2022

McGilchrist did publish a 30-page summary called “Ways of Attending”, which might be better. It seems to cost as much to buy as a full-size book, but perhaps some Googling can reveal a cheap copy somewhere.

gamegoblin · on Aug 8, 2022

Thanks for mentioning this -- I hadn't heard of it. Google did indeed return a PDF link as the top result. Will give it a read!

axlprose · on Aug 7, 2022

> People who think otherwise probably don’t understand why people like javascript and python and will never write any product that catches on.

Ironically, this has been posted on a popular site written in a dialect of scheme:

http://arclanguage.org/

https://github.com/arclanguage/anarki

meltedcapacitor · on Aug 7, 2022

Where is the irony? HN does not expose scheme to the user.

Don't think the parent was saying that it's impossible to write a good program in scheme/lisp, more that scheme as user interface for a software system is a hard sell... which seems anecdotally true.

baby · on Aug 7, 2022

how is this ironic? HN was written by one person, and nobody needs to care what language they used to make it work.

axlprose · on July 22, 2022

Reminds me of the SICP lecture[0] where Hal Abelson introduces the concept of linguistic abstraction (or Stratified Design[1]) as an alternative to the approach of decomposing a program into a tree of well-specified sub-components/tasks, which ultimately fails to capture the essence of the problem being solved.

[0] https://youtu.be/2QgZVYI3tDs?t=3349

[1] https://dspace.mit.edu/handle/1721.1/6064

axlprose · on June 28, 2022

I find it somewhat amusing that this view is now gaining traction on the same site that was originally deeply influenced by "The Blub Paradox":

http://www.paulgraham.com/avg.html