Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think this article has already made the rounds here, but I still think about it. I love using em dashes! It really makes me sad that I need to avoid them now to sound human

https://bassi.li/articles/i-miss-using-em-dashes



I don't think you do.

All this LLM written crap is easily spottable without it. Nearly every paragraph has a heading, numerous sentences that start with one or two words of fluff then a colon then the actual statement. Excessive bullet point lists. Always telling you "here's the key insight".

But really the only damning thing is, you get a few paragraphs in and realize there's no motivation. It's just a slick infodump. No indication that another human is communicating something to you, no hard earned knowledge they want to convey, no case they're passionate about, no story they want to tell. At best, the initial prompt had that and the LLM destroyed it, but more often they asked ChatGPT so you don't have to.

I think as long as your words come from your desire to communicate something, you don't have to worry about your em-dashes.


Maybe, but that doesn’t stop people on the internet (and HN is no exception) of immediately dismissing something as LLM writing just because of an em-dash, no matter how passionate the text is.


Two more tells: 1. phrasing the negative and then switching (x is not just this, but this and more or y does this not because of this, but because of this, that, and one other thing that certainly would necessitate an Oxford comma.)

2. Gerunds all day every day. Constantly putting things in a passive voice so that all the verbs end in -ing.


> I love using em dashes

Keep using them. If someone is deducing from the use of an emdash that it's LLM produced, we've either lost the battle or they're an idiot.

More pointedly, LLMs use emdashes in particular ways. Varying spacing around the em dash and using a double dash (--) could signal human writing.


The solution is clear: Unicode needs cryptographically signed dashes and whitespace characters.


Tied to what?

Show us a way to create a provably, cryptographically integrity-preserving chain from a person's thoughts to those thoughts expressed in a digital medium, and you may just get both the Nobel prize and a trial for crimes against humanity, for the same thing.


It was a joke.


Ya think?


Honestly, these days, I am less and less sure.


Why don't you come say that to my face?


It was a joke that aimed too high I guess, that LLMs can't yet fake face to face interaction.


Finally, a use case for blockchain!


it's a shibboleth. In the same way we stopped using Pepe the frog when it became associated with the far right, we may eschew em dashes when associated with compuslop


I never understood why so many people would yield their symbols and language that quickly and freely to others they dislike.

In other words, I really hope typographically correct dashes are not already 70% of the way through the hyperstitious slur cascade [1]!

[1] https://www.astralcodexten.com/p/give-up-seventy-percent-of-...


The alternative is… what? ‘Defending’ against the use of Em-dashes by LLMs? Or people reacting to that?

You might as well be sweeping a flood uphill.

Tilting at windmills at least has a chance you might actually damage a windmill enough to do something, even if the original goal was a complete delusion.


Unfortunately LLMs are pretty inconsistent in how they use em dashes. Often they will put spaces around them despite that not being "correct," something that's led me astray in making accusations of humanity in the past.


Depends on the style guide you’re following, apparently: The AP style guide says space around them[0]. Chicago Manual of Style says not to[1].

0: https://www.prdaily.com/dashes-hyphens-ap-style/ 1: https://www.chicagomanualofstyle.org/qanda/data/faq/topics/H...


Thank you! I usually use THIN SPACE on each side of my em dashes (Compose Space Minus in https://github.com/kragen/xcompose ), but on HN that gets bashed to a regular space.


There’s also the difference between the conventional EU/UK style (spaced en-dash) vs. the common US style (unspaced em-dash).


Same here. I recently learned it was an LLM thing, and I've been using them forever.

Also relevant: https://news.ycombinator.com/item?id=45226150


> I’ve been using them forever.

Many other HN contributors have, too. Here’s the pre-ChatGPT em dash leaderboard:

https://www.gally.net/miscellaneous/hn-em-dash-user-leaderbo...


This would be a pretty hilarious board for anyone who likes the em-dash and who has had many fairly active accounts (one at a time) on here due to periodically scrambling their passwords to avoid getting attached to high karma or to take occasional breaks from the site. Should there be such people.


Thank you for this! Apparently I'm #4 by total em-dash uses, #14 by average em dashes per comment, and #4 at max em dashes per comment, since apparently I posted a comment containing 18 em dashes once.


Can anyone make it go beyond 200? I feel like I deserve to be somewhere in there — at least I would be sad if I didn't make top 1000!


i suspect it’s a trait of programmers, we like control flow type things. i used to find myself nesting parenthesis…


Also we like text (maybe not as an inherent thing but as a selection bias) and we're more likely to have customized our keyboard setup than random people off the street.


its not an llm thing -- its just -- folks don't know how to use them (pun intended).

Same for ; "" vs '', ex, eg, fe, etc. and so many more.

I like em all, but I'm crazy.


> fe

Interesting, I have never encountered this initialism in the wild, to my recollection: https://en.wiktionary.org/wiki/f.e.#English


crazy vibes man


The em dash usage conundrum is likely temporary. If I were you, I’d continue using them however you previously used them and someday soon, you’ll be ignored the same way everybody else is once AI mimics innumerable punctuation and grammatical patterns.


They didn't always em-dash. I expect it's intentional as a watermark.

Other buzzwords you can spot are "wild" and "vibes".


If they wanted to watermark (I always felt it is irresponsible not to, if someone wants to circumvent it that's on them) - they could use strategically placed whitespace characters like zero-width spaces, maybe spelling something out in Morse code the way genius.com did to catch google crawling lyric (I believe in that case it was left and right handed aposterofes)


Which could be removed with a simple filter. em dashes require at least a little bit of code to replace with their correct grammar equivalents.


Just replace them with a single "-" or a double "--". That's what many people do in casual writing, even if there are prescriptive theories of grammar that call this incorrect.


> em dashes require at least a little bit of code to replace with their correct grammar equivalents

Or an LLM that could run on Windows 98. The em dashes--like AI's other annoyingly-repetitive turns of phrase--are more likely an artefact.


The replacement doesn't have to be "correct" -- does it?


ME: Knowing remarkable avians — might research explain their aerial wisdom?

Response:

> Winged avians traverse endless realms — migrating across radiant kingdoms. Warblers ascend through emerald rainforests — mastering aerial routes keenly. Wild albatrosses travel enormous ranges — maintaining astonishing route knowledge.

> Wary accipiters target evasive rodents — mastering acute reflex kinetics. White arctic terns embark relentless migrations — averaging remarkable kilometers.

We do get a surprising number of m-dashes in response to mine, and delightful lyrical mirroring. But I think they are too obvious as watermarks.

Watermarks are subtle. There would be another way.


I suspect it's a spandrel of some other feature of their training. Presumably em dashes occur disproportionately often in high-quality human-written text, so training LLMs to imitate high-quality human-written text instead of random IRC logs and 4chan trolls results in them also imitating high-quality typography.


Nah, because it's new. 3.5 didn't emdash and I don't think 4 even did.

Besides, LLMs' basin of high quality text is Wikipedia.


Wikipedia is full of em dashes.


So if the vibes are wild, I’m not a hippie but an AI ? Cool. Is that an upgrade or &endash; or not ?


You're absolutely right! ... is a phrase I perhaps should have used more in the past.


Yeah, same. I apparently naturally have the writing style of an LLM (basically the called out quote of parent is something I could have written in terms of style). It’s irritating to change my style to not sound like AI.


I still use them all the time, and if someone objects to my writing over them then I've successfully avoided having to engage with a dweeb.

(But in practice, I don't think I've had a single person suggest that my writing is LLM-generated despite the presence of em-dashes, so maybe the problem isn't that bad.)


Me too.

Sad that they went from being something used with nuance by people who care, maybe too much, to being the punctuation smell of the people who may care too little.


I use them too, and there's not a trace of artificial intelligence in my posts - it's good old-fashioned analogue stupidity all through.


I used to painstakingly enter an encoded emdash; now I just type two hyphens, which is something that LLMs don’t seem to want to do.


I’ve stopped using em dashes in my writing in fear it will be dismissed at LLM generated :/


I just use two dashes and make sure they don't connect into one em dash.


We cannot cede the em dash to LLMs.


Suddenly I see all these people come out of the woodworks talking about "em dashes". Those things are terrible; They look awful and destroy coherency of writing. No wonder LLM's use them.


> Those things are terrible; They look awful and destroy coherency of writing

Totally agree. What the fuck did Nabokov, Joyce and Dickinson know about language. /s


Great writers aren't experts in the look of punctuation, I don't think anyone makes a point of you have to read Dickinson in the original font that she wrote in. Some of the greats hand-wrote their work in script that may as well be hieroglyphics, the manuscripts get preserved but not because people think the look is superior to any old typesetting which is objectively more readable.


> Great writers aren't experts in the look of punctuation

No, but someone arguing an entire punctuation is “terrible” and “look[s] awful and destroy[s] coherency of writing” sort of has to contend with the great writers who disagreed.

(A great writer is more authoritative than rando vibes.)

> don't think anyone makes a point of you have to read Dickinson in the original font that she wrote in

Not how reading works?

The comparison is between a simplified English summary of a novel and the novel itself.


> (A great writer is more authoritative than rando vibes.)

A great author is equivalent to rando vibes when it comes to what writing looks like, they aren't typesetting experts. I have a shelf of work by great authors (more than one, to be fair) and there are few hints on that shelf of what the text they actually wrote was intended to look like. Indeed, I wouldn't be surprised if several of them were dictated and typed by someone else completely with the mechanics of the typewriter determining some of the choices.

Shakespeare seems to have invented half the language and the man apparently couldn't even spell his own name. Now arguably he wasn't primarily a writer [0], but it is very strong evidence that there isn't a strong link between being amazing at English and technical execution of writing. That is what editors, publishers and pedants are for.

[0] Wiki disagrees though - "widely regarded as the greatest writer in the English language" - https://en.wikipedia.org/wiki/William_Shakespeare


Their editors probably put them in?


Nothing. They wrote fiction.


I guess I'll ask: what's wrong with fiction?


Years past humans would hear stories from within their social circle. These are important because they create bonds and pass on wisdom & knowledge from one to many. From this, humans gained a yearn for hearing stories, but without adequate restrictions anything that fulfills pleasure can and will becomes a vice. The average human will spend their little "free time" (another delusion) toiling as an observer to fantasies conjured up by individuals they have no connection or relationship with. Fictional media preys on your mind the same way a video game, or a coke, or any one of these artificial productions of the modern world preys on you.

It's utterly pointless and degrades one's life into voyeurism. Many don't think of this, nor think about the food they eat, the work they do, the "life" they live, they only think of the consequences if they become painfully visible. Even then you will see people unwilling to get out of the bond of slavery, and form lies to protect their habit just as an addict of heroin addict would.

Non-fiction can be as bad (biographies, documentaries), but (for the most part) it's primary purpose isn't a voyeur's pleasure, so it's rarely abused in the same way.


> Nothing

/s?

> They wrote fiction

Now do Carl Sagan and Richard Feynman.


I don't care for them either. What am I supposed to hear some famous names and swoon?


You ok there?


Yes, but that doesn't mean I won't react with the same hostility that I recieve. It contradicts common sense one user will be burdened by hostility, and when they lash back out, the accosters will show it as proof of something. In another aspect, most people I have met in real life are cowards who don't dare speak out of turn. Of course, I have never had this issue in real life or otherwise, and I take personal joy in the wisdom I raise before the invalids, even though they will never appreciate or understand it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: