More

ctenb · 2025-11-05T12:45:30 1762346730

Do you have a reference where it is explained? It's not part of the docs as far as I can tell

pfdietz · 2025-11-05T13:17:18 1762348638

You might try this blog entry (or other blog entries that talk about shrinking):

https://hypothesis.works/articles/how-hypothesis-works/

ctenb · 2025-11-05T12:44:53 1762346693

> Shrinking is expressed as manipulations of that tape.

How do you do that in general? I can't find any documentation on that.

sunshowers · 2025-11-05T17:06:44 1762362404

This paper by the Hypothesis authors has some information: https://www.doc.ic.ac.uk/~afd/papers/2020/ECOOP_Hypothesis.p...

ctenb · 2025-11-05T12:26:05 1762345565

What is complexity sorted order?

rtu8lu · 2025-11-05T13:18:35 1762348715

When you generate randomized data in an order from the simplest to the most complex.

ctenb · 2025-11-06T06:48:55 1762411735

Isn't that very wasteful or difficult to do in practice? If you consider that shrinkers generally take lower numbers to be 'simpler' than higher numbers, complexity-ordering requires you to generate all the numbers from low to high

rtu8lu · 2025-11-07T10:42:49 1762512169

Not really. There are many ways depending on your needs. For example, you can partition your space first, then generate randomly inside each of the subspaces. Let's say I need 200 numbers from -1000 to 999. The first range will be 0 to +99, the second -1 to -100, then +100 to +199, and so on. So, to generate a random number I just need an index and the bounds.

ctenb · 2025-11-03T19:59:12 1762199952

This post has many upvotes, but all the comments ask questions about the usefulness of this, without any justifying response so far. I have the same question, and I wonder what's going on with this post?

ctenb · 2025-10-30T12:20:32 1761826832

No, it's about the distribution being injective, not a single sampled response. So you need a lot of outputs of the same prompt, and know the LLM, and then you should in theory be able to reconstruct the original prompt.

ctenb · 2025-10-29T07:12:21 1761721941

This is a cool example of how specializing a generic algorithm to a specific subspace can yield much better results. This is quite often the case in my experience, but we often don't bother utilizing properties that are specific to our problem space, and just apply the generic algorithm out of convenience (and because it is often good enough)

lasfter · 2025-10-29T14:44:13 1761749053

I wrote my thesis on this! Application-specific system design can get you orders of magnitude performance improvement, as well as better scalability/fault tolerance properties. I focused on graph analytics, but it's reasonable to think it applies more broadly.

Definitely true that application-specific design is often not worth the investment though. Chasing that 1000x improvement can easily cost you a year or two.

Adrock · 2025-10-30T19:27:39 1761852459

Can you please link to your thesis? This sounds very interesting.

lasfter · 2025-11-08T03:52:45 1762573965

Here it is:

https://scholar.google.com/citations?view_op=view_citation&h...

j2kun · 2025-10-29T18:08:25 1761761305

Came here to say this, but with caveats. The particular domain has extra properties that allow their "stupider" algorithm to work better in their case. But a general graph drawing system has to deal with the inherent generality of the domain.

Usually there is a good middle ground: heuristic analysis of the input to see if it fits well with special-case "stupid and fast" algorithms, and sophisticated optimizations that are the fallback and work for everything and come with guarantees.

kylereeve · 2025-10-29T18:26:26 1761762386

That's how all application-specific specializations work though, take advantage of domain properties that make you need a less generic algorithm.

ctenb · 2025-10-14T07:46:53 1760428013

What is a CLO?

ExoticPearTree · 2025-10-14T12:47:41 1760446061

Chief Legal Officer.

ctenb · 2025-10-13T10:02:04 1760349724

What is the business model?

ctenb · 2025-10-13T06:26:53 1760336813

See helix.vim on GitHub

ctenb · 2025-10-13T06:21:05 1760336465

Yes, it feels neglectful that the author has not mentioned this prior art, especially since they are near identical