More

ngalstyan4 · 2025-12-15T16:17:52 1765815472

Sounds cool!

Would be curious to know what the underlying aws ec2 instance is.

Is each DB on a dedicated instance?

If not, are there per-customer iops bounds?

rcrowley · 2025-12-15T17:02:06 1765818126

We run on the same instance types the larger PlanetScale Metal sizes offer as whole instances. For Intel that's r6id, i4i, i7i, i3en, and i7ie. For ARM that's r8gd, i8g, and i8ge. (Right now, at least. AWS is always cookin' up new instance types.) Same story will soon be true for GCP.

samlambert · 2025-12-15T16:46:54 1765817214

there aren't per customer IOPs limits but the CPU will be the bottleneck.

ngalstyan4 · 2025-11-11T17:14:54 1762881294

They would need to handle all the translation changes as well, no?

<https://github.com/search?q=repo%3Apostgres%2Fpostgres+%22ma...>

I agree code change is simple, but I guess the task is complex for other reasons

anarazel · 2025-11-11T17:23:23 1762881803

If you just make a change of code, you don't need to handle translations at that time. That will get done by the various translation teams closer to the release. However you do need to make sure that the code is translatable (e.g. injecting pre-formulated english messages into a larger message is problematic).

ngalstyan4 · 2025-10-16T13:29:37 1760621377

Surprised to not see Ubicloud in there, which provides cloud services on top of various (including European) infra providers.

https://www.ubicloud.com/

cess11 · 2025-10-16T13:31:24 1760621484

They're under the CLOUD Act, i.e. quite toxic.

kassner · 2025-10-16T13:35:08 1760621708

Basically everyone with even a CDN endpoint on the US is under Cloud Act. Hetzner, OVH, etc. Maybe only Scaleway that I couldn’t find any mentions of an US PoP.

preisschild · 2025-10-16T13:38:57 1760621937

US Cloud act says they need to provide US govt with access to your data even if the DC is outside the US

matt-p · 2025-10-16T13:31:54 1760621514

Not a European company are they though. Worst of both worlds as they come under US and EU regulation.

ngalstyan4 · 2025-02-18T19:45:50 1739907950

This is really cool, congrats on launch!

I am curious how you prevent private data from getting leaked to the auto-generated public docs. I imagine this problem does not exist in open source projects, but would become an issue if not everything discussed in company's private messenger should be used as context for generating docs.

prithvi2206 · 2025-02-18T19:50:32 1739908232

Absolutely. There are steps in Promptless's agent flow that are designed to prevent this, but this is why users still review Promptless's suggestions to guides before committing/publishing them. I think people will still want to review Promptless's suggestions for a while, but the granularity of oversight will probably decrease as trust increases.

ngalstyan4 · on July 20, 2024

Author here.

> I don’t think “get moar ram” is a good response to that particular critique.

I do not think the blog post suggested "get more ram" as a response, but happy to clarify if you could share more details!

> Indexing in Postgres is legitimately painful

Lantern is here to make the process seamless and remove most of the pain for people building LLM/AI applications. Examples:

1. We build tools to remove the guesswork of HNSW index sizing. E.g. https://lantern.dev/blog/calculator

2. We analyze typical patterns people use when building LLM apps and suggest better practices. E.g. https://lantern.dev/blog/async-embedding-tables

3. We build alerts and triggers into our cloud database that automate the discovery of many issues via heuristics.

ngalstyan4 · on July 20, 2024

Author here. We will benchmark this thoroughly in the future for our vector indexes.

But at least anecdotally, it made a ton of difference.

We met <200ms latency budget with Ubicloud NVMes but had to wait seconds to get an answer from the same query with GCP persistent disks or local SSDs

ngalstyan4 · on April 17, 2024

We provide this functionality in Lantern cloud via our Lantern Extras extension: <https://github.com/lanterndata/lantern_extras>

You can generate CLIP embeddings locally on the DB server via:

  SELECT abstract,
       introduction,
       figure1,
       clip_text(abstract) AS abstract_ai,
       clip_text(introduction) AS introduction_ai,
       clip_image(figure1) AS figure1_ai
  INTO papers_augmented
  FROM papers;

Then you can search for embeddings via:

  SELECT abstract, introduction FROM papers_augmented ORDER BY clip_text(query) <=> abstract_ai LIMIT 10;

The approach significantly decreases search latency and results in cleaner code. As an added bonus, EXPLAIN ANALYZE can now tell percentage of time spent in embedding generation vs search.

The linked library enables embedding generation for a dozen open source models and proprietary APIs (list here: <https://lantern.dev/docs/develop/generate>, and adding new ones is really easy.

charlieyuan · on April 17, 2024

Lantern seems really cool! Interestingly we did try CLIP (openclip) image embeddings but the results were poor for 24px by 24px icons. Any ideas?

Charlie @ v0.app

ngalstyan4 · on April 17, 2024

I have tried CLIP on my personal photo album collection and it worked really well there - I could write detailed scene descriptions of past road trips, and the photos I had in mind would pop up. Probably the model is better for everyday photos than for icons

ngalstyan4 · on April 6, 2024

For similar isolation level anomalies in real world applications check out this SIGMOD '17 paper:

ACIDRain: Concurrency-Related Attacks on Database-Backed Web Applications: http://www.bailis.org/papers/acidrain-sigmod2017.pdf

ngalstyan4 · on Feb 17, 2024

Not sure what the approach of this library is, but can't you generate a nonce from a larger alphabet, hash the column values with the nonce `hash(nonce || column)`, and crypto-shred the nonce in the end.

Then, during hashing you just need a constant immutable state, which effectively expands the hash space, without incurring the mutable state overhead of replacement strings strategy.

ngalstyan4 · on Feb 2, 2024

this is cool!

Does this only collect logs from frontend?

Or it can also collect the backend and DB latency data related to a frontend interaction?

podoman · on Feb 2, 2024

We collect logs across the stack. Here's some docs on our backend logging integrations (we also have connectors for major cloud providers): https://www.highlight.io/docs/getting-started#for-your-backe...