Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You’re almost certainly going to have to write your own splitting code for anything nontrivial. LlamaIndex breaks down hard when there’s a lot of markup in the document, for example. You’ll also want control over the vector search strategy (just using the query or chunk embedding may not be enough)


in terms of search store and engine, would you agree that pgvector is sufficient for most text-specific cases?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: