More

objektif · 2026-05-26T01:45:27 1779759927

I once asked HN why EVs look funky and many people responded with “oohh no they don’t what are you talking about”. Tell me now if this looks weird or not.

JJMcJ · 2026-05-26T01:49:21 1779760161

If they look like regular cars, then the owners don't get the special feeling when people see their car.

objektif · 2026-05-09T13:09:41 1778332181

This is pretty insightful thank you. Which provider are you guys using? Is it also over the phone or fully web/app based. Do you have any resources you can point me to learn about this?

aenis · 2026-05-09T14:22:38 1778336558

We use a bunch, at the moment we mainly self host (and use pipecat) use Daily, and a few niche boutique suppliers who built things for us.

There is a great resource for learning this stuff - the CEO of Daily, Kwindla Kramer, hosted a series of 1hr sessions on low latency voice ai. Here:

https://youtube.com/playlist?list=PLzU2zoMTQIHjMPZ-OnpC3ozZs...

Some of this is a bit outdated but most of it is very valuable.

Kwindla posts a lot of extremely useful stuff on x and linkedin, incl. working, easily replicable sub 500ms setups.

objektif · 2026-05-09T14:35:54 1778337354

Beautiful thanks. We are also looking at this and another complication is transcripts can get pretty messy updates, corrections etc.

objektif · 2026-04-23T18:10:49 1776967849

Are there faster mini/nano versions as well?

tedsanders · 2026-04-23T18:15:07 1776968107

Not this time, no.

abi · 2026-04-23T18:20:38 1776968438

Usually, those get released a few weeks later.

objektif · 2026-04-22T19:21:18 1776885678

Does anyone know good provider for low latency llm api provider? We tried to look at Cerebras and Groq but they have 0 capacity right now. GPT models are too slow for us at the moment. Gemini are better but not really at same level as GPT.

spmurrayzzz · 2026-04-23T02:15:55 1776910555

This depends a bit on your cost sensitivity and what model families you want support for, but Baseten and Fireworks have been my goto.

Currently Baseten has ~610ms TTFT and ~82 tk/s for Kimi K2.6, which is roughly 2x the throughput of GPT-5.4 (per their openrouter stats). GLM 5 is slightly slower on both metrics, but still strong.

objektif · 2026-04-08T01:17:52 1775611072

No. They like stealing land.

objektif · 2026-04-01T02:45:57 1775011557

What are you basing how good they are on? Personal experience or some benchmarks?

a-t-c-g · 2026-04-01T03:55:36 1775015736

Benchmarks, we have internal ones testing reasoning fine-tuned v/s frontier + prompts

For some use cases it can be parity performance at 1/20th the cost up to exceeds at 1/10th the cost. Trade-off is ofc narrow applicability

objektif · 2026-04-01T12:51:15 1775047875

How can I learn more about these models? Are they open source?

a-t-c-g · 2026-04-01T17:25:59 1775064359

there are plenty of OSS finetuned models + base models around. If you're looking for doing these on your own dataset, worth getting in touch with cartesien.io or wire up https://github.com/SalesforceAIResearch/PretrainRL-pipeline

objektif · 2026-04-01T19:13:18 1775070798

Thank you.

objektif · 2026-02-28T13:47:54 1772286474

Yeah we care about Iranian protesters you got this right.

mupuff1234 · 2026-02-28T14:08:45 1772287725

That's not what I said.

objektif · 2026-02-27T01:42:05 1772156525

Will Randian tech bros start calling for socialism soon? Inshallah.

objektif · 2026-02-22T03:17:12 1771730232

He sounds greedy as fuck. He speed ran buggy POS to sell to model co? Obvious as day what is there to see?

objektif · 2026-02-22T03:12:48 1771729968

PG commissioned dan on X to send anyone who criticize Andrej or Pete to gulag.