Hacker Newsnew | past | comments | ask | show | jobs | submit | 1ba9115454's commentslogin

I can't imagine this setup will get more than 1 token per second.

I would love to see Deepseek running on premise with a decent TPS.


It says 4.25 TPS in the first para.


Honest mistake. Some people think HN is just a series of short tweets and haven’t realized they are links yet!


It's the modern way. Why read when you can just imagine facts straight out of your own brain.


I agree but also found your comment funny in the context of LLMs. People love getting facts straight out of their models.


4.25 is enough tps for a lot of use cases.


That's still pretty slow, considering there's that "thinking" phase.


True, but 4.25 is the number we all want to know.


You can get 1t/s on a raspberry pi.

https://youtu.be/o1sN1lB76EA?si=i8ecEBjLdV0zewFQ


this has nothing to do with the full 671B and the ollama models are distilled qwen2.5


I appreciate both of these comments, thank you both.


https://github.com/cornucopia-rs/cornucopia

Cornucopia generates rust code.


A tenary is all you need.


We're using Dioxus on the backend with Axum for https://github.com/bionic-gpt/bionic-gpt

Very pleased with the result. It's great having the compiler help out with UI work.


SEEKING VOLUNTEERS

Bionic-GPT

https://github.com/bionic-gpt/bionic-gpt

Rust, LLM's, RAG, Kubernetes. Generative AI.

Ideally for the over 50's but younger people also considered.


You can look at it from a pen testers point of view.

Here's a checklist for a web pentest. https://pentestbook.six2dez.com/others/web-checklist


Why did you say you didn't play an instrument when you did?


he probably wanted to skip the bullshit


What would be the outcome of this? Are you trying to reduce the gap between the poor and the wealthy?


Yes? Sort of?

Depends what you mean by "gap".

I certainly think the power dynamic we currently have in the US is bad. The wealthy are insulated from most, if not all, of the troubles facing the rest of society. Yet they still extract benefit from, and have considerable control and influence over, society at large.

That's not healthy for anyone but the wealthy. Maybe not even them, long term, but that's more of a philosophical question.

Not sure if that answers your question or not.


I seriously doubt that YC discriminates on age deliberately.

My understanding is that you're expected to do 3 months in San Francisco and that would stop most people with a family. Which is like most people over 30.


I didn't look too deep at this as soon as it said JSON. I was gone.

I've been using dbmate which uses SQL and works really well.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: