Hacker Newsnew | past | comments | ask | show | jobs | submit | the_king's commentslogin

This it is very impressive. But scrolling through the preprint, I wouldn't call any of it elegant.

I'm not blaming the model here, but Python is much easier to read and more universal than math notation in most cases (especially for whatever's going on at the bottom of page four). I guess I'll have one translate the PDF.


I just tested it on a very difficult Raven matrix, that the old version of DeepThink, as well as GPT 5.2 Pro, Claude Opus 4.6, and pretty much every other model failed at.

This version of DeepSeek got it first try. Thinking time was 2 or 3 minutes.

The visual reasoning of this class of Gemini models is incredibly impressive.


Deep Think not DeepSeek


I think good names and a good file structure are the most important thing to get right here.


I think Aqua v1 had two problems:

1. The models weren't ready.

2. The interactions were often strained. Not every edit/change is easy to articulate with your voice.

If 1 had been our only problem, we might have had a hit. In reality, I think optimizing model errors allowed us to ignore some fundamental awkwardness in the experience. We've tried to rectify this with v2 by putting less emphasis on streaming for every interaction and less emphasis on commands, replacing it with context.

Hopefully it can become a tool in the toolbox.


Looking forward to giving it another try!


you've got nothing to worry about here.

if firebase studio can make a todo app, i'd be surprised. this is the worst "vibe coding" tool i've ever used.


> i've ever used

There are tools people complain about and there are tools nobody uses.


We can do that, with some help from the community.


I'm a paying customer and I signed onto Aqua Voice shortly after your demo on HN.

My experience with it has been overall positive but mixed. I enjoy using it for dictation, but I found that issuing editing commands and having them recognized/executed often took a lot longer than making an edit myself (which I can't do while in dictation mode).

But as a paying customer, seeing you go in this direction is somewhat sad/frustrating. You're abandoning the product I use, and you're saying that if I want to see my platform supported, I or someone from the community has to provide it- for a fully proprietary paid application.

I understand that I'm a minority user, but it's a bit disappointing to read this.


Totally understand, thanks for being a customer. I'm sorry we weren't able to make the web version as smooth as we wanted to.

We do plan to support Linux. This was probably a little bit of a blind spot for us - not realizing that anyone running a Linux desktop doesn't even have system voice tool to fall back on.


What support do you need?


thanks!

I share the same sentiment. I remember thinking in college how annoying it was that I was reading low-resolution, marked-up, skewed, b&w scans of a book using Adobe Acrobat while CS concentrators were doing everything in VS Code (then brand new).

but we do think voice is actually great with Cursor. It’s also really useful in the terminal for certain things. Checking out or creating branches, for example.


Aqua is in another league when it comes to accuracy. I just ran them side by side on a simple q to ChatGPT and here were the results...

Aqua Voice

  What is the first recorded eclipse in human history? I'm not asking when the first one occurred, but the first written record we have of an eclipse.
Windows Voice Typing (v11 24H2, Dell XPS 13 9340)

  What is the first recorded eclipse in human history i'm not asking 1 like the first occurred but the first ridden record we have of an eclipse

Windows mistakes were:

-"1" should be "when"

-"ridden" should be "written"

-No punctuation


thanks! We're working on iOS, but it's tough to get the ergos right given all of Apple's restrictions and neglected APIs.


Android app please!


I was excited to try this out because I've had a lot of trouble getting the Supabase integrations to work on Lovable and Bolt.new.

Sorry to say that Firebase Studio did an awful job. It did not successfully build even the first view of the app I asked for. It feels like I'm stepping back to release day of GPT-4.

Am I missing a switch to use the good Gemini 2.5 somewhere? I could tell from their response speed that I was not using a thinking model.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: