Hacker Newsnew | past | comments | ask | show | jobs | submit | meerab's commentslogin

VideoToBe.com - Transcription & Chat for Audio/Video (board meetings, lecture notes, churches summons etc.)

Started as Pay-per-use service (upload the file and receive an email with transcript). Now we also offer subscription based service for media library, AI summaries, chat Q&A, insights/highlights.

Makes between $200 to $600 per month.

https://videotobe.com

Tips for sub growth in transcription space?


It is up now!


You sure?


Just checked. It's up!!


nope... order page is still 500


Please don’t get a dog just because you’re having an existential crisis. Dogs are lifelong responsibilities.


Thank you! It certainly looks like Rotoscoping.

And EbSynth looks amazing! I wonder what model/tech will track the moment so well!


I use a two-pass approach - first pass with ASR (OpenAI Whisper) and second pass with an LLM. I ask users to provide context upfront and use that as the "initial_prompt" parameter in Whisper: https://github.com/openai/whisper/discussions/963#discussion...

Gemini might have similar capabilities for custom vocabulary, though I'm not certain about their specific implementation. The two-pass ASR+LLM approach could work with Gemini's output as well.


Working on https://videotobe.com a audio/video transcription service. VideoToBe started as a user friendly Whisper wrapper — but is evolving into a full pipeline that extracts, summarizes, and structures insights from multimedia content.


I created transcript for it for anyone wanting to read instead of listen!

https://videotobe.com/play/youtube/hS1YqcewH0c


My actual time breakdown as a solo developer:

10% Daily time to learn about various AI tools and improve my workflow.

20% Procrastination (this might be way more than what I'm willing to accept. But this is HN, I want to appear smart )

20% Writing detailed description of features and breaking down task lists, writing acceptance tests.

20% AI Coding (Claude Code)

20% Testing + Production


Personal opinion:

Claude code is more user friendly than cursor with its CLI like interface. The file modifications are easy to view and it automatically runs psql, cd, ls , grep command. Output of the commands is shown in more user friendly fashion. Agents and MCPs are easy to organized and used.


I feel just the opposite. I think Cursor's output is actually in the realm of "beautiful." It's well formatted and shows the user snippets of code and reasoning that helps the user learn. Claude is stuck in a terminal window, so reduced to monospaced bullet lines. Its verbose mode spits out lines of file listings and other context irrelevant to the user.


I have barely written any code since my switch to Claude Code! It's the best thing since sliced bread!

Here's what works for me:

- Detailed claude.md containing overall information about the project.

- Anytime Claude chooses a different route that's not my preferred route - ask my preference to be saved in global memory.

- Detailed planning documentation for each feature - Describe high-level functionality.

- As I develop the feature, add documentation with database schema, sample records, sample JSON responses, API endpoints used, test scripts.

- MCP, MCP, MCP! Playwright is a game changer

The more context you give upfront, the less back-and-forth you need. It's been absolutely transformative for my productivity.

Thank you Claude Code team!


What you’re working on? In my industry it fails half of the time and comes up with absolute nonsense. The data just don’t exist for our problems, it can only work when you guide it and ask for a few functions at max.


This sounds like my experiences with it. I'm writing embedded firmware in C and Rust. I'd describe further, but Claude seems incompetent at all aspects of this space.


This. Every "AI is greate" response seems to be from someone doing web development - something I've intentionally avoided ever since I got tired of it around 2001, and hope to never have to do again.

We write C++ code in a very customized internal idiom to drive our hardware. Claude is great at filling in debugging statements / iterating over standard data structures to dump their contents, but not much else.


I am working on VideoToBe.com - and my stack is NextJS, Postgresql and FastAPI.

Claude code is amazing at producing code for this stack. It does excellent job at outputting ffmpeg, curl commands, linux shell script etc.

I have written detailed project plan and feature plan in MarkDown - and Claude has no trouble understanding the instructions.

I am curious - what is your usecase?


That seems to be a great example of precisely the sort of program an AI would be good at. A small focused product that only does one thing. Mainly gluing together other people's code. It's a polished greenfield project that does one tiny bit of focused functionality.

Interestingly, this guy has been making pretty much the same app as you, and live-streamed making it on youtube:

https://www.youtube.com/@RayFernando1337

Looks like he's now pivoted to selling access to his discord server for vibe coding tips as I can't find a link to his product.

But if we're honest here, it's not going to take a ton of code to make that. All the functionality to do it is well documented.

Many people here could make a competitor in a week, without agentic AI, just using AI as a super-charged SO. The limiter pre-AI (aside from AI transcribing it) would have been reading and implementing/debugging all the documentation of the libraries you're using, which AI is great at circumventing.

Your product looks really good, and is an excellent example of what vibe coded AI is great at. I hope you're getting good traction.


Personally, I give Claude a fully specified program as my prompt so that it gives me back a working program 100% of the time.

Really simple workflow!


Ah, I’ve tried that one, but I must be doing something wrong. I give it a fully specified working program, and often times it gives me back one that only works 50% of the time!


Does Claude Code provide some kind of "global memory" the llm refers to, or is this just a request you make within the the llm's context window? Just curious hadn't heard the use of the term

EDIT: I see, you're asking Claude to modify claude.md to track your preference there, right?

https://docs.anthropic.com/en/docs/claude-code/memory


Yes. /init will initialize the project and save initial project information and preference.

Ask Claude to update the preference and document the moment you realize that claude has deviated away from the path.


What does the playwright MCP accomplish for you? Is it basically a way for Claude to play with your app in the browser without having to write playwright tests?


How have you been using Playwright MCP?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: