This text is republished with permission from Surprise Instruments, a e-newsletter that helps you uncover essentially the most helpful websites and apps. Subscribe here.
Typing isn’t all the time the easiest way to get your ideas down. Generally speaking via an thought results in higher readability. New AI instruments can reliably rework these spoken ideas into clear, organized textual content.
I’ve spent months experimenting with voice AI instruments—first on my cellphone, and now on my laptop computer. They’ve been serving to me pull concepts from my mind onto paper. The instruments beneath have grow to be essential to my workflow.
Why voice AI beats conventional transcription
Conventional transcription merely converts speech to textual content. Fashionable voice AI does way more:
- Instantaneous transformation: Converse naturally and get a refined draft, define, or abstract
- Good cleanup: AI removes filler phrases and provides correct punctuation
- Format flexibility: Convert speech into numerous codecs like bullet lists or structured paperwork
- Context consciousness: AI understands context and organizes your ideas logically. As a result of it’s grounded in your personal phrases, it doesn’t hallucinate.
5 methods I like utilizing voice AI
Listed here are some eventualities the place voice AI is especially beneficial:
1. Journal entries
As a substitute of looking at a clean web page, I communicate my ideas at day’s finish. The AI transforms my stream of consciousness into organized reflections.
2. Assembly follow-ups
After an in-person assembly, I open my voice AI app, hit file, and discuss via key factors whereas they’re nonetheless contemporary. I don’t fear in regards to the construction of my sentences or about pausing as I believe. The AI waits for me and summarizes my rambling.
3. Presentation planning
Talking via presentation concepts helps me determine my narrative circulation. The AI helps me manage my ideas right into a structured define. I can discuss via a number of potential variations, then examine them on display screen later.
4. Guide notes
To protect insights from one thing I’m studying, I activate a voice AI app and flip via the pages or scroll via the textual content to remind myself out loud about intriguing passages or concepts. I then save the structured word the AI creates.
I like with the ability to look again on the textual content whereas dictating the word. And the enhancing a part of my mind interferes much less once I’m speaking than once I’m typing.
5. Every day planning
Beginning my day by verbally mapping out my priorities helps me assume via what’s forward extra successfully than typing out an inventory.
Voice AI apps to attempt
Letterly
- Simple to make use of: Simply press the app’s huge button. As much as quarter-hour per recording.
- Cross-platform: File or entry your previous text-from-voice throughout routinely synchronized desktop, internet, and cellular apps.
- Good format detection: The magic rework possibility can routinely reformat your phrases, turning lists into bullets or structuring e mail drafts for fast copy-and-pasting into different apps.
- Customizable outputs: Remodel recordings into LinkedIn posts, podcast or video scripts, structured paperwork, or your personal customized codecs.
- Iterative refinement: Strive totally different transformations of the identical recording till you get precisely what you want.
- A number of languages: File in any of 90 languages, or file in a single language and have the app translate your textual content into one other.
- Offline and screen-off choices: File wherever, even with out Web entry. Strive utilizing background mode with out your display screen on. I usually file with my AirPods whereas strolling with my cellphone in my pocket.
- Founder’s tip: “Don’t confuse it with dictation,” says Letterly’s founder and CEO Anton Lebedev. “You don’t have to pronounce the right textual content you need to write. As a substitute, assume out loud, communicate slowly, shortly, and even chaotically. AI will perceive you. Consider it like a writing assistant you’re telling what to put in writing. The assistant can perceive you and determine how one can rewrite the textual content.”
- Letterly Pricing: $80/12 months after a free trial
Oasis
- Multi-purpose output: Get your recording remodeled concurrently into numerous codecs—from a memo or define to a weblog put up or TED discuss.
- Make customized templates: Create and title brief prompts that mirror your most popular types or codecs. These grow to be a part of your customized immediate library for remodeling future recordings. I made one for my journal entries.
- Net accessibility: Like Letterly and Audiopen, you may entry your recordings and remodeled textual content via a browser on any system.
- Oasis pricing: $5/month or $50/12 months for sufficient credit for lots of of month-to-month makes use of.
AudioPen
- Customise rewrite size: Customise the size setting should you’d choose summaries of your transcribed recordings to be shorter or longer. Create and entry them in your cellphone or on any system via your browser.
- Shareable audio notes: Ship particular person audio word hyperlinks to colleagues or collaborators. Or ship then to different apps with a Zapier integration.
- Versatile group: Mix a number of audio notes or their summaries into bigger collections. You may seek for outdated notes or organize them in folders.
- Wealthy template choice: Select from numerous transformation templates.
- AudioPen pricing: $99/12 months or $159/two years after a free trial.
Backside Line
Begin with Letterly if you would like simplicity and reliability. Think about Oasis if you would like a barely cheaper possibility or have to concurrently entry a number of format variations of the identical content material. AudioPen is beneficial if you wish to customise the size of your voice summaries or if sharing or combining audio notes is vital to your workflow.
The place to make use of voice AI
Voice AI shines when typing isn’t sensible or whenever you need to assume freely with out your palms on a keyboard. Listed here are conditions the place you may attempt it:
At dwelling
- Cozy chair: Seize e book notes with out interrupting your studying rhythm.
- Kitchen: Doc recipe changes or cooking notes whereas your palms are busy with elements.
- Bedside: File late-night musings with out disrupting your wind-down routine with a shiny display screen.
- Backyard: Log landscaping concepts or random ideas whereas your palms are soiled.
On the transfer
- Strolling: Seize venture concepts and inspiration throughout your day by day stroll.
- Commute: Draft emails and plan your day whereas on the subway or bus.
- Automotive: File ideas safely after parking however earlier than you neglect an vital thought.
At work
- Quiet area: Create reflective journal entries whereas searching the window.
- Convention: Seize insights between periods to keep away from being overwhelmed whenever you get dwelling.
- Physician’s workplace: File appointment particulars and follow-up steps whereas the data is contemporary.
Energetic time
- Outdoor: Draft journal entries or inventive concepts whereas surrounded by nature
- Train: Define displays or brainstorm on the treadmill
- Buying: Create lists or remind your self about merchandise
Voice AI in your laptop computer
I used to rely solely on cellular voice AI apps, however currently I’ve been counting on laptop computer voice AI apps. These are much less targeted on remodeling textual content and extra on placing your spoken textual content in your clipboard so you may paste into any instrument you’re utilizing. It really works with Google Docs, Phrase, e mail, or no matter else you’re utilizing. I take advantage of these on my laptop computer as a result of it’s faster and simpler for me to speak than to sort. Listed here are three value making an attempt:
Movement
- Fast to begin: When you’ve put in the software program, simply maintain down the operate key to begin recording in any of 100+ languages. Your recording will get immediately transcribed and the cleaned-up textual content is copied to your clipboard.
- Works wherever in your pc: Paste transcribed textual content immediately into any utility—e mail, paperwork, or messaging apps.
- Reduces display screen and hand fatigue: File whereas wanting away out of your display screen to cut back eye pressure and provides your palms a break.
- Flow pricing: Free for as much as 2,000 phrases/week; $12/month billed yearly for limitless phrases and further options. $8/month for students and educators.
TalkTastic
- Easy transcription: Made by the staff that created the Oasis cellular app, TalkTastic is designed to be easier. As a substitute of reworking your speech into numerous textual content sorts, it simply places a cleaned-up model of what you say onto your clipboard to stick into any app.
- Good textual content transformation: You may optionally set it to investigate your display screen context to supply remodeled variations of your textual content.
- Free: Whereas in beta, there’s no value for TalkTastic.
MacWhisper
- Superior transcription: Use this free software program to transcribe on-line conferences, podcasts, or dwell dictation. You may even add information to transcribe.
- Pay as soon as for professional options: Allow YouTube transcriptions, batch uploads, translation, and prime AI mannequin utilization with a one-time buy.
- MacWhisper pricing: Free for fundamental utilization; about $60 for professional improve; 20% low cost with this link. Journalists, college students, or non-profits can e mail support@macwhisper.com for 50% off.
Different methods to make use of your voice to profit from AI
- ChatGPT has a robust voice mode in its cellular and desktop apps. Moderately than typing out AI queries, you may have a dialog with an AI bot. Right here’s why that’s so useful.
- Perplexity’s cellular app voice AI mode is terrific. I ask it a sequence of questions, like an oracle. It beats Google on a lot of my queries. The AI understands what I’m asking, then gathers and summarizes a useful response. Citations within the app guarantee I can examine on its data sources.
- Google’s Gemini and Microsoft’s Copilot have recently-upgraded cellular voice modes. Converse with human-sounding AI bots with out thumb typing.
- Open-source options abound.
This text is republished with permission from Surprise Instruments, a e-newsletter that helps you uncover essentially the most helpful websites and apps. Subscribe here.