Skip to content

The science

You're not lazy for not writing it down. You're working against your own brain.

Speaking is 3× faster than typing. 70% of new ideas are gone in 24 hours. Every interruption costs 23 minutes of focus. The research has been clear for decades — capture has to match the speed of thought, or thought wins. Here's the peer-reviewed evidence behind every claim VoxNota makes.

Speed

Speech beats typing on phones — 3× faster, 20.4% fewer errors. The keyboard isn't your friend; it's the tax you pay for not speaking.

Stanford / Baidu, 2016

Memory

70%

Of any new idea is gone within 24 hours unless you capture it. The shower thought, the walk-home insight, the standup epiphany — all gone by tomorrow.

Ebbinghaus, 1885 (replicated)

Focus

23 min

To refocus after one interruption — with two unrelated tasks in between. Stopping to type a note isn't 'a quick second'. It's a quarter of an hour you don't get back.

UC Irvine, 2008

What it means for you

The research isn't theory — it's already shaping your day. You just don't see it yet.

The interruption tax

Every typed note costs you 23 minutes of deep work.

The math nobody talks about: the average knowledge worker is interrupted every 11 minutes, and it takes 23 minutes to fully refocus. Your 'quick note' isn't quick — it's the thing that keeps you from doing the work you opened your laptop to do.

VoxNota slips inside that gap. Tap, speak, get back to thinking. See how researchers, writers, and students use it.

The production effect

Saying it out loud makes you remember it. The science is settled.

MacLeod et al. (2010) showed that words spoken aloud are remembered substantially better than words read silently. When you record a voice note, you're getting two backups for the price of one — the file on your phone, and the encoding in your own head.

Your archive becomes a real second brain — because you've already half-memorised it.

The speed gap

Your thoughts run at 150 wpm. Your thumbs run at 40.

Speaking matches the speed your brain actually thinks at. Typing forces you to slow down, queue ideas, and watch the good ones fall out of working memory. Stanford's research is clear: voice isn't just faster on phones — it's more accurate.

VoxNota's AI catches every word, then structures it. You stay in flow. How the AI turns speech into searchable notes.

Peer-reviewed sources

Don't take our word for it. Read the papers yourself.

Every speed claim, every memory stat, every interruption number on this site comes from one of the studies below. The links go straight to the source.

2016

Speech is 3× faster than typing for English and Mandarin text entry on mobile devices

Ruan, S., Wobbrock, J., Liou, K., Ng, A., & Landay, J.

Speech recognition input was 3.0× faster in English and 2.8× faster in Mandarin than typing on smartphones, with a 20.4% lower error rate in English.

Stanford HCI / Baidu Research

View source

1885

The forgetting curve

Ebbinghaus, H.

Roughly 70% of new information is gone within 24 hours without review or active recall. Modern replications confirm the original finding.

Memory: A Contribution to Experimental Psychology

View source

2008

The cost of interrupted work: more speed and stress

Mark, G., Gudith, D., & Klocke, U.

After an interruption, it takes an average of 23 minutes 15 seconds to return to the original task — with around two intervening tasks before resuming.

CHI '08: Proceedings of the SIGCHI Conference

View source

2011

Consider it done! Plan-making can eliminate the cognitive effects of unfulfilled goals

Masicampo, E.J. & Baumeister, R.F.

Unfinished goals cause intrusive thoughts and impair performance on unrelated tasks. Making specific plans for those goals removes the cognitive cost.

Journal of Personality and Social Psychology

View source

2010

The production effect: delineation of a phenomenon

MacLeod, C.M., Gopie, N., Hourihan, K.L., Neary, K.R., & Ozubko, J.D.

Words spoken aloud during study are remembered substantially better than words read silently. The effect extends to educationally relevant texts.

Journal of Experimental Psychology: Learning, Memory, and Cognition

View source

2018

Typing expertise in a large student population

Dhakal, V., Feit, A., Kristensson, P.O., & Oulasvirta, A.

Average typing speed across a large population was 52 wpm, with significant variation by experience (35–80 wpm range). Speech sits well above the top of that range.

Cognitive Research: Principles and Implications

View source

The math

Two minutes saved per note. Multiply that by your week.

Speaking speed:
~150 words per minute (conversational pace).
Typing speed:
~40 words per minute (average adult on mobile).
Speed advantage:
3.75× faster (150 ÷ 40).
Worked example:
A 150-word note takes 1 minute to speak vs. 3.75 minutes to type — saving 2.75 minutes per note.

Actual savings vary by typing speed, device, and content complexity. Stanford / Baidu (2016) measured a 3.0× average mobile advantage for speech.

Deeper reading

Four ideas that explain why voice-first works — long before VoxNota existed.

Cognitive load

7 ± 2 — the limit of working memory.

Sweller's cognitive load theory: your working memory can hold about seven items at once. Trying to remember a thought while typing it out is what causes the 'wait, what was I about to say?' moment. Voice capture closes that gap before it opens.

Read the source

Zeigarnik effect

Open loops drain you — even when you forget you opened them.

Bluma Zeigarnik showed that unfinished tasks consume mental bandwidth in the background, whether you're actively thinking about them or not. Quick capture is how you close those loops. Less stuff in the background = more for the work in front of you.

Read the source

Flow state

Csikszentmihalyi: flow is fragile, and friction kills it.

The hardest part of deep work isn't getting in — it's staying in. Every interruption that pulls you out of flow takes minutes (or longer) to recover from. Voice capture is the lowest-friction tool there is. Tap, speak, keep going.

Read the source

Dual coding

Two channels remember better than one.

Paivio's dual-coding theory: information encoded both verbally and auditorily is recalled more reliably than either alone. Speaking your notes uses both channels at once — which is why people say they remember spoken notes even before they re-read them.

Read the source

Stop reading. Start capturing.

You've seen the science. Your next idea is the test.

3 seconds to capture. 70% of ideas saved that would have died. Zero interruptions to your flow. No signup, no credit card.