Simple STT

Speech-to-text, simplified. Live transcription, read-aloud, voice commands, and a clean transcript workspace.

Simple STT is a speech-to-text browser app and extension with live transcription, read-aloud, configurable voice commands, and optional active-field insertion.

What It Includes

a full-page transcript workspace
a compact popup launcher
a settings surface for language, voice, hotkey, and command phrases
a hosted web app at simple-stt.github.io
optional writing of final dictated text into the currently focused editable field in the active tab

Features

One live transcript surface with interim and final speech merged into the same textarea
Start and stop transcription from the main app
Read-aloud with play, pause, resume, and restart controls
Language selection and read-aloud voice selection
Copy, cut, and clear transcript actions
Configurable spoken command phrases for line and paragraph breaks
Configurable transcription toggle hotkey
Optional active-field writing for normal web pages
Guards against restricted browser and internal pages
Snackbar feedback for clipboard actions, settings changes, and important errors

Open The App

Hosted web app

Visit simple-stt.github.io

Extension app

Click the extension toolbar icon, then choose Open App
Open settings from the popup gear or the app gear

The extension deduplicates its own app and settings tabs, so opening them again focuses the existing tab instead of creating a new copy.

Install The Extension

Open chrome://extensions in Google Chrome
Enable Developer mode
Click Load unpacked
Select this repository's ext/ directory

Chrome only:

This project is built and tested for Google Chrome
Brave and other Chromium browsers are not supported targets for this repo

How Transcription Works

The main app page is the primary workspace
Interim speech stays visible in the same textarea while speaking
Final speech is committed into the transcript
The transcript grows until a visual cap, then scrolls internally
The textarea auto-focuses on load and regains focus after the main transcript actions

Read Aloud

Simple STT can read the transcript back using the browser speech-synthesis engine.

If text is selected, it reads the selection
If no text is selected, it can read from the cursor position
Restart jumps back to the beginning and starts again
Starting read-aloud stops active transcription so the app does not transcribe its own output

Active-Field Writing

When Write to active field is enabled in the main app, final dictated text is also inserted into the currently focused editable field in the active tab.

Notes:

This only applies to normal editable pages and fields
Restricted browser and internal pages are skipped quietly
Turning the toggle off keeps transcription local to Simple STT
The setting is persisted through the shared settings layer

Spoken Commands

Simple STT replaces configured spoken phrases after recognition finalizes.

Default phrases:

carriage return => newline
double carriage return => blank line

You can change these in Settings.

Keyboard Shortcuts

Default transcription toggle:

Alt+Shift+R

Transcript actions in the app page:

Ctrl/Cmd+A selects the transcript
Ctrl/Cmd+C copies the transcript
Ctrl/Cmd+X cuts the transcript

The transcription toggle hotkey is configurable in Settings.

Expected hotkey format:

Alt+Shift+R
Cmd+Shift+R
Ctrl+Alt+K

If the saved value is blank or invalid, Simple STT falls back to the default.

Settings

Settings currently support:

line break phrase
paragraph break phrase
transcription toggle hotkey
language
read-aloud voice

The live Write to active field mode toggle stays on the main app because it is intended as a working-mode control rather than a static preference.

Permissions

activeTab: lets the extension interact with the current tab when needed
scripting: used for focused-field insertion into editable pages
storage: stores settings such as command phrases and the hotkey
tabs: used for app and settings tab focusing and deduping

Limitations

Speech recognition depends on the browser's built-in speech recognition support
Behavior is intended for Google Chrome only
Active-field writing will not work on restricted browser and internal pages
Dictation quality and availability depend on the browser speech engine and microphone permissions

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github/workflows		.github/workflows
dist		dist
ext		ext
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json
server.js		server.js
simple-stt-logo.png		simple-stt-logo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple STT

What It Includes

Features

Open The App

Hosted web app

Extension app

Install The Extension

How Transcription Works

Read Aloud

Active-Field Writing

Spoken Commands

Keyboard Shortcuts

Settings

Permissions

Limitations

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Simple STT

What It Includes

Features

Open The App

Hosted web app

Extension app

Install The Extension

How Transcription Works

Read Aloud

Active-Field Writing

Spoken Commands

Keyboard Shortcuts

Settings

Permissions

Limitations

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages