User Guide
Launch the application like any other Java application:
java --enable-native-access=ALL-UNNAMED -jar $HOME/speedofsound/speedofsound.jar
On the first launch, two things will happen:
- Model setup: The built-in Whisper Tiny model is unpacked into your user data folder. This is automatic and the app will start faster in the future.
- Permissions prompt: The app will ask you to grant permission to type on your behalf. To support both X11 and Wayland desktops without requiring root access, Speed of Sound uses XDG Desktop Portals for keyboard input. You must approve this prompt for dictation to work.
Once permissions are granted, press your shortcut to start dictating and press it again to stop. The transcribed text will be typed into whatever application is currently focused.
Keyboard shortcuts
| Shortcut | Action |
|---|---|
Super+Z |
Start or stop dictation (global, configured in the previous step) |
S |
Start or stop listening (while the app window is focused) |
Escape |
Cancel listening |
M |
Minimize window |
Left Shift |
Select primary language |
Right Shift |
Select secondary language |
Ctrl+Q |
Quit |
The in-app shortcuts (all except the global one) are active only while the Speed of Sound window is focused.