node package manager
We need your input. Help make JavaScript better: Take the 2017 JavaScript Ecosystem survey ยป


highly advanced voice recognition software powered by the googles and written with Java scripts

requires sox so you have to brew install sox first

also requires say so you have to have a mac

for funtimes:

npm install stenographer -g
stenographer hello yes this is dog

here is some example input and output:

input: I was seated by the shore of a small pond, about a mile and a half south of the village of Concord and somewhat higher than it

output: I was heated by the shore the small pond about a mile and a half south of the villages Concord and somewhat tired and it

you can also pipe in raw audio (use -t to specify input format):

cat hello.wav | stenographer -t wav

if the audio you pipe in is longer than 10 seconds it will automatically get chunked and processed 10 seconds at a time. googles servers return errors if you upload large files but 5-10 second clips work okay

you can control the chunk size and beginning offset using environment variables:

cat longaudio.wav | TRIMSTART=50 CHUNKSIZE=10 stenographer -t wav

be warned that this library is pretty slow! it takes about 5-10 seconds for a 10 second audio clip to make a round trip to googles servers and back

if you get errors regarding input format try converting your input file to a wav first: sox input.mp3 output.wav