Speakeasy Settings

This section lists settings supported by Speakeasy. These settings are to be used in conjunction with the preceding scripting actions.

Wavelink recommends grouping Speakeasy settings together at the beginning of a script.

Text-to-Speech Settings

Setting

Description

tts_calibrate

Opens the speaker volume calibration wizard.

tts_external_speaker_setting

Speaker setting for use on Motorola/Symbol mobile devices.

tts_frequency

Indicates the sampling frequency.

tts_language_long

Displays the full name of the language currently being used.

tts_language_short

Displays the three-letter abbreviation of the language currently being used.

tts_pitch

Indicates the pitch level of spoken text.

tts_rate

Indicates the speed level.

tts_readmode

Indicates how text should be separated.

tts_voice

Indicates the name of the voice that is currently selected.

tts_volume

Indicates the sound level.

tts_waitfactor

Indicates the length of the pause between messages.

Speech-to-Text Settings

Setting

Description

stt_accuracy

This value affects the trade-off between CPU load, memory requirements, and accuracy.

stt_adjust_gain

This feature allows the engine to automatically increase and decrease the microphone input volume.

stt_beep_threshold

If the confidence value for a result is below this value, then a negative acknowledgement beep will not be played.

stt_calibrate

Opens the microphone calibration wizard.

stt_calibration_silence

Sets how long the user is expected to remain silent during a quick microphone calibration.

stt_confidence

Indicates the minimum difference in confidence required between the top two speech-to-text results for the top result to be accepted.

stt_expanded

Use this to get the confidence value along with the speech-to-text result.

stt_fx_detect_start

Indicates the action the speech engine should take before attempting to determine what the user is saying.

stt_fx_microphone

Tells the speech engine the distance between the user and the microphone.

stt_fx_min_duration

Indicates the minimum duration (in ms) of speech before speech detection is activated.

stt_fx_sensitivity

Indicates the speech detection sensitivity.

stt_fx_silence

Indicates the milliseconds of silence used to indicate the user is done speaking.

stt_fx_threshold

Indicates the amount of energy the microphone input must have before the speech detection is activated.

stt_idle_timeout

Indicates the total milliseconds for the engine to continue collecting results following the last result or timeout.

stt_language_long

Displays the full name of the language currently being used.

stt_language_short

Displays the three-letter abbreviation of the language currently being used.

stt_logging

Creates a Speech-to-Text log file in the root folder of the device.

stt_logging_audio

Sets the engine to log speech-to-text attempts as .wav files.

stt_logging_engine

If set to 1, the speech-to-text engine will create a log file in the root folder of the device.

stt_pool_size

Sets the number of terms the engine will examine closely for the best match.

stt_preserve

Causes the speech engine to save the current engine state for use later.

stt_priority

Determines how aggressively the microphone input is collected and speech analysis is performed.

stt_processing

Indicates the action the speech engine should take when returning a grammar result.

stt_reset

Modifies engine adaptation speed and/or saved engine information.

stt_reset_session_delay

Indicates the total milliseconds for the speech engine to wait for a valid response before reverting back to the last saved state.

stt_result_sound

Causes a sound to play for result recognition.

stt_save_increase

Increases the threshold for saving a new engine state as time progresses.

stt_save_session_delay

Indicates the total milliseconds for the speech engine to wait before saving the next current state.

stt_save_threshold

Directs the speech engine to save the state if the result confidence is greater than the result confidence for stt_threshold and stt_save_threshold combined.

stt_server_timeout

When uploading or downloading user training data, the value for this setting is how long (in seconds) the Client will wait for a response from the Avalanche server.

stt_size

Displays the size of the speech-to-text engine being used.

stt_special_sounds

Indicates how the speech engine should interpret special sounds.

stt_threshold

Indicates the minimum amount of confidence for the most-likely result that will be accepted.

stt_timeout

Indicates the total milliseconds (ms) for the system to wait before responding to the speaker.

stt_use_jumpback

Sets a buffer to check if the engine is processing speech.

stt_use_word_ids

Enables support for Word IDs (the !id directive) in grammar files.

stt_volume

Indicates the current volume of the microphone input.

 


Was this article useful?    

The topic was:

Inaccurate

Incomplete

Not what I expected

Other

Privacy and Legal