Options

Use "Basic Channels" property sheet to configure the usage of multi-sensor setup with your sound card(s). Pressing Setup... button for each channel, you can select the channel type (close microphone, far microphone, simple photo sensor, complex photo sensor (reserved for future usage) and breath sensor.

You can also select the spectrum type for each channel and change the recording volume. We recommend to use linear prediction spectrum for microphones and energy for other channels.

Use "Additional Channels" property sheet to add special user-defined spectral channels. Pressing the Add button You can add another channel to the list f additional channels. Remove button removes one of the selected channels (if it does not have child channels). Remove All button rmoves all channels from additional channels list. Properties button opens the Channel Properties dialog box where You can change different parameters of selected channel. Finally, the Add to Result button allows You to add this channels to the Resulting spectrum channel components list.

Resulting spectrum channel components list is located below the Additional channels list. You can modify it by adding or removing channels. You can add wheather basic channels or Additional channels to the Resulting spectrum channel components list.

When You press Add buton to add additional channels, the Aadd channel dialog box appears. To add new channel, You have to select one of the spectral operations from the Operations list, modify or set the operations parameters and select one (for unary operations) or two (for binary operations) parent spectral channels.

Operation name	Operation type	Short description	Parameters
log10	unary	log10() of parent channel	not used
Absolute value	unary	Absolute value of parent channel	not used
Change sign	unary	parent = - child	not used
Delta-derivative	unary	delta-derivative of parent channel	derivative frame size / 2
Mel-frequency scaling	unary	Mel-frequency scaled parent channel	minimal cut-off frequency (Hz), maximal cut-off frequency (Hz) total number of triangular filters, number of linerly-scaled filters, frequency of last linear filter (Hz)
Time-axis smoothing	unary	Time-axis smoothing of parent channel	Type of smoothing (rectangular, triangular or exponential) and smoothing frame size
Freequency-axis smoothing	unary	Freequency-axis smoothing of parent channel	Type of smoothing (rectangular, triangular or exponential) and smoothing frame size
Maximal value of all parameters	unary	Maximal value of all parameters in parent channel	not used
Norm of parameters vector	unary	Norm of vector of parameters of parent channel	type of norm (L1, L2 etc.)
Normalization	unary	child = (parent - offset) / factor	offset, factor
Normalizationusing Mean and Standart Deviation values	unary	Normalization of parent channel using Mean and Standart Deviation values: child = (parent - Mean) / stdDev	not used
Threshold bounding	unary	Threshold bounding of parent channel	threshold type (upper or lower), threshold itself
Parameter with maximal \|derivative\|	unary	Parameter of parent channel with maximal \|derivative\|	frame size / 2 for delta-derivative calculation, derivative threshold
Speech detection	unary	Speech detection in parent channel	See Speech detection
Subset of parameters	unary	Subset of parameters in parent channel parameters vector	Indeces of starting and of ending parameters
Parameter's reliability	binary	Calculates reliability vector of vector of parent spectrum 1 using parent spectrum 2 as speech detector	Noise update time in milliseconds
Signal-to-Noise relation	binary	Calculates signal-to-noise relation in parent spectrum 1 using parent spectrum 2 as speech detector	Noise update time in milliseconds
Subtraction	binary	Subtract parent channel1 and parent channel2	not used
Multiplication	binary	Multiply parent channel1 and parent channel2	not used
Summ	binary	Add parent channel1 and parent channel2	not used
Subtraction	binary	Subtract parent channel1 and parent channel2	not used
Binary maximum	binary	Take maximal value of parent channel1 and parent channel2	not used
Binary minimum	binary	Take minimal value of parent channel1 and parent channel2	not used
Concatenate	binary	Concatenate parent channel1 and parent channel2	not used
Reliable channel	binary	Make child channel reiable by adding parent channel1 to reliablility vectors stored in parent channel 2	not used

You can change different signal processing options on this property sheet, such as Step of spectral analysis, Analysis frame size, number of linear prediction coeffitions etc. and set default values for parameters of different operations on spectral channels.

Spech detection window contains all the speech detection channels created on Additional channels property sheet. You can urn any channel on and off independantly. Each speech detector produces "speech detected" events, that are processed in speech detection processor, the result of which is the event of the same class. In other words, speech is considered to be detected on some time interval if and only if all the speech detectors turned on generate the events meaning that speech is detected by these detectors on the given time interval.

To setup each speech detector. The record speech sample dialog appears. You have to follow these steps to setup the detector:

1. Select of one of basic or additional channels ian the auxilliary spectral channels combo box. You will use it to determine wher the speech is.

2. Record a speech sample by pressing the corresponding button and saying one (and ONLY one) command of the dictionary.

3. Using the left mouse button, mark the speech frames in the recorded speech (white columns in the lower spectral window, corresponding to the parent channel of this speech detector). Using right mouse button you can un-mark some frames.

5. Move speech boundaries markers on the top of this window to the real bounaries of speech.

7. Press Train parameters button. Change the speech detection threshold to 0 (its training does not work so far). Change any other parameters You like.

8. Now You are ready to check the work of speech detector. Go to teh bullet 2. and record the speech sample again.

9. Now press the Mark speech segments button. Segments with speech will be automatically marked with red and speech boundarieswill be set to appropriate values automatoically. Yoy can continue the training process from the bullet 4 or close this dialog.

The general options property sheet allows You to change the following engine parameters: the number of repetitions during training, the recognition options (what distance metrics to use - HMM or DTW, number of HMM iterations and stop HMM evaluation delta) and allow You to turn on and off the play sounds option