1 Engine structures
1.1 Audio
HTS Audio Audio output wrapper.
'
size t sampling frequency size t
short
size t
max bu size
*bu
bu size
$
sampling frequency
1.2 Model
HTS Window Window coecients to calculate dynamic features.
'
size t size
- # of windows (static + deltas)
- left width of windows
int *l width
int *r width
- right width of windows
double
size t
&
**coecient
max width
window coecients
maximum width of windows
*next
%
*no
*next
*quest
size t
state
ntree
*npdf
# of trees
# of PDFs at each tree
***pdf
*tree
*question
PDFs
pointer to the list of trees
pointer to the list of questions
sampling frequency
frame period
num voices
sampling frequency
frame period
# of HTS voices
size t
size t
num states
num streams
# of HMM states
# of streams
*stream type
*fullcontext format
*fullcontext version
stream type
fullcontext label format
version of fullcontext label
HTS Question
char
*gv o context
**option
GV switch
options for each stream
HTS Model
HTS Window
HTS Model
*duration
*window
**stream
**gv
char
char
char
HTS Model
&
%
$
%
$
1.3 Label
HTS LabelString Individual label string with time information.
HTS LabelString *next
- pointer to the next label string
char *name - label string
double
double
start
end
size t
size
# of label strings
$
vector length (static features only)
mean vector sequence
variance vector sequence
double
size t
*msd
win size
int
int
double
*win l width
*win r width
**win coecient
size t
double
*gv vari
*gv switch
variance vector of GV
GV flag sequence
double
HTS Boolean
&
&
size t
size t
nstream
nstate
# of streams
# of states
size t
size t
size t
*duration
total state
total frame
duration sequence
total state
total frame
%
$
'
double
$
**mean
double
double
**ivar
*g
double
double
&
**wuw
*wum
W U 1 W
W U 1 m
$
vector length (static features only)
stream length
width
**par
sm
win size
*win l width
*win r width
**win coecient
*msd flag
*gv mean
*gv vari
mean vector of GV
variance vector of GV
*gv switch
gv length
GV flag sequence
frame length for GV calculation
size t
total frame
total frame
%
double
**par
generated parameter
double
total frame
nstream
*gstream
total frame
# of streams
generated parameter streams
*gspeech
generated speech
1.7 Engine
HTS Condition Synthesis condition.
'
size t sampling frequency
size t fperiod
$
-
sampling frequency
frame period
audio bu size
stop
volume
*msd threshold
*gv weight
MSD thresholds
GV weights
HTS Boolean
double
size t
HTS Boolean
double
double
double
double
beta
additional half tone
*duration iw
postfiltering coecient
additional half tone
weights for duration interpolation
double
double
**parameter iw
**gv iw
size t
HTS Boolean
double
double
double
&
%
$
synthesis condition
audio output
ms
label
sss
pss
gss
2 Engine functions
2.1 Initialize engine
2.1.1 HTS Engine initialize
Type
Use
void
Initialize engine.
Arguments
Attention!!
HTS Boolean
Use
Arguments
Load duration PDFs and trees from files using given file names.
- pointer to HTS Engine structure
HTS Engine *engine
char **voices
- HTS voice file names
Attention!!
void
set sampling frequency.
Arguments
HTS Engine
size t
*engine
i
size t
Use
Arguments
void
Use
Arguments
size t
Use
Arguments
void
Use
Arguments
Attention!!
size t
Use
Arguments
Attention!!
void
set stop flag.
Arguments
Attention!!
HTS Boolean
get stop flag.
Arguments
Attention!!
void
Use
Arguments
Attention!!
double
Use
Arguments
void
set MSD threshold.
Arguments
HTS Engine
size t
double
*engine
stream index
threshold
index of streams
double
get MSD threshold.
HTS Engine *engine
size t
stream index
void
set GV weight.
Arguments
HTS Engine
size t
double
Attention!!
*engine
stream index
f
double
get GV weight.
Arguments
HTS Engine
size t
*engine
stream index
void
Use
Arguments
Attention!!
void
set flag to use phoneme alignment in label.
Arguments
HTS Engine
HTS Boolean
Attention!!
*engine
b
void
set frequency warping parameter alpha.
Arguments
HTS Engine
double
*engine
f
double
get frequency warping parameter alpha.
HTS Engine *engine - pointer to HTS Engine structure
void
set postfiltering coecient parameter beta.
Arguments
Attention!!
double
get postfiltering coecient parameter beta.
HTS Engine *engine - pointer to HTS Engine structure
Attention!!
void
Use
Arguments
void
Use
Arguments
voice index
f
double
get weight for duration interpolation.
Arguments
HTS Engine
size t
*engine
voice index
void
Use
Arguments
stream index
f
index of streams
interpolation weight
double
Use
Arguments
voice index
stream index
10
void
Use
Arguments
stream index
f
index of streams
interpolation weight
double
Use
Arguments
stream index
index of streams
size t
get total # of state.
Arguments
HTS Engine
*engine
void
set mean value of state.
HTS Engine *engine
size t
size t
stream index
state index
index of streams
index of states
size t
double
vector index
f
index of vector
mean value
index of states
index of vector
double
Use
Arguments
state index
vector index
11
size t
Use
Arguments
size t
get # of HTS voices.
HTS Engine *engine
size t
get # of stream.
HTS Engine *engine
size t
get # of state.
Arguments
HTS Engine
*engine
const char *
get fullcontext label format defined in HTS voice.
Arguments
HTS Engine
*engine
const char *
get fullcontext label version defined in HTS voice.
Arguments
HTS Engine
*engine
size t
get total # of frame.
Arguments
HTS Engine
*engine
12
size t
Use
Arguments
get # of samples.
HTS Engine *engine
size t
Use
Arguments
frame index
vector index
index of frames
index of vector
size t
get generated speech.
HTS Engine *engine
size t
index
index of samples
HTS Boolean
Use
Arguments
HTS Boolean
synthesize speech from string list.
Arguments
HTS Engine
char
size t
*engine
**lines
num lines
HTS Boolean
generate state sequence from file name (1/3 synthesis step)
HTS Engine *engine - pointer to HTS Engine structure
char
*fn
13
HTS Boolean
Use
Arguments
num lines
# of lines
HTS Boolean
generate parameter sequence (2/3 synthesis step)
Arguments
HTS Engine
*engine
HTS Boolean
Use
Arguments
void
Use
Arguments
*fp
Attention!!
void
Use
Arguments
*fp
Attention!!
void
Use
Arguments
Attention!!
14
void
Use
Arguments
Attention!!
void
output ri format file.
Arguments
HTS Engine
FILE
*engine
*fp
Attention!!
void
Use
Arguments
free label, state streams, PDF streams and generated parameter streams per one time synthesis
HTS Engine *engine - pointer to HTS Engine structure
Attention!!
void
free engine.
HTS Engine
*engine
Attention!!
15