I am new to using sidekit for speaker recognition, I encountered a problem where I need to use "feature_list".feature_list=ubm_list but what's in feature list?It says it contains list of feature files to train a GMM with. But what is suppose to be in the feature files.
ubm=sk.Mixture()
ubm_list="/home/david/Documents/development_set/anthonyschaller-20071221-/list"
ubm.EM_split(features_server=server,feature_list=ubm_list,
distrib_nb=1024,iterations=(1,2,2,4,4,4,4,8,8,8,8,8,8),
num_thread=10,llk_gain=0.01,save_partial=False,ceil_cov=10,
floor_cov=1e-2)
Yeah, you are right..
ubm_list
is a list of features files which probably have.h5
extension. So, yourubm_list
should be:And according to your second question, the
feat_dir
should contain the features files of HDF5-format (files that have .h5 extension). YOu can open one of these files usingh5py
module to explore. I did that with one of mine, and here is what I found out:The following image is from the the documentation of a former-version where you can find all the information I mentioned above with minor changes: