speechtotext.model.amazonWrapper.AmazonAPIWrapper
- class AmazonAPIWrapper(model_version)[source]
Bases:
ModelWrapperWrapper for AMAZON API. AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, AMAZON_REGION and AMAZON_BUCKET need to be in the ‘.env’ file in current directory.
Wrapper for AMAZON model.
- Parameters:
model_version (AmazonAPIVersion) – Model version of AMAZON STT API to use.
Methods
Benchmark n samples with model.
Benchmark sample with model.
Benchmark samples with model.
Convert sample to correct format.
Get model.
get_transcript_of_fileAttributes
Boolean that represents if the bucket exists.
Code for the language to transcribe.
path to temp file that will be created to convert the audio files to an accepted audio format.
-
LANGUAGE_CODE:
str= 'nl-NL' Code for the language to transcribe.
See supported languages for amazon
- Type:
-
PATH_OF_TEMP_CONVERTED_AUDIO_FILE:
str= 'converted_audio_file.wav' path to temp file that will be created to convert the audio files to an accepted audio format.
- Type:
PATH_OF_TEMP_CONVERTED_AUDIO_FILE
- _append_error(samples, audio_id, error)
Append error to model_errors.
- Parameters:
samples (SampleDataset) – Dataset of audio.
id (str) – Id of failed sample.
error (str) – Error message.
- _benchmark_sample_with_time(dataset, audio_id, with_cleaning=True)
Benchmark sample for model with timer.
- _get_transcribe_file_location(file_uri, transcribe_client, job_name='Transcribe')[source]
Transcribe and return result location. :raises AmazonNoTranscriptReturned: Exception when API does not return an transcript.
- benchmark_n_samples(dataset, number_of_samples, with_cleaning=True)
Benchmark n samples with model.
- benchmark_sample(dataset, audio_id, with_cleaning=True)
Benchmark sample with model.
- benchmark_samples(samples, with_cleaning=True)
Benchmark samples with model.
- convert_sample(path_to_sample)
Convert sample to correct format.