[Bug]: Speech Recognition result cannot always be converted to Pronunciation Assessment Result #770

coreyward · 2023-12-08T16:54:04Z

What happened?

I ran into an issue where an audio recording sent didn't have any detectable speech in it. I would expect some kind of error message to come back, but what I did not expect is that the SDK would throw an error over it when used as shown in examples.

Cannot read properties of undefined (reading '0')

This is coming from line 108 here:

cognitive-services-speech-sdk-js/src/sdk/PronunciationAssessmentResult.ts

Lines 103 to 110 in b062648

    
           export class PronunciationAssessmentResult { 
        
               private privPronJson: DetailResult; 
        
               private constructor(jsonString: string) { 
        
                   const j = JSON.parse(jsonString) as AssessmentResult; 
        
                   Contracts.throwIfNullOrUndefined(j.NBest[0], "NBest"); 
        
                   this.privPronJson = j.NBest[0]; 
        
               }

It seems like the types are inaccurate for j.NBest. When the Speech Recognizer is handed an audio file that does not have any words in it, the result comes back without any errors, but also rather sparse without many fields including NBest. For example:

SpeechRecognitionResult {
      privResultId: '75EDA7E5421A411F81E0D64B9504D75C',
      privReason: 0,
      privText: undefined,
      privDuration: 49200000,
      privOffset: 0,
      privLanguage: undefined,
      privLanguageDetectionConfidence: undefined,
      privErrorDetails: undefined,
      privJson: '{"Id":"0ed4402056c8472a99bc19fff317a024","RecognitionStatus":2,"Offset":0,"Duration":49200000,"Channel":0,"SNR":0}',
      privProperties: PropertyCollection {
        privKeys: [ 'SpeechServiceResponse_JsonResult' ],
        privValues: [
          '{"Id":"0ed4402056c8472a99bc19fff317a024","RecognitionStatus":"InitialSilenceTimeout","Offset":0,"Duration":49200000,"Channel":0,"SNR":0.0}'
        ]
      },
      privSpeakerId: undefined
    }

It would be really useful if this behavior (and overall, all of the potential response formats from the API) were documented somewhere accurately. It's hard to build robust applications that fail gracefully when the behavior of dependencies is undocumented.

Using v1.33.1, but the "Version" dropdown in the issue form only lists up to 1.33.0.

Version

1.33.0 (Latest)

What browser/platform are you seeing the problem on?

No response

Relevant log output

No response

The text was updated successfully, but these errors were encountered:

glharper · 2024-01-05T14:04:25Z

@coreyward Thanks for submitting this issue and using JS Speech SDK. In theory, an InitialSilenceTimeout result should be shunted down a different codepath before attempting to create a PronunciationAssessmentResult, so this does feel like a bug.

coreyward added the bug Something isn't working label Dec 8, 2023

coreyward assigned glharper Dec 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Speech Recognition result cannot always be converted to Pronunciation Assessment Result #770

[Bug]: Speech Recognition result cannot always be converted to Pronunciation Assessment Result #770

coreyward commented Dec 8, 2023

glharper commented Jan 5, 2024 •

edited

[Bug]: Speech Recognition result cannot always be converted to Pronunciation Assessment Result #770

[Bug]: Speech Recognition result cannot always be converted to Pronunciation Assessment Result #770

Comments

coreyward commented Dec 8, 2023

What happened?

Version

What browser/platform are you seeing the problem on?

Relevant log output

glharper commented Jan 5, 2024 • edited

glharper commented Jan 5, 2024 •

edited