-
Notifications
You must be signed in to change notification settings - Fork 13
Azure Kinect DK in C# mono instead of 8 channel audio #29
Comments
Thanks for reporting the issue! My initial guess would be that Microsoft.CognitiveServices.Speech.extension.pma.dll, libpma.dll and libunimic_runtime.dll are not present at the correct location. These libraries are required for recording correctly using Azure Kinect DK and do conversation transcription. Can you tell me where are you placing these? How did you get the other libraries apart from these above three libraries? Thanks, |
I am using NuGet package manager to install the Microsoft.CognitiveServices.Speech I can see the following in the bin directory on compile
I found the Microsoft.CognitiveServices.Speech.extension.pma.dll from the java sample but was not able to locate libpma.dll and libunimic_runtime.dll. Where do I acquire these? I placed Microsoft.CognitiveServices.Speech.extension.pma.dll in the bin directory, but nothing changed. |
Sorry, I wrote the wrong names above. I meant pma.dll and unimic_runtime.dll. They are in same location as Microsoft.CognitiveServices.Speech.extension.pma.dll in the Java sample. When you download sdsdk-jre.zip from https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-devices-sdk-windows-quickstart, unzip it. There will be JRE-Sample-Release.zip in the unzipped folder. Unzip it too. Then you will find all three (Microsoft.CognitiveServices.Speech.extension.pma.dll, pma.dll and unimic_runtime.dll) in the unzipped JRE-Sample-Release folder. You have to paste all three of these dlls into the directory where other dlls are present (bin directory). |
Yes that did the trick, I'm now getting conversation transcription working in my project. Is the a specific NuGet package to use to get these dll's added automatically or must I add the references manually to the project so they are copied on compile? |
Unfortunately, we don't have a NuGet package as of release 1.6. We are planning to add it in a future release. For now, you will have to manually copy the .dlls. Sorry about that. |
My only remaining issue is I'm getting a lot of false identifications which I assume is due to poor voice signatures. Is there a place to gain more information on what a quality voice signature should be or how to reduce incorrect identifications? Also I would like to use a recording of a previous conversation because right now I'm driving my co-workers crazy asking them to "hey say something" (open office floor plan) every time I want to test. What the best way to capture an 8 channel audio file from the Azure Kinect DK? |
I don't have answer to your first question. I will ask around internally and will get back to you. You can capture 7-channel audio in Audacity using Azure Kinect DK. In the Device Toolbar, change Audio Host to Windows WASAPI, Recording Device to Azure Kinect Microphone Array and Recording Channels to 7. In the Selection Toolbar, change Project Rate to 16KHz. Capture the audio using this setup. Once you are done with recording, note down the length of the audio you recorded. Now, click on Tracks menu item -> Add New -> Mono Track. Then select the new track and click on Generate menu item -> Silence. Enter the duration equal to your recording duration. Now, you should have 8-channel audio with first 7 channels containing recorded audio and 8th channel containing silence. Thanks, |
This issue is for a: (mark with an
x
)Minimal steps to reproduce
Any log messages given by the failure
Expected/desired behavior
OS and Version?
Versions
Mention any other details that might be useful
Running the java example I experience no issues with transcription and participant identification. When trying to run with C# I don't receive any errors but no transcription happens. Looking at the log files I noticed the difference in output form java to c# was the AudioConfig_NumberOfChannelsForCapture.
The text was updated successfully, but these errors were encountered: