How Audio Files Are Processed
Audio files are processed as follows:- A transcript is produced using an Automatic Speech Recognition (ASR) system
- The resulting ASR transcript is passed through the text-based PII detection engine
- If specified by the user, the audio file undergoes pitch distortion
- Using the ASR timestamps, any sections of the audio file corresponding with PII detections are replaced with a Sine wave or bleep tone
- The resulting de-identified or redacted audio file and transcript are passed back to the user
Parameters
Please see theaudio_options object in the API reference for a full list of the parameters for audio processing.
Supported File Types
| File Type | Extension | Content Type | Added In |
|---|---|---|---|
| wave | .wav | audio/wav | 3.0.0 |
| x-wave | .wav | audio/x-wav | 3.5.0 |
| mp3 | .mp3 | audio/mpeg, audio/mp3 | 3.0.0 |
| mp4 | .mp4 | audio/mp4 | 3.0.0 |
| m4a | .m4a | audio/m4a | 3.5.0 |
| m4a-latm | .m4a-latm | audio/m4a-latm | 3.5.0 |
| webm | .webm | audio/webm | 3.5.0 |
VOX Files
Note that .vox files are not natively supported by Limina, but can be processed by converting the .vox file to a wav or mp3 using a conversion tool like SoX. Because .vox files are headerless, you will need to know the sample rate and encoding to specify. For example, to take a vox file with a sample rate 8000, mono channel, mu-law encoded:SoX Command
Support Matrix
| CPU Container | GPU Container | Community API | Professional API | |
|---|---|---|---|---|
| Supported | Yes | Yes | Up to 10 MiB | No |
Sample Request
Connect with one of our privacy experts to run this code.
Sample Response
Response