Natural Language Processing
API that defines how to turn recorded speech into text
Last updated
API that defines how to turn recorded speech into text
Last updated
This API is in proposal status and is not yet available for general use. If you have questions or suggestions to improve this API please reach out to the Dynepic team.
The Natural Language Processing API allows MOTAR applications to submit recorded speech in an audio file formatted as MP3, MP4, FLAC, or WAV. MOTAR will convert the speech to text. To use this API, start a job by obtaining a speech to text job id and upload URL. After uploading the audio file, pass the job ID into the get text endpoint to obtain the transcription.
GET
https://api.motar.io/nlp/v1/start-speech-to-text
This API takes a multipart/form-data upload of the audio file and returns a UUID of the transcription job. You can use this ID to retrieve the transcript through the endpoint below.
Name | Type | Description |
---|---|---|
Once the client receives the pre-signed URL, perform a multipart/form-data POST of the audio file to the URL. Then call the get-transcription endpoint to obtain the status and results of the transcription.
GET
https://api.motar.io/nlp/v1/get-text
If the transcription is not finished, the API returns a status of "IN_PROCESS". When completed, the API returns the transcription and metadata about it.
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Authorizaton*
String
Bearer Token
speech_to_text_job_id
String
UUID of the transcription job
Authorization*
String
Bearer Token