ai_sadtalker
An authenticated client can post a request to sadtalker
Last updated
An authenticated client can post a request to sadtalker
Last updated
This AI has the ability to produce an animated video from a single image upload and a sound file.
This AI can run AI alongside a Daemon on GPU only
GET
https://opensourceais.com/api/v1/public/config/sadtalker
POST
https://opensourceais.com/api/v1/private/client/ai/sadtalker
Name | Type | Description |
---|---|---|
url_audio | String | A well formed URI starting with https://... and accessing a .WAV file. Note that this file has a limit of 2Mb. You should only upload files of at most 15sec of sample voice. |
url_upload* | String | A well formed URI starting with https://... and accessing a .PNG or .JPG file. Note that this file has a limit of 2Mb. |
still_image | Boolean | Set to true if you want to avoid too much movement in the video ; this is interesting if you have a full body video. Default = false |
blink | Boolean | Set to false if you do not want eye blinking animation. Default = true |
scale_image | Boolean | Set to true if you want to use gfpgan style AI to improve the quality of images generated to make the video. Default = false |
preprocess | String | Takes one of those values: "crop", "resize", "full", "extcrop", or "extfull". Default is "crop" |
res | Number | The video output resolution, either 256 (for 256x256px), or 512 (for 512x512px). Default = 256 |
pose | Number | A "pose" indicator, between 0 and 45 |