ai_sadtalker
An authenticated client can post a request to sadtalker
Last updated
An authenticated client can post a request to sadtalker
Last updated
This AI has the ability to produce an animated video from a single image upload and a sound file.
This AI can run AI alongside a Daemon on GPU only
GET
https://opensourceais.com/api/v1/public/config/sadtalker
POST
https://opensourceais.com/api/v1/private/client/ai/sadtalker
url_audio
String
A well formed URI starting with https://... and accessing a .WAV file. Note that this file has a limit of 2Mb. You should only upload files of at most 15sec of sample voice.
url_upload*
String
A well formed URI starting with https://... and accessing a .PNG or .JPG file. Note that this file has a limit of 2Mb.
still_image
Boolean
Set to true if you want to avoid too much movement in the video ; this is interesting if you have a full body video. Default = false
blink
Boolean
Set to false if you do not want eye blinking animation. Default = true
scale_image
Boolean
Set to true if you want to use gfpgan style AI to improve the quality of images generated to make the video. Default = false
preprocess
String
Takes one of those values: "crop", "resize", "full", "extcrop", or "extfull". Default is "crop"
res
Number
The video output resolution, either 256 (for 256x256px), or 512 (for 512x512px). Default = 256
pose
Number
A "pose" indicator, between 0 and 45