ai_sadtalker

An authenticated client can post a request to sadtalker

This AI has the ability to produce an animated video from a single image upload and a sound file.

sadTalker github oage

Get the JSON config file for this AI

GET https://opensourceais.com/api/v1/public/config/sadtalker

Post a request to AI sadtalker via OpenSourceAIs

POST https://opensourceais.com/api/v1/private/client/ai/sadtalker

Request Body

Name
Type
Description

url_audio

String

A well formed URI starting with https://... and accessing a .WAV file. Note that this file has a limit of 2Mb. You should only upload files of at most 15sec of sample voice.

url_upload*

String

A well formed URI starting with https://... and accessing a .PNG or .JPG file. Note that this file has a limit of 2Mb.

still_image

Boolean

Set to true if you want to avoid too much movement in the video ; this is interesting if you have a full body video. Default = false

blink

Boolean

Set to false if you do not want eye blinking animation. Default = true

scale_image

Boolean

Set to true if you want to use gfpgan style AI to improve the quality of images generated to make the video. Default = false

preprocess

String

Takes one of those values: "crop", "resize", "full", "extcrop", or "extfull". Default is "crop"

res

Number

The video output resolution, either 256 (for 256x256px), or 512 (for 512x512px). Default = 256

pose

Number

A "pose" indicator, between 0 and 45

Last updated