The audio source can be any file supported by FFMPEG containing audio data: *.wav, *.mp3 or even a video file, from which the code will automatically extract the audio. You can specify it as an argument, similar to several other available options. The result is saved (by default) in results/result_voice.mp4. Python inference.py -checkpoint_path -face -audio Lip-syncing videos using the pre-trained models (Inference) Weights of the visual disc trained in a GAN setup Slightly inferior lip-sync, but better visual quality Alternative link if the above does not work. Face detection pre-trained model should be downloaded to face_detection/detection/sfd/s3fd.pth.Have a look at this comment and comment on the gist if you encounter any issues. Alternatively, instructions for using a docker image is provided here.
0 Comments
Leave a Reply. |