Generate the orientation of the characters in the video. 'Image': same orientation as the person in the picture (max 10s video). 'Video': consistent with the orientation of the characters in the video (max 30s video).
0/2000
Whether to retain original audio from reference video