Realistic Facial Expression Synthesis for an Image-based Talking Head



Kang Liu and Joern Ostermann

accepted in the preceedings of IEEE conference ICME 2011

    Abstract

      This paper presents an image-based talking head system that is able to synthesize realistic facial expressions accompanying speech, given arbitrary text input and control tags of facial expression. As an example of facial expression primitives, smile is used. First, three types of videos are recorded: a performer speaking without any expressions, smiling while speaking, and smiling after speaking. By analyzing the recorded audio-visual data, an expressive database is built and contains normalized neutral mouth images and smiling mouth images, as well as their associated features and expressive labels. The expressive talking head is synthesized by an unit selection algorithm, which selects and concatenates appropriate mouth image segments from the expressive database. Experimental results show that the smiles of talking heads are as realistic as the real ones objectively, and the viewers cannot distinguish the real smiles from the synthesized ones.

      Resutls

        Facial Expression Synthesis
        Pair No. Recorded Animated
        1 s001 s001_a
        2 s002 s002_a
        3 s003 s003_a
        4 s004 s004_a
        5 s005 s005_a

        Other exmaples of facial expression synthesis (big smiles):

        big smile 1

        big smile 2


        Leibniz Universitaet Hannover

          

        TNT Home | LUH | Search | Administrator
        Updated 21/02/2011
        Webpage design M.Sc. Kang Liu