3D Reconstruction of a Human Face with Monocular Camera Based on Head Movement

Yip, B. and Jin, J.S.

    Constructing three-dimensional model from two-dimensional images is an old problem in the area of computer vision. There are many publications and our approach is specifically designed for constructing the depth map of a human face, based on the head movement in a monocular setting. In our example, along with the front view image of the user, three additional images with various head movement are also captured. The objective of our algorithm is to construct the depth map of the front view image. The head pose of the images facing left, up and right are calculated with reference to the front image. The depth map is calculated through a triangular mesh. The nodes on the mesh are the feature points that we calculate the depth with. Through image registration process, the feature points on the front view image are mapped to the other three images. Based on the head pose and the newly mapped coordinate, we could calculate the depth of the feature point. The depth results calculated from each of the three images are combined together to find the final depth value. In this paper, we assumed that the only movement in the scene is the head movement. The result is not as accurate as we expect, and we believe it could be improved.
Cite as: Yip, B. and Jin, J.S. (2004). 3D Reconstruction of a Human Face with Monocular Camera Based on Head Movement. In Proc. 2003 Pan-Sydney Area Workshop on Visual Information Processing (VIP2003), Sydney, Australia. CRPIT, 36. Piccardi, M., Hintz, T., He, S., Huang, M. L. and Feng, D. D., Eds. ACS. 99-103.
pdf (from crpit.com) pdf (local if available) BibTeX EndNote GS