Abstract
In this paper, real-time tracking and information acquisition of lip region of a person during speech in active video scenes are addressed. The lip is deformed by speech, and the size and orientation are changed by camera motion. The difficulty is mainly due to change of these appearances of the lip. We use template matching with a genetic algorithm to overcome these problems. A high speed and accurate tracking scheme using Evolutionary Video Processing is proposed. Usually, a genetic algorithm is unsuitable for a tracking accuracy of 91.6% and an average processing time of 26.0 milliseconds per frame are achieved.