Audio-video synthesis methods for improving performance of biometric systems

Detta är en avhandling från Gothenburg : Department of Signals and Systems, Chalmers University of Technology

Sammanfattning: System security is important for any automation. It is even more so in the case of biometric systems due to the sensitive nature of the data it uses for enrollment and authentication - the subjects physical or biological trait. The performance quantification of biometric systems, such as face tracking and recognition, highly depend on the database used for testing the systems. Systems trained and tested on realistic and represenative databases evidently perform better. In fact, the main reason for evaluating any system on test data is that these data sets represent problems that system might face in the real world. However, building biometric databases that represent the real world is an expensive task due to its high demand on the side of the participants. This becomes even more difficult and unrealistic if the data is to be collected in a natural environment such as supermarkets, offices, streets, etc.This thesis presents a procedure to build a synthetic biometric database by damascening images from a studio recorded database with a realistic scenery. To this end, we developed an image segmenation procedure to spearate the background of a video recorded in studio conditions with the prupose to replace it with an arbitrary complex background. Furthermore, we present how several degradations such as affine transformation, imaging noise, and motion blur can be incorporated into the production of the new database to simulate natural recording environments. The system is applied to the entire XM2VTS database, which already consists of several terabytes of data, to produce the DXM2VTS - Damascened XM2VTS database.Moreover, the thesis presents a method to segment a video sequence in the time domain based on its audio concept. The video is then reshuffled and used for testing resilience of text-prompted biometric systems against playback attacks. The playback is supported by pyramid based frame interpolation method to reduce discontinuities created at the digit boundaries in time.

  Denna avhandling är EVENTUELLT nedladdningsbar som PDF. Kolla denna länk för att se om den går att ladda ner.