Research
Audio-Visual Signal Processing focuses on areas such as Automatic Speech Recognition (ASR) and Biometrics. Our most recent research includes facial tracking and feature extraction algorithms, information fusion across the audio and visual modalities, and ASR system architectures using dynamic Bayesian networks (DBNs).
Many multimedia communication applications require transporting of compressed video data over lossy channels that can exhibit wide variability in throughput, delay, and packet loss. Providing acceptable video quality in such environments is a demanding task for both the video encoder/decoder as well as the communication and networking infrastructure.
At IVPL we are working towards the development of cutting-edge video compression techniques which are to deliver HD quality video under parsimonious transmission and storage requirements. Projects we are involved include advancement of the current video encoding standard H.264 and development of pre-and-post processing algorithms for video.






