摘要

Techniques to auto-generate professional background music for the home-made video become highly demanded recently with the prevalence of digital video recorders and social media communities. An automated system far such purpose can significantly relieve users'burden of editing background music to accompany home-made video. However, the major obstacle lies in the fact that the assessment of the video-audio matching quality itself is always subjective. We therefore seek solutions from the rich online sharing professional videos to alleviate this difficulty. Mote specifically, a learning-from-Internet framework is proposed to uncover the underlying structures and rules of the video and audio track association patterns from professional online videos. After collecting a large corpus of online professional videos, a joint probabilistic framework is proposed to model prior knowledge from two aspects, namely, the correlation between video and audio tracks, as well as the transition mode from speech and music components. For novel pure-video input, candidate background music tracks are first selected according to the learnt joint probability model, which are further integrated into a smoothed matching sequence via dynamic programming. Qualitative experiments and comprehensive user studies well demonstrate the effectiveness of the proposed framework for background music auto-generation.

  • 出版日期2010

全文