英文摘要:
The mutual support of traditional audio-visual services and emerging haptic services will definitely bring more extreme interactive experience and scene experience to multimedia users. Owing to substantial differences among audio, video, and haptic signals in terms of physical characteristic, transmission requirement, and display form, cross-modal communications architecture based on audio-video-haptic is proposed, which mainly includes haptic signal codecs, heterogeneous streaming transmission, and cross-modal information reconstruction. Firstly, the current efficient and robust haptic signal coding schemes are introduced based on the user haptic perception mechanism to provide a theoretical basis for signal compression. Then, by fully leveraging the spatio-temporal transmission characteristics, heterogeneous streaming transmission strategy empowered by edge intelligence is proposed to meet transmission needs of ultra-low latency, ultra-high reliability, and large volume. Subsequently, the intelligent and complete cross-modal information reconstruction mechanism is explored by the fusion and sharing of semantic levels among heterogeneous modalities to improve users’ immersive experience. Finally, the challenges and future directions existing in cross-modal communications are prospected.
|