Der Artikel wird am Ende des Bestellprozesses zum Download zur Verfügung gestellt.

MultiMedia Modeling

26th International Conference, MMM 2020, Daejeon, South Korea, January 5-8, 2020, Proceedings, Part II

Yong Man Ro

Springer International Publishing

Sofort lieferbar | Lieferzeit: Sofort lieferbar I

106,99 €*

In den Warenkorb

Zum Merkzettel

Zahlung / Versand

ISBN-13:

9783030377342

Veröffentl:

2019

Seiten:

820

Autor:

Yong Man Ro

Serie:

11962, Lecture Notes in Computer Science Information Systems and Applications, incl. Internet/Web, and HCI

eBook Typ:

PDF

eBook Format:

EPUB

Kopierschutz:

1 - PDF Watermark

Sprache:

Englisch

Systemvoraussetzungen

Beschreibung:

The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020.

Poster Papers.- Multi-Scale Comparison Network for Few-Shot Learning.- Semantic and Morphological Information guided Chinese Text Classification.- A Delay-aware Adaptation Framework for Cloud Gaming under the Computation Constraint of User Devices.- Efficient Edge Caching for High-Quality 360-Degree Video Delivery.- Inferring Emphasis for Real Voice Data: an Attentive Multimodal Neural Network Approach.- PRIME: Block-wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems.- A New Local Transformation Module for Few-shot Segmentation.- Background Segmentation for Vehicle Re-Identification.- Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence through Facial Action Units.- A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos.- Generate images with obfuscated attributes for private image classifcation.- Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution.- A Compact Deep Neural Network for Single Image Super-Resolution.- An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network.- Structured Neural Motifs: Scene Graph Parsing via Enhanced Context.- Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet.- TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation.- More-Natural Mimetic Words Generation for Fine-grained Gait Description.- Lite Hourglass Network for Multi-person Pose Estimation.- SS1: AI-Powered 3D Vision.- Single View Depth Estimation via Dense Convolution Network with Self-supervision.- Multi-Data UAV Images for Large Scale Reconstruction of Buildings.- Deformed Phase Prediction Using SVM for Structured Light Depth Generation.- Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization.- Similarity Graph Convolutional Construction Network for Interactive Action Recognition.- Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning.- Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance.- InSphereNet: a Concise Representation and Classification Method for 3D Object.- 3-D Oral Shape Retrieval Using Registration Algorithm.- Face Super-Resolution by Learning Multi-view Texture Compensation.- Light Field Salient Object Detection via Hybrid Priors.- SS2: Multimedia Analytics: Perspectives, Tools and Applications.- Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content.- Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings.- An inverse mapping with manifold alignment for zero-shot learning.- Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System.- An Extensible Framework for Interactive Real-time Visualizations of Large-scale Heterogeneous Multimedia Information from Online Sources.- SS3: MDRE: Multimedia Datasets for Repeatable Experimentation.- GLENDA: Gynecologic Laparoscopy Endometriosis Dataset.- Kvasir-SEG: A Segmented Polyp Dataset.- Rethinking the Test Collection Methodology for Personal Self-Tracking Data.- Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset.- SS4: MMAC: Multi-Modal Affective Computing of Large-Scale Multimedia Data.- Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection.- Enhanced Gaze Following via Object Detection and Human Pose Estimation.- Region Based Adversarial Synthesis of Facial Action Units.- Facial Expression Restoration Based on Improved Graph Convolutional Networks.- Global Affective Video Content Regression Based on Complementary Audio-Visual Features.- SS5: MULTIMED: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments.- Using Publicly Available Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction.- AttenNet: Deep Attention based Retinal Disease Classification in OCT Images.- NOVA: A Tool for Explanatory Multimodal Behavior Analysis and its Application to Psychotherapy.- Instrument Recognition in Laparoscopy for Technical Skill Assessment.- Real-time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding.- Model-based and Class-based Fusion of Multisensor Data.- Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos.- SS6: Intelligent Multimedia Security.- Compact Position-aware Attention Network for Image Semantic Segmentation.- Law is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design.- Rational Delegation Computing Using Information Theory and Game Theory Approach.- Multi-hop Interactive Cross-modal Retrieval.- Demo Papers.- Browsing Visual Sentiment Datasets using Psycholinguistic Groundings.- Framework Design for Multiplayer Motion Sensing Game in Mixture Reality.- Lyrics-Conditioned Neural Melody Generation.- A Web-based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images.- An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement.- DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models.- Real-time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems.- CNN-based Multi-Scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications.- Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution.- VBS Papers.- diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020.- Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search.- An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts.- VIREO @ Video Browser Showdown 2020.- VERGE in VBS 2020.- VIRET at Video Browser Showdown 2020.- SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop.- Exquisitor at the Video Browser Showdown 2020.- Deep Learning-Based Video Retrieval using Object Relationships and Associated Audio Classes.- IVIST: Interactive Video Search Tool in VBS 2020.

weniger

Kunden Rezensionen

Zu diesem Artikel ist noch keine Rezension vorhanden.
Helfen sie anderen Besuchern und verfassen Sie selbst eine Rezension.

> neue Rezension schreiben

Herzlich Willkommen!

Notwendige Cookies

Komfort Cookies

Marketing-/ Tracking-Cookies

MultiMedia Modeling

Beschreibung:

Kunden Rezensionen

Information

Firma

Folgen Sie uns auf: