Default Cover Image

2024 International Symposium on Multimedia (ISM)

Dec. 11 2024 to Dec. 13 2024

Tokyo, Japan

ISBN: 979-8-3315-1111-1

Table of Contents

Title Page iFreely available from IEEE.pp. 1-1
Title Page iiiFreely available from IEEE.pp. 3-3
Copyright PageFreely available from IEEE.pp. 4-4
Table of ContentsFreely available from IEEE.pp. 5-12
Message from the General ChairsFreely available from IEEE.pp. 13-13
Message from the Program ChairsFreely available from IEEE.pp. 14-15
S2MGen: A synthetic skin mask generator for improving segmentationFull-text access may be available. Sign in or learn about subscription options.pp. 1-8
StegoFusion-Net: Fusion of Convolutional Neural Networks for Spatial Image SteganalysisFull-text access may be available. Sign in or learn about subscription options.pp. 17-23
Disparity Correction Method of the Monocular Omnidirectional Stereo CameraFull-text access may be available. Sign in or learn about subscription options.pp. 24-25
Unveiling the Potential of SSL-Generated Audio Embeddings for Cross-Lingual Speaker RecognitionFull-text access may be available. Sign in or learn about subscription options.pp. 26-32
Two-stage instrument timbre transfer method using RAVEFull-text access may be available. Sign in or learn about subscription options.pp. 33-40
Speaker Pseudonymization for Japanese Speech Using Duration EmbeddingsFull-text access may be available. Sign in or learn about subscription options.pp. 41-48
Appeal prediction for AI up-scaled ImagesFull-text access may be available. Sign in or learn about subscription options.pp. 55-62
Ensuring Color Consistency in RGB-D Multi-Camera SetupFull-text access may be available. Sign in or learn about subscription options.pp. 79-84
Low Complexity Learning-based Lossless Event-based CompressionFull-text access may be available. Sign in or learn about subscription options.pp. 85-92
Flexible And Faithful Data Insights GenerationFull-text access may be available. Sign in or learn about subscription options.pp. 98-105
Holistic Visualization of Contextual Knowledge in Hotel Customer Reviews Using Self-AttentionFull-text access may be available. Sign in or learn about subscription options.pp. 106-109
Investigation of Feature Distribution and Network Weight Updates in the Machine Unlearning ProcessFull-text access may be available. Sign in or learn about subscription options.pp. 110-113
Platform for Endangered Language EducationFull-text access may be available. Sign in or learn about subscription options.pp. 114-115
Homophonic Music Composition Using a GAN and LSTM Pipeline for Melody and Harmony GenerationFull-text access may be available. Sign in or learn about subscription options.pp. 116-119
Instrumentality Classification Evaluation System for Natural Sounds*Full-text access may be available. Sign in or learn about subscription options.pp. 120-123
Generating Bass Phrases from Guitar Chord Backing with NMFFull-text access may be available. Sign in or learn about subscription options.pp. 124-125
Watch your back! Dynamic thumbnails for a 360-degree video player to enhance viewing experience on 2D displaysFull-text access may be available. Sign in or learn about subscription options.pp. 126-132
VEMOCLAP: A video emotion classification web applicationFull-text access may be available. Sign in or learn about subscription options.pp. 137-140
A Power-Law Transformation Approach for Template-Based Cross-Component PredictionFull-text access may be available. Sign in or learn about subscription options.pp. 141-142
Investigating the Impact of High Frame Rate on Video Quality: A SAMVIQ ApproachFull-text access may be available. Sign in or learn about subscription options.pp. 143-144
A Server-driven View-aware Point Cloud Video Streaming FrameworkFull-text access may be available. Sign in or learn about subscription options.pp. 145-148
Evaluation of strategies for efficient rate-distortion NeRF streamingFull-text access may be available. Sign in or learn about subscription options.pp. 149-153
Perceptual Quality Driven Point Cloud Compression for 6DoF 3D Point Cloud StreamingFull-text access may be available. Sign in or learn about subscription options.pp. 154-157
On Multi-CDN Delivery Costs Optimization ProblemFull-text access may be available. Sign in or learn about subscription options.pp. 158-161
Sliding Window Check: Repairing Object IdentitiesFull-text access may be available. Sign in or learn about subscription options.pp. 162-169
Data Augmentation with Diffusion Model for Hand DetectionFull-text access may be available. Sign in or learn about subscription options.pp. 170-173
Cross-Modal 3D Model RetrievalFull-text access may be available. Sign in or learn about subscription options.pp. 176-180
Prevention of Unexpected Object Generation in Diffusion Model-Based InpaintingFull-text access may be available. Sign in or learn about subscription options.pp. 181-184
LMM-Regularized CLIP Embeddings for Image ClassificationFull-text access may be available. Sign in or learn about subscription options.pp. 185-188
Evaluation Framework for Novel View SynthesisFull-text access may be available. Sign in or learn about subscription options.pp. 189-192
Ultra-low-latency 8K120p-video-transmission System Parallelizing SMPTE ST 2110Full-text access may be available. Sign in or learn about subscription options.pp. 201-202
Low-latency Software-based Uncompressed Video TransmissionFull-text access may be available. Sign in or learn about subscription options.pp. 203-204
Visual Speech Recognition with Surrounding and Emotional InformationFull-text access may be available. Sign in or learn about subscription options.pp. 205-212
Synchronized Object Sharing for Augmented Reality Virtual ConferencingFull-text access may be available. Sign in or learn about subscription options.pp. 213-218
Fusion-Based Human Pose Estimation Using RGB and IR Images with Transformer-Based DecodingFull-text access may be available. Sign in or learn about subscription options.pp. 219-220
Occlusion-Aware Real-Time Tiny Facial Alignment Model for Makeup Virtual Try-OnFull-text access may be available. Sign in or learn about subscription options.pp. 221-224
A Study on Mental Stress Test using Cybersickness caused by Virtual Reality ContentsFull-text access may be available. Sign in or learn about subscription options.pp. 225-226
Human-in-the-loop knowledge base upkeep for retrieval augmented generation applicationsFull-text access may be available. Sign in or learn about subscription options.pp. 232-233
LiveSkeleton: High-Quality Real-Time Human Tracking and Pose EstimationFull-text access may be available. Sign in or learn about subscription options.pp. 234-235
A technical Concept for enhancing the Student Experience in Hybrid Lecture ScenariosFull-text access may be available. Sign in or learn about subscription options.pp. 236-241
Evaluating Interactive Concept Maps Produced from E-PortfoliosFull-text access may be available. Sign in or learn about subscription options.pp. 255-260
Real-time Multi-modal Highlight Prediction for Simultaneous Viewing of Multiple Live StreamsFull-text access may be available. Sign in or learn about subscription options.pp. 275-278
The ≪Huh?≫ Button: Improving Understanding in Educational Videos with Large Language ModelsFull-text access may be available. Sign in or learn about subscription options.pp. 285-289
Author IndexFreely available from IEEE.pp. 291-293
Showing 66 out of 66