List of Accepted Papers – MAPR 2026 | 2026 International Conference on Multimedia Analysis and Pattern Recognition (MAPR)

Paper ID	Paper Title
1	ViTOED: A Dataset for Target-Oriented Emotion Detection on Vietnamese Social Media Texts
4	Improved Neighbor Feature Centralization for Person Re-Identification
7	Benchmarking UAV-based Vehicle Re-Identification under Simulated Weather Conditions
9	Topic Classification of Short and Context-Poor Turns in Imbalanced Vietnamese Dialogue Data
10	ViPanelTR: A Multi-Agent Framework for Vietnamese Table Question Answering
11	Random Token Sparsification for ViT-based Hand Representation
12	CGRL: Concept-Guided Pruning and Representation Learning for Whole-Slide Image Classification
13	Retrieval-Augmented Fine-Tuning with Reasoning Distillation for Vietnamese Medical Question Answering
14	A Multimodal Acoustic-Semantic Framework for Language Diarization in Code-Switched Speech
19	Improving Multimodal Skin Disease Classification via Feature-Space Augmentation with Bayesian Semantic Data Augmentation
20	Improving Vietnamese Text Classification with Multi-Head Attention Pooling in Pre-trained Language Models
24	VietTayNMT: Directional LoRA for Low-Resource Vietnamese–Tay Neural Machine Translation
25	HeAD-CP: Heterophily-Aware Diffused Conformal Prediction Sets for Graph Neural Networks
27	Integrating Temporal Supervision and Self-Attention for Audio-Driven Head Synthesis
28	Hardware implementation for traffic sign classification system using convolutional neural network
30	AFUNet: Arbitrary Feature Upscaling for medical images super-resolution
31	Shopfloor: Multimodal Retrieval for Spare-Part Identification from Visual and Textual Data
33	When Iterative Prompting Fails: An Empirical Study of Unit Test Generation with Open-Source LLMs
34	On the Instability of Saliency Maps under Sparse Perturbations
37	LVT-EG: Edge-Guided License Plate Recognition via Learned Visual Tactility
41	PIXEL: Pixel-proxy Integrated eXplainable Entropy-texture Layered-evidence malware classification
42	TRACE: Temporal Reconstruction with Aligned Context Explanations for anomaly detection in building energy systems
44	Representation-Aware Knowledge Distillation for Efficient Malware Family Classification
47	Scene Text Detection Based on Attention ConvNeXt
48	ViePAWS: A Vietnamese Adversarial Dataset for Paraphrase Identification under LLM-based Word Scrambling
50	CoFARS-Sparse: A Context-Aware Recommendation Framework for Sparse E-Commerce Data
56	Summarization of Movie Transcript and Relevance Evaluation Between Comments and Movie Content In Vietnamese
58	A Preliminary Study on Building an LLM-Powered Physics AI Tutor for High School Students
59	Efficient Hybrid Models for Multiclass Plant Recognition: From Mask-Guided Focusing to Mobile Deployment Optimization
63	AI-Driven Digital Design Cues and Impulsive Buying: A Control-Based Pathway from Algorithmic Personalization and Infinite Scrolling to Urge to Impulse Buy and Impulsive Buying
68	MoE-KAN-AD: Mixture-of-Experts Kolmogorov-Arnold Autoencoders for Univariate Time Series Anomaly Detection
69	A Greedy Skeleton Retrieval Framework for Vietnamese Text-to-Sign Generation
70	A Comprehensive Evaluation of Timestep Discretization Strategies in Text-to-Image Diffusion Models
71	Domain-Adaptive Object Detection via Pseudo-Label Self-Training and Depth Priors
72	ViFA-Council: Multi-Agent LLM Deliberation for Vietnamese Folk Art Generation
75	Adapting Medical and General-Purpose Vision-Language Models for ENT Endoscopic Report Generation
77	Artifact-driven Alignment of Project Planning and DevSecOps Pipelines in Microservice Systems
78	State-Synchronized Bayesian Risk Assessment for Partially Observable DevSecOps Pipelines
79	Latency-Coded Spiking Networks with Dual Attention for Subject-Independent Alzheimer’s Disease Detection from Resting-State EEG
83	Disagreement-Aware Multi-View Stacking for Robust Malware Classification
85	Test-Time Augmentation-Enhanced Knowledge Distillation for Brain Tumor MRI Classification
86	VietQuill: Quality-Controlled Paraphrase Generation for Vietnamese Language
88	Evaluating the Robustness of Retrieval-Augmented Generation Against Character-Level Poisoning Attacks in Vietnamese Question Answering
89	A Hierarchical Hardware-in-the-Loop Agentic Framework for Smart Home Automation
94	Verifiable AI Reviewers: Decentralized Skill Matching and Soulbound Reputation for Multi-Agent Peer Review
96	X-RAG: Explainable Retrieval-Augmented Generation via Argumentation Frameworks and Dempster-Shafer Theory
98	ViFEC: Evidence-Guided Distantly Supervised Learning for Vietnamese Factual Error Correction
99	Enhance-Then-Recognize: Early Cross-Frame Fusion for Multi-Frame License Plate Recognition
102	Transference of Classifier-Free Guidance strategies to Autoguidance
104	PRECISE: Precision-Driven Discovery of Token-centric MEV
108	SemLex: A Hybrid Semantic–Lexical Framework for Vietnamese Job Recommendation
109	Towards Automated Pollen Detection in Microscopy Images
112	LLM-powered Smart Contract Vulnerability Detection with RAG-based Knowledge Base and Corrective Quality Gating
113	SLIM-IBR: Intent Broadcast Residual for Vietnamese Joint Intent Detection and Slot Filling
116	Vi-FusionQA: A Unified Benchmark and Baseline Study for Vietnamese Question Answering
119	A Hybrid Two-Stage Framework for Twitter Bot Detection: Decoupling Graph Representation Learning and Classification
120	Deep Learning for Multiphasic CT-Based Liver Cancer Subtype Classification and Segmentation Refinement
121	Smoke-Aware 3D Gaussian Splatting with Pseudo-Clean Supervision for 3D Scene Restoration
128	HAPI: A Hybrid Knowledge Graph and Vector Search Framework for Efficient Vietnamese Traffic Law Retrieval
130	Mean-Field Critic with Linear Graph Transformer in Sparse-Reward Multi-Agent Reinforcement Learning
133	MaskFlow: Attention-Guided Localized Editing for Rectified Flow Models
134	Entropy-Adaptive Patch Weighting for Improving Gradient-Based Adversarial Attack on Vision Transformer
136	A Concept-Guided Fusion Framework for Interpretable Skin Lesion Classification
139	OPEN-VOCABULARY VISUAL RELATIONSHIP DETECTION VIA VISION-LANGUAGE MODELS AND ATTENTION MECHANISMS
141	MicroCharNet: Less is More for License Plate Character Detection
146	Toward Efficient Weakly Supervised Semantic Segmentation Using Only Low-Magnification Histopathological Images
147	Lightweight Stable Diffusion via StableKOT: Knowledge Distillation Meets Optimal Transport
151	Benchmarking Multimodal Medical Data Analysis for Cancer-Related Disease Diagnosis
152	Benchmarking Test-Time Adaptation for Multi-Label Chest X-ray Classification under Distribution Shift
155	Causal-Ordered Event Logs from Public Health Surveys: Mining Diabetes Process Heterogeneity from NHIS
156	SPARE: Spatial Pattern-Aware Reranking for Weakly-Supervised Referring Expression Comprehension
157	Prior-Guided Amodal Mask Completion: Revisiting Late-Stage Spatial Attention in Shape Reconstruction
160	Graph-Driven LLM-Augmented Stateful API Fuzzing for OWASP API Top 10
163	Region-Guided Search for Ultra-High-Resolution Small Object Detection
167	Noun Presence Loss with Dynamic Weighting for Compositional Text-to-Image Generation
168	Fast-Converging and Architecture-Agnostic 6-Bit Face Recognition via Gradient Coordination and Refined Data
173	Super-Image Reranking: Lightweight Temporal Context Aggregation for Ad-hoc Video Search
175	ViG: Grounded Adaptation of Transformer-Based Models for Low-Resource Vietnamese Image Captioning
177	Efficient fMRI and Textual Alignment for Image Reconstruction from Human Brain Activity
178	Holistic Feature Fusion for Fine-Grained Pen Classification
179	Segmentation-Guided Scene Perturbation for Passage-Based Top-View Person Re-Identification
180	An Empirical Study on the Transferability of Transformer-Based Models for Software Vulnerability Detection
181	Evaluating AutoEDA Without Human Raters: A Diagnostic Framework and Case Study of Structured Question Generation
182	MGTE: A Modular Multi-Granularity Text Ensemble for Interactive Image Retrieval
183	ViCorpReviews: A Benchmark Dataset for Multi-Dimensional Sentiment and Hate Speech Detection in Vietnamese Workplace Context
186	An Accuracy-Efficiency Trade-off Study of Lightweight CNNs for Plant Leaf Disease Classification
187	Assessing Evasion and Behavioral Trade-offs of Obfuscation-Conditioned LLMs in PowerShell Attack Synthesis
188	Efficient Model Pruning via Selective Layer-wise Distillation with Dynamic CKA-based Weighting
189	PrismTrack: Perspective-Aware Multi-Cue Association for Robust Multi-Object Tracking
190	Track-Level Aggregation for Low-Resolution License Plate Recognition
191	An Efficient HLS-Based Hardware Accelerator with Resource Optimization for Transformer Models
192	MGFace: Mask-Gated Face Matching via Conditional Similarity Routing
194	Auditing Population-Level XAI Agreement with cABC: Evidence from Diabetes Risk Prediction
196	A Cascade English–Vietnamese Speech-to-Speech Translation Framework for Online IT Education
200	From CSV to Fraud Graphs: An Automated LLM-Guided Pipeline for Graph-Based Fraud Detection and Querying
201	SCAmodal: Reasoning-Aware Amodal Completion via Semantic-Geometric Dual Guidance
203	Learning to Infer the Invisible: Conditional Mask Prediction for Occluded Regions
204	Questioning Matters: A Controlled Study of Question Generation in Conversational Image Retrieval
205	CQ-MoE: CLIP-Aligned Question-Conditioned Mixture-of-Experts for Micro-Expression Visual Question Answering
207	Beyond Prompting-Based Commercial VLMs: Task-Specific Fine-Tuning for Flowchart Question Answering
208	VietNorm: Vietnamese Dialect Normalization
209	Operationalization Matters: When Graph-Aware Learning Adds Value for IC-Based Influence Approximation
211	An In-Vehicle Child Cry Recognition System Using Deep Learning Techniques
212	Real-Time Object Detection and Active Learning Framework for Smart Waste Sorting from Newly Collected Data
213	A Two-Stage Privacy-Aware SQL Anomaly Detector for Inline Database Defense
214	SeRel-LightFM: Bridging Semantic and Relational Representations for Sparse Hybrid Recommendation
215	Reliability-Aware Early Correctness Recognition for Exercise Feedback under Partial 3D Skeleton Observation
216	SAM-Anchored DINOv3 Soft Calibration for Weakly Supervised Semantic Segmentation
217	Multiview Consistency Learning with Synthetic Camera Views for Hand-in-the-Wild Gesture Recognition
220	A Unified Framework for Automated Proctoring under Overhead Surveillance and Imperfect Supervision
221	A lightweight transformer-based YOLO for object detection in complex traffic scenarios
223	PestNet50: Pest Classification via Swin-Hybrid Networks with Auxiliary Supervision
230	CFML: Cross-Modal Feature-Gated Multi-Task Learning with T-Hybrid Loss for Alzheimer's Diagnosis and MMSE Prediction
231	Keypoint-Based Isolated Sign Language Recognition via Multi-Stream MLP and Temporal Attention BiLSTM
232	Artistic Image Generation with Visual Balance Constraints
234	S-AECR: Word Intelligence and Explanation Strategy for Cyber Threat Intelligence Extraction and Mapping
239	Benchmarking Open-Source Vietnamese-English Speech-to-Text Translation Systems
241	Graph-based Multi-Agent LLM Framework for OCR-based Visual Question Answering
242	KDTwin: Task-Aware Knowledge Distillation for Lightweight Multi-Task Driving Scene Segmentation
244	LLM-Augmented Hybrid Representations for Disease Category Classification from Clinical Notes
245	A Class-incremental and Few-shot learning model for Intrusion detection under Concept drift
250	Internal Collision Resolution for Prioritized EDCA in IEEE 802.11bn: Toward Low-Latency Wireless Transport for XR and Multimedia Streaming
252	Efficient Frame Retrieval for Traffic Law Question Answering over Dashcam Videos
255	Alignment of Superseded and Replacement Vietnamese Legal Documents: Task, Baseline Models and Challenges
256	HOPE-Based Temporal Modeling for Continuous Sign Language Recognition
257	Explainable Malware Detection from Noisy API Sequences with RAG-Based MITRE ATT&CK Mapping
258	A Leakage-Aware Deep Learning Framework for Reliable Chest X-ray Classification with Anatomically Constrained Explanations
259	VSL-HV8: A Vietnamese Sign Language Dataset for Sign Recognition and Translation Research
260	SMILESGNN: Interpretable Clinical Toxicity Prediction via SMILES-Graph Cross-Attention Fusion
261	Metabolic networks on PET-based C-atlas for diagnosis of Alzheimer's disease
263	Towards Explainable Brain Tumor Analysis with Spatial Knowledge Graphs and Large Language Models
264	ENEA-GS: Enhancing Single-Image 3D Gaussian Splatting with Normal-guided Texture Smoothness and Entropy-based Alpha Regularization
265	Direct Image-to-modern Vietnamese Translation of Han-Nom Manuscripts via MultiModal RLHF Preference Alignment
267	DongHoCrafter: Object-Aware Style Transfer for Vietnamese Dong Ho Folk Art
268	Seasonality-Aware Parallel Deep Model for Improved Long-Term Air Quality Prediction

Search form

Main Menu

List of Accepted Papers – MAPR 2026