Paper ID | Paper Title |
1 | Systematic evaluation of deep learning-based vehicle detection methods |
3 | A Multi-Loss Hybrid CNN-ViT Model for Efficient Image Retrieval |
4 | Enhanced MobileNets via Augmentation of Filtering-based Features |
5 | Vietnamese Automatic Speech Recognition Utilizing Audio and Visual Data |
11 | Detection of Varroa Mites from Bee Images Using YOLO Architecture |
12 | Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making |
15 | Att2Search: an Attribute-based Pedestrian Search |
22 | ResEViT-Road: An Efficient Model for Road Quality Classification |
23 | Hybrid approach for Chest X-ray diagnosis with automated MLOps pipeline |
24 | Integrating Deep Learning And Explainable AI for Interpretable Educational Intervention Analysis |
26 | AI Agent Workflow for Reviewing AI-driven Sperm Assessment Literature |
27 | SATURN: Autoregressive Image Generation Guided by Scene Graphs |
28 | SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification |
29 | Trusted Proof: Verifying Photographic Authenticity via Multi-Camera Stereo Vision for Insurance and Forensic Applications |
31 | Adapting WavLM for Vietnamese Speaker Diarization in Real-world Conversations |
32 | Analyzing the Correlation and Impact of Speech Evaluation Metrics on Real-World Speaker Verification and Speech Recognition |
33 | Unified Acoustic Representation Learning for Vietnamese Speech Classification Tasks |
34 | HySUP: Seam-Aware Hand-Body Fusion for Accurate Whole-Body 3-D Reconstruction from a Single Image |
35 | Enhancing Multi-Camera People Tracking with Conflict-aware Cosine Tracking and Human Matching Re-assignment |
36 | OTADiff: Ovarian Tumor-Aware Diffusion Model for Ultrasound Image Augmentation and Detection |
37 | UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance |
38 | PowerGNN: A source code structural and textual awareness approach for identifying malicious PowerShell scripts |
44 | Movie Character Retrieval with Few Examples |
46 | UDP-Edit: Union Dual-Prompt Attention for Local Image Editing in Fast-sampling Diffusion Models |
49 | Dropout Attacks on Knowledge Distillation |
51 | Improving Micro-Expression Recognition with Phase-Aware Temporal Augmentation |
52 | RAG-SmartVuln: Enhancing Smart Contract Vulnerability Detection via Retrieval-Augmented LLMs |
53 | Smart Contract Vulnerability Detection using Prompt Engineering with Reasoning Models |
54 | Undermining Trust: How Bit-Flip Attacks Compromise Anomaly-based Network Intrusion Detection Systems |
59 | Generative One-shot Camouflage Instance Segmentation |
61 | HAPX-CLIP: Human Activity Recognition with Visual Sequences and Language Prompts |
63 | Truth or Trick? A Vietnamese Dataset of Adversarial Logic in Geometry |
66 | Multimodal Windows Malware Detection via Hybrid Analysis and Enriched Graphs: Effectiveness and Explainability |
67 | Ovarian Malignance Risk Prediction from Medical Notes: a case study in Vietnam |
68 | VISION-LANGUAGE TRANSFORMER FRAMEWORK FOR AUTOMATED MEDICAL IMAGING REPORTING |
71 | ChainFLIP: A Unified Framework Integrating Blockchain, Federated Learning, and IPFS for Secure Supply Chain Management |
72 | Plant species identification from pollen grain images using fusion of advanced deep neural networks |
77 | Forensic Challenges in Face Manipulated Videos |
78 | Vietnamese Emotion Recognition from Voice and Text: A Confidence-Based Approach |
80 | OSNet-DCN: Integrating Deformable Feature Learning for Lesion Tracking in Endoscopic Videos |
81 | Towards Robust Cancer Diagnosis: A Hybrid Deep Learning Pipeline for Imbalanced Multi-Label Microscopic Image Analysis |
82 | SwahiliVQA: A Dataset for Visual Question Answering in Swahili Language |
86 | Improving Table Structure Recognition Based on Content-Based Post-Processing |
90 | 3DSMCOS: A 3D Model-Based Synthetic Data Pipeline for Military Camouflaged Object Segmentation with Distractor-Augmented Realism |
92 | Context-Aware Question Answering for Vietnamese University Admissions via Multi-LLM Architecture |
93 | Robust Traffic Vehicle Detection under Real-World Conditions with Misclassified Vehicles Minimization and Weighted Box Fusion |
96 | Enhancing Continuous Student Activity Recognition through Virtual Trajectory and Appearance Matching |
98 | A Semi-Supervised Sparsity-Aware Loss Function for Crack Segmentation |
99 | Superpixel-based Graph Neural Network with Hierarchical Topology for retinal image grading |
102 | Multimodal Deep Learning for ECG Heartbeat Classification with SHAP-Based Interpretability |
103 | Development of an Indoor Position Monitoring System for Rescue Personnel during Emergencies |
104 | Enhanced Class Incremental Semantic Segmentation with New Classifier Pre-Tuning |
105 | Cross-lingual XLSR-Wav2Vec2-based Spoofing Speech Detection for Vietnamese Speech |
106 | Eliminating False Positives in Single-Human Parsing via Prompt Filtering and Pose-Aware Segmentation |
107 | Enhancing User Intent Detection in Chatbots through LLM Distillation |
108 | CR4Re: Reformulating binary COPD classification using contrastive representations |
109 | Enhancing Video-Only Sign Language Recognition with Wasserstein Knowledge Distillation on the VSL Dataset |
112 | Multimodal Fusion for Vulnerability Detection: Integrating Sequence and Graph-Based Analysis with LLM Augmentation |
115 | DSC-SNN: A Depthwise Separable Convolutional Spiking Neural Network for Efficient, Privacy-Preserving Action Recognition from Event-Based Data |
122 | LLM-Based Compliance Analysis of Declared Policies and Actual Cookie Usage on Websites |
124 | PerSynth: Personalized Character Video Synthesis from Text Prompts |
127 | Few-Shot Instance Segmentation: An Exploration in the Frequency Domain for Camouflage Instances |
130 | Enhancing Ship Detection in Remote Sensing: A Data Augmentation Approach Using State-of-the-Art Text-to-Image Diffusion |
139 | Adversarial Robustness Evaluation of a Vietnamese Handwriting OCR System |
140 | Unsupervised Malicious Domain Detection Using Auto-Encoder Models with Reconstruction Error |
144 | Scalable Fashion Product Retrieval with Multi-Task Fine-Tuned Vision-Language Models and Real-Time Distributed Architecture |
145 | VietX-NLI: A Cross-lingual Natural Language Inference Dataset with Vietnamese as the Source Language |
147 | Efficient Sign Language Recognition with Skeleton Data: A Study of Keypoint Selection, Pose Estimators, and GCN Models |
151 | In Defense of Character-Level Answer Generation Methods for Text-based Visual Question Answering |
Mua sắm thiết bị dụng cụ toolsviet.com hàng đầu tại việt nam