Last updated: 2026-06-03 05:01 UTC
All documents
Number of pages: 165
| Author(s) | Title | Year | Publication | Keywords | ||
|---|---|---|---|---|---|---|
| Ricardo Yaben, Emmanouil Vasilomanolakis | Digital ghost ships: abandoned, neglected, and obsolete IoT & OT devices exposed to the Internet | 2026 | Early Access | Internet Security Protocols Probes Authentication Servers Internet of Things Measurement Encryption Conferences IoT OT vulnerability identification Internetwide scans Internet measurements active probing longitudinal study | The rapid adoption of Internet of Things (IoT) and Operational Technology (OT) devices to control systems remotely has introduced significant cybersecurity challenges. Attackers have compromised millions of such devices over the years, exploiting their lack of management and weak cybersecurity. This paper examines cybersecurity issues of neglected, obsolete, and abandoned IoT and OT devices exposed to the Internet. To unify these issues under an umbrella term, we coined the term Digital Ghost Ships (DGSs). Our work focuses on identifying DGSs using common scanning tools to find indicators of security misconfigurations and misuse. Moreover, we compare two Internet-wide scans conducted two years apart, focusing on security issues in eight IoT and OT protocols: MQTT, CoAP, XMPP, Modbus, OPC UA, RTPS, DNP3, and BACnet. During our first scan (S1) we found 675,896 DGSs, and 75,007 during our second scan (S2). Lastly, we examine the IP reputation of the vulnerable devices and find that 7,424 (S1) and 792 (S2) DGSs were reported at least once. | 10.1109/TNSM.2026.3699092 |
| Jing Zhang, Chao Luo, Rui Shao | MTG-GAN: A Masked Temporal Graph Generative Adversarial Network for Cross-Domain System Log Anomaly Detection | 2026 | Early Access | Anomaly detection Adaptation models Generative adversarial networks Feature extraction Data models Load modeling Accuracy Robustness Contrastive learning Chaos Log Anomaly Detection Generative Adversarial Networks (GANs) Temporal Data Analysis | Anomaly detection of system logs is crucial for the service management of large-scale information systems. Nowadays, log anomaly detection faces two main challenges: 1) capturing evolving temporal dependencies between log events to adaptively tackle with emerging anomaly patterns, 2) and maintaining high detection capabilities across varies data distributions. Existing methods rely heavily on domain-specific data features, making it challenging to handle the heterogeneity and temporal dynamics of log data. This limitation restricts the deployment of anomaly detection systems in practical environments. In this article, a novel framework, Masked Temporal Graph Generative Adversarial Network (MTG-GAN), is proposed for both conventional and cross-domain log anomaly detection. The model enhances the detection capability for emerging abnormal patterns in system log data by introducing an adaptive masking mechanism that combines generative adversarial networks with graph contrastive learning. Additionally, MTG-GAN reduces dependency on specific data distribution and improves model generalization by using diffused graph adjacency information deriving from temporal relevance of event sequence, which can be conducive to improve cross-domain detection performance. Experimental results demonstrate that MTG-GAN outperforms existing methods on multiple real-world datasets in both conventional and cross-domain log anomaly detection. | 10.1109/TNSM.2026.3654642 |
| Songtao Peng, Yiping Chen, Xincheng Shu, Wu Shuai, Shenhao Fang, Zhongyuan Ruan, Qi Xuan | MAD-MulW: A Multi-Window Anomaly Detection Framework for BGP Security Events | 2026 | Early Access | In recent years, various international security events have occurred frequently and interacted between real society and cyberspace. Traditional traffic monitoring mainly focuses on the local anomalous status of events due to a large amount of data. BGP-based event monitoring makes it possible to perform differential analysis of international events. For many existing traffic anomaly detection methods, we have observed that the window-based noise reduction strategy effectively improves the success rate of time series anomaly detection. Motivated by this, we propose an unsupervised anomaly detection model, MAD-MulW, which introduces a multi-window serial framework. The W-GAT module adaptively updates sample weights within the window to reduce noise, while the W-LAE module captures temporal trends through predictive reconstruction, enhancing inter-class separation. Our model has been experimentally validated on multiple BGP anomalous events with an average F1 score of over 90%, which demonstrates the significant improvement effect of the stage windows and adaptive strategy on the efficiency and stability of the timing model. The source code is available at://github.com/2024ChenYP/MAD-MulW. | 10.1109/TNSM.2026.3696319 | |
| Wenyi Wang, Junchang Wang, Yu Hong, Lei Han, Xin He, Weibei Fan, Zixuan Guan, Xiaolong Zheng, Fu Xiao | LLT: Lossless Transmission using Local Recirculation for WANs | 2026 | Early Access | As distributed applications increasingly span geographically distributed data centers, the demand for high-performance, long-distance transmission has been continuously growing. While intra-data-center networks have employed techniques like remote direct memory access (RDMA) to meet these design goals, extending these techniques toWANs presents unique challenges. WANs notably suffer from inherent packet losses due to buffer overflows in routers and switches, leading to decreased throughput and making distributed applications barely usable. This paper proposes Lossless Transmission (LLT), a novel buffer management scheme for enabling lossless WAN transport. LLT intelligently integrates on-chip switch buffers with an off-chip caching system to absorb traffic bursts that would otherwise cause packet loss. Its data plane logic uses a multi-level threshold system to selectively offload only critical flows during congestion. A closed-loop control protocol, managed by a stateful flow table, ensures these offloaded packets are later re-injected with guaranteed lossless and in-order delivery, effectively protecting latency-sensitive applications from retransmission overhead. We evaluate LLT using both ns-3 simulations and P4-programmable devices. The experimental results show that in typical use cases (RTT > 30ms), LLT improves link bandwidth utilization by 1.9% to 29.5% and reduces the P99 percentile tail latency by 17% to 66% in WANs compared to the state-of-the-art solutions. Overall, LLT provides a scalable, efficient, and reliable framework for long-distance data transmission, addressing critical challenges in WANs. Additionally, LLT eliminates the need for expensive WAN infrastructure modifications. | 10.1109/TNSM.2026.3699483 | |
| Huijuan Zhu, Chenhao Zheng, Zhongyuan Liu, Yuan Zhang | Reliable Interpretations of Deep Learning-based Malware Detectors via Deep Q-Networks | 2026 | Early Access | Deep learning has become widely used in Android malware detection, but its black-box nature raises trust concerns, limiting its use in critical security areas. To address this, various interpretation methods have been proposed. Unfortunately, these solutions often suffer from inconsistent results and poor adaptability to model updates. In this work, we propose XDQNMal, a Deep Q-Networks (DQN)-based global interpretation framework designed to uncover the critical features that drive decisions in deep learning-based malware detectors. To enhance the reliability of interpretation, XDQNMal captures API call frequency features derived from the runtime behavior of each application (App). Then, it unites a DQN model with the TabPFN detection model to work collaboratively, using variations in detection results as reward signals. These signals guide the DQN model to gradually identify the most impactful features as interpretations for the detection model’s decisions. Our experimental evaluation on real-world datasets demonstrates that the proposed XDQNMal framework generates reliable interpretation for deep learning-based malware detection models. For instance, suppressing the critical features identified by XDQNMal leads to an average decrease of 20.30% in the probability that the malicious sample is predicted as malicious, highlighting the pivotal role these features play in the model’s decision-making. | 10.1109/TNSM.2026.3699408 | |
| Ke Yu, Xiaofeng Tao, Shen Wang, Chaojie Guo | Game-Theoretic Defense of SYN Flood Attacks in B5G Cloud-Edge-Terminal Networks | 2026 | Early Access | The emergence of beyond-fifth-generation (B5G) networks and the increasing demand for Internet of Things (IoT) requires deploying a cloud-edge-terminal computing network with the Software Defined Network as the controller. However, this network is vulnerable to various threats, notably SYN flood attacks. This paper adopts queuing theory and game theory to explore Mobile Edge Computing (MEC) attack and defense interaction in different IoT businesses. Moreover, we propose a utility model of the packet flow in MEC networks featuring the delay and packet loss rate in the SYN flood attacks. For the attacker and defender’s strategy, we use game theory to model the interaction between strategy and resource allocation. A search algorithm analyzing MEC cell impact on strategy selection is developed, and we investigate the impact of the attacker’s possession of prior knowledge versus lack thereof regarding MEC cell characteristics under SYN flood attacks. The proposed game models are solved, and the results show that under the defender’s strategy, the attacker has no chance to launch SYN flood attacks under the defender’s defense cost of four times MEC computing resources; the cost of defense resources is lower than other related schemes. | 10.1109/TNSM.2026.3695918 | |
| Arad Kotzer, Tom Azoulay, Yoad Abels, Aviv Yaish, Ori Rottenstreich | SoK: DeFi Lending and Yield Aggregation Protocol Taxonomy, Empirical Measurements, and Security Challenges | 2026 | Early Access | Filtering Application specific integrated circuits Filters Protocols Smart contracts Communication systems Proof of stake Proof of Work Internet Amplitude shift keying Blockchain Decentralized Finance (DeFi) Lending Yield Aggregation | Decentralized Finance (DeFi) lending protocols implement programmable credit markets without intermediaries. This paper systematizes the DeFi lending ecosystem, spanning collateralized lending (including over- and under- collateralized designs, and zero-liquidation loans), uncollateralized primitives (e.g., flashloans), and yield aggregation protocols which allocate capital across underlying lending platforms. Beyond a taxonomy of mechanisms and comparing protocols, we provide empirical on-chain measurements of lending activity and user behavior, using Compound V2 and AAVE V2 as case studies, and connect empirical observations to protocol design choices (e.g., interestrate models and liquidation incentives). We then characterize vulnerabilities that arise due to notable designs, focusing on interestrate setting mechanisms and time-measurement approaches. Finally, we outline open questions at the intersection of mechanism design, empirical measurement and security for future research. | 10.1109/TNSM.2026.3682174 |
| Jiang Mo, Ke Zhao, Limei Peng, Hsiao-Chun Wu | PDO-SFCM: Prediction-Driven Orchestration for SFC Migration in SAGIN via Fine-Tuned Large Time-Series Model and DRL | 2026 | Early Access | Modeling Space-air-ground integrated networks Timing Costing Costs Tuning Delays Optimization Algorithms Joining processes Space-air-ground integrated network (SAGIN) service function chain (SFC) migration prediction-driven network orchestration large time-series model (LTM) deep reinforcement learning (DRL) cost-augmented enhanced timeexpanded graph (C-eTEG) | Space-air-ground integrated networks (SAGINs) have emerged as an appealing enabling technology for the next-generation ubiquitous connectivity. By extending terrestrial networks with aerial and space platforms, SAGIN can provide seamless coverage and flexible resource-access across various altitudes. However, dynamic link conditions, intermittent connectivity, and heterogeneous latency constraints would often introduce serious challenges to the service function chain (SFC) migration and orchestration. In this work, we introduce a novel PDO-SFCM (prediction-driven orchestration for SFC migration) approach, which utilizes a fine-tuned large time-series model (LTM) for network status prediction and a deep reinforcement learning (DRL) module for proactive SFC migration in SAGINs. In detail, the fine-tuned LTM predicts multi-horizon estimates of SFC arrivals and per virtual network function (per-VNF) resource demands, which will form the observation space of the DRL agent. The DRL module thus schedules appropriate migration actions on the cost-augmented time-expanded graph (C-eTEG), which can satisfy the feasibility subject to the bandwidth, buffering, and precedence constraints. Extensive simulation results demonstrate that our proposed new PDO-SFCM scheme consistently greatly improves the acceptance rate, reduces the end-to-end delay, and lowers the migration cost in comparison with DRL baselines under different prediction settings. Our proposed new scheme can significantly leverage the SAGIN performance by the devised foundation-level time-series prediction and learning-based orchestration mechanisms. | 10.1109/TNSM.2026.3694203 |
| Lizhuang Tan, Nguyen Van Tu, Xinhang Wang, Peiying Zhang, James Won-Ki Hong | SDNIE: A Software-Defined Approach to High-Performance Network Impairment Emulation using Programmable Switches | 2026 | Early Access | Emulation Central Processing Unit Testing Delays Switches Hardware Programming Information rates Throughput Limiting Software-Defined Networking Programmable Data Plane Network Testing Network Impairment Emulation Network Management | Network testing is critical for evaluating the performance, reliability, and security of modern computer networks. A key challenge is creating an accurate, cost-effective, and high-performance network emulation environment. Network Impairment Emulators (NIEs) emulate real-world network conditions such as bandwidth constraints, latency, and packet loss, but existing CPU- and FPGA-based solutions suffer from limited performance, high costs, and poor flexibility. This paper proposes Software-Defined Network Impairment Emulation (SDNIE), a novel framework that leverages programmable switches for scalable, cost-efficient network impairment emulation. SDNIE introduces three key techniques: (1) intent-driven network impairment configuration, automating impairment modeling; (2) serial-parallel combined execution, optimizing performance; and (3) CPU-Tofino collaborative deployment, offloading complex computations. Experimental results show that SDNIE matches commercial emulators in performance while significantly reducing costs. This work demonstrates the potential of programmable switches in network testing, offering a scalable, cost-effective, and high-performance alternative for next-generation network impairment emulation. | 10.1109/TNSM.2026.3694388 |
| Deemah H. Tashman, Soumaya Cherkaoui | Trustworthy AI-Driven Dynamic Hybrid RIS: Joint Optimization and Reward Poisoning-Resilient Control in Cognitive MISO Networks | 2026 | Early Access | Reconfigurable intelligent surfaces Reliability Optimization Security MISO Array signal processing Vectors Satellites Reflection Interference Beamforming cascaded channels cognitive radio networks deep reinforcement learning dynamic hybrid reconfigurable intelligent surfaces energy harvesting poisoning attacks | Cognitive radio networks (CRNs) are a key mechanism for alleviating spectrum scarcity by enabling secondary users (SUs) to opportunistically access licensed frequency bands without harmful interference to primary users (PUs). To address unreliable direct SU links and energy constraints common in next-generation wireless networks, this work introduces an adaptive, energy-aware hybrid reconfigurable intelligent surface (RIS) for underlay multiple-input single-output (MISO) CRNs. Distinct from prior approaches relying on static RIS architectures, our proposed RIS dynamically alternates between passive and active operation modes in real time according to harvested energy availability. We also model our scenario under practical hardware impairments and cascaded fading channels. We formulate and solve a joint transmit beamforming and RIS phase optimization problem via the soft actor-critic (SAC) deep reinforcement learning (DRL) method, leveraging its robustness in continuous and highly dynamic environments. Notably, we conduct the first systematic study of reward poisoning attacks on DRL agents in RIS-enhanced CRNs, and propose a lightweight, real-time defense based on reward clipping and statistical anomaly filtering. Numerical results demonstrate that the SAC-based approach consistently outperforms established DRL base-lines, and that the dynamic hybrid RIS strikes a superior trade-off between throughput and energy consumption compared to fully passive and fully active alternatives. We further show the effectiveness of our defense in maintaining SU performance even under adversarial conditions. Our results advance the practical and secure deployment of RIS-assisted CRNs, and highlight crucial design insights for energy-constrained wireless systems. | 10.1109/TNSM.2026.3660728 |
| Xiaomao Zhou, Zihao Shao, Qingmin Jia, Renchao Xie | ProxyLLM: Augmenting LLMs with Proxy Models for Tool Utilization in Network Service Generation | 2026 | Early Access | Tools Modeling Large language models Learning (artificial intelligence) Training Optimization Accuracy Planning Strontium Cognition Large Language Models Tool utilization knowledge distillation Deep Reinforcement Learning Network service generation Computing Power Network | This paper introduces ProxyLLM, a novel framework designed to enhance the tool utilization capabilities of Large Language Models (LLMs) by leveraging an ensemble of smaller, specialized proxy models. Specifically, instead of invoking tools directly, ProxyLLM delegates tasks to these proxy models, each of which is responsible for a distinct domain and equipped with a curated set of relevant tools. Meanwhile, ProxyLLM employs a two-step knowledge transfer mechanism, utilizing data generated by the LLM for knowledge distillation and LLM-guided Deep Reinforcement Learning (DRL) to enhance the decision-making abilities of the proxy models. During the data-driven knowledge distillation process, the introduction of rationales ensures that proxy models maintain a comprehensive understanding of tasks, thereby improving the learning effectiveness. In the DRL learning process, LLM guidance is separately integrated into both the actor and critic learning phases. This ensures consistency in strategy and uniformity in evaluating the action space, which enhances both the efficiency and effectiveness of the learning process. Extensive experiments, including real-world applications such as network service generation in a Computing Power Network (CPN) system, demonstrate that ProxyLLM significantly outperforms existing methods in terms of task accuracy and tool invocation efficiency. The proposed framework offers a promising solution for constructing generalizable, large-scale intelligent agents capable of effectively leveraging diverse tools to solve complex, cross-domain problems. | 10.1109/TNSM.2026.3695074 |
| Haoyu Luo, Ming Liu, Shaojian Qiu, Xiao Liu | FaaSAdapter: An Adaptive Resource Configuration Framework for Serverless Workflows at the Edge | 2026 | Early Access | Optimization Resource management Modeling Timing Costing Costs Runtime Matrices Conferences Modules (abstract algebra) Serverless computing resource configuration workflow service level objective edge computing | Serverless computing has emerged as a promising deployment paradigm for edge scenarios, owing to its efficient resource utilization and flexible provisioning enabled by Function-as-a-Service (FaaS). In Serverless environment, developers are required to configure resources for functions to balance cost efficiency and performance. However, determining appropriate resource allocations for the functions running at the edge is a challenge due to the dynamic nature of the environment. This challenge is further compounded when managing serverless workflows composed of multiple interconnected functions with complex dependencies. To address such an challenge, we present FaaSAdapter, an efficient runtime resource configuration framework for workflow functions, aiming at conserving computational resources at the edge while ensuring timely response to user requests. Different from existing dynamic resource configuration methods that incrementally determine resource schemes for only the immediate subsequent workflow function, FaaSAdapter predicts the execution times of all the unexecuted functions across various resource configurations and determines an optimal configuration schema for the function instances based on the current execution progress. Then, it updates the configuration schema as needed during runtime. Comprehensive experiments demonstrate that FaaSAdapter ensures satisfactory response time of user requests with lowest resource consumption. | 10.1109/TNSM.2026.3695591 |
| Arash Heidari, Jamal N. Al-Karaki | NOVA: A Self-Supervised Graph Framework for Real-Time Anomaly Detection in Internet of Vehicles | 2026 | Early Access | Context Internet of Vehicles Modeling Timing Vehicles Labeling Anomaly detection Matrices Vectors Joining processes Internet of Vehicles V2X Security Anomaly Detection Self-Supervised Learning Graph Neural Networks | The Internet of Vehicles (IoV) enables cooperative driving and real-time Vehicle-to-Everything (V2X) communication but remains vulnerable to behavioral and structural anomalies due to its dynamic, decentralized nature. Existing deep learning methods either overlook topological inconsistencies or ignore communication feature fidelity, while random-walk sampling introduces contextual noise. In this paper, we propose Network Observation for Vehicular Anomalies (NOVA), a self-supervised graph-based framework that detects both behavioral and structural anomalies in IoV networks without labeled data. NOVA models vehicular communications as attributed graphs and employs intimacy-guided subgraph sampling to extract meaningful neighborhoods. A Graph Convolutional Network (GCN)–based generative module reconstructs node attributes to reveal behavioral deviations, while a contrastive module validates structural coherence through embedding comparisons of real and perturbed contexts. Their hybrid anomaly score enables accurate, scalable, and real-time detection of compromised nodes. Performance results show that NOVA achieves state-of-the-art performance (98.7% accuracy, 98.1% F1), real-time throughput (~4.7k events/s at 5k msg/s), and strong robustness (AUROC 0.99, AUPRC 0.98, FAR 0.05) with near-linear scalability (≤40 ms latency for 50k vehicles). By integrating generative and contrastive self-supervised learning with context-aware sampling, NOVA significantly enhances IoV security, reliability, and adaptability. | 10.1109/TNSM.2026.3696324 |
| Soonbeom Kwon, Yusu Noh, Youngwoo Jang, Illyoung Choi, Byungchul Tak, In-geol Chun, Young-Kyoon Suh | Scalable and Robust Resource Provisioning via Adaptive Task Scheduling for Edge Devices | 2026 | Early Access | Schedules Scheduling Cloning Timing Educational institutions Computers Transcoding Videos Tail Edge computing Edge devices Edge server Resource augmentation Task distribution Kubernetes | Edge devices, such as wearables, drones, and CCTV systems, are vital for real-time data collection in urban intelligence. However, their limited computational and storage capacities pose significant challenges. While offloading to public clouds offers scalability, it often incurs high latency and operational costs. Conversely, centralizing workloads on edge servers may result in the underutilization of high-performance edge devices. To address these limitations, we introduce ERPF, a Kubernetes-based Edge Resource Provisioning Framework that augments the capabilities of heterogeneous edge environments. ERPF orchestrates dynamic volume provisioning, GPU-aware resource allocation, execution context migration, and adaptive task distribution to improve system flexibility and efficiency. Building on this, we propose a novel adaptive task scheduling technique, termed eATS, composed of three key mechanisms: (i) Partition Smoothing Scheme for stable task granularity control, (ii) Resilient Edge Reintegration for failure detection and task reassignment, and (iii) Competitive Task Cloning for speculative execution with fastest-result commitment. The proposed eATS scheme reduces task execution time by up to 27.6%, lowers partition size variability by 8.7×, and improves scheduling robustness across heterogeneous edge devices over the baseline. | 10.1109/TNSM.2026.3694238 |
| Kunpeng Zheng, Huibin Zhang, Yongli Zhao, Yuan Cao, Wei Wang, Xin Li, Zhuangzhuang Ma, Lihan Zhao, Jie Zhang | Sun-Outage-Aware Topology Modeling and Adaptive Routing for Optical Satellite Networks | 2026 | Early Access | Sun Interrupters Joining processes Satellites Routing Algorithms Modeling Timing Topology Interference Optical inter-satellite links optical service connections optical satellite network sun outage topology modeling | Optical satellite networks, supported by optical inter-satellite links (OISLs), provide reliable and low-latency optical connectivity. However, periodic and predictable sun outage events significantly compromise OISL availability, leading to frequent OISL interruptions and reduced network reliability. Existing routing algorithms often overlook the regularity of sun outage-induced interrupts and their differentiated impacts on services, resulting in degraded service performance. To address this challenge, this paper proposes a sun outage-enhanced time discretization OISL model and introduces a sun outage link-aware routing (SOLR) algorithm. By incorporating joint awareness of sun outage patterns and service requirements, SOLR employs an adaptive optimization mechanism to dynamically adjust routing decisions within temporal windows. Experimental results demonstrate that SOLR extends stable path durations by 39.9%, reduces interruption rates by 28.5%, and decreases blocking rates by 36.4%, significantly outperforming link-state-based routing algorithms. By effectively mitigating the impact of sun outages, SOLR ensures continuous optical service connections. This interruption-tolerant framework bridges network modeling and service provisioning, offering a robust solution for mission-critical service in optical satellite networks. | 10.1109/TNSM.2026.3697856 |
| Shuyun Luo, Dongmiao Ying, Zhiyi Luo, Weiqiang Xu | Edge-based Approximate Caching for Fast and Scalable Text-to-Image Diffusion Models | 2026 | Early Access | Modeling Clouds Servers Algorithms Image synthesis Text to image Silicon Diffusion models Costing Costs Approximate Caching Edge Computing Diffusion Models Text-to-image Generation | Text-to-image generation applications based on diffusion models face substantial challenges in computational efficiency and latency, particularly in time-sensitive scenarios, due to the inherently iterative denoising process. Although approximate caching techniques can reduce denoising iterations by reusing intermediate states of diffusion models, existing approaches fail to adequately capture user request behaviors. It is observed that users tend to issue a large number of prompt requests within short time intervals (bursty patterns), and that prompts from the same user often exhibit high similarity over short time periods (temporal locality). In this work, we define and formalize the Intermediate State Selection (ISS) problem to minimize denoising iterations. We further prove the NP-hardness of the ISS problem via a polynomial-time reduction from the Dominating Set problem. We then exploit both characteristics of prompt requests and present EdgeDiffusion, a novel edge-cloud cooperative framework in which the cloud retains image generation, while prompts caching and intermediate state selection are offloaded to edge servers. Specifically, we design an ISS algorithm that optimizes state reuse by leveraging temporal locality and an adaptive caching strategy tailored to bursty patterns. Experimental results on real-world datasets demonstrate that EdgeDiffusion achieves 18.3%-77.3% computational savings over baseline strategies (NIRVANA, qLRU-AC, LRU and LFU), while maintaining 98% quality of images. | 10.1109/TNSM.2026.3697787 |
| Emilio Paolini, Andrea Pinto, Luca Valcarenghi, Flavio Esposito | Programmable In-Network Aggregation for Communication-Aware Federated Learning in 5G RANs | 2026 | Early Access | Modeling Timing Training Federated learning Accuracy 5G mobile communication Convergence Aggregates Labeling Point cloud compression Federated Learning Mobile Networks Wireless In-Network Aggregation Grouping | Federated Learning (FL) enables collaborative model training without sharing raw data, making it attractive for privacy-preserving applications at the wireless edge. However, when executed over real 5G networks, FL performance degrades due to uplink congestion, heterogeneous client capabilities, and intermittent connectivity. Most existing approaches attempt to mitigate these issues indirectly by optimizing clients (through adaptive participation, local training, or selection strategies) or by optimizing models (via pruning, quantization, or compression), but they ignore potential network bottlenecks. This paper introduces FLAG, an FL architecture that embeds innetwork aggregation directly into 5G gNodeBs, transforming the network into an active participant in the learning process. In particular, FLAG performs parameter aggregation at line rate within the 5G Service Data Adaptation Protocol layer and incorporates three mechanisms: Partial-Contribution Correction for loss-tolerant averaging, a timer-driven pipeline for real-time scheduling, and a deadline-based grouping strategy to mitigate stragglers. Experiments with realistic wireless emulation show that FLAG achieves up to 5.1× faster time-to-accuracy and maintains accuracy within 0.8% of a loss-free baseline, while reducing gNB-to-server bandwidth by aggregating pergNB rather than per-client. FLAG requires no modifications to clients or the parameter server, demonstrating how 5G-aware system design can make federated learning scalable, efficient, and resilient under real-world wireless conditions. | 10.1109/TNSM.2026.3697723 |
| S Gangadhar, A Chinmayananda, Animesh Roy | Decentralized Adaptive Initial Congestion Window in 5G Using Handshake-Based Flow Classification and Online Learning | 2026 | Early Access | Fluid flow Modeling TCP Information rates Throughput Timing Training Accuracy 5G mobile communication Electronic learning Initial Congestion Window TCP Flow Classification Online Learning Next-Generation Networks Congestion Control Decentralized Framework | The growing prevalence of latency-sensitive applications necessitates increasingly adaptive network congestion control mechanisms. This is particularly critical in Next-Generation Networks, e.g., 5G and beyond 5G networks, which promise low-latency and high-throughput support for diverse traffic. Traditional TCP variants employ a fixed Initial Congestion Window (ICW), resulting in suboptimal performance for short and long flows. This is because a small, fixed ICW unnecessarily prolongs short flows, while also delaying the ramp-up of long flows, leading to under-utilized bandwidth and increased latency. While machine learning based solutions offer improvements, they often rely on centralized architectures, limiting their applicability in decentralized scenarios. This paper introduces a novel decentralized framework that combines lightweight machine learning -based flow classification with online learning for real-time adaptive ICW. We first generate a unique dataset using OMNeT++/Simu5G simulations and select a suitable classifier that leverages TCP handshake features to distinguish short flows from long flows with 97.1% accuracy. This prediction drives an online learning model to dynamically select the efficient ICW before data transmission. Rigorous evaluation across TCP variants such as Reno, NewReno, and Westwood through simulations shows that our proposed framework achieves a 27–36% reduction in flow completion time and a 104-204% increase in throughput compared to baseline implementations. Robustness of our framework is further validated via an ablation study, threshold sensitivity analysis, missing TCP options, statistical significance with 95% confidence intervals (10-seed experiments, p < 0.0001), and multi-UE fairness (Jain index > 0.99). Operating entirely at the UE, our proposed solution eliminates dependence on centralized control, offering a scalable and resilient strategy for Next-Generation Networks. | 10.1109/TNSM.2026.3698279 |
| Xin Hu, Xiantao Jiang, F. Richard Yu, Victor C.M. Leung | Enhancing Adaptive Video Streaming through Bandwidth Prediction with Deep Reinforcement Learning | 2026 | Early Access | Algorithms Videos Bit rate Modeling Bandwidth Training Quality of experience Timing Optimization Streams Adaptive Bitrate (ABR) deep reinforcement learning (DRL) quality of experience (QoE) bandwidth prediction Bidirectional Long Short-Term Memory (BiLSTM) | With the development of HTTP-based video streaming, Adaptive Bitrate (ABR) algorithms have become crucial for optimizing video quality. These algorithms dynamically select the bitrate of video chunks based on factors such as network throughput and playback buffer occupancy. However, the volatility of network throughput, conflicting Quality of Experience (QoE) objectives, and cascading effects in decision-making pose significant challenges for ABR algorithms to accurately determine bitrate selections, leading to substantial revenue losses for content providers. This paper proposes a bandwidth prediction-based ABR algorithm for video streaming, termed the BPA algorithm, which consists of two components: a Bandwidth Prediction Model (BPM) and a Bitrate Selection Model (BSM). The BPM leverages a Bidirectional Long Short-Term Memory (BiLSTM) network for bandwidth prediction, while the BSM adopts an Actor-Critic reinforcement learning framework. A reward function based on bandwidth prediction accuracy is proposed, and an end-to-end joint optimization loss function is designed to train the model for optimal video bitrate selection. Under various network conditions, the BPA algorithm outperforms existing baseline algorithms, achieving an improvement of nearly 31.9% compared to traditional heuristic methods and a 9% enhancement over other deep reinforcement learning-based approaches. The BPA algorithm demonstrates excellent performance in terms of bitrate smoothness and QoE. | 10.1109/TNSM.2026.3696658 |
| Xiang Li, Peijun Dong, Hang Tao, Siyao He, Hanjiang Luo, Jiehan Zhou | AOTSS: Acoustic-optical Communication based Multi-AUV Collaborative Target Search Scheme via Deep Reinforcement Learning | 2026 | Early Access | Algorithms Optical fiber communication Timing Probability Distance measurement Modeling Elementary particles Noise Training Optimization Multi-agent Reinforcement Learning Acoustic–optical Multimodal Communication Autonomous Underwater Vehicles AUV Swarm Network Underwater Target Search | In complex underwater environments, multiple autonomous underwater vehicles (AUVs) typically rely on under-water acoustic communication when performing collaborative target search tasks. However, traditional underwater acoustic technology has communication constraints (e.g., high latency and low bandwidth), which leads to poor information sharing and degrade the performance of multi-AUV collaboration target search missions. To address these challenges, this paper proposes a multi-AUV collaborative acoustic-optical communication based target search scheme (AOTSS), which consists of two main components: a particle filter-based path planning algorithm (PFPPA) and a multi-agent reinforcement learning-based multi-AUV Collaborative Search Algorithm (MASA). In PFPPA, to implement efficient multimodal communication among AUVs, we design a navigation algorithm based on particle filter method and deep reinforcement learning. This approach maximizes optical communication to enhance information sharing among AUVs. Furthermore, In MASA we leverage multimodal communication to enhance information sharing among AUVs to obtain precise probability maps, and incorporate pheromones into these maps to guide AUVs performing efficient cooperate search via the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) approach. Through extensive simulations, the results demonstrate that the proposed scheme significantly enhances the multi-AUV collaboration target search efficiency. | 10.1109/TNSM.2026.3698507 |