Last updated: 2026-07-03 05:01 UTC
All documents
Number of pages: 167
| Author(s) | Title | Year | Publication | Keywords | ||
|---|---|---|---|---|---|---|
| Deemah H. Tashman, Soumaya Cherkaoui | Trustworthy AI-Driven Dynamic Hybrid RIS: Joint Optimization and Reward Poisoning-Resilient Control in Cognitive MISO Networks | 2026 | Early Access | Reconfigurable intelligent surfaces Reliability Optimization Security MISO Array signal processing Vectors Satellites Reflection Interference Beamforming cascaded channels cognitive radio networks deep reinforcement learning dynamic hybrid reconfigurable intelligent surfaces energy harvesting poisoning attacks | Cognitive radio networks (CRNs) are a key mechanism for alleviating spectrum scarcity by enabling secondary users (SUs) to opportunistically access licensed frequency bands without harmful interference to primary users (PUs). To address unreliable direct SU links and energy constraints common in next-generation wireless networks, this work introduces an adaptive, energy-aware hybrid reconfigurable intelligent surface (RIS) for underlay multiple-input single-output (MISO) CRNs. Distinct from prior approaches relying on static RIS architectures, our proposed RIS dynamically alternates between passive and active operation modes in real time according to harvested energy availability. We also model our scenario under practical hardware impairments and cascaded fading channels. We formulate and solve a joint transmit beamforming and RIS phase optimization problem via the soft actor-critic (SAC) deep reinforcement learning (DRL) method, leveraging its robustness in continuous and highly dynamic environments. Notably, we conduct the first systematic study of reward poisoning attacks on DRL agents in RIS-enhanced CRNs, and propose a lightweight, real-time defense based on reward clipping and statistical anomaly filtering. Numerical results demonstrate that the SAC-based approach consistently outperforms established DRL base-lines, and that the dynamic hybrid RIS strikes a superior trade-off between throughput and energy consumption compared to fully passive and fully active alternatives. We further show the effectiveness of our defense in maintaining SU performance even under adversarial conditions. Our results advance the practical and secure deployment of RIS-assisted CRNs, and highlight crucial design insights for energy-constrained wireless systems. | 10.1109/TNSM.2026.3660728 |
| Kai Chen, Guangjie Liu, Jiangtao Zhai, Weiwei Liu, Yuewei Dai | SSH-CAM: Fine-Grained SSH Behavior Identification in Encrypted Tunnel Traffic using Curriculum-Adaptive Mixup | 2026 | Early Access | Encrypted tunneling mechanisms are widely deployed for privacy protection and secure communication, while also obscuring application-layer semantics, making fine-grained traffic analysis more challenging. When Secure Shell (SSH) traffic is encapsulated within encrypted tunnels, multiple internal behaviors can coexist within a tunneled flow, such that traffic captured at a tunnel observation point rarely corresponds to a single behavior. Existing tunnel analysis methods focus on protocol- or application-level identification and are not designed for fine-grained SSH behavior identification under complex tunnel scenarios. We present SSH-CAM, a curriculum-guided framework for inferring the dominant SSH behavior at encrypted tunnel observation points, robust to the presence of coexisting interfering behaviors within the captured traffic. SSH-CAM constructs packet-level representations that capture both structural attributes and temporal information, followed by sequence-level feature extraction. A Curriculum-Adaptive Mixup mechanism is introduced to gradually increase training difficulty through controlled structural interpolation. The framework also imposes a learnable Gaussian prototype constraint on the latent representations, fostering intra-class compactness and greater inter-class separation under significant interference. Experiments conducted on a dataset constructed from six widely used tunneling protocols demonstrate that SSH-CAM consistently outperforms existing baselines across varying interference levels, showing robustness in highly mixed tunnel traffic scenarios. | 10.1109/TNSM.2026.3705758 | |
| Madhura Adeppady, Yenchia Yu, Ali Rahmanian, Ahmed Ali-Eldin Hassan, Carla Fabiana Chiasserini | Efficient Management of Composite Heterogeneous Applications at the Network Edge | 2026 | Early Access | Edge computing is a promising paradigm for deploying latency-sensitive applications (Apps) as it brings resources closer to end users. Edge Apps often adopt a microservice (MS) architecture, breaking monolithic Apps into lightweight, containerized MSs that can be dynamically and independently deployed. However, managing such Apps involves three key challenges: (i) optimizing the placement of MSs to reduce both response time and resource overhead, (ii) handling MS migration or relocation as users move while minimizing App service disruption (App downtime), and (iii) enabling MS sharing across Apps while ensuring performance guarantees. We formulate this as an optimization problem, named Multi-microservice Application Placement (MAP), prove its NP-hardness, and introduce STEP (State and Topology-aware Edge-MS Placement), a polynomial-time heuristic. STEP distinguishes itself from prior work by: (i) jointly considering stateful and stateless MS characteristics in deployment decisions, (ii) exploiting MS shareability to reduce resource usage, (iii) balancing response latency, App downtime, and resource utilization, and (iv) leveraging multiple versions of the same MS to adapt quality of service to available edge resources. Our results in a small-scale scenario show that STEP achieves near-optimal performance with only 7% higher CPU cost than the optimal solution. Large-scale real-time experiments on a Kubernetes cluster demonstrate that STEP consistently outperforms competing methods, achieving up to 50% lower deployment costs while delivering 50% gain in app quality and saving 15% in radio resources with over 90% request success rates. | 10.1109/TNSM.2026.3709656 | |
| Kunpeng Zheng, Huibin Zhang, Yongli Zhao, Yuan Cao, Wei Wang, Xin Li, Zhuangzhuang Ma, Lihan Zhao, Jie Zhang | Sun-Outage-Aware Topology Modeling and Adaptive Routing for Optical Satellite Networks | 2026 | Early Access | Sun Interrupters Joining processes Satellites Routing Algorithms Modeling Timing Topology Interference Optical inter-satellite links optical service connections optical satellite network sun outage topology modeling | Optical satellite networks, supported by optical inter-satellite links (OISLs), provide reliable and low-latency optical connectivity. However, periodic and predictable sun outage events significantly compromise OISL availability, leading to frequent OISL interruptions and reduced network reliability. Existing routing algorithms often overlook the regularity of sun outage-induced interrupts and their differentiated impacts on services, resulting in degraded service performance. To address this challenge, this paper proposes a sun outage-enhanced time discretization OISL model and introduces a sun outage link-aware routing (SOLR) algorithm. By incorporating joint awareness of sun outage patterns and service requirements, SOLR employs an adaptive optimization mechanism to dynamically adjust routing decisions within temporal windows. Experimental results demonstrate that SOLR extends stable path durations by 39.9%, reduces interruption rates by 28.5%, and decreases blocking rates by 36.4%, significantly outperforming link-state-based routing algorithms. By effectively mitigating the impact of sun outages, SOLR ensures continuous optical service connections. This interruption-tolerant framework bridges network modeling and service provisioning, offering a robust solution for mission-critical service in optical satellite networks. | 10.1109/TNSM.2026.3697856 |
| Arash Heidari, Jamal N. Al-Karaki | NOVA: A Self-Supervised Graph Framework for Real-Time Anomaly Detection in Internet of Vehicles | 2026 | Early Access | Context Internet of Vehicles Modeling Timing Vehicles Labeling Anomaly detection Matrices Vectors Joining processes Internet of Vehicles V2X Security Anomaly Detection Self-Supervised Learning Graph Neural Networks | The Internet of Vehicles (IoV) enables cooperative driving and real-time Vehicle-to-Everything (V2X) communication but remains vulnerable to behavioral and structural anomalies due to its dynamic, decentralized nature. Existing deep learning methods either overlook topological inconsistencies or ignore communication feature fidelity, while random-walk sampling introduces contextual noise. In this paper, we propose Network Observation for Vehicular Anomalies (NOVA), a self-supervised graph-based framework that detects both behavioral and structural anomalies in IoV networks without labeled data. NOVA models vehicular communications as attributed graphs and employs intimacy-guided subgraph sampling to extract meaningful neighborhoods. A Graph Convolutional Network (GCN)–based generative module reconstructs node attributes to reveal behavioral deviations, while a contrastive module validates structural coherence through embedding comparisons of real and perturbed contexts. Their hybrid anomaly score enables accurate, scalable, and real-time detection of compromised nodes. Performance results show that NOVA achieves state-of-the-art performance (98.7% accuracy, 98.1% F1), real-time throughput (~4.7k events/s at 5k msg/s), and strong robustness (AUROC 0.99, AUPRC 0.98, FAR 0.05) with near-linear scalability (≤40 ms latency for 50k vehicles). By integrating generative and contrastive self-supervised learning with context-aware sampling, NOVA significantly enhances IoV security, reliability, and adaptability. | 10.1109/TNSM.2026.3696324 |
| Emilio Paolini, Andrea Pinto, Luca Valcarenghi, Flavio Esposito | Programmable In-Network Aggregation for Communication-Aware Federated Learning in 5G RANs | 2026 | Early Access | Modeling Timing Training Federated learning Accuracy 5G mobile communication Convergence Aggregates Labeling Point cloud compression Federated Learning Mobile Networks Wireless In-Network Aggregation Grouping | Federated Learning (FL) enables collaborative model training without sharing raw data, making it attractive for privacy-preserving applications at the wireless edge. However, when executed over real 5G networks, FL performance degrades due to uplink congestion, heterogeneous client capabilities, and intermittent connectivity. Most existing approaches attempt to mitigate these issues indirectly by optimizing clients (through adaptive participation, local training, or selection strategies) or by optimizing models (via pruning, quantization, or compression), but they ignore potential network bottlenecks. This paper introduces FLAG, an FL architecture that embeds innetwork aggregation directly into 5G gNodeBs, transforming the network into an active participant in the learning process. In particular, FLAG performs parameter aggregation at line rate within the 5G Service Data Adaptation Protocol layer and incorporates three mechanisms: Partial-Contribution Correction for loss-tolerant averaging, a timer-driven pipeline for real-time scheduling, and a deadline-based grouping strategy to mitigate stragglers. Experiments with realistic wireless emulation show that FLAG achieves up to 5.1× faster time-to-accuracy and maintains accuracy within 0.8% of a loss-free baseline, while reducing gNB-to-server bandwidth by aggregating pergNB rather than per-client. FLAG requires no modifications to clients or the parameter server, demonstrating how 5G-aware system design can make federated learning scalable, efficient, and resilient under real-world wireless conditions. | 10.1109/TNSM.2026.3697723 |
| Heewon Kim, Hochan Lee, Chanbin Bae, Haneul Ko, Sangheon Pack | Traffic- and Multi-Tenancy-Aware In-Network Aggregation Placement for Distributed Machine Learning | 2026 | Early Access | Memory Magnesium Modeling Algorithms Telecommunication traffic Timing Delays Switches Fluid flow Educational institutions In-Network Aggregation Distributed Machine Learning Programmable Data Plane P4 | Distributed machine learning is an effective method to alleviate the intensive computation costs of training; however, it suffers from network bottlenecks while collecting local results. The recent advent of programmable data planes has opened a new avenue, in-network aggregation, which executes gradient aggregations in the middle of the network, resolving network bottlenecks, and further accelerates distributed machine learning. However, due to resource-constrained features of current programmable data planes, deploying in-network aggregation functionalities throughout the network would impose an unacceptable burden, posing a need for sophisticated deployment. In this paper, a problem of deploying in-network aggregation functionalities is studied to minimize the total network traffic in multi-tenant distributed machine learning. We formulate the problem as an integer linear programming (ILP) problem and prove its NP-hardness. Since finding the optimal solution using the brute-force method is extremely complicated, we propose a traffic-aware in-network aggregation placement algorithm based on a two-stage many-to-one matching game (denoted TAPINA-MG). The simulation results demonstrate that TAPINA-MG shows nearoptimal performance with low complexity, achieving up to 22.5%, 38.9%, and 96.0% reduction for network traffic, maximum link utilization, and communication time, respectively, compared to state of the art, and effectively handles dynamic situations with minimal migration delay and comparable traffic performance. | 10.1109/TNSM.2026.3709103 |
| Daishi Kondo, Yuya Shibuya, Rie S. Yamaguchi, Tomohiro Ishihara, Yuji Sekiya, Toshiyuki Nakata, Tohru Asami | Assessing the Adoption of Email Security Measures After Google’s New Sender Guidelines | 2026 | Early Access | Electronic mail Security Modeling Internet Search engines Companies Guidelines Recording Educational institutions Business DKIM DMARC Email authentication Internet measurements Security protocol adoption SPF | The email sender guidelines introduced by Google on October 3, 2023, mandate authentication protocols like Sender Policy Framework (SPF), DomainKeys Identified Mail (DKIM), and Domain-based Message Authentication, Reporting, and Conformance (DMARC) to enhance email security. However, how such platform-driven policies can effectively promote the adoption of security measures across the global email ecosystem remains unclear. In this measurement study, we analyze the impact of these guidelines by examining the adoption of email security measures across globally popular domains and country-specific subsets. Our results show that the adoption of SPF, DKIM, and DMARC has not yet achieved widespread uptake and exhibits significant regional disparities. In particular, domains associated with China, South Korea, and Japan exhibit consistently low adoption rates. While low adoption in China and South Korea can be partially explained by Gmail’s limited influence in these countries, Japan presents a striking contradiction, with low adoption persisting despite Google’s dominance. Focusing on Japanese-stock market-listed companies, we observe a significant increase in DMARC adoption following the introduction of the guidelines; however, a substantial proportion of entities remain non-compliant. These findings suggest that platform-driven policies alone are insufficient to achieve widespread security adoption and highlight the need for broader, ecosystem-level, multi-stakeholder initiatives. | 10.1109/TNSM.2026.3707567 |
| Ashiqur Rahaman Ridoy, Arnab Kumar Biswas | Adaptive Intrusion Detection Systems: Leveraging Meta-Learning for Improved Cybersecurity | 2026 | Early Access | Modeling Fluid flow Labeling Accuracy Metalearning Learning (artificial intelligence) Training Timing Machine learning Optimization Intrusion Detection Systems Low-Shot Learning Anomaly Detection Network Security Metric-Based Adaptation | In the evolving landscape of cybersecurity, the integration of machine learning (ML) into Intrusion Detection Systems (IDS) has become critical for detecting both known and unknown attacks. This paper proposes a novel multi-stage hybrid IDS framework combining unsupervised anomaly detection, supervised classification, and low-shot adaptation for enhanced resilience to concept drift. The architecture comprises three interconnected stages: Stage 1 (unsupervised anomaly gating) and Stage 2 (supervised taxonomy learning) operate in parallel on a shared harmonized feature space; Stage 3 (Hybrid Low-Shot Adapter (H-LSA)) performs low-shot adaptation when the Stage 1 trigger fires, using transferred Stage 2 weights and a prototype-based cosine-kNN jury. Within the meta-learning family, we instantiate a metric-based low-shot adaptation approach eschewing second-order Model-Agnostic Meta-Learning (MAML) in favor of a partial-freeze, first-order protocol with a prototype-based cosine-kNN jury to enable rapid, low-resource adaptation. Extensive experiments were conducted on the CICIDS2017 (Source), CSECIC-IDS2018 (Target), and the modern BCCC-cPacket-Cloud-DDoS-2024 (Target) datasets (hereafter referred to as BCCC-2024). The results demonstrate that while static Stage 2 models suffer catastrophic failure under concept drift (dropping to 45.36% and 38.32% accuracy on CICIDS2018 and harmonized BCCC-2024, respectively), the proposed framework successfully adapts to new environments, achieving 90.64% accuracy on CICIDS2018 (Macro-F1: 0.8981) and 89.70% on BCCC-2024 (Macro-F1: 0.8801) with a low-resource support set of only 500 labeled samples per class. Furthermore, the system exhibits high computational efficiency, achieving a Stage 3 adapted inference latency between 0.0786 ms and 0.1667 ms per flow across diverse traffic profiles, proving its suitability for real-time, scalable deployment in modern cloud and edge network infrastructures. | 10.1109/TNSM.2026.3706597 |
| Jiayi Liu, Jinshuo Wang, Yizhi Huang, Chen Wang | LLM Deployment Strategies on Mobile Edge Servers for Dynamic Uncertain User Requests | 2026 | Early Access | Modeling Large language models Internet of Things Timing Training Algorithms Costing Costs Delays Optimization LLM Agent MEC IoT LLM deployment Task offloading | Leveraging on the task planning and solving capability of pretrained Large Language Models (LLMs), deploying LLM agents on Mobile Edge Computing (MEC) edge servers brings significant benefits for an Internet of Things (IoT) network for providing enhanced AI intelligence with acceptable delay. In this work, we consider the edge LLMs deployment strategy in an end-edge-cloud LLM agents system for the IoT services, which jointly determines the locations and number of LLM initializations and user requests offloading strategy in a dynamic network environment with stochastic user requests. We formulate this joint LLM Deployment and inference Tasks Offloading (LLMDTO) problem. Typically, we design an LLM service performance evaluation mechanism by measuring its processing delay with stochastic user requests arrivals by Stochastic Network Calculus (SNC). Due to the complexity of the LLMDTO problem, we decompose this joint optimization problem into two subproblems and propose an algorithm based on Multi Agent Deep Reinforcement Learning (MADRL) scheme. To accelerate the training process of the DRL, a reward model is designed by applying the Kolmogorov Arnold Networks (KAN) to return a fast reward estimation. Finally, we validate the proposed algorithm through extensive simulations and results show the effectiveness of the proposition on lower deployment cost and delay in a dynamic network environment. | 10.1109/TNSM.2026.3708677 |
| Ibirisol Fontes Ferreira, Eiji Oki | Forestall: A Prefetching Scheme for Domain Name System Resolver Cache Services | 2026 | Early Access | Prefetching Timing Servers Modeling Management Measurement Recording Ecosystems Tracking TV Domain name systems service architecture caching time-to-live renewal policy prefetching | The domain name system (DNS) is crucial to accessing Internet services by playing an essential role in facilitating this process for Internet users. Still, it affects the quality of experience within the Internet service chain. This impact includes the role of the resolver component, which can negatively influence the final user experience when consuming services. Some studies have developed strategies to reduce resolution time within the DNS resolver ecosystem by incorporating components into users’ devices to trigger resolution in advance, changing DNS service and cache algorithm implementation, or utilizing a complex and expensive service architecture that is not scalable for local DNS resolvers in edge deployments. This paper proposes a dynamic prefetching scheme called Forestall to reduce misses, including those caused by expired domain translation data, and to improve the overall performance of the resolver cache component. We model the prefetching scheme for DNS resolvers using DNS transactional information. We define a prefetching advising routine that advises on possible domains by observing past request patterns. We introduce two prefetching routines for efficient domain tracking and advising. We introduce miss-based metrics to measure the efficiency of the prefetching scheme and the potential resource trade-off associated with its deployment. The numerical results indicate that the prefetching scheme improves the performance of the DNS resolver cache component compared to well-deployed prefetching solutions on the Internet. Forestall reduces the miss ratio by more than 50%, depending on the dataset. In a specific workload, Forestall’s results with adjusted parameter combinations yield a decrease in the miss ratio of more than 16%, accompanied by a reasonable increase in additional fetches of around 35%. In terms of service latency that users perceive, Forestall achieves a reduction varying between 20% and 49%. | 10.1109/TNSM.2026.3704549 |
| Behrooz Farkiani, Fan Liu, Ke Yang, John DeHart, Jyoti Parwatikar, Patrick Crowley | Hermes: A General-Purpose Proxy-Enabled Networking Architecture | 2026 | Early Access | Tunneling HTTP Joining processes Planing IP networks Internet TCP Architecture Computer architecture Servers Overlay Networking Proxy HTTP Architecture Tunneling Service Delivery MASQUE NDN Envoy | We introduce Hermes, a general-purpose networking architecture that aims to improve service delivery over the Internet. Hermes delegates networking responsibilities from applications and services to proxies and is designed as a portable, adaptable solution to four fundamental challenges of efficient service delivery over the Internet: end-to-end traffic management, backward compatibility, data-plane security and privacy models, and adaptable communication layers. The design centers on an overlay of reconfigurable proxies and HTTP tunneling and proxying techniques, utilizing assisting components to extend proxy functionality when needed. Through prototyping and emulation, we demonstrate that Hermes improves key performance metrics across multiple use cases: it provides backward compatibility through protocol translation and tunneling, improves reliability by delegating retry logic to proxies, enables unified policy-based Layer 3 routing across network segments, and serves as an efficient substrate for future architectures like NDN, facilitating their operation over the Internet. Beyond evaluating Hermes across various use cases, we measured the overhead of Hermes’ HTTP tunneling and proxying mechanisms and found it to be modest, typically under 2 ms per proxy pair traversal in an isolated collocated setup. Although the HTTP proxying and tunneling techniques used by Hermes increase single-connection processing overhead, we also show that, with up to 1,000 concurrent requests, proxies can amortize connection setup time and reduce end-to-end latency by utilizing connection pooling and multiplexing. | 10.1109/TNSM.2026.3705327 |
| Yiyang Li, Wei Wang, Yibo Wang, Qiaojun Hu, Weiliang Zhang, Yongli Zhao, Xiaoyu Wang, Jie Zhang | Computing-State Driven Proactive Congestion Control for AI Cluster Interconnect Networks | 2026 | Early Access | Timing Modeling Fluid flow Information rates Throughput Switches Training Data centers Conferences Joining processes large language model remote direct memory access congestion control algorithms distributed training | The rapid upgrade of computing power and the prosperity of large language model (LLM) in data center networks (DCNs) lead to a rigorous demand for ultra-low latency and high throughput. To mitigate the overhead of collective communication during distributed training (DT), Remote Direct Memory Access (RDMA) has been widely adopted in DCNs. Particularly, congestion control algorithms (CCAs) designed for RDMA have attracted much attention to mitigate performance deterioration under network congestion. However, through comprehensive analysis, we investigate that, due to sluggish end-to-end reaction and slow rate convergence, existing widely used reactive CCAs have several limitations in handling bursty traffic (e.g., AllReduce). Specifically, excessive packets are transmitted before senders activate the reaction and converge to the fair rate, which builds up a deep queue and may incur subsequent significant throughput loss. In this paper, we propose a computing-state driven proactive congestion control (CSPCC) with easy deployability. CSPCC consists of the congestion prediction module and the active congestion response module. It leverages current computing state to predict network congestion time and inform corresponding sources in advance. We provide a detailed introduction to the implementation of CSPCC. Then, we conducted small-scale hardware tests and large-scale simulations to evaluate the performance of CSPCC. On our testbed, under NCCL-TESTs, CSPCC improves throughput by 1.67%–13.35% and decreases switch queue occupancy by 28.33%–58.33% compared to DCQCN. Furthermore, under concurrent multi-job LLaMA training, it reduces end-to-end job completion time (JCT) by 5.3%–9.0%. | 10.1109/TNSM.2026.3705429 |
| Huanlin Liu, Bing Ma, Yong Chen, Bo Liu, Haonan Chen, Jiachen Zou | Virtual Network Embedding Based on Hierarchical Reinforcement Learning for Admission Decision and Policy Fine-Tuning in Elastic Optical Network | 2026 | Early Access | Joining processes Elastic optical networks Algorithms Modeling Substrates Resource management Costing Costs Optimization Tuning Elastic optical network virtual network embedding graph convolutional network hierarchical reinforcement learning revenue-cost ratio | Network virtualization (NV) provides flexible services for diverse services by decoupling elastic optical network (EON) resources. Virtual optical network embedding aims to allocate the finite resources of EON to sequentially arriving virtual network requests (VNRs) with different resource demands. But existing methods have limitations, such as insufficient global optimization ability and a lack of awareness of link features. We propose a hierarchical reinforcement learning algorithm for admission decision and policy fine-tuning (HRL-ADPT), which achieves efficient virtual optical network embedding through a dual-layer collaborative optimization mechanism and a customized link-aware graph convolutional network (GCN) tailored for EON. The HRL framework decomposes the virtual network embedding process into two stages: 1) The upper-level agent generates admission decision and initial node embedding strategies based on topological and link features extracted by GCN, maximizing the revenue-cost ratio of individual VNR; 2) The lower-level agent dynamically fine-tunes the initial policy in combination with global resource load to optimize long-term resource utilization. The proximal policy optimization (PPO) algorithm is adopted as the basic training method. To address the sparse reward problem, the lower-level agent adopts a multi-objective intrinsic reward function, incorporating the revenue-cost ratio and load balancing to ensure local adjustments align with global objectives. Simulation experiments show that the proposed algorithm outperforms the compared NRM-VNE, MCTS-VNE, and HCMARL-VNE algorithms in terms of acceptance ratio, revenue-cost ratio, and spectrum utilization ratio. | 10.1109/TNSM.2026.3706998 |
| Yahuza Bello, Ahmed Refaey, Ping Yang | Secure Multi-Timescale Orchestration for Zero-Trust Cross-Datacenter Networks | 2026 | Early Access | Authentication Optimization Resource management Modeling Costing Costs Timing Data centers Learning (artificial intelligence) Security Zero trust architecture hierarchical deep reinforcement learning cross-datacenter networks multi-timescale optimization resource management | The widespread deployment of geographically distributed Data Centers (DCs) has intensified the need for scalable and secure access control mechanisms across Cross-Datacenter Networks (CDNs). Zero Trust Architecture (ZTA) addresses this need by enforcing continuous authentication and authorization through Policy Decision Points (PDPs); however, determining where to deploy PDPs and how to dynamically assign authentication requests in the CDNs remains a challenging and NP-hard problem. This challenge arises from the tight coupling between long-term placement decisions and short-term, stochastic authentication workloads. In this paper, we formulate a joint PDP placement and authentication assignment problem for zero-trust-enabled CDNs that minimizes deployment cost, authentication assignment cost, bandwidth consumption, and the number of active PDP instances under resource constraints. To efficiently solve the problem, we propose a Hybrid Hierarchical Deep Reinforcement Learning (HHDRL) framework that decomposes decision-making across multiple time scales. A high-level Double Deep Q-Network (DDQN) agent learns long-term PDP placement policies, while multiple low-level Asynchronous Advantage Actor–Critic (A3C) agents perform real-time authentication assignment within each DC. Extensive simulations demonstrate that the proposed DDQN–A3C framework converges reliably and consistently outperforms benchmark schemes, including DDQN–A2C, a single-agent DDQN approach, and a greedy baseline, achieving lower overall system cost and improved scalability with modest computational overhead. | 10.1109/TNSM.2026.3707392 |
| Juan Zhang, Yangjun Ma, Xunzheng Zhang, Zhao Huang, Qiuji Yi, Nauman Aslam | Multi-objective SFC Placement with Future Demand Awareness in Dynamic Cross-Domain Networks | 2026 | Early Access | Modeling Optimization Transformers Topology Resource management Tin Modules (abstract algebra) Availability Service function chaining Scalability Service function chaining cross-domain networks multi-objective optimization resource allocation predictive modeling | Efficient service function chain (SFC) placement is critical for optimizing network service delivery in dynamic cross-domain networks (CDNs), especially under resource-constrained and heterogeneous environments. However, existing approaches face fundamental limitations in achieving effective multi-objective optimization, particularly in balancing latency minimization with efficient resource utilization. These challenges are further compounded by the inability to capture future resource dynamics and limited visibility across multiple domains. To address these challenges, we propose a novel multi-objective framework for SFC placement that jointly considers latency and resource utilization. The framework integrates Transformer-based prediction with linear programming (LP) to explicitly model future deployability, enabling proactive and globally informed placement decisions. In addition, a dynamic modeling mechanism is developed using domain-aware detection and graph autoencoders (GAEs) to capture evolving network topologies and cross-domain structural dependencies. A Pareto-based optimization strategy is further employed to systematically balance latency and resource efficiency across heterogeneous domains and varying workload conditions. Extensive experiments across multiple network scales and diverse SFC configurations demonstrate that the proposed framework achieves a superior trade-off between latency and deployment capability, while improving scalability, robustness, and long-term resource efficiency in dynamic and large-scale CDN environments. | 10.1109/TNSM.2026.3708714 |
| Jing Zhang, Chao Luo, Rui Shao | MTG-GAN: A Masked Temporal Graph Generative Adversarial Network for Cross-Domain System Log Anomaly Detection | 2026 | Early Access | Anomaly detection Adaptation models Generative adversarial networks Feature extraction Data models Load modeling Accuracy Robustness Contrastive learning Chaos Log Anomaly Detection Generative Adversarial Networks (GANs) Temporal Data Analysis | Anomaly detection of system logs is crucial for the service management of large-scale information systems. Nowadays, log anomaly detection faces two main challenges: 1) capturing evolving temporal dependencies between log events to adaptively tackle with emerging anomaly patterns, 2) and maintaining high detection capabilities across varies data distributions. Existing methods rely heavily on domain-specific data features, making it challenging to handle the heterogeneity and temporal dynamics of log data. This limitation restricts the deployment of anomaly detection systems in practical environments. In this article, a novel framework, Masked Temporal Graph Generative Adversarial Network (MTG-GAN), is proposed for both conventional and cross-domain log anomaly detection. The model enhances the detection capability for emerging abnormal patterns in system log data by introducing an adaptive masking mechanism that combines generative adversarial networks with graph contrastive learning. Additionally, MTG-GAN reduces dependency on specific data distribution and improves model generalization by using diffused graph adjacency information deriving from temporal relevance of event sequence, which can be conducive to improve cross-domain detection performance. Experimental results demonstrate that MTG-GAN outperforms existing methods on multiple real-world datasets in both conventional and cross-domain log anomaly detection. | 10.1109/TNSM.2026.3654642 |
| Lion Steger, Liming Kuang, Johannes Zirngibl, Georg Carle, Oliver Gasser | Still on Target? An Evaluation of IPv6 Target Generation Algorithms | 2026 | Early Access | Internet measurements are a crucial foundation of IPv6-related research. Due to the infeasibility of full address space scans for IPv6 however, those measurements rely on collections of reliably responsive, unbiased addresses, as provided e.g., by the IPv6 Hitlist service. Although used for various use cases, the hitlist provides an unfiltered list of responsive addresses, the hosts behind which can come from a range of different networks and devices, such as web servers, customer-premises equipment (CPE) devices, and Internet infrastructure. In this paper, we demonstrate the importance of tailoring hitlists in accordance with the research goal in question. By using PeeringDB we classify hitlist addresses into six different network categories, uncovering that 42% of hitlist addresses are in ISP networks. Moreover, we show the different behavior of those addresses depending on their respective category, e.g., ISP addresses exhibiting a relatively low lifetime. Furthermore, we analyze different Target Generation Algorithms (TGAs), which are used to increase the coverage of IPv6 measurements by generating new responsive targets for scans. We use seed sets, e.g., based on the categorized Hitlist. We evaluate the performance of TGAs under various conditions and find generated addresses to show vastly differing responsiveness levels for different TGAs. Furthermore, we evaluate of algorithm run times and differences between multiple TGA runs. | 10.1109/TNSM.2026.3705935 | |
| Jeffrey Redondo, Nauman Aslam, Juan Zhang, Zhenhui Yuan | Optimising QoS in HD Map Updates: Cross-Layer Multi-Agent with Multi-task and Mixed-Dependence (MTMD) | 2026 | Early Access | Optimization Timing High definition video Quality of service Media Access Control Information rates Throughput Vehicles Modeling Videos Edge computing HD map hierarchical learning latency multi-agent offloading reinforcement learning | High-definition (HD) maps generated from autonomous vehicle (AV) sensor data are essential for enabling high levels of driving automation. However, offloading large volumes of raw sensory data to edge servers in dense vehicular ad hoc networks (VANETs) introduces significant latency due to network congestion and packet collisions. Existing solutions primarily focus on dynamically adjusting the minimum contention window (CWmin), while additional MAC-layer parameters — including the maximum contention window (CWmax) and interframe space number (IFSn) — remain largely underexplored. To address this, we propose a cross-layer multi-agent reinforcement learning (MARL) framework that jointly optimises CWmin–CWmax, IFSn, and transmission waiting time within IEEE 802.11p-compliant bounds. The proposed multi-task mixed-dependence (MTMD) framework decomposes the optimisation problem into specialised subtasks handled by selectively coupled agents, balancing coordination and scalability while avoiding the overhead of fully symmetric MARL or centralised hierarchical controllers. A lightweight orchestration layer coordinates agent interaction with the simulation environment via secure message exchange. Evaluated against standard EDCA and representative RL baselines, MTMD achieves latency reductions of 31%, 49%, 87.3%, and 64% for Voice, Video, HD Map, and Best-Effort traffic, respectively, confirming the effectiveness of structured multi-parameter optimisation for latency-critical vehicular applications. | 10.1109/TNSM.2026.3705270 |
| Ibirisol Fontes Ferreira, Cassio Vinicius Serafim Prazeres, Maycon Leone Maciel Peixoto, Eiji Oki, Gustavo Bittencourt Figueiredo | Narrow: A Fair Routing Multicast Algorithm for Distributed Interactive Applications in Edge Networks | 2026 | Early Access | Delays Algorithms Timing Routing Measurement Servers Modeling Games Topology Joining processes Distributed interactive application edge computing multicast routing network virtualization overlay network shortest path k-shortest path delay and delay variation fairness | Recent research in networking has increasingly focused on addressing the challenges of edge network services. A crucial issue in this context is routing, which must account for quality-of-service requirements. In particular, multicast routing provides optimized network services for groups of people using the same application, which is advantageous for operators and application providers. However, latency-constrained routing poses challenges when integrating diverse requirements into the routing computation, particularly when fairness among users is required. This work addresses the fairness requirement in multicast-overlaid and virtualized networks by presenting a solution that improves the equity of group interactions in the routing service. Our proposal, named Narrow, achieves fairer group interaction by selecting improved path options for multicast routing in edge networks. We compared Narrow with the Fair Shortest Path Tree (FSPT) and Chains algorithms from related studies on delay-constrained routing. Simulations indicated that Narrow reduced the inter-destination delay deviation by up to 84% and 49% relative to FSPT and Chains, respectively, across topologies of varying sizes. Similarly, Narrow improved by more than 99% against FSPT and by 70% against Chains across topologies with varying node degrees. Depending on the number of allowed alternative paths, Narrow reduced the inter-destination delay deviation by more than 99% compared with FSPT and by 38% compared with Chains. In emulated distributed interactive application session experiments, Narrow delivered the fairest response time, reducing it by 89% and 86% relative to FSPT and Chains, respectively. Furthermore, fairness in players’ scores improved by 20% and 16%, respectively, yielding more equitable group interaction from the application’s perspective. | 10.1109/TNSM.2026.3704927 |