This collection of papers explores diverse advancements in wireless communication and signal processing, ranging from channel modeling and optimization to novel applications of deep learning and reconfigurable intelligent surfaces (RIS). Several works focus on enhancing the performance and robustness of communication systems in challenging environments. Askari and Lampe (2024) Askari & Lampe (2024) delve into probabilistic shaping for nonlinearity tolerance in optical fiber communication, investigating the interplay between shaping distributions and nonlinear interference noise. Their analysis, based on a first-order time-domain perturbation approximation, revisits the concepts of linear and nonlinear shaping gain, particularly for probabilistic amplitude shaping.
Complementing this exploration of enhancing communication robustness, Chen et al. (2024) Chen et al. (2024) explore the nascent field of wireless-native big AI models (BAIMs). They highlight the peculiarities of wireless intelligence compared to language intelligence and propose methodologies for developing specialized BAIMs. This push towards leveraging AI for wireless systems is echoed in other works focusing on channel prediction and signal detection.
The application of RIS for improved communication and sensing is a recurring theme. Abrardo and Bartoli (2024) Abrardo & Bartoli (2024) propose a framework for RIS-aided covert communications using multiport network theory, considering practical aspects like mutual coupling and statistical CSI. Similarly, Ni et al. (2024) Ni et al. (2024) investigate the potential of RIS for the Internet of Robotic Things (IoRT), proposing solutions based on multi-agent deep reinforcement learning and multi-objective optimization for enhancing communication, sensing, computation, and energy harvesting in robotic networks. Kim and Jeong (2024) Kim & Jeong (2024) address the challenge of secure communication in cell-free ISAC systems, proposing joint fronthaul compression and beamforming optimization to maximize sensing performance while guaranteeing secrecy.
Several papers introduce novel methodologies and applications of deep learning in wireless systems. Liu et al. (2024) Liu et al. (2024) introduce WiFo, a wireless foundation model for channel prediction, leveraging masked autoencoders for self-supervised pre-training on a large-scale simulated CSI dataset. This approach aims to achieve zero-shot generalization across diverse CSI configurations. Pan et al. (2024) Pan et al. (2024) propose a residual channel-based data augmentation strategy for radio frequency fingerprint identification (RFFI), combined with a lightweight SimSiam contrastive learning framework, to improve generalization with limited data. In the realm of signal processing, Kim et al. (2024) Kim et al. (2024) present a fully Bayesian approach to wideband direction-of-arrival (DoA) estimation and detection, employing non-reversible jump Markov chain Monte Carlo (NRJMCMC) for efficient posterior inference. This Bayesian approach offers a comprehensive solution for both detection and estimation tasks.
Beyond deep learning, other innovative signal processing techniques are explored. Chen et al. (2024) Chen et al. (2024) propose a modified Baum-Welch algorithm for joint blind channel estimation and turbo equalization, demonstrating improved convergence and estimation accuracy. They also analyze scenarios where a joint design may not be advantageous. Similarly, Chen et al. (2024) Chen et al. (2024) investigate turbo receiver design for coded differential BPSK in bursty impulsive noise channels, comparing separate and joint detection and demapping approaches. Wang et al. (2024) Wang et al. (2024) introduce a parameter estimation-based automatic modulation recognition (AMR) method for RF signals, using power spectrum analysis for carrier frequency and bandwidth estimation, coupled with an LSTM network for efficient and accurate modulation classification. Finally, Liu et al. (2024) Liu et al. (2024) provide experimental analysis and modeling of penetration loss for building materials in FR1 and FR3 bands, contributing valuable data for channel modeling and standardization.
Several papers also explore specific applications and practical considerations in wireless systems. Liu et al. (2024) Liu et al. (2024) propose an active sampling and Gaussian reconstruction method for RF Radiance Field, offering a training-free alternative to neural network-based approaches. Meneses et al. (2024) Meneses et al. (2024) introduce a physics-informed latent diffusion model (PI-LDM) for synthesizing training datasets for MRI-based fat quantification, addressing the challenge of limited data in medical imaging applications. Bhatti et al. (2024) Bhatti et al. (2024) compare deep learning approaches for harmful brain activity detection using EEG, emphasizing the importance of training strategies and data preprocessing. Kristensen et al. (2024) Kristensen et al. (2024) present an SDR-based monostatic Wi-Fi system with analog self-interference cancellation for sensing, demonstrating its capability for vital sign monitoring. Bahingayi et al. (2024) Bahingayi et al. (2024) focus on joint beamforming design for large-scale downlink RIS-assisted MU-MIMO systems, proposing an efficient algorithm with linear complexity scaling. Finally, Zakerimanesh et al. (2024) Zakerimanesh et al. (2024) investigate the impact of rigid communication topologies on vehicular platoon performance, highlighting the benefits of forward-looking communication.
Further contributions include Jacobellis and Yadwadkar (2024) Jacobellis & Yadwadkar (2024), who introduce WaLLoC, a novel neural codec architecture for learned compression, suitable for compressed learning tasks. Knight and Saito (2024) Knight & Saito (2024) propose a novel multiscale spatial phase estimate using the structure multivector, demonstrating its robustness in low SNR scenarios. Mo et al. (2024) Mo et al. (2024) introduce a physics-informed neural network for capacitive touch sensor modeling, leveraging Maxwell's equations to create an efficient surrogate model. The COST INTERACT Whitepaper Burr et al. (2024) provides a broader overview of signal processing challenges and approaches for 6G communications, localization, and integrated sensing. Köppel et al. (2024) Köppel et al. (2024) demonstrate the application of machine learning for predicting NOx emissions in biochar production plants. Liu et al. (2024) Liu et al. (2024) present a data-and-model-driven framework for 5G NR monostatic positioning with array impairments, demonstrating improved positioning accuracy through deep learning and MUSIC. These diverse contributions highlight the ongoing efforts to optimize and enhance the performance of wireless communication systems across various applications and challenging environments.
Towards Wireless-Native Big AI Model: Insights into Its Ambitions, Peculiarities and Methodologies by Zirui Chen, Zhaoyang Zhang, Chenyu Liu, Ziqing Xing https://arxiv.org/abs/2412.09041
Caption: This diagram contrasts human-centric intelligence, exemplified by Large Language Models (LLMs) trained on human-generated data, with hyper-cognitive intelligence, the basis for Wireless Big AI Models (wBAIMs). wBAIMs leverage data from natural and industrial systems, like wireless networks, and are optimized through system feedback, aiming to surpass human cognitive limitations in understanding complex scientific laws.
The integration of Artificial Intelligence (AI) into wireless systems holds immense potential, but current approaches, often adapted from other domains like computer vision or natural language processing, face limitations. This paper champions the development of wireless-native Big AI Models (wBAIMs) specifically designed for the unique characteristics of wireless systems. A review of recent research reveals a growing interest in BAIMs for wireless, but a distinct lack of focus on truly wireless-native approaches. This paper aims to fill that gap.
The authors emphasize a fundamental difference in the type of intelligence being pursued. Large Language Models (LLMs) represent human-centric intelligence, learning from human-generated data and refined by human feedback. In contrast, wBAIMs embody hyper-cognitive intelligence, learning from the intricate workings of natural and industrial systems, like electromagnetic signals, and optimized by system feedback. This allows wBAIMs to potentially transcend human cognitive limitations, particularly in understanding and applying complex scientific laws like those governing wireless communication. Wireless networks, with their well-defined structures, high digitization, and abundant data, offer an ideal testing ground for this type of AI.
wBAIMs possess distinct characteristics that set them apart from LLMs and conventional wireless models. They must process multi-modal data like Channel State Information (CSI) and modulation symbols, operate under stringent latency requirements for real-time applications, and function within decentralized network architectures with distributed data and processing. Crucially, wBAIMs must capture the fundamental mechanisms of electromagnetic laws, moving beyond mere memorization towards a physics-driven model design.
The paper proposes a methodology for constructing wBAIMs inspired by the evolution of LLMs. This involves: hybrid data collection (combining real and simulated data); a physics-driven learning paradigm; establishing wireless scaling laws to guide model growth and emergent behavior; using structural prompts for efficient adaptation without retraining; and collaboration with AI agents for enhanced usability. These methods aim to leverage the unique strengths of wireless systems, such as vast data availability and system feedback, to build powerful and generalizable AI models.
By addressing the specific challenges and opportunities of the wireless domain, wBAIMs hold the potential to unlock unprecedented capabilities. This includes unifying multiple tasks and scenarios, enabling all-in-one scheduling, and ultimately achieving ubiquitous intelligence in wireless networks. The development of wBAIMs represents a significant step towards realizing the full potential of AI in revolutionizing wireless communications.
WiFo: Wireless Foundation Model for Channel Prediction by Boxun Liu, Shijian Gao, Xuanyu Liu, Xiang Cheng, Liuqing Yang https://arxiv.org/abs/2412.08908
Caption: This figure illustrates the WiFo (Wireless Foundation Model) architecture, a masked autoencoder (MAE) based model for channel prediction. The model takes space-time-frequency (STF) channel state information (CSI) as input, processes it through an encoder and decoder, and reconstructs the masked CSI during self-supervised pre-training. This pre-trained model can then perform zero-shot inference for time and frequency domain channel prediction tasks without further training.
Channel prediction, essential for acquiring CSI without excessive signaling overhead, has traditionally relied on task-specific models. This paper introduces WiFo, a novel wireless foundation model designed for a unified approach to channel prediction. WiFo leverages the power of foundation models and self-supervised pre-training to generalize across various prediction tasks and CSI configurations without fine-tuning.
WiFo employs a masked autoencoder (MAE)-based architecture, processing heterogeneous space-time-frequency (STF) CSI data through 3D patching and embedding. A novel positional encoding (STF-PE) structure captures 3D position information within the CSI. Self-supervised pre-training uses masked reconstruction tasks (random, time, and frequency-masked) to learn the inherent 3D variations of CSI and spatiotemporal correlations. The pre-trained model then performs zero-shot inference, eliminating retraining overhead.
The researchers built a large-scale simulated CSI dataset with diverse configurations for training and evaluation. WiFo's performance was assessed on time-domain channel prediction (predicting future CSI from historical data: H[T<sub>h</sub> + 1 : T, :, :] = Φ<sub>t</sub>(H[1 : T<sub>h</sub>, :,:])) and frequency-domain prediction (predicting CSI in adjacent frequency bands: H[:, K<sub>u</sub> + 1 : K, :] = Φ<sub>f</sub>(H[:, 1 : K<sub>u</sub>, :])). Comparisons were made against various baselines, including traditional and deep learning methods.
Results demonstrated WiFo's superior performance and remarkable zero-shot generalization. WiFo achieved significantly lower Normalized Mean Squared Error (NMSE) compared to baselines in multi-dataset unified learning. Impressively, its zero-shot performance on unseen scenarios surpassed the full-shot performance of all baselines, highlighting its adaptability to new configurations without retraining. Ablation studies confirmed the effectiveness of the proposed STF-PE and masked reconstruction tasks.
Learned Compression for Compressed Learning by Dan Jacobellis, Neeraja J. Yadwadkar https://arxiv.org/abs/2412.09405
This paper tackles the challenge of processing high-resolution data in resource-constrained machine learning systems. While compressed-domain learning offers a solution by operating on compact representations, existing compression methods are not ideal. The paper introduces WaLLoC (Wavelet Learned Lossy Compression), a novel neural codec architecture designed to address the limitations of current approaches.
WaLLoC distinguishes itself by combining linear transform coding with nonlinear dimensionality reduction. By sandwiching a shallow, asymmetric autoencoder and an entropy bottleneck between an invertible wavelet packet transform, WaLLoC achieves computationally efficient encoding. The wavelet transform pre-processes the signal, simplifying the autoencoder's task by operating at a lower resolution. The asymmetry of the autoencoder, with a simple linear analysis transform and a more complex nonlinear synthesis transform, further optimizes for encoding efficiency.
High compression ratios are achieved through the use of an entropy bottleneck during training. This technique enhances the resilience of latent representations to quantization, facilitating further compression using standard lossless codecs. Importantly, WaLLoC provides uniform dimensionality reduction, making it a direct replacement for conventional resolution reduction methods in machine learning pipelines. The encoder efficiently projects high-dimensional signal patches to low-dimensional latent representations.
Evaluations on image and audio compression tasks showcased WaLLoC's superior performance. For images, WaLLoC achieved significantly higher compression ratios than existing methods while maintaining comparable quality and offering higher dimensionality reduction. For audio, WaLLoC demonstrated substantial improvements in spatial quality and encoding throughput. Furthermore, the efficacy of WaLLoC for compressed-domain learning was demonstrated across various tasks, including image classification, colorization, document understanding, and music source separation. In each case, WaLLoC outperformed resolution reduction, demonstrating significant accuracy gains while maintaining computational efficiency.
This newsletter highlights a convergence of trends in wireless communication and signal processing. The push for more intelligent and adaptable systems is evident in the development of wireless-native BAIMs, like those proposed by Chen et al., and foundation models like WiFo, which promises zero-shot generalization for channel prediction. The need for efficient processing of high-resolution data is addressed by innovative compression techniques like WaLLoC, enabling compressed-domain learning with minimal performance loss. These advancements, alongside explorations in RIS, Bayesian DoA estimation, and novel signal processing algorithms, represent a significant step towards more robust, efficient, and intelligent wireless communication systems of the future.