Publications

You can also find my articles on my Google Scholar profile.

Category:

Type:

/ publications

A Lightweight Neural Speech Compression Method for Edge Devices
Yu Lu, Yongjian Fu^&, Hao Pan , Dian Ding, Guangtao Xue, Ju Ren
Acta Electronica Sinica (电子学报 CCF T1) · 2025
We redesign the encoder module using a streamlined convolutional neural network architecture and introduce a multi-objective knowledge distillation strategy that integrates perceptual alignment, spectral constraints and adversarial training.
Paper
Show citation
LU Yu, FU Yong-jian, DING Dian, et al. A Lightweight Neural Speech Compression Method for Edge Devices[J/OL]. ACTA ELECTRONICA SINICA, 2025, 1-14. DOI：10.12263/DZXB.20250524.
Aucom: Extreme Compression for Real-Time Edge-to-Server Universal Audio Streaming
Yu Lu, Ran Wang, Dian Ding^&, Yijie Li, Longyuan Ge, Juntao Zhou, Yongzhao Zhang, Yi-Chao Chen, Jiannong Cao, Guangtao Xue
IEEE TMC (CCF A) · 2025
We propose a novel edge-to-server audio streaming architecture that leverages Mel filter bank spectral features to achieve ultra-high compression efficiency. Our system integrates audio denoising, Mel feature extraction,and quantization-based compression at the edge, effectively suppressing environmental and device-induced noise while achieving an extreme compression ratio relative to the original uncompressed audio.
Paper
Show citation
Y. Lu et al., "Aucom: Extreme Compression for Real-Time Edge-to-Server Universal Audio Streaming," in IEEE Transactions on Mobile Computing, doi: 10.1109/TMC.2025.3634628.
MODepth: Benchmarking Mobile Multi-frame Monocular Depth Estimation with Optical Image Stabilization
Yu Lu, Hao Pan^&, Dian Ding^&, Jiatong Ding, Yongjian Fu, Yi-Chao Chen, Ju Ren, Guangtao Xue
ACM SIGGRAPH Asia (CCF A) · 2025
Leveraging multiframe images captured under OIS-controlled lens movements, we design a high-precision depth estimation network, MODNet, and introduce the principal point offset estimation module and pose estimation modules to fully exploit geometric information across frames.
Paper
Show citation
Lu, Yu, et al. "MODepth: Benchmarking Mobile Multi-frame Monocular Depth Estimation with Optical Image Stabilization." Proceedings of the SIGGRAPH Asia 2025 Conference Papers. 2025.
STELLAR: Pacemaker Recognition Using 12-Lead ECG and Spatio-Temporal Harmonic Mechanism
Han Zhang*, Zeyuan Ding*, Leping Yang, Yu Lu, Jiatong Ding, Dian Ding^&, Yiding Qi, Ruogu Li, Guanghui Gao, Yi-Chao Chen, Guangtao Xue
IEEE BIBM (CCF B) · 2025
We propose STELLAR, a novel deep learning framework that integrates a Spatio-Temporal Lead-Harmonic Mechanism to model both the temporal dynamics of ECG waveforms and the spatial coherence across leads.
Paper
A Transform-Domain Approach with Symmetric and Edge Constraints for MRI Super-Resolution
Han Zhang*, Yu Lu*, Ran Wang, Dian Ding^&, Mengying Zhu, Shengyun He, Ling Ma, Yi-Chao Chen, Ruokun Li, Shikui Tu, Guangyu Wu, Guangtao Xue
IEEE BIBM (CCF B) · 2025
In this paper, we propose a novel super-resolution up-sampling pipeline that enhances both the high-frequency and low-frequency components of magnetic resonance imaging.
Paper
Toward High Spatial Resolution and Low Ambiguity in Wideband Signal Receiver Beamforming for VR/AR
Longyuan Ge, Juntao Zhou, Dian Ding, Yi-Chao Chen, Yu Lu, Yida Wang, Guangtao Xue
ACM Ubicomp (CCF A) Workshop MIMSVAI · 2025
We propose a novel receiver beamforming system that employs a dynamically reconfigurable circular microphone array with an adjustable radius, paired with a frequency-adaptive ambiguity suppression algorithm.
Paper
Show citation
Ge, Longyuan, et al. "Toward High Spatial Resolution and Low Ambiguity in Wideband Signal Receiver Beamforming for VR/AR." Companion of the 2025 ACM International Joint Conference on Pervasive and Ubiquitous Computing. 2025.
High-resolution mmWave Imaging using Metasurface and Diffusion
Yida Wang, Yu Lu, Yuxuan Zhou, Yifei Shen, Lili Qiu, Zeyuan Lai, Yi-Chao Chen^&, Hao Pan, Juntao Zhou, Dian Ding, Mei Wang, Guangtao Xue, Qian Zhang
ACM Mobisys (CCF B) · 2025
In this paper, we propose a novel high-resolution mmWave imaging technique that operates with a small, off-the-shelf mmWave module and eliminates the need for any mechanical movement, offering a streamlined, portable solution.
Paper
Show citation
Wang, Yida, et al. "High-resolution mmWave Imaging using Metasurface and Diffusion." Proceedings of the 23rd Annual International Conference on Mobile Systems, Applications and Services. 2025.
Moir´eComm: Secure Screen-camera Communication based on Moir´e Cryptography
Hao Pan, Yongjian Fu^&, Yu Lu, Feitong Tan, Yi-Chao Chen, and Ju Ren
IEEE TDSC(CCF A) · 2025
Addressing this, we propose a novel Moire encryption technique-based secure screen-camera communication system, named MoireComm. The Moire encryption can enhance security by using distinct spatial frequency patterns for camouflage.
Paper
Show citation
Pan, Hao, et al. "MoireComm: Secure Screen-camera Communication Based on Moire Cryptography." IEEE Transactions on Dependable and Secure Computing (2025).
Effective Local Texture Estimation Using Wavelet Transforms for Arbitrary-Scale Super-Resolution
Baihong Qian*, Yu Lu*, Dian Ding^&, Yi-Chao Chen, Qiaoling Xiao, Guanghui Gao, Zhengguang Xiao, Guangtao Xue^&
Springer ICIC (CCF C) Poster · 2025
In this work, we propose a novel Local Wavelet Transformer (LWT) framework that leverages the Discrete Wavelet Transform (DWT) to capture both local textures and global structures, improving the accuracy of fine grained detail restoration.
Paper
Show citation
Qian, Baihong, et al. "Effective Local Texture Estimation Using Wavelet Trans-forms for Arbitrary-Scale Super-Resolution."
Amser+: Accelerating Mobile Speech Emotion Recognition in IoT Environments with Mel Feature Compression
Yu Lu, Ran Wang, Dian Ding^&, Yijie Li, Yongzhao Zhang, Lanqing Yang, Yi-Chao Chen, Guangtao Xue^&
IEEE IoTJ (CCF C & JCR Q1) · 2025
This article proposes Amser+, a real-time SER framework using signal compression and task offloading. Amser+ utilizes logarithmic Mel-filter bank coefficients (Fbank) and singular value decomposition (SVD) for feature extraction and compression.
Paper
Show citation
Lu, Yu, et al. "Amser+: Accelerating Mobile Speech Emotion Recognition in IoT Environments With Mel Feature Compression." IEEE Internet of Things Journal (2025).
TouchHBC: Touch-based Human Body Communication via Leakage Current
Dian Ding, Yijie Li, Hao Pan,Yu Lu, Yongzhao Zhang, Yi-Chao Chen, Guangtao Xue^&
IEEE TMC (CCF A) · 2025
This study introduces TouchHBC, a secure and reliable communication scheme leveraging a smartwatch’s built-in electrodes. This system establishes a touch-based human communication system utilizing a laptop’s leakage current.
Paper
Show citation
Ding, Dian, et al. "TouchHBC: Touch-based Human Body Communication via Leakage Current." IEEE Transactions on Mobile Computing (2025).
M2Silent: Enabling Multi-user Silent Speech Interactions via Multi-directional Speakers in Shared Spaces
Juntao Zhou, Dian Ding^&, Yijie Li, Yu Lu, Yida Wang, Yongzhao Zhang, Yi-Chao Chen^&, Guangtao Xue
ACM CHI (CCF A) · 2025
M2Silent addresses this by allowing users to communicate silently, without producing audible speech, using acoustic sensing integrated into directional speakers. We leverage FMCW signals as audio carriers, simultaneously playing audio and sensing the user’s silent speech.
Paper
Show citation
Zhou, Juntao, et al. "M2SILENT: Enabling Multi-user Silent Speech Interactions via Multi-directional Speakers in Shared Spaces." Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems. 2025.
AMSER: Accelerate Mobile Speech Emotion Recognition with Signal Compression
Yu Lu*, Ran Wang*, Dian Ding^&, Han Zhang, Liyun Zhang, Lanqing Yang, Yi-Chao Chen, Guangtao Xue^&
IEEE ICASSP (CCF B) · 2025
This paper proposes AMSER, a real-time speech emotion recognition framework using signal compression and task offloading. AMSER utilizes logarithmic Mel-filter bank coefficients (Fbank) and singular value decomposition (SVD) for feature extraction and compression.
Paper
Show citation
Lu, Yu, et al. "AMSER: Accelerate Mobile Speech Emotion Recognition with Signal Compression." ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025.
DASIV: Directional Acoustic Sensing based Intelligent Vehicle Interaction System
Dinghua Zhao*, Juntao Zhou*, Dian Ding^&, Yu Lu, Yijie Li, Hang Yang, Yi-Chao Chen, Guangtao Xue
IEEE IPCCC (CCF C) · 2025
In this paper, we propose DASIV, which utilizes the highly directional nature of ultrasonic signals to achieve fine-grained directional acoustic sensing in vehicle environments.
Paper
Show citation
Zhao, Dinghua, et al. "DASIV: Directional Acoustic Sensing based Intelligent Vehicle Interaction System." 2024 IEEE International Performance, Computing, and Communications Conference (IPCCC). IEEE, 2024.
M3Cam: Extreme Super-resolution via Multi-Modal Optical Flow for Mobile Cameras
Yu Lu*, Dian Ding*, Hao Pan^&, Yongjian Fu, Liyun Zhang, Feitong Tan, Ran Wang, Yi-Chao Chen, Guangtao Xue, Ju Ren
ACM SENSYS (CCF B) · 2024
Our proposed system can generate a 16× SR image from four captured low-resolution images in real-time, with low computational load, low inference latency, and minimal reliance on runtime RAM.
Paper Slides
Show citation
Lu, Yu, et al. "M3cam: Extreme super-resolution via multi-modal optical flow for mobile cameras." Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems. 2024.
HandPad: Make Your Hand an On-the-go Writing Pad via Human Capacitance
Yu Lu, Dian Ding^&, Hao Pan,Yijie Li,Juntao Zhou,Yongjian Fu, Yongzhao Zhang, Yi-Chao Chen, Guangtao Xue
ACM UIST (CCF A) · 2024
This paper introduces HandPad, the system that turns the hand into an on-the-go touchscreen, which realizes interaction on the hand via human capacitance. HandPad achieves keystroke and handwriting inputs for letters, numbers, and Chinese characters, reducing the dependency on capacitive or pressure sensor arrays.
Paper Slides
Show citation
Lu, Yu, et al. "HandPad: Make your hand an on-the-go writing pad via human capacitance." Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology. 2024.
Enable Touch-based Communication between Laptop and Smartwatch
Dian Ding, Yijie Li, Hao Pan, Yu Lu, Yi-Chao Chen, Guangtao Xue^&
ACM Ubicomp (CCF A) Poster · 2024
This study introduces TouchHBC, a secure and reliable communication scheme leveraging a smartwatch built-in electrodes. This system establishes a touch-based human communication system utilizing a laptop leakage current.
Paper
Show citation
Ding, Dian, et al. "Enable Touch-based Communication between Laptop and Smartwatch." Companion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing. 2024.
HCMG: Human-Capacitance based Micro Gesture for VR/AR
Yu Lu, Dian Ding^&, Ran Wang, Guangtao Xue
ACM Ubicomp (CCF A) Workshop MIMSVAI · 2024
Building on the foundation of human capacitance, this paper introduces a novel approach termed human capacitance-based micro gesture (HCMG) recognition.
Paper
Show citation
Lu, Yu, et al. "HCMG: Human-Capacitance based Micro Gesture for VR/AR." Companion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing. 2024.
Visar: Projecting Virtual Sound Spots for Acoustic Augmented Reality Using Air Nonlinearity
Juntao Zhou, Yijie Li^&, Yida Wang, Dian Ding, Yu Lu, Yi-Chao Chen^&, and Guangtao Xue
ACM IMWUT (CCF A) · 2024
This paper proposes Visar, a device-free virtual sound spots projection system leveraging air nonlinearity. Visar achieves simultaneous tracking and sound spot generation while suppressing unintended audio leakages caused by grating lobes and nonlinear effects in mixing lobes through optimization.
Paper
Show citation
Zhou, Juntao, et al. "Visar: Projecting Virtual Sound Spots for Acoustic Augmented Reality Using Air Nonlinearity." Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 8.3 (2024): 1-30.
CarbonNet: Enterprise-Level Carbon Emission Prediction with Large-Scale Datasets
Jinghua Tang, Nan Fang, Lanqing Yang^&, Yuqiao Pei, Ran Wang, Dian Ding, Yu Lu, Guangtao Xue
Springer ICIC (CCF C) · 2024
We propose CarbonNet, a novel firm-level carbon emission prediction scheme. To build large-scale firm-level datasets, we crawled carbon emission data and reporting data (e.g., financial statements) of 3346 companies over 31 years containing 688 data fields, and combined them together.
Paper
Show citation
Tang, Jinghua, et al. "CarbonNet: enterprise-level carbon emission prediction with large-scale datasets." International Conference on Intelligent Computing. Singapore: Springer Nature Singapore, 2024.
VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints
Jinghua Tang*, Liyun Zhang*, Yu Lu, Dian Ding^&, Lanqing Yang, YiChao Chen, Minjie Bian, Xiaoshan Li^&, Guangtao Xue
Springer ICIC (CCF C) Poster · 2024
This paper introduces the VCEMO dataset to address this deficiency. The proposed dataset is constructed from everyday conversations and comprises over 100 users and 7,747 textual samples.
Paper
Show citation
Tang, Jinghua, et al. "VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints." arXiv preprint arXiv:2408.13019 (2024).
Adaptive Metasurface-Based Acoustic Imaging using Joint Optimization
Yongjian Fu, Yongzhao Zhang, Yu Lu, Lili Qiu, Yi-Chao Chen, Yezhou Wang, Mei Wang, Yijie Li, Ju Ren^&, Yaoxue Zhang
ACM Mobisys (CCF B) · 2024
We leverage a 3D-printed passive acoustic metasurface to significantly enhance the diversity of the measurement data, thereby improving the imaging quality. Specifically, we jointly design the transmission signal, transceivers beamforming weights, metasurface, and imaging algorithm to minimize the imaging reconstruction error in an end-to-end manner.
Paper
Show citation
Fu, Yongjian, et al. "Adaptive metasurface-based acoustic imaging using joint optimization." Proceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services. 2024.
Pushing the Limits of Acoustic Spatial Perception via Incident Angle Encoding
Yongjian Fu, Yongzhao Zhang, Hao Pan, Yu Lu, Xinyi Li, Lili Chen, Ju Ren^&, Xiong Li, Xiaosong Zhang, Yaoxue Zhang
ACM IMWUT (CCF A) · 2024
In this paper, we introduce MetaAng, a system designed to augment microphone arrays by enabling wideband spatial perception across both speech signals and inaudible sounds by leveraging the spatial encoding capabilities of acoustic metasurfaces.
Paper
Show citation
Fu, Yongjian, et al. "Pushing the limits of acoustic spatial perception via incident angle encoding." Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 8.2 (2024): 1-28.
Effectively Learning Moiré QR Code Decryption from Simulated Data
Yu Lu, Hao Pan^&, Feitong Tan, Yi-Chao Chen,Jiadi Yu, Jinghai He, Guangtao Xue^&
IEEE INFOCOM (CCF A) · 2023
In this work, we propose a deep learning-based Moiré QR code decryption framework and achieve an excellent decryption performance.
Paper Slides
Show citation
Lu, Yu, et al. "Effectively learning moiré qr code decryption from simulated data." IEEE INFOCOM 2023-IEEE Conference on Computer Communications. IEEE, 2023.
Addressing Practical Challenges in Acoustic Sensing To Enable Fast Motion Tracking
Yongzhao Zhang, Hao Pan, Yi-Chao Chen, Lili Qiu, Yu Lu, Guangtao Xue, Jiadi Yu, Feng Lyu, and Haonan Wang
ACM IPSN (CCF B) · 2023
Motivated by many potential applications that could be enabled by acoustic motion tracking, in this paper we systematically examine the factors that limit the accuracy of acoustic tracking in practical scenarios.
Paper
Show citation
Zhang, Yongzhao, et al. "Addressing practical challenges in acoustic sensing to enable fast motion tracking." Proceedings of the 22nd International Conference on Information Processing in Sensor Networks. 2023.

Yu Lu (鲁昱)

Publications