Publications

You can also find my articles on my Google Scholar profile.
/ publications
  1. A Lightweight Neural Speech Compression Method for Edge Devices
    Yu Lu, Yongjian Fu&, Hao Pan , Dian Ding, Guangtao Xue, Ju Ren
    Acta Electronica Sinica (电子学报 CCF T1) · 2025
    We redesign the encoder module using a streamlined convolutional neural network architecture and introduce a multi-objective knowledge distillation strategy that integrates perceptual alignment, spectral constraints and adversarial training.
  2. Aucom: Extreme Compression for Real-Time Edge-to-Server Universal Audio Streaming
    Yu Lu, Ran Wang, Dian Ding&, Yijie Li, Longyuan Ge, Juntao Zhou, Yongzhao Zhang, Yi-Chao Chen, Jiannong Cao, Guangtao Xue
    IEEE TMC (CCF A) · 2025
    We propose a novel edge-to-server audio streaming architecture that leverages Mel filter bank spectral features to achieve ultra-high compression efficiency. Our system integrates audio denoising, Mel feature extraction,and quantization-based compression at the edge, effectively suppressing environmental and device-induced noise while achieving an extreme compression ratio relative to the original uncompressed audio.
  3. MODepth: Benchmarking Mobile Multi-frame Monocular Depth Estimation with Optical Image Stabilization
    Yu Lu, Hao Pan&, Dian Ding&, Jiatong Ding, Yongjian Fu, Yi-Chao Chen, Ju Ren, Guangtao Xue
    ACM SIGGRAPH Asia (CCF A) · 2025
    Leveraging multiframe images captured under OIS-controlled lens movements, we design a high-precision depth estimation network, MODNet, and introduce the principal point offset estimation module and pose estimation modules to fully exploit geometric information across frames.
  4. STELLAR: Pacemaker Recognition Using 12-Lead ECG and Spatio-Temporal Harmonic Mechanism
    Han Zhang*, Zeyuan Ding*, Leping Yang, Yu Lu, Jiatong Ding, Dian Ding&, Yiding Qi, Ruogu Li, Guanghui Gao, Yi-Chao Chen, Guangtao Xue
    IEEE BIBM (CCF B) · 2025
    We propose STELLAR, a novel deep learning framework that integrates a Spatio-Temporal Lead-Harmonic Mechanism to model both the temporal dynamics of ECG waveforms and the spatial coherence across leads.
  5. A Transform-Domain Approach with Symmetric and Edge Constraints for MRI Super-Resolution
    Han Zhang*, Yu Lu*, Ran Wang, Dian Ding&, Mengying Zhu, Shengyun He, Ling Ma, Yi-Chao Chen, Ruokun Li, Shikui Tu, Guangyu Wu, Guangtao Xue
    IEEE BIBM (CCF B) · 2025
    In this paper, we propose a novel super-resolution up-sampling pipeline that enhances both the high-frequency and low-frequency components of magnetic resonance imaging.
  6. Toward High Spatial Resolution and Low Ambiguity in Wideband Signal Receiver Beamforming for VR/AR
    Longyuan Ge, Juntao Zhou, Dian Ding, Yi-Chao Chen, Yu Lu, Yida Wang, Guangtao Xue
    ACM Ubicomp (CCF A) Workshop MIMSVAI · 2025
    We propose a novel receiver beamforming system that employs a dynamically reconfigurable circular microphone array with an adjustable radius, paired with a frequency-adaptive ambiguity suppression algorithm.
  7. High-resolution mmWave Imaging using Metasurface and Diffusion
    Yida Wang, Yu Lu, Yuxuan Zhou, Yifei Shen, Lili Qiu, Zeyuan Lai, Yi-Chao Chen&, Hao Pan, Juntao Zhou, Dian Ding, Mei Wang, Guangtao Xue, Qian Zhang
    ACM Mobisys (CCF B) · 2025
    In this paper, we propose a novel high-resolution mmWave imaging technique that operates with a small, off-the-shelf mmWave module and eliminates the need for any mechanical movement, offering a streamlined, portable solution.
  8. Moir´eComm: Secure Screen-camera Communication based on Moir´e Cryptography
    Hao Pan, Yongjian Fu&, Yu Lu, Feitong Tan, Yi-Chao Chen, and Ju Ren
    IEEE TDSC(CCF A) · 2025
    Addressing this, we propose a novel Moire encryption technique-based secure screen-camera communication system, named MoireComm. The Moire encryption can enhance security by using distinct spatial frequency patterns for camouflage.
  9. Effective Local Texture Estimation Using Wavelet Transforms for Arbitrary-Scale Super-Resolution
    Baihong Qian*, Yu Lu*, Dian Ding&, Yi-Chao Chen, Qiaoling Xiao, Guanghui Gao, Zhengguang Xiao, Guangtao Xue&
    Springer ICIC (CCF C) Poster · 2025
    In this work, we propose a novel Local Wavelet Transformer (LWT) framework that leverages the Discrete Wavelet Transform (DWT) to capture both local textures and global structures, improving the accuracy of fine grained detail restoration.
  10. Amser+: Accelerating Mobile Speech Emotion Recognition in IoT Environments with Mel Feature Compression
    Yu Lu, Ran Wang, Dian Ding&, Yijie Li, Yongzhao Zhang, Lanqing Yang, Yi-Chao Chen, Guangtao Xue&
    IEEE IoTJ (CCF C & JCR Q1) · 2025
    This article proposes Amser+, a real-time SER framework using signal compression and task offloading. Amser+ utilizes logarithmic Mel-filter bank coefficients (Fbank) and singular value decomposition (SVD) for feature extraction and compression.
  11. TouchHBC: Touch-based Human Body Communication via Leakage Current
    Dian Ding, Yijie Li, Hao Pan,Yu Lu, Yongzhao Zhang, Yi-Chao Chen, Guangtao Xue&
    IEEE TMC (CCF A) · 2025
    This study introduces TouchHBC, a secure and reliable communication scheme leveraging a smartwatch’s built-in electrodes. This system establishes a touch-based human communication system utilizing a laptop’s leakage current.
  12. M2Silent: Enabling Multi-user Silent Speech Interactions via Multi-directional Speakers in Shared Spaces
    Juntao Zhou, Dian Ding&, Yijie Li, Yu Lu, Yida Wang, Yongzhao Zhang, Yi-Chao Chen&, Guangtao Xue
    ACM CHI (CCF A) · 2025
    M2Silent addresses this by allowing users to communicate silently, without producing audible speech, using acoustic sensing integrated into directional speakers. We leverage FMCW signals as audio carriers, simultaneously playing audio and sensing the user’s silent speech.
  13. AMSER: Accelerate Mobile Speech Emotion Recognition with Signal Compression
    Yu Lu*, Ran Wang*, Dian Ding&, Han Zhang, Liyun Zhang, Lanqing Yang, Yi-Chao Chen, Guangtao Xue&
    IEEE ICASSP (CCF B) · 2025
    This paper proposes AMSER, a real-time speech emotion recognition framework using signal compression and task offloading. AMSER utilizes logarithmic Mel-filter bank coefficients (Fbank) and singular value decomposition (SVD) for feature extraction and compression.
  14. DASIV: Directional Acoustic Sensing based Intelligent Vehicle Interaction System
    Dinghua Zhao*, Juntao Zhou*, Dian Ding&, Yu Lu, Yijie Li, Hang Yang, Yi-Chao Chen, Guangtao Xue
    IEEE IPCCC (CCF C) · 2025
    In this paper, we propose DASIV, which utilizes the highly directional nature of ultrasonic signals to achieve fine-grained directional acoustic sensing in vehicle environments.
  15. M3Cam: Extreme Super-resolution via Multi-Modal Optical Flow for Mobile Cameras
    Yu Lu*, Dian Ding*, Hao Pan&, Yongjian Fu, Liyun Zhang, Feitong Tan, Ran Wang, Yi-Chao Chen, Guangtao Xue, Ju Ren
    ACM SENSYS (CCF B) · 2024
    Our proposed system can generate a 16× SR image from four captured low-resolution images in real-time, with low computational load, low inference latency, and minimal reliance on runtime RAM.
  16. HandPad: Make Your Hand an On-the-go Writing Pad via Human Capacitance
    Yu Lu, Dian Ding&, Hao Pan,Yijie Li,Juntao Zhou,Yongjian Fu, Yongzhao Zhang, Yi-Chao Chen, Guangtao Xue
    ACM UIST (CCF A) · 2024
    This paper introduces HandPad, the system that turns the hand into an on-the-go touchscreen, which realizes interaction on the hand via human capacitance. HandPad achieves keystroke and handwriting inputs for letters, numbers, and Chinese characters, reducing the dependency on capacitive or pressure sensor arrays.
  17. Enable Touch-based Communication between Laptop and Smartwatch
    Dian Ding, Yijie Li, Hao Pan, Yu Lu, Yi-Chao Chen, Guangtao Xue&
    ACM Ubicomp (CCF A) Poster · 2024
    This study introduces TouchHBC, a secure and reliable communication scheme leveraging a smartwatch built-in electrodes. This system establishes a touch-based human communication system utilizing a laptop leakage current.
  18. HCMG: Human-Capacitance based Micro Gesture for VR/AR
    Yu Lu, Dian Ding&, Ran Wang, Guangtao Xue
    ACM Ubicomp (CCF A) Workshop MIMSVAI · 2024
    Building on the foundation of human capacitance, this paper introduces a novel approach termed human capacitance-based micro gesture (HCMG) recognition.
  19. Visar: Projecting Virtual Sound Spots for Acoustic Augmented Reality Using Air Nonlinearity
    Juntao Zhou, Yijie Li&, Yida Wang, Dian Ding, Yu Lu, Yi-Chao Chen&, and Guangtao Xue
    ACM IMWUT (CCF A) · 2024
    This paper proposes Visar, a device-free virtual sound spots projection system leveraging air nonlinearity. Visar achieves simultaneous tracking and sound spot generation while suppressing unintended audio leakages caused by grating lobes and nonlinear effects in mixing lobes through optimization.
  20. CarbonNet: Enterprise-Level Carbon Emission Prediction with Large-Scale Datasets
    Jinghua Tang, Nan Fang, Lanqing Yang&, Yuqiao Pei, Ran Wang, Dian Ding, Yu Lu, Guangtao Xue
    Springer ICIC (CCF C) · 2024
    We propose CarbonNet, a novel firm-level carbon emission prediction scheme. To build large-scale firm-level datasets, we crawled carbon emission data and reporting data (e.g., financial statements) of 3346 companies over 31 years containing 688 data fields, and combined them together.
  21. VCEMO: Multi-Modal Emotion Recognition for Chinese Voiceprints
    Jinghua Tang*, Liyun Zhang*, Yu Lu, Dian Ding&, Lanqing Yang, YiChao Chen, Minjie Bian, Xiaoshan Li&, Guangtao Xue
    Springer ICIC (CCF C) Poster · 2024
    This paper introduces the VCEMO dataset to address this deficiency. The proposed dataset is constructed from everyday conversations and comprises over 100 users and 7,747 textual samples.
  22. Adaptive Metasurface-Based Acoustic Imaging using Joint Optimization
    Yongjian Fu, Yongzhao Zhang, Yu Lu, Lili Qiu, Yi-Chao Chen, Yezhou Wang, Mei Wang, Yijie Li, Ju Ren&, Yaoxue Zhang
    ACM Mobisys (CCF B) · 2024
    We leverage a 3D-printed passive acoustic metasurface to significantly enhance the diversity of the measurement data, thereby improving the imaging quality. Specifically, we jointly design the transmission signal, transceivers beamforming weights, metasurface, and imaging algorithm to minimize the imaging reconstruction error in an end-to-end manner.
  23. Pushing the Limits of Acoustic Spatial Perception via Incident Angle Encoding
    Yongjian Fu, Yongzhao Zhang, Hao Pan, Yu Lu, Xinyi Li, Lili Chen, Ju Ren&, Xiong Li, Xiaosong Zhang, Yaoxue Zhang
    ACM IMWUT (CCF A) · 2024
    In this paper, we introduce MetaAng, a system designed to augment microphone arrays by enabling wideband spatial perception across both speech signals and inaudible sounds by leveraging the spatial encoding capabilities of acoustic metasurfaces.
  24. Effectively Learning Moiré QR Code Decryption from Simulated Data
    Yu Lu, Hao Pan&, Feitong Tan, Yi-Chao Chen,Jiadi Yu, Jinghai He, Guangtao Xue&
    IEEE INFOCOM (CCF A) · 2023
    In this work, we propose a deep learning-based Moiré QR code decryption framework and achieve an excellent decryption performance.
  25. Addressing Practical Challenges in Acoustic Sensing To Enable Fast Motion Tracking
    Yongzhao Zhang, Hao Pan, Yi-Chao Chen, Lili Qiu, Yu Lu, Guangtao Xue, Jiadi Yu, Feng Lyu, and Haonan Wang
    ACM IPSN (CCF B) · 2023
    Motivated by many potential applications that could be enabled by acoustic motion tracking, in this paper we systematically examine the factors that limit the accuracy of acoustic tracking in practical scenarios.