Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models
ACL 2026 Findings
Kainan Liu*, Yong Zhang*, Ning Cheng†, Yun Zhu, Yanmeng Wang, Shaojun Wang, Jing Xiao
Proposes a novel PEFT method that estimates the activation covariance matrix from a task-specific calibration set (as few as 64 samples), performs eigendecomposition to extract the tail eigenvector subspace, and constrains low-rank updates within this weakly-activated subspace to avoid interfering with the pretrained model's dominant representations. Evaluated on 16 benchmarks spanning GLUE, mathematical reasoning, code generation, and commonsense reasoning, Astra significantly outperforms LoRA, DoRA, PiSSA, LoRA-GA and other PEFT methods — achieving substantial gains on CoLA and MRPC where standard LoRA collapses — and surpasses full fine-tuning in certain scenarios. Code open-sourced.
GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression
EMNLP 2025 Main Conference
Kainan Liu*, Yong Zhang*, Ning Cheng†, Zhitao Li, Shaojun Wang, Jing Xiao
A gradient-guided hybrid compression framework that combines redundant layer identification with adaptive singular parameter selection. It locates functionally redundant Transformer layers via cosine similarity of hidden states between adjacent layers, then uses gradient-attributed importance scores (rather than magnitude-based heuristics) to adaptively retain critical singular components from SVD, replacing full layers with lightweight parameterizations. At 20% compression, it preserves ~90% of original performance across 19 datasets spanning 5 model families (LLaMA, LLaMA 2/3, Mistral-7B), outperforming SliceGPT, LaCo, LLM-Pruner, and others. The compression process is training-free and completes in ~0.16 hours on a single A100 GPU.
Detecting and dissecting anomalous anatomic regions in spatial transcriptomics with STANDS
Nature Communications
Kaichen Xu†, Yan Lu, Suyang Hou, Kainan Liu, Yihang Du, Mengqian Huang, Hao Feng, Hao Wu, Xiaobo Sun*
A GAN-based multi-task deep learning framework that jointly models gene expression and histology image information for detection, cross-sample alignment, and subtyping of anomalous tissue domains in spatial transcriptomics data. It integrates multi-modal features via graph attention networks and Transformer fusion modules, validated across multiple platforms including 10x Visium, Slide-seqV2, and Stereo-seq. STANDS can detect early-stage cancerous regions visually indistinguishable in histology images, and is the only method that accurately performs cross-sample alignment and anomaly subtyping in multi-sample settings.
* Equal contribution † Corresponding author