Chenyu Wang

I am a researcher at Harvard University. I work on AI and systems, and am envisioning Autonomous AI agents for system engineering.

This semester, I'm Course Staff for CS249r: Agentic AI for Computer Systems Design with Professor Vijay Janapa Reddi.

I curated an AI for Hardware/System papers collection, a small contribution to the community. Feel free to explore and share!

Research Interests

Artificial Intelligence

Computer Architecture

AI for System

My Professor's Vision

Recent Works

SLM-MUX: Orchestrating Small Language Models for Reasoning

C Wang*, Z Wan*, H Kang, E Chen, Z Xie, T Krishna, V Janapa Reddi, Y Du

Harvard University, Georgia Tech, Stanford University, 2025

A multi-model architecture that effectively coordinates multiple small language models, achieving up to 13.4% improvement on MATH and outperforming Qwen 2.5 72B with just two SLMs.

Project Page

EPIM: Efficient Processing-In-Memory Accelerators based on Epitome

C Wang, Z Dong, D Zhou, Z Zhu, Y Wang, J Feng, K Keutzer

Design Automation Conference (DAC), 2024 • 3 citations

Developed a novel processing-in-memory architecture for efficient neural network acceleration, demonstrating significant improvements in energy efficiency and throughput.

Evaluating Zero-Shot Long-Context LLM Compression

C Wang, Y Wang, K Li

arXiv preprint, 2024 • 1 citation

Comprehensive evaluation of compression techniques for large language models with extended context windows, addressing practical deployment challenges.

Gibbon: Efficient co-exploration of NN model and processing-in-memory architecture

H Sun, C Wang, Z Zhu, X Ning, G Dai, H Yang, Y Wang

Design, Automation & Test in Europe Conference (DATE), 2022 • 20 citations

Designed an efficient co-exploration framework for jointly optimizing neural network models and processing-in-memory architectures, achieving state-of-the-art performance.

View All Publications