Welcome to the MGHPCC Virtual Booth

University of Massachusetts

Amherst

Adaptive Deep Learning Systems Towards Edge Intelligence

Researchers at UMASS use MGHPCC in their work realizing adaptiveness aimed at facilitating more effective deployment of deep learning techniques across diverse applications and environments.

Hui Guan’s research enhances speed, scalability, and reliability of machine learning through innovations in algorithms and systems. Her research draws insights from applications, algorithms, and high-performance computing techniques to reduce the costs of model development and enable deep learning in resource-constrained and distributed edge environments.

Edge intelligence pushes intelligent data processing using deep neural networks (DNNs) to the edge of the network, closer to data sources. It enables applications across various fields and has garnered significant attention from both industry and academia. However, the limited resources on edge platforms, such as edge servers and Internet of Things devices, hinder the ability to deliver fast and accurate responses to queries from deep learning prediction tasks. As a result, only some deep learning tasks and smaller DNN models suitable for edge deployment are feasible.

To overcome this limitation, this project explores a new adaptive approach to building deep learning systems. The systems will make real-time adjustments to the DNNs executed for prediction tasks based on the varying resource demands arising from three critical dimensions -- variable task complexity, fluctuating inference workloads, and resource contention in multi-tenant edge environments. The goal is to optimize both system efficiency and accuracy. Realizing the envisioned adaptiveness will facilitate the effective deployment of deep learning techniques across diverse applications and environments.

Over the past year, the team advanced adaptive inference with four systems: CACTUS (context-aware micro-classifiers), Proteus (dynamic model scaling), GMorph (multi-DNN fusion), and DiffServe (query-aware diffusion model serving). These systems significantly improved accuracy, latency, and throughput across diverse platforms, outperforming static and prior state-of-the-art approaches.

Hui Guan
Assistant Professor in the College of Information and Computer Sciences (CICS) at the University of Massachusetts Amherst

Principal Members

Yale

Featured Projects

SC25 Project
A Safer Way to See Inside Cells
Accelerating Rendering Power
Adaptive Deep Learning Systems Towards Edge Intelligence
SC25 Project
AI for Cancer Diagnosis
SC25 Project
AI Pareidolia
SC25 Project
AI That Speaks Human About Health
Analyzing the Gut Microbiome
Asteroid Data Mining
SC25 Project
Better Pathogen Targeting
SC25 Project
Bone Ratios and Big Data
Research Computing Center
BU Research Computing Services
SC25 Project
Building for Floods
Computation + Machine Intelligence | Wu Tsai Institute
Computational Modeling of Biological Systems
SC25 Project
Computing Hidden Health Threats from Heat
SC25 Project
CRISPR Mice, Smarter Science
Dancing Frog Genomes
Deciphering Alzheimer's Disease
Denser Environments Cultivate Larger Galaxies
Detecting Protein Concentrations in Assays
Developing Advanced Materials for a Sustainable Energy Future
Dexterous Robotic Hands
Discovering Evolution’s Master Switches
MGHPCC Project
Ecosystem for Research Networking
Electron Heating in Kinetic-Alfvén-Wave Turbulence
Ephemeral Stream Water Contributions to US Drainage Networks
Evaluating Health Benefits of Stricter US Air Quality Standards
Evolution of Viral Infectious Disease
Exact Gravitational Lensing by Rotating Black Holes
MGHPCC Project
Expanding Computing Education Pathways (ECEP) Alliance
SC25 Project
FlowER: AI for Predicting Chemical Reactions
Global Consequences of Warming-Induced Arctic River Changes
SC25 Project
Grid Responsive Data Centers
Research Computing Center
Harvard FASRC
SC25 Project
How Monkeys - and Machines - See in 3D
IceCube: Hunting Neutrinos
Impact of Marine Heatwaves on Coral Diversity
Research Computing Center
Lincoln Laboratory Supercomputing Center (LLSC)
SC25 Project MGHPCC Project
Massachusetts AI Hub
SC25 Project MGHPCC Project
MGHPCC AI Computing Resource (AICR)
SC25 Project
Microplastic-Free by Design
MIT Brain and Cognitive Sciences
Research Computing Center
MIT Office of Research Computing and Data
Modeling Breast Cancer Spread
Modeling Hydrogels and Elastomers
Monte Carlo eXtreme (MCX) - a Physically-Accurate Photon Simulator
SC25 Project
Multifunctional 3D-Printed Materials
SC25 Project
Naval and Ocean Renewable Energy Hydrodynamics
Network Attached FPGAs in the OCT
Research Computing Center
NEU Research Computing
MGHPCC Project
New England Research Cloud
New Insights on Binary Black Holes
MGHPCC Project
Northeast Storage Exchange
Open Cloud Testbed
SC25 Project MGHPCC Project
OSN - Open Storage Network
Pulling Back the Quantum Curtain on ‘Weyl Fermions’
Quantum Computing in Renewable Energy Development
Revolutionizing Materials Design with Computational Modeling
SC25 Project
Sailing the Symbiosis Seascape
SC25 Project
Shining a Light on Dark Matter
Simulating Large Biomolecular Assemblies
Social Capital and Economic Mobility
Software for Unreliable Quantum Computers
SC25 Project
Staving off the Banana Apocalypse
Studying Highly Efficient Biological Solar Energy Systems
SC25 Project
Supercomputers Reveal Ancient Atmospheric Battle
SC25 Project
Supporting Data-intensive Social Science
Surface Behavior
Taming the Energy Appetite of AI Models
The Institute for Experiential AI
The Kempner Institute - Unlocking Intelligence
MGHPCC Project
The Mass Open Cloud Alliance (MOC Alliance)
The US ATLAS Northeast Tier 2 Center
Tornado Path Detection
Towards a Whole Brain Cellular Atlas
SC25 Project
Tracking Environmental Health Risks
Research Computing Center
UMass - URI Unity Cluster
Research Computing Center
UMass Amherst Research Computing and Data
Research Computing Center
URI Institute for AI & Computational Research
Volcanic Eruptions Impact on Stratospheric Chemistry & Ozone
SC25 Project
Wrangle Range Modeling
Yale Budget Lab
Research Computing Center
Yale Center for Research Computing
SC25 Project
YARD: A Curation Workflow Tool
100 Bigelow Street, Holyoke, MA 01040