Preprint
LithOS: An Operating System for Efficient Machine Learning on GPUs
Address Scaling: Architectural Support for Fine-Grained Thread-Safe Metadata Management
ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines
ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines
Memento: Architectural Support for Ephemeral Memory Management in Serverless Environments
Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling