Show HN: Granite Switch - compose multiple LoRA adapters to one deployable model

Category: infrastructure

Tags: lora, model-composition, fine-tuning

Score: 7.0/10 (Innovation: 8, Technical: 7, Documentation: 7, Utility: 6)

Granite Switch is a tool from IBM Research that allows composing multiple LoRA adapters into a single deployable LLM checkpoint, enabling task-specific fine-tuning without requiring separate models per task. Its novel switch-layer mechanism and KV cache normalization make adapter composition practical and inference-efficient, addressing a significant operational pain point in deploying fine-tuned models.

Target audience: backend devs

Repository: https://github.com/generative-computing/granite-switch · Jupyter Notebook · Apache-2.0 · 15 stars

View on Hacker News