Documentation
How to use Adaptive effectively
Ultra-fast routing with multi-tier caching and optimized Go architecture
L1: Prompt-response cache (microseconds) L2: Semantic cache (1-2ms) L3: Router caches (5-10ms)
Model Selection: Under 1ms Cache Lookup: Under 1ms Provider Routing: Under 1ms Total Overhead: Under 3ms