My LLM Is Too Slow: Latency Fixes That Actually Work