👋 Need help with code?
Power analysis for LLM evals: how big does your eval set need to be to catch a 5% regression? | TechForDev