Use feature flags generously for safe tests. Tune thresholds per segment; one size rarely fits all. Quick checklist that saved us time.