58% of Cold Email Replies Fire on Step 1: Rewrite, Don't Extend

Instantly's 2026 Benchmark Report — pulled from billions of emails sent through their platform — landed with one stat that should rewire how outbound teams build sequences: 58% of all cold email replies come from the first send. The remaining 42% trickle across steps 2 through 7. Average reply rate sits at 3.43%. Top performers clear 10%.
Most teams read that and ask the wrong question: "Should I send fewer emails?" The right question is: "Is my Step 1 setting a ceiling I can never break through?"
This is a teardown of what a step-1-dominant sequence looks like in practice — the word counts, the CTA structure, the cadence — and how to rebuild yours without burning your domain or your list.
The First-Touch Ceiling Math Nobody Talks About
Here's the part the headline stat hides. If 58% of your replies come from email #1, your Step 1 reply rate effectively caps the whole sequence.
Run the numbers on a 5-step sequence:
- Step 1 reply rate of 2% → sequence tops out around 3.4% total
- Step 1 reply rate of 5% → sequence lands near 8.6%
- Step 1 reply rate of 8% → sequence clears 13–14%
This is why top performers in the Instantly dataset don't just "follow up more." They invest disproportionately in the first email because every percentage point at Step 1 compounds across the entire sequence. Adding a 6th touch to a broken Step 1 produces marginal returns. Rewriting Step 1 from 2% to 5% more than doubles your pipeline.
If your first email underperforms, no amount of follow-up volume will save the sequence. You're stacking 0.5% lifts on top of a broken foundation.
Diagnose Before You Rewrite: Is Step 1 Actually the Bottleneck?
Before you tear apart your copy, rule out the two upstream killers: deliverability and list quality. Reply rate below 1% across all steps is rarely a copy problem.
Quick diagnostic checklist:
- Inbox placement: Run a seed test through GlockApps or Mailreach. If you're below 80% primary inbox, fix infrastructure first.
- List freshness: Anything older than 60 days on a verified list is decaying. Re-verify before testing copy.
- Open rate sanity check: Apple Mail Privacy Protection inflates opens to 60–80% artificially. Reply rate is the only honest signal in 2026.
- Reply distribution: If your replies are evenly spread across steps 1–5, your Step 1 is likely under-indexing on the 58% benchmark.
If infrastructure checks out and Step 1 still produces fewer than half your replies, you have a copy problem — not a cadence problem.
What a Step-1-Dominant First Email Looks Like
The Instantly data is consistent with what RAIN Group and Gong have published independently: emails between 50 and 125 words outperform longer formats by a wide margin, and a single, specific CTA beats multi-option CTAs by roughly 2x.
Here's the anatomy of a rewritten Step 1 under 80 words:
Line 1 — Signal-based opener (1 sentence). Reference something specific and recent. A funding round, a job posting, a podcast appearance, a stack change. Generic personalization ("Saw you're VP of Sales at...") is now actively penalized — LinkedIn's 360Brew patterns are bleeding into email filters too. We covered the broader detection shift in our breakdown of how AI openers are being deprioritized.
Line 2 — Relevance bridge (1 sentence). Why this signal made you reach out. Tie it to a specific outcome or problem.
Line 3 — Soft proof (optional, 1 sentence). A single number or named customer. No case study links.
Line 4 — Single CTA (1 sentence). Ask one question or propose one specific next step. "Open to a 15-minute call next Tuesday?" beats "Would love to connect sometime."
That's it. No P.S. No calendar link in Step 1. No three-paragraph value pitch. The job of email #1 is to earn a reply, not to close.
The 4–7 Touchpoint Rule and the Death of 12-Step Sequences
The Instantly dataset shows a clear inflection: sequences longer than 7 touches show negligible lift and meaningfully higher unsubscribe and spam-complaint rates. The 12-step sequences that defined 2021–2023 outbound are now actively damaging sender reputation.
The sweet spot for 2026:
- 4 touches for warm or signal-rich audiences (recent funding, hiring, tech changes)
- 5–6 touches for cold ICP outreach with clear pain alignment
- 7 touches maximum for enterprise multi-threaded plays
Cadence matters as much as length. The pattern that performs best across the Instantly data and matches what we see in LinkedCamp customer accounts is 3-7-7: three days between Step 1 and Step 2, then seven days between each subsequent touch. Faster than that reads as desperate. Slower loses thread continuity.
Step 2: The "Feels Like a Reply" Pattern
If 58% of replies come from Step 1, the next biggest chunk — roughly 22% based on the Instantly distribution — comes from Step 2. And Step 2 is where most teams blow it by writing a "just checking in" follow-up.
The pattern that lifts Step 2 reply rates by approximately 30% in our customer data: make Step 2 feel like the reply to Step 1, not a new pitch. Drop the salutation. Skip the recap. Open with a single sentence that adds new information or asks a sharper question.
Example structure:
Quick thought — [specific observation about their business or industry]. Worth a 15-minute conversation, or not the right time?
No "Hi {firstName}." No "Following up on my last email." No re-pitch. It reads as continuation, not pursuit.
LinkedCamp runs AI-personalized LinkedIn + email sequences on dedicated IPs, with AI agents that book meetings while you focus on closing.
Steps 3–5: Handle the Residual 42% Without Burning the List
The remaining replies are won by accumulated context, not repetition. Each subsequent touch should add a new dimension — not restate the original ask.
Here's how to allocate the back half of a 5-step sequence:
- Step 3 (day 14): Switch the angle. If Step 1 led with a pain point, lead Step 3 with a customer outcome. If Step 1 led with a signal, lead Step 3 with a relevant benchmark.
- Step 4 (day 21): Multi-thread. Reference that you're also reaching out to a peer (CFO, RevOps lead, head of marketing). Creates urgency and signals seriousness.
- Step 5 (day 28): The breakup. Short, polite, no guilt-trip. "Closing the loop — should I check back in Q2 or is this not a fit at all?" The breakup email consistently produces 8–12% of total sequence replies in our data.
Notice what's missing: the "bump" email, the "resending in case you missed this," the meme-based pattern interrupt. These tactics produced lift in 2022. They now correlate with spam complaints and reduced primary inbox placement.
A/B Testing Step 1 Without Torching Your Domain
Once you accept that Step 1 sets the ceiling, testing it becomes the highest-leverage activity in your outbound program. But running tests at scale on a single domain is how you end up in spam folders.
The testing framework that works:
- Split traffic across at least 2 sending domains with separate warm-up histories. This isolates copy performance from infrastructure variance.
- Test one variable at a time: opener line, CTA phrasing, or word count — never all three at once.
- Minimum sample: 400 sends per variant before calling a winner. Below that, you're reading noise.
- Decision metric: positive reply rate, not raw reply rate. Auto-replies and "unsubscribe" responses should not count.
- Test cycle: two-week sprints. Anything longer and your list quality drifts enough to confound the result.
Teams running this discipline typically lift Step 1 reply rate from baseline 2–3% into the 6–9% range within 6–8 weeks. That's the difference between a sequence that produces 4 meetings per 1,000 sends and one that produces 14.
What This Means for Your Stack
The operational implication of step-1-dominance is that your tooling needs to support fast Step 1 iteration with clean attribution — not just "send more emails." Most legacy sequencers were built for the 12-step era and treat each step as equivalent.
What you actually need: rapid variant testing on Step 1, clean reply attribution per step, sending-domain isolation, and signal-based personalization that doesn't trip the AI-detection patterns now baked into both LinkedIn and major email providers. We've written more on the post-Apollo, post-Seamless tooling shift and how the stack is consolidating around platforms that handle this natively.
- 58% of cold email replies come from Step 1 per Instantly's 2026 benchmark — meaning your first email sets a hard ceiling on the entire sequence's performance.
- Average reply rate is 3.43%; top performers clear 10% — the gap is almost entirely explained by Step 1 quality, not sequence length.
- Rewrite Step 1 around a 50–125 word, single-CTA structure with signal-based personalization. Cap sequences at 4–7 touches on a 3-7-7 cadence.
- Step 2 should feel like a continuation, not a follow-up — drop the salutation and add new information for ~30% lift on that step.
- Test Step 1 variants on isolated domains, 400+ sends per variant, two-week sprints — this is the single highest-leverage activity in modern outbound.
Keep reading

Signal-Stacked Outreach: Hit 15-25% Replies With 3 Triggers
The exact Clay → LinkedCamp recipe for stacking funding, hiring, and tool gap signals into one opener — with reply-rate benchmarks per stack tier.

LinkedIn's 360Brew Broke Your Outreach: Fix It Before Q2
LinkedIn's 360Brew AI now scores outreach against your profile for coherence — and reply rates cratered in Feb-March. Here's the 7-day audit before Q2.

360Brew Is Deprioritizing AI Openers: Q1 2026 A/B Data
Fresh Q1 2026 A/B data from LinkedCamp campaigns shows human-written openers now out-reply AI-only messages by 2.4x. Here's the mechanism and the workflow that still works.
Ready to try LinkedCamp?
14-day free trial, dedicated IP, AI agents — start outbound in under an hour.