58% of Cold Email Replies Fire on Step 1: Rewrite, Don't Extend

Brian·May 1, 2026·8 min read

Editorial illustration of a tall staircase where the first step is dramatically wider and brighter than the seven shrink

Instantly's 2026 Benchmark Report — pulled from billions of emails sent through their platform — landed with one stat that should rewire how outbound teams build sequences: 58% of all cold email replies come from the first send. The remaining 42% trickle across steps 2 through 7. Average reply rate sits at 3.43%. Top performers clear 10%.

Most teams read that and ask the wrong question: "Should I send fewer emails?" The right question is: "Is my Step 1 setting a ceiling I can never break through?"

This is a teardown of what a step-1-dominant sequence looks like in practice — the word counts, the CTA structure, the cadence — and how to rebuild yours without burning your domain or your list.

The First-Touch Ceiling Math Nobody Talks About

Here's the part the headline stat hides. If 58% of your replies come from email #1, your Step 1 reply rate effectively caps the whole sequence.

Run the numbers on a 5-step sequence:

Step 1 reply rate of 2% → sequence tops out around 3.4% total
Step 1 reply rate of 5% → sequence lands near 8.6%
Step 1 reply rate of 8% → sequence clears 13–14%

This is why top performers in the Instantly dataset don't just "follow up more." They invest disproportionately in the first email because every percentage point at Step 1 compounds across the entire sequence. Adding a 6th touch to a broken Step 1 produces marginal returns. Rewriting Step 1 from 2% to 5% more than doubles your pipeline.

If your first email underperforms, no amount of follow-up volume will save the sequence. You're stacking 0.5% lifts on top of a broken foundation.

Diagnose Before You Rewrite: Is Step 1 Actually the Bottleneck?

Before you tear apart your copy, rule out the two upstream killers: deliverability and list quality. Reply rate below 1% across all steps is rarely a copy problem.

Quick diagnostic checklist:

Inbox placement: Run a seed test through GlockApps or Mailreach. If you're below 80% primary inbox, fix infrastructure first.
List freshness: Anything older than 60 days on a verified list is decaying. Re-verify before testing copy.
Open rate sanity check: Apple Mail Privacy Protection inflates opens to 60–80% artificially. Reply rate is the only honest signal in 2026.
Reply distribution: If your replies are evenly spread across steps 1–5, your Step 1 is likely under-indexing on the 58% benchmark.

If infrastructure checks out and Step 1 still produces fewer than half your replies, you have a copy problem — not a cadence problem.

What a Step-1-Dominant First Email Looks Like

The Instantly data is consistent with what RAIN Group and Gong have published independently: emails between 50 and 125 words outperform longer formats by a wide margin, and a single, specific CTA beats multi-option CTAs by roughly 2x.

Here's the anatomy of a rewritten Step 1 under 80 words:

Line 1 — Signal-based opener (1 sentence). Reference something specific and recent. A funding round, a job posting, a podcast appearance, a stack change. Generic personalization ("Saw you're VP of Sales at...") is now actively penalized — LinkedIn's 360Brew patterns are bleeding into email filters too. We covered the broader detection shift in our breakdown of how AI openers are being deprioritized.

Line 2 — Relevance bridge (1 sentence). Why this signal made you reach out. Tie it to a specific outcome or problem.

Line 3 — Soft proof (optional, 1 sentence). A single number or named customer. No case study links.

Line 4 — Single CTA (1 sentence). Ask one question or propose one specific next step. "Open to a 15-minute call next Tuesday?" beats "Would love to connect sometime."

That's it. No P.S. No calendar link in Step 1. No three-paragraph value pitch. The job of email #1 is to earn a reply, not to close.

The 4–7 Touchpoint Rule and the Death of 12-Step Sequences

The Instantly dataset shows a clear inflection: sequences longer than 7 touches show negligible lift and meaningfully higher unsubscribe and spam-complaint rates. The 12-step sequences that defined 2021–2023 outbound are now actively damaging sender reputation.

The sweet spot for 2026:

4 touches for warm or signal-rich audiences (recent funding, hiring, tech changes)
5–6 touches for cold ICP outreach with clear pain alignment
7 touches maximum for enterprise multi-threaded plays

Cadence matters as much as length. The pattern that performs best across the Instantly data and matches what we see in LinkedCamp customer accounts is 3-7-7: three days between Step 1 and Step 2, then seven days between each subsequent touch. Faster than that reads as desperate. Slower loses thread continuity.

Step 2: The "Feels Like a Reply" Pattern

If 58% of replies come from Step 1, the next biggest chunk — roughly 22% based on the Instantly distribution — comes from Step 2. And Step 2 is where most teams blow it by writing a "just checking in" follow-up.

The pattern that lifts Step 2 reply rates by approximately 30% in our customer data: make Step 2 feel like the reply to Step 1, not a new pitch. Drop the salutation. Skip the recap. Open with a single sentence that adds new information or asks a sharper question.

Example structure:

Quick thought — [specific observation about their business or industry]. Worth a 15-minute conversation, or not the right time?

No "Hi {firstName}." No "Following up on my last email." No re-pitch. It reads as continuation, not pursuit.

Ready to scale your outbound?

Put what you just read into practice — free for 14 days.

LinkedCamp runs AI-personalized LinkedIn + email sequences on dedicated IPs, with AI agents that book meetings while you focus on closing.

Start 14-day free trial See pricing

Steps 3–5: Handle the Residual 42% Without Burning the List

The remaining replies are won by accumulated context, not repetition. Each subsequent touch should add a new dimension — not restate the original ask.

Here's how to allocate the back half of a 5-step sequence:

Step 3 (day 14): Switch the angle. If Step 1 led with a pain point, lead Step 3 with a customer outcome. If Step 1 led with a signal, lead Step 3 with a relevant benchmark.
Step 4 (day 21): Multi-thread. Reference that you're also reaching out to a peer (CFO, RevOps lead, head of marketing). Creates urgency and signals seriousness.
Step 5 (day 28): The breakup. Short, polite, no guilt-trip. "Closing the loop — should I check back in Q2 or is this not a fit at all?" The breakup email consistently produces 8–12% of total sequence replies in our data.

Notice what's missing: the "bump" email, the "resending in case you missed this," the meme-based pattern interrupt. These tactics produced lift in 2022. They now correlate with spam complaints and reduced primary inbox placement.

A/B Testing Step 1 Without Torching Your Domain

Once you accept that Step 1 sets the ceiling, testing it becomes the highest-leverage activity in your outbound program. But running tests at scale on a single domain is how you end up in spam folders.

The testing framework that works:

Split traffic across at least 2 sending domains with separate warm-up histories. This isolates copy performance from infrastructure variance.
Test one variable at a time: opener line, CTA phrasing, or word count — never all three at once.
Minimum sample: 400 sends per variant before calling a winner. Below that, you're reading noise.
Decision metric: positive reply rate, not raw reply rate. Auto-replies and "unsubscribe" responses should not count.
Test cycle: two-week sprints. Anything longer and your list quality drifts enough to confound the result.

Teams running this discipline typically lift Step 1 reply rate from baseline 2–3% into the 6–9% range within 6–8 weeks. That's the difference between a sequence that produces 4 meetings per 1,000 sends and one that produces 14.

What This Means for Your Stack

The operational implication of step-1-dominance is that your tooling needs to support fast Step 1 iteration with clean attribution — not just "send more emails." Most legacy sequencers were built for the 12-step era and treat each step as equivalent.

What you actually need: rapid variant testing on Step 1, clean reply attribution per step, sending-domain isolation, and signal-based personalization that doesn't trip the AI-detection patterns now baked into both LinkedIn and major email providers. We've written more on the post-Apollo, post-Seamless tooling shift and how the stack is consolidating around platforms that handle this natively.

TL;DR

58% of cold email replies come from Step 1 per Instantly's 2026 benchmark — meaning your first email sets a hard ceiling on the entire sequence's performance.
Average reply rate is 3.43%; top performers clear 10% — the gap is almost entirely explained by Step 1 quality, not sequence length.
Rewrite Step 1 around a 50–125 word, single-CTA structure with signal-based personalization. Cap sequences at 4–7 touches on a 3-7-7 cadence.
Step 2 should feel like a continuation, not a follow-up — drop the salutation and add new information for ~30% lift on that step.
Test Step 1 variants on isolated domains, 400+ sends per variant, two-week sprints — this is the single highest-leverage activity in modern outbound.

Keep reading

The January 2026 LinkedIn Cap: 100 Requests/Week

LinkedIn's January 2026 update made 100 connection requests/week the practical hard ceiling. Here's the pacing agencies should run per account tier.

AI SDR Post-Mortem: Why 50-70% Churn in Under a Year

AI SDR tools churn at 50-70% annually — double human SDR turnover. Inside the 11x and Artisan failures, and what actually works for outbound teams.

LinkedIn Volume Tax: Why 100+ Requests/Week Kills Trust

LinkedIn's algorithm now silently throttles high-volume senders. Here's how to diagnose if your account is already taxed — and the recovery protocol that works.

Ready to try LinkedCamp?

14-day free trial, dedicated IP, AI agents — start outbound in under an hour.

Start free trial See pricing