Recovery · 12 min read

HRV, Sleep and Training Readiness

How to read a wearable the way a coach does — trend over single day, context over absolute number.

The short answer

Trend beats single reads.Compare a rolling 7-day average against your own 30–60 day baseline. Never compare your number to anyone else's.
Use RMSSD.It reflects vagal tone, is validated in the literature, and is what modern wearables compute under the hood.
10% below baseline for 3+ days = flag.One bad day is noise. Three consecutive days is the system telling you it's accumulating stress faster than it's clearing it.
Combine HRV, sleep, mood, soreness.No single metric predicts next-day performance well. Four cheap inputs together do.
Adjust the session, don't skip blindly.Cap intensity, swap VO2 work for Z2, reduce duration — in that order. Full rest only on red.

What HRV actually measures

Heart rate variability is the millisecond-level variation between successive heartbeats. A healthy, well-rested heart does notbeat like a metronome — parasympathetic tone (the “rest and digest” branch of the autonomic nervous system) is constantly nudging each beat slightly earlier or later than the last. More variability means more vagal control. Less variability means your sympathetic system is dominating — training load, psychological stress, alcohol, poor sleep, impending illness, over-reaching.

Two HRV metrics matter for athletes:

RMSSD(root mean square of successive differences) — dominated by parasympathetic input. This is the metric used in virtually all sports-science HRV research from the last decade and the one your wearable exposes under the hood. It's robust, responds fast to stress, and works well on short (2–5 min) samples.
SDNN (standard deviation of normal-to- normal intervals) — captures both parasympathetic and sympathetic input, needs 24-hour recordings to be meaningful, more common in clinical cardiology than sport. Safe to ignore for training decisions.

Gold-standard measurement is a medical-grade ECG. The practical gold standard for athletes is overnight nocturnal measurement from a good wearable — several hours of quiet, supine, post-absorptive data averaged into one value. Far less noisy than a 2-minute morning reading, and fully automated.

Trend vs single read

HRV is a noisy signal. Day-to-day swings of 10–20% are routine even in well-rested athletes. A bad night's sleep, a spicy meal, dehydration, a late workout — any of these can punch a hole in tonight's reading without telling you anything useful about your training state.

Every reliable HRV framework — from Kiviniemi and Plews onward — converges on the same rule: compare a 7-day rolling average to a personal 30-to-60 day baseline. The 7-day window smooths out random noise. The 30-day baseline adapts slowly enough to reflect genuine fitness state, fast enough to respond to a new training block.

The shape of that signal looks like this:

Typical two-week pattern. Days 1–10: normal scatter around baseline — ignore. Days 11–14: sustained drop below the −10% line — act.

The decision rule: three to four consecutive days of the 7-day average more than 10% below your baseline is a genuine signal. Fewer than that — noise. Don't rearrange your training around a single bad Tuesday.

Sleep metrics that matter

Wearables report a wardrobe of sleep metrics. Only four predict training readiness reliably. The rest are estimates with large error bars and should be read as trends, not absolutes.

Total sleep time

The most validated sleep metric on any consumer device, typically within 10–15 minutes of polysomnography. Target 7–9 hours. Dropping to 4–6 hours costs 5–12% of endurance output. Below 5 hours, downgrade the session.

Sleep consistency

The standard deviation of your bed- and wake-times across a week. High consistency (< 30 min variation) correlates with better HRV, lower resting heart rate, and improved glucose regulation — independently of total duration. Going to bed at 22:00 every night matters more than any given night reaching 8 hours.

Deep and REM percentage

Deep (slow-wave) sleep is when growth hormone pulses and muscle repair happens. REM consolidates motor learning and emotional regulation. Wearables approximate these stages from heart rate and movement patterns — directional accuracy is decent, absolute numbers less so. Track the trend: a sustained drop in deep-sleep percentage often precedes a drop in HRV by a few days.

Resting / nocturnal heart rate

A rise of 5–7 bpm in overnight resting heart rate that isn't explained by training load is the earliest detectable sign of something off — illness incubating, dehydration, severe sleep debt. Much less noisy than single-day HRV. Watch RHR and HRV together; they tell you the same story from two angles.

Wearable validity — who's right?

Consumer HRV accuracy has improved dramatically since 2023. A 2025 validation study compared modern wearables against medical-grade ECG and found most top devices hit 0.94–0.99 concordance for nocturnal RMSSD. Device choice matters less than picking one and sticking with it.

Device	Nocturnal HRV accuracy	Known gotchas
Oura (Gen 3 / 4)	Exceptional. ~0.94–0.99 vs ECG.	Finger pulse + skin temperature is the cleanest consumer setup. Reading quality degrades if the ring is loose or worn on a finger with cold extremities.
Whoop 4.0	Exceptional. ~0.94–0.99 vs ECG.	The “Recovery” score weights deep-sleep parasympathetic rebound heavily, so it can say “green” on nights with short total sleep but good deep-sleep density. Cross-check with total sleep time.
Garmin (Enhanced BBI)	Research-grade on newer models with Enhanced BBI (Fenix 7+, Epix Gen 2, Forerunner 255/265/955/965).	Older Garmins under-report high HRV and over-report low HRV — systematic bias. If your watch predates Enhanced BBI, don't use it for HRV tracking.
Apple Watch	Spot-check only. Not continuous nocturnal.	Samples HRV in 1-minute windows at irregular times. Not suitable for a rolling baseline without third-party apps that force scheduled readings.
Polar H10 + app	Medical-grade — chest strap gold standard.	Requires a 3–5 min supine reading every morning. Most accurate single measurement available to consumers, but compliance is harder than wearing a ring.

Absolute HRV numbers don't transfer between devices. Pick one, build a 30-day baseline on it, and only compare within that baseline.

Combining signals into a readiness call

No single signal predicts next-day performance well enough to drive training decisions on its own. The literature converges on a four-input framework:

HRV deviation — 7-day rolling average vs 30-day baseline.
Last night's sleep duration.
Two subjective ratings — mood/stress and muscle soreness, each 1–5.
Days since last hard session (Z4/Z5 intervals or long run).

Score each input, sum, and apply a threshold. This is the algorithm behind the recovery readiness score. The adjustment rules below show what to do with the verdict.

Adjustment rules — what to actually do

The one decision most runners get wrong: they either skip training entirely or do it exactly as planned. Neither is usually right. The correct move is almost always a graded modification.

HRV deviation (7-day vs baseline)	Sleep	Today's session
At or above baseline	7+ h	Green. Proceed as planned. Primed for intervals, long run, tempo.
1–5% below	7+ h	Green with awareness. Proceed. Reassess mid-warm-up; abort intervals if legs feel genuinely bad.
5–10% below	5–7 h	Amber — cap intensity. Keep the session but cut VO2 work to threshold, threshold to tempo. Hold duration.
10–20% below	Any	Amber — swap modality. Replace quality work with upper Z2 aerobic. Cut volume by ~30%.
>20% below, 3+ day streak	< 5 h	Red. Full rest or 30 min lower-Z2 active recovery. Eat. Sleep. Re-test tomorrow.

When the HRV row and the sleep row disagree, defer to the more conservative signal.

Special cases — when the normal rules bend

Female cycle phases

Luteal-phase HRV suppression (roughly days 15–28) is a hormonal artefact, not a training flag. Progesterone raises core body temperature and sympathetic tone, pulling RMSSD down a few percent and nudging resting heart rate up. Build a phase-specific baseline by averaging readings across 2–3 cycles and compare within phase. A 10% drop from your luteal-phase baseline still matters; a 5% drop from the follicular-phase mean into luteal territory is expected.

Masters athletes (>50)

Absolute RMSSD naturally declines with age — a healthy 55-year-old can have a baseline 40–50% below a 25-year- old's. Percentage-deviation rules still apply, but clearance from hard sessions takes longer (48–72 h is typical for VO2 work instead of 24–48 h). Skew amber calls toward rest more aggressively than a younger athlete would.

Alcohol, jet lag, illness

All three crush HRV in ways that have nothing to do with training load. If you drank the night before, flew across two or more time zones in the last 48 h, or are fighting something off, the reading is measuring that — not your fitness. Don't let it drive training decisions in either direction.

RED-S / under-fueling

Chronic energy deficit (Relative Energy Deficiency in Sport) produces persistently suppressed HRV that does not respond to rest or deload. The fix is nutritional, not training-based. If your baseline is drifting down over months rather than weeks, or you're hitting missed menstrual cycles, weight loss, or recurrent bone stress injuries, stop training adjustments and talk to a sports-medicine clinician. The IOC 2023 consensus on REDs is the reference.

Common mistakes

Chasing the number.Trying to “raise your HRV” through ice baths, breathwork, cold showers, red-light panels. HRV is a mirror, not a dashboard to optimise. Sleep more, drink less, train consistently — the number tracks.
Training hard when the trend is red. The trend line exists to tell you something. Ignoring three consecutive days of sustained drop is how overreaching turns into overtraining syndrome, which takes months to resolve.
Skipping a workout because of one bad reading. The opposite failure mode. Single-day noise is not a training signal. If the other three inputs (sleep, mood, soreness) are fine, run.
Comparing your HRV to someone else's. RMSSD is as individual as height. Your training partner's 85 ms vs your 50 ms means nothing about either of your fitness states.
Ignoring subjective feel when the device says green. Mood and soreness ratings predict next-day performance about as well as HRV in the research. When the body and the watch disagree, default to the body.

Bottom line

Readiness data is useful in direct proportion to how coach-like you are about reading it. Track RMSSD overnight on one device. Build a 30-day baseline. Compare a rolling 7-day average against it. Combine with sleep duration, mood, and soreness. Act on trends of three or more days, not single reads. Adjust the session — cap intensity, swap modality — before you skip it outright.

If you want the decision automated: the recovery readiness score implements this framework as a ten-second check. If you want the wider context on sleep, fueling, mobility and temperature work, the pillar is Recovery for Runners: 6 Rules for Free Speed.

Check today's readiness with Pallie

Frequently Asked Questions

What's the most reliable HRV metric for runners?

RMSSD (root mean square of successive differences). It reflects vagal / parasympathetic tone, is the metric validated in endurance research, and is what all the major wearables (Oura, Whoop, Garmin Enhanced BBI) expose under the hood. Ignore single-number "scores" that mix in other inputs — track RMSSD against your own rolling baseline.

How many days of HRV data do I need before the trend is meaningful?

A minimum of 30 days to establish a personal baseline, 60 is better. Until then, individual readings are almost noise — there's no reference to deviate from. Keep measuring, don't act on numbers, and once a baseline is built, track the 7-day rolling average against it.

Is a morning-on-wake reading or overnight reading more accurate?

Overnight recordings captured during deep sleep are cleaner and more reproducible than a morning-supine-on-wake reading, because nocturnal measurements average across hours of quiet, supine, post-absorptive data. Wearables that sample continuously during sleep (Oura, Whoop, newer Garmins) are the gold standard for consumer HRV. A single 2-minute morning reading is fine if that's what you have, but take the mean of at least 3 readings across the week.

My Oura and my chest strap give different HRV numbers. Which is right?

Both can be right. The absolute number depends on sampling method, window length, and algorithm — devices aren't cross-comparable. What matters is consistency within one device. Pick one source, stick with it, and compare each day's reading only to that device's 30–60 day baseline. Switching devices resets the baseline.

My HRV dropped 15% last night — should I skip today's workout?

Not automatically. One reading that low can mean alcohol, a late meal, poor sleep, allergies, or illness coming on. Check subjective feel: if mood, sleep quality, and soreness all say "fine," do the workout but cap intensity. If the low reading is the third or fourth in a row, or if subjective signals also tank, swap the session to easy Z2 or take a rest day.

Why does my HRV drop during the second half of my menstrual cycle?

Luteal-phase HRV suppression is a normal hormonal effect, not a training-readiness flag. Progesterone raises core body temperature and sympathetic tone, pulling RMSSD down by a few percent. Build a luteal-phase baseline (average across 2–3 cycles) and compare within phase rather than against the full-month average.

My watch says "ready" but I feel terrible. Who do I trust?

Your body. Subjective soreness and mood predict next-day performance as accurately as any wearable metric in most published studies. When the device and the human disagree, default to the more conservative signal — easy day, not hard day. The watch doesn't know about the argument last night or the cold you're fighting off.

Do older runners need different HRV thresholds?

Yes. Absolute RMSSD falls with age — a 55-year-old's healthy baseline can be half of a 25-year-old's. That's why you compare against your own baseline, never absolute numbers or anyone else's. The percentage-deviation rules (>10% below for 3+ days = flag) apply regardless of age, but masters athletes should skew amber calls toward rest because recovery capacity is genuinely reduced.

How much does alcohol affect HRV?

A lot, and for longer than most people expect. Even one or two drinks with dinner can suppress overnight RMSSD by 20–40% and keep resting heart rate elevated until the following afternoon. If you drink the night before, don't act on the number — it's measuring alcohol, not training load. Plan accordingly around hard sessions.

What's the single best thing I can do to raise my HRV baseline?

Sleep. Nothing else is close. Consistent 7–9 hours nightly does more for HRV than any supplement, cold plunge, or breathing protocol. After sleep: aerobic base volume, alcohol reduction, and stress management. HRV is a lagging indicator of lifestyle more than training.