Question 1

Which wearable readiness score is most accurate?

Accepted Answer

No consumer wearable has been validated against gold-standard performance metrics with consistently strong results (Düking et al. 2018). Oura has the most published sleep validation data. Whoop has more athlete-focused training load integration. Accuracy varies by individual physiology, skin tone, and consistency of wear. None should be used as a sole training decision tool.

Question 2

Can I use two devices simultaneously to cross-validate?

Accepted Answer

Cross-validation is useful for identifying consistent signals. If both Whoop and Oura show low readiness on the same morning, the signal is more reliable. Disagreement between devices on a given day is common and normal — treat it as data uncertainty rather than conflicting ground truth.

Question 3

How much should I adjust training based on a low readiness score?

Accepted Answer

A single low score warrants attention, not automatic deload. A trend of 3+ consecutive low scores is more meaningful. The research on using readiness scores for training modification shows mixed results — athletes who adjust training responsively based on HRV-anchored scores tend to perform slightly better over multi-week blocks, but the evidence is not strong for single-session decisions.

Question 4

Does the Garmin Body Battery measure recovery differently from HRV-based scores?

Accepted Answer

Yes. Body Battery uses a proprietary energy reservoir model that depletes with activity (using accelerometer and heart rate data) and recharges during sleep (using HRV). It is more activity-context-aware than pure HRV scores but less directly tied to autonomic nervous system state. It integrates more data types but with less physiological specificity.

Question 5

What is the biggest limitation of all these scores?

Accepted Answer

All readiness scores are backward-looking — they summarize recovery from recent stress. They do not measure readiness for a specific type of future effort. A high readiness score does not mean optimal performance for a maximal power session; it means recovery from recent load is good. Training context and accumulated fatigue over weeks require human interpretation beyond any single daily score.

Measure	Value	Unit	Notes
HRV measurement agreement (vs. ECG)	r=0.82-0.96	correlation	Photoplethysmography (PPG) wrist-based HRV approximates ECG-derived HRV; accuracy varies by motion and skin tone
Readiness score vs. lab performance	Low-to-moderate	agreement	Düking et al. 2018 found wearable readiness indices do not reliably predict same-day performance test outcomes
Oura sleep stage accuracy	~79	% vs. PSG	Polysomnography comparison from Altini & Kinnunen 2021; best among consumer wearables
Whoop Recovery update frequency	Every 24	hours	Updates after each sleep period; requires consistent sleep data for accurate daily score
Garmin Body Battery range	5-100	points	Proprietary energy reservoir model based on HRV, sleep, and activity; recharges during sleep, depletes with activity
Oura Readiness score range	0-100	points	Composite of resting HR, HRV, body temperature, sleep, and activity balance factors

Device	Algorithm Inputs	Update Frequency	Validation Studies	Reliability	Key Limitation
Whoop Recovery	HRV (rMSSD), RHR, sleep duration/stages, respiratory rate	Every 24h (post-sleep)	Limited; primarily internal	Moderate within-subject	Requires consistent wear; no display
Garmin Body Battery	HRV, RHR, sleep, accelerometer (activity drain)	Continuous (depletes in real-time)	Limited independent studies	Moderate	Activity model is proprietary; poor with shift work
Oura Readiness	HRV, RHR, body temperature, sleep stages, activity balance	Every 24h (post-sleep)	Most published (Altini & Kinnunen 2021)	Moderate-to-good	Ring fit affects PPG accuracy
Apple Health readiness	RHR trend, HRV trend, sleep, walking HRV	Daily; limited synthesis	Minimal peer-reviewed data	Low-to-moderate	No unified readiness score; fragmented
HRV4Training (app)	Morning camera HRV, subjective wellness survey	Daily (manual measurement)	Plachta et al. 2022 — strongest independent validation	Good for HRV trends	Requires active morning routine; no wearable passivity
Manual RMSSD (chest strap)	Single-lead ECG via Polar H10	Daily (60-second morning measurement)	Gold standard consumer method	High — matches clinical ECG closely	Requires dedicated hardware + app (Elite HRV, etc.)

Recovery: Readiness Scores Compared

Device Comparison Table

Algorithm Inputs in Depth

Related Pages

Sources

Frequently Asked Questions

Which wearable readiness score is most accurate?

Can I use two devices simultaneously to cross-validate?

How much should I adjust training based on a low readiness score?

Does the Garmin Body Battery measure recovery differently from HRV-based scores?

What is the biggest limitation of all these scores?