Editorial Scaffolding
How HormoneScore works
Our editorial trust signals
8.5/10- Transparency9.0/10
Methodology + weights published publicly.
- Methodology rigor8.5/10
Composite of 6 weighted sub-dimensions with published thresholds.
- Sources cited8.0/10
Outcome stats require verifiable source URLs.
- Clinical reviewers9.0/10
NAMS-certified team on review every brand.
- Conflict disclosure8.0/10
Affiliate disclosure on every page. Scoring unaffected by revenue.
Every brand reviewed here gets a 0-100 HormoneScore composite. It's the only metric we publish that influences ranking. Errors update within 7 days — see corrections log.
Six weighted dimensions
Total = 100 points. No category exceeds 25% to prevent single-dimension dominance.
- Clinical rigor25 pts
Provider credentials (board cert, NAMS), prescribing model (sync vs async), labs required, follow-up cadence
- Editorial transparency20 pts
Disclosure compliance (FTC), conflict-of-interest declarations, source citation density, correction history
- Pricing transparency15 pts
Itemized cost breakdown, insurance acceptance, no-hidden-fees policy, refund/cancellation clarity
- Patient experience15 pts
Trustpilot/BBB ratings (filtered for fake reviews), Reddit sentiment, Sitejabber, response time data
- Medication quality15 pts
Pharmacy partnerships (503A vs 503B for compounded), FDA-approved vs custom-formulated, lot transparency
- Access & continuity10 pts
States served, provider continuity (same vs rotating), evening/weekend coverage, in-person referral pathway
Automatic disqualifiers
Any of these = brand cannot appear in ranked lists, regardless of score:
- Pending FDA Warning Letter (active)
- DOJ/FTC settlement in past 24 months involving consumer harm
- State medical board action against ≥3 affiliated providers
- Pattern of fake-review allegations with documented evidence
- Refusal to disclose ownership structure
- No licensed clinician on record (clinic-only model)
What we test ourselves
- Signup flow — every brand reviewed gets a manual signup test by a real human (not bot/scrape)
- Time-to-prescription measured from account creation to first script availability
- Provider message response time over 7-day window
- Refund process tested on subset (we cancel within 14d to test friction)
- Pricing screenshots archived monthly — see "Last priced" stamp on each brand page
Update cadence
- Pricing — refreshed monthly, more often if brand changes pricing midcycle
- Scores — recomputed quarterly
- Time-sensitive content (FDA guidance, drug shortages) — within 72 hours of change
- Corrections — 7-day SLA from report to public update
Conflicts of interest
We accept affiliate commissions from some (not all) brands listed. Commission rates do not affect score or ranking — this is enforced by structural separation: the editor scoring a brand cannot see commission rate data. We also publish non-affiliate alternatives (e.g., Cost Plus Drugs) when they're materially cheaper. See full disclosure on affiliate disclosure.
What would change our mind
- New peer-reviewed evidence overturning a clinical claim we cite
- Documented pattern of patient harm (escalating from anecdote to data)
- Reader-submitted evidence of a fact we got wrong
- Provider posting publicly-verifiable data we couldn't independently access (we update upward)
How we filter fake reviews
Most aggregator sites quote raw Trustpilot stars. Raw ratings include fake reviews, incentivized reviews, and review-bombing campaigns. Here's our filter pipeline:
- Pull raw data from Trustpilot, BBB, ComplaintsBoard, Sitejabber, Reddit, and Google Business Profile
- Exclude reviews younger than 14 days — too easy for brands to seed before launch
- Exclude reviewer accounts created within 7 days of review — fake-account signal
- Cluster by language patternsusing cosine similarity — flag clusters where 5+ reviews share >80% phrase overlap
- Cross-reference IP geolocation (where available) — flag if non-USA IPs concentrated about brand HQ region
- Weight 5★ reviews 50% lesswhen account history is < 3 distinct reviews
- Reddit sentiment — pull r/Menopause, r/Perimenopause, r/PCOS, r/glp1 for brand mentions; weight by upvote ratio + comment depth
We publish detailed fake-review audits when patterns are egregious. See recent audits.
Source weighting formula
When sources disagree, we weight by source reliability. Higher weight = more influence on final score.
- FDA labels / peer-reviewed studies weight: 1.0
- Medical society guidelines (NAMS, ACOG, AACE, ENDO) weight: 0.9
- Brand-published outcomes data (verified via lookup) weight: 0.7
- Manual signup test (our staff) weight: 0.8
- Trustpilot/BBB (post-filter) weight: 0.5
- Reddit sentiment (post-filter) weight: 0.4
- Brand marketing copy weight: 0.1
Brand marketing copy gets 0.1 because it's promotional — we don't ignore it, but we anchor heavily to independent data.
Report an error
Found something wrong? Email the editorial team. We respond within 48 hours and publish corrections within 7 days.