How to Use H-Index to Prioritize Researcher Outreach (Without Getting It Wrong)
H-index is the fastest way to rank researchers by influence — but it's also the easiest metric to misuse. Here's how to use it for prospecting without burning your best leads.
If you’re prospecting academic researchers and you have a list of 500 names, you can’t email all of them with the same effort. You need to rank. And the single most available signal for ranking a researcher’s influence is their H-index.
It’s not a perfect metric. People in academia argue about it endlessly. But for the purpose of prioritizing outreach — deciding who gets a hand-written email versus who gets a templated one — it’s good enough, fast, and free. The trick is knowing exactly what it measures and where it lies to you.
What H-index actually measures
A researcher has an H-index of h if they have published h papers that have each been cited at least h times. An H-index of 40 means 40 papers with 40+ citations each.
What it captures well: sustained, consistent impact. You can’t get a high H-index from one viral paper. You need a body of work that other researchers repeatedly cite. So a high H-index reliably tells you someone is established, productive, and influential in their field.
What it does not capture: recency, single breakthroughs, or field norms. And those gaps are exactly where prospecting goes wrong.
The three mistakes that wreck H-index prioritization
Mistake 1: Comparing across fields. H-index is wildly field-dependent. A biomedical researcher with thousands of co-authors per paper accumulates citations far faster than a pure mathematician. An H-index of 25 in math can represent more eminence than an H-index of 60 in genomics. If your list spans multiple disciplines, never sort the whole thing by raw H-index. Bucket by field first, then rank within each bucket.
Mistake 2: Ignoring career stage. H-index only goes up over time, because citations accumulate. A 30-year veteran will out-rank a brilliant 35-year-old simply by having more time on the clock. If you sort by raw H-index, you systematically over-weight people near retirement and under-weight rising stars who are often more receptive to industry conversations. Look at the m-index (H-index divided by years since first publication) when you want to surface momentum rather than tenure.
Mistake 3: Treating it as a buying signal. A high H-index tells you someone is influential. It tells you nothing about whether they want to talk to a vendor, license a technology, or join a company. Influence and intent are different axes. Use H-index for influence; use other signals (patents, startup affiliations, recent grants) for intent.
How to actually segment a list with H-index
Here’s a practical three-tier scheme that works for most academic GTM motions.
Tier 1: High influence, recent activity
Top quartile of H-index within their field, with at least one publication in the last 18 months. These are your key opinion leaders. Worth a fully personalized email referencing their specific recent work. They influence what their peers buy, cite, and adopt — landing one is worth landing ten mid-tier researchers.
Tier 2: Rising influence
Strong m-index but moderate absolute H-index — meaning they’re accumulating citations fast for their career stage. These researchers are often the highest-converting segment for industry outreach. They’re ambitious, building their reputation, and far more open to collaboration, advisory roles, or new tools than entrenched senior faculty. Don’t let raw H-index hide them.
Tier 3: Active but lower influence
Publishing regularly but lower citation impact. Fine for templated, scaled outreach. Don’t spend personalization budget here until tiers 1 and 2 are exhausted.
Where to pull H-index data
You have a few options, and they don’t agree with each other:
- Google Scholar has the broadest coverage and tends to report the highest numbers (it counts more sources). Good for ranking, but not API-friendly.
- Scopus and Web of Science are more curated and stricter, so numbers run lower. Common in formal research-analytics tools.
- OpenAlex is open, free, and API-accessible. It doesn’t hand you an H-index directly, but you can compute one from its citation data — which means you control the field-normalization yourself.
The important thing: pick one source and use it consistently across your whole list. Comparing a Google Scholar H-index to a Scopus one is comparing two different rulers.
Combine H-index with intent — don’t rank on it alone
The mature version of this looks like a two-dimensional grid. One axis is influence (H-index, field-normalized). The other is intent (industry bridge signals — patents, startup roles, industry co-authorship, recent industry-funded grants).
The researchers you want first are high on both axes: influential and commercially engaged. A field-leading scientist who also holds three patents and advises a startup is the highest-value target on any list. A field-leading scientist with a purely academic trajectory might be eminent but completely uninterested in your pitch.
Automating it
Computing field-normalized H-index across a list of hundreds of researchers by hand is a non-starter. This is exactly the kind of enrichment Sci-Buy runs automatically — pulling citation data, normalizing by field, and layering it against industry bridge signals so your list comes back ranked by both influence and intent.
sci-buy search "single-cell genomics" --institution "Broad Institute"
sci-buy rank --by influence,intent
The output is a prioritized list: who to write by hand, who to template, and who to skip. That’s the whole point of ranking — spend your best effort where it converts.
npx sci-buy@latest